1、IS0 10754:1996(E) Foreword IS0 (the International Organization for Standardization) is a worldwide federation of national standards bodies (IS0 member bodies). The work of preparing International Standards is normally carried out through IS0 technical committees. Each member body interested in a sub
2、ject for which a technical committee has been established has the right to be represented on that committee. International organizations, governmental and non-governmental, in liaison with ISO, also take part in the work. IS0 collaborates closely with the International Electrotechnical Commission (I
3、EC) on all matters of electrotechnical standardization. Draft International Standards adopted by the technical committees are circulated to the member bodies for voting. Publication as an International Standard requires approval by at least 75 % of the member bodies casting a vote. International Sta
4、ndard IS0 10754 was prepared by Technical Committee ISO/TC 46, Information and documentation, Subcommittee SC 4, Computer applications in information and documentation. Annexes A, B and C of this International Standard are for information only. 0 IS0 1996 All rights reserved. Unless otherwise specif
5、ied, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from the publisher. International Organization for Standardization Case Postale 56 l CH-1211 Geneve 20 l Switzerla
6、nd Printed in Switzerland ii INTERNATIONAL STANDARD IS0 IS0 10754:1996(E) Information and documentation - Extension of the Cyrillic alphabet coded character set for non-Slavic languages for bibliographic information interchange 1 Scope 1.1 This International Standard specifies a set of 93 graphic ch
7、aracters with their coded representations. It consists of a code table and a legend showing each graphic, its use and its name. Explanatory notes are also included. The character set is primarily intended for the interchange of information among data processing systems and within message transmissio
8、n systems. 1.2 These characters, together with characters in the basic Cyrillic set, registered as number 37 in the IS0 international register, constitute a character set for the international interchange of bibliographic citations, including their annotations, in the non-Slavic Cyrillic alphabets f
9、or the languages specified in 1.3. 1.3 This character set is intended to handle information in the following language groups: Abazian Abkhasian Adyg he Aisor Altaic Avar Azerbaijani Balkar Bashkir Buryat Chechen Chukchi Chuvash Dargwa Dungan Eskimo Even Evenki Gagauzi lngush Kabardian Kalmyk Karacha
10、y Kara-Kalpak Karelian Kazakh Khakass Khanty Kirg hiz Komi Korya k Kumyk Kurdish Lak Lezghian Lithuanian Mansi Mari Moldavian Mongolian Mordvin Nenets Nivkh Nogai Ossetic Romany Sami Selkup Shor Tabasaran Taji k Tat Tatar Turkmen Tuvinian Udekhe Udmurt Uighur Uzbek Yakut 1.4 This coded character set
11、 contains characters used since the Russian Revolution (1917). Some letters which appear to be unrepresented in the character table are actually graphic variants. Obsolete letters, those used for only a brief period in the late 19th century, have been excluded from this International Standard. This
12、applies chiefly to early letters used in Chechen, Chuvash, Dargwa, Lak and Lezghian. Letters from their 20th century alphabets are included. 1 IS0 10754:1996(E) IS0 2 Normative references The following standards contain provisions which, through reference in this text, constitute provisions of this
13、International Standard. At the time of publication, the editions indicated were valid. All standards are subject to revision, and parties to agreements based on this International Standard are encouraged to investigate the possibility of applying the most recent editions of the standards indicated b
14、elow. Members of IEC and IS0 maintain registers of currently valid International Standards. lSO/l EC 646: 1991, Information technology - IS0 T-bit coded character set for information interchange. ISO/I EC 2022: 1994, Information technology - Character code structure and extension techniques. Interna
15、tional register of character sets to be identified by means of escape sequences. 1) 3 Implementation 3.1 The implementation of this coded character set in physical media and for transmission, taking into account the need for error checking, is the subject of other International Standards (see annex
16、C). 3.2 The implementation of this International Standard is in accordance with the provisions of ISO/IEC 20222) and is identified by an escape sequence. (To be assigned.) 33 . bib1 The unassigned positions in the code iographic information. shall not be utilized in the internatlonal interchange of
17、1) Available on application to the Secretariat of the Registration Authority: ECMA, 114 rue du Rh6ne, Cl-l-l 204 Gen Gl: ESC Z/9 F; G2: ESC Z/IO F; G3: ESC 2/l 1 F (“F” represents the final character of the escape sequence). 2 IS0 IS0 10754:1996(E) 4 Code table for extended Cyrillic characters of no
18、n-Slavic languages Table 1 is the code table for extended Cyrillic characters of non-Slavic languages. Table 1 b 7 0 0 0 0 1 1 1 1 b 6 0 0 1 1 0 0 1 1 b 5 0 1 0 1 0 1 0 1 0 1 0 1 5 i3 k Ix ?; % 0 1 1 0 6 B q op Y y 0 1 1 1 7 ti 1 L Y Y :. : _,._ ,._ ;_ :.:,:,: :. :_ :.:.:. :_. . :. . .:.:.: . . : _
19、:;. 1 0 0 0 8 :. : _._._, : :. ._._._ :. _._._. ,. :. :. : . . . . :. :-_ a? . . . . . AZ ns l-b W W . . . . . . . _. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 0 0 1 9 f f G7 J-b u u _. : . .,.; : .: _, : _I : _ :;. . _,_ . . 1: _,._ 1 0 1 0 ,:. A ” . :_ , ,. ._ ,;. : : ; . .:. : ;.
20、: :.: . ., . . . 1. F -rl nc TK w Y :. _. ;:_ . :. . :_ _._ .: .:.:. :. . :. :. 1 0 1 1 B .: _, ._,:.:.: ._,. .:. : . , . . ,. : .;_ .: y.,:.:.: : ,_, , . . . :. : ,.:.:. . . . . : : d ., d I-r ._ w h h :. _. _, ,.;_.,_ ._“_ . “. _., _. ;,._._. : .,. .:.,.I .;, 1 :.;. . ._. ;,._._.,_ 1. .,.,.,_ ._ :
21、_ ,.:.,. _, .,_,., _ _,_ : _ .: :. : 1 1 0 0 ,. ,I. ,._.,. c . ._ . : 1. . . , ,: ; . . . . ._: ,.,I ,., . :. :. :.:. ,.;.: .,.,. :. ,., ,_,_ : .:.: 1. : . 1 d :. ._,_ _,_,., ,_ : :. al I+ Ii, : 1 1 0 1 D :.:_, _.I _ ,., _. _, _,_I ._ 1. :_,.: .: ., : .I.:. . .: _,.I. . ” :. _ ,_ .:.: : ., 1.1. 1. .
22、, :, :. _, : 3 3 ,. :_ : .: I-L Fb a a ._ :. : .,. ., ,. : _. :_ ,._ : .,_. 1.; ., _. :. :, _._ :.,:. .:.:. . . .:_:_, .:. :.:. :_ . _ : : . : 1 1 1 0 E :.:.:.:.:. :,:.: .,. _,. ,. :. : .,._.I ,._._. .; :. _:.:.:.:.:.: .,.:.:. .: ._ ,._., ,._ ,. . . :. : . :. _. :. :_ ._.; :.:.:. . . . . . . . . . .
23、 . . .,.,. . _ 3 7 b e? R? se :. _ _ : _._._. . . :. . Reserved for future standardization IS0 10754:1996(E) 5 Legend Table 2 gives the code, graphic and name of each character and comments on usage. Code Graphic Name 21 22 23 24 25 26 27 28 29 2A 28 2c 20 2E 2F 6 B P 9 i:j :-? i t . .i ; .A a? f 5
24、d CL 3 3 ?J COMBINING ACUTE COMBINING DIAERESIS (Dialytika) COMBINING OGONEK COMBINING RIGHT DESCENDER COMBINING BREVE COMBINING CEDILLA COMBINING GRAVE CYRILLIC SMALL LETTER A IE CYRILUC SMALL LETTER GHE WITH STROKE CYRILUC SMALL LETTER GHE WITH MIDDLE HOOK CYRILUC SMALL LETTER KOMI DE CYRILLIC SMA
25、LL LETTER KOMI DJE CYRILLIC SMALL LETTER ABKHASIAN DZE CYRILLIC SMALL LETTER KOMI DZE CYRIUIC SMALL LETTER KOMI ZJE 30 31 32 33 34 35 36 37 38 39 3A 38 3c 3D 3E 3F p . ! (y p P K u. Ipi ii f i “ :,.A AZ f 5 d I) must not be confused with the Latin script capital “I”. This sign is used in many Caucas
26、ian languages. It always follows a consonant and has the same form regardless of the case of the other letters in a word (e.g. rIanca). Although technically a sign (like a percent “%” sign), this character IS given as the last letter of most Cyrillic-based alphabets. The notion of capitalization is
27、not applied to this sign, thus, it is assigned only one code in this International Standard. 6.4 The 14 characters coded in columns 2 and 3 of table 2 (positions 21-27 and 31-37) represent combining marks which are non-spacing characters, that is, characters whose use is not followed by the forward
28、movement of an output device. In a character string, these non-spacing characters are input before the characters they modify. Multiple combining marks associated with one letter are to be encoded in the order in which they appear, reading left to right or top to bottom. They are intended to be comb
29、ined with other spacing characters in this International Standard or characters from the basic Cyrillic set. These combining marks (e.g. diaeresis) are used liberally in the non-Slavic languages that have Cyrillic-based alphabets. The BACKSPACE character (hexadecimal code 08 in ISO/IEC 646) should n
30、ot be used when encoding non-spacing characters. 6.5 The rendering of graphic characters is intended solely to identify uniquely the additional Cyrillic script letters used by non-Slavic languages. The graphics used do not necessarily represent the most desirable calligraphic forms. 6.6 The names of
31、 characters (but not codes) have been made to correspond as much as possible to those assigned in lSO/IEC 10646-I. IS0 10754:1996(E) Annex A (informative) Non-Slavic languages using Cyrillic script characters from this International Standard Tables A.1 and A.2 show the usage by non-Slavic languages
32、of Cyrillic script characters defined in this International Standard. Only lower-case letters are shown. Variant graphics are enclosed in parentheses. Table A.1 is a listing by non-Slavic language, presented alphabetically according to their English-language names. Table A.2 is a listing of non-spac
33、ing characters and identifies with which non-Slavic language each is used. Characters are presented in ascending order according to the codes assigned in this International Standard. Table A.1 - Listing by non-Slavic language Language Characters and combined forms Combining marks Abazian I Abkhasian
34、 WeE3!5(K)kQrl”,b?W- Adygei I Aisor dajlqsh Altai jbfijj; Avar I Azerbaijani +XjKC3yh+i Bal kar Bashkir +-p=Vgyha Buryat ib$+h Chechen rAJhjykFJq3XViiyVI Chukchi YY Chuvash aEhphfl$?;ljj;j; Dargwa I Dungan Wd-iGY Eskimo (Yuit) I- K l-l (4) H X j; Even ye6 Evenki Y Gagauz =v lngush I Ka bardian I Kal
35、myk iil-yha Karachay 9 Y Kara-Kalpak +-K(ya Karelian =v Kazakh a f i Y (5) Y 8 Y (9 Y h Khakass f i j tf 6 ji y ($ h Khanty (Kazym) ii a 5 n y 8 6 Khanty (Shuryshkary) y (apostrophe) (apostrophe) f-y r-y i-i L$ / - ,-. . . g ;y in t; g (apostrophe) v r-Y .r., Y i-i : - t.7. I , tJ i-j “+ 3i i.j i I .x .-7 .-. if i ; -3 q E (apostrophe) :t i-1 r-1 f I2 :-t LG (apostrophe) . t-7 i; zy : * y- r-t L.2 + * fZ I i L.2 f-y f-j y-j -3 =I Ld z! . ? id ti r-; 3 i- r;: LJ g! (apostrophe) y i ! 3 8