1、INTERNATIONAL STANDARD ISOIIEC 10367 First edition 1991-12-15 -_ Information technology - Standardized coded graphic character sets for use in 8-bit codes Technologies de /information - Jeux de caracteres graphiques cod code code table control function device escape sequence graphic character graphi
2、c symbol position repertoire user 5 Notation, code tables and names 5.1 Notation 5.2 Layout of the code tables 5.3 Names 5.3.1 NO-BREAK SPACE (NBSP) 5.3.2 SOFT HYPHEN (SHY) 6 Specification of the character sets 4 Code tables and lists of character names 5 Basic GO Set 6 Latin Alphabet No. 1 Suppleme
3、ntary Set 8 2 2 2 2 2 2 2 2 2 2 3 3 3 3 3 4 4 8 ISOIIEC 1991 All rights reserved. No part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permlsslon In writing from the publisher. ISO/IEC Copyrlght
4、 Office l Case Fostale 56 l Cl-i-1211 Genkve 20 l Switzerland Printed in Switzerland ii ISO/IEC 10367:1991 Latin Alphabet No. 2, Supplementary Set Latin Alphabet No. 3, Supplementary Set Latin Alphabet No. 4, Supplementary Set Latin alphabet No. 5, Supplementary Set Cyrillic Supplementary Set Arabic
5、 Supplementary Set Greek Supplementary Set Hebrew Supplementary Set Supplementary Set for Latin Alphabets No. 1 or No. 5, and No. 2 Basic Box Drawing Set Annexes A - Specification and use of the Supplementary Set of ISO/IEC 6937 B - Font design C - List of the standardized character names 10 12 14 1
6、6 18 20 22 24 26 28 30 34 35 . . . III ISO/IEC 10367:1991 Foreword IS0 (the International organization for Standardization) and IEC (the International Electrotechnical Commission) form the specialized system for worldwide standardization. National bodies that are members of IS0 or IEC participate in
7、 the development of International Standards through technical committees established by the respective organization to deal with particular fields of technical activity. IS0 and IEC technical committees collaborate in fields of mutual interest. Other international organizations, governmental and non
8、- governmental, in liaison with IS0 and IEC, also take part in the work. In the field of information technology, IS0 and IEC have established a joint technical committee, ISO/IEC JTCl. Draft International Standards adopted by the joint technical committee are circulated to national bodies for voting
9、. Publication as an International Standard requires approval by at least 75% of the national bodies casting a vote. International Standard ISO/IEC 10367 was prepared by Joint Technical Committee ISO/IEC JTCI, Information technology. Annexes A and C form an integral part of this International Standar
10、d. Annex B is for information only. iv ISO/IEC 10367:1991 Introduction In the course of the past years two different g-bit codes were developed by ISO/IEC/JTCl/SCZ. An International Standard, ISO/IEC 6937, was developed with a view to satisfying the needs of CCITT-defined Telematic services: Teletex
11、 and Videotex. It is based on a primary and supplementary set of graphic characters, the latter containing a series of so-called non-spacing diacritical marks for the generation of accented letters. Thus, the coded representation of a graphic character may consist of one or more bit combinations. Th
12、e standard specifies the allowed repertoire of 333 characters. ISO/IEC 6937 is applicable not only to 8-bit coding but also to 7-bit coding. A family of S-bit code tables, IS0 8859, was also developed to satisfy a need for single-byte coded graphic characters in particular in data processing applica
13、tions. It consists of several parts, each specifying an 8-bit single- byte coded graphic character set of up to 191 characters. The selection of characters for each set is such that it satisfies the needs of several languages of a large, relatively coherent, geographical area. The different parts of
14、 1SO 8859 cover not only the Latin script but also the Arabic, Cyrillic, Greek and Hebrew scripts. Many applications need a code structure which permits more than one script to be represented in information interchange, for example Latin Alphabet No. 1 (“Western” Europe) with the Greek script or Lat
15、in Alphabet No. 2 (“Eastern” Europe) with the Cyrillic script. For this purpose ISO/IEC 4873 is suitable since it allows up to 382 graphic characters grouped in four G sets, by using a small selection of the facilities offered by IS0 2022. Within this structure the graphic character sets from the va
16、rious parts of IS0 8859 may be used in conjunction with each other. Since some characters appear in more than one of these sets, rules are needed to avoid violation of the principle of one-to-one relationship between a character and its coded representation when such sets are used together. The new
17、version of ISO/lEC 4873 (3rd edition of 1991) contains such rules. In a similar way ISO/IEC 6937 may be used in conjunction with additional non-Latin graphic character sets taken from the parts of IS0 8859. Thus this International Standard specifies a collection of coded graphic character sets usabl
18、e within the structure of ISO/IEC 4873. INTERNATIONAL STANDARD ISOllEC 10367:1991(E) Information technology - Standardized coded graphic character sets for use in 8-bit codes 1 Scope This International Standard specifies a unique coded graphic character set for use as GO set and a series of coded gr
19、aphic character sets of up to 96 characters for use as the Cl, G2 and G3 sets in versions of lSO/lEC 4873. All sets specified in this International Standard are shown as elements of an g-bit code. These sets are intended for use in data and text processing applications and may also be used for infor
20、mation interchange. They contain graphic characters used for general purpose applications in typical office environments. This International Standard does not specify the control functions to be allocated to the CO and Cl sets of versions of ISO/IEC 4873. ISO/IEC 6429 specifies these control functio
21、ns; the required control functions shall be selected from that International Standard depending on the application considered. 2 Conformance 2.1 Conformance of information interchange A coded-character-data-element (CC-data-element) within coded information for interchange is in conformance with thi
22、s International Standard if all the coded representations of characters within that CC-data-element conform to the requirements of clause 6 and of ISOllEC 4873. A claim of conformance shall identify the selected character sets. 2.2 Conformance of devices A device is in conformance with this Internat
23、ional Standard if it conforms to the requirements of 2.2.1, and either or both of 2.2.2 and 2.2.3. A claim of conformance shall identify the document which contains the description specified in 2.2.1, and shall identify the selected character sets. 2.2.1 Device description A device that conforms to
24、this International Standard shall be the subject of a description that identifies the means by which the user may supply characters to the device, or may recognize them when they are made available to him, as specified respectively in 2.2.2 and 2.2.3. 2.2.2 Originating devices An originating device
25、shall allow its user to supply any sequence of characters from the selected character sets, and shall be capable of transmitting their coded representations within a CC-data-element. 2.2.3 Receiving devices A receiving device shall be capable of receiving and interpreting any coded representations o
26、f characters that are within a CC-data-element, and that conform to 2.1, and shall make the corresponding characters available to its user in such a way that the user can identify them from among those from the selected character sets, and can distinguish them from each other. ISO/IEC 10367:1991 3 N
27、ormative references The following standards contain provisions which, through reference in this text, constitute provisions of this International Standard. At the time of publication, the editions indicated were valid. All standards are subject to revision, and parties to agreements based on this In
28、ternational Standard are encouraged to investigate the possi- bility of applying the most recent editions of the standards listed below. Members of IEC and IS0 maintain registers of currently valid lnternational Standards. IS0 2022: 1986, Information technology - IS0 7-bit and 8-bir coded character
29、sets - Code extension techniques ISO/lEC 4873:1991, Information technology - g-bit code for information interchange - Structure and rules for implementation. ISO/lEC 6429: ), Information rechnology - Control functions for 7-bit and 8-bit coded character sets. ISO/IEC 6937: , ) Information technology
30、 - Coded graphic character set for the communication of texts using lhe Latin alphabet. lSO/IEC 8859, Information processing - 8-bil single-byte coded graphic character sets. lnternarional Register of Coded Character Sets to Be Used wifh Escape Sequences. (IS0 2375) 4 Definitions For the purpose of
31、this International Standard the following definitions apply. 4.1 bit combination: An ordered set of bits used for the representation of characters. 4.2 character: A member of a set of elements used for the organization, control or representation of data. 4.3 coded-character-data-element (CC-data-ele
32、ment): An element of interchanged information that is specified to consist of a sequence of coded representations of characters, in accordance with one or more identified standards for coded character sets. NOTES 1. In a communication environment according to the Reference Model for Open Systems Int
33、erconnection (IS0 74%). a CC- data-element will form all or part of the information that corresponds to the Presentation-Protocol-Data-Unit (PPDU) defined in that International Standard. 2. When information interchange is accomplished by means of interchangeable media, a CC-data-element will form al
34、l or part of the information that corresponds to the user data, and not that recorded during formatting and initialization. 4.4 coded character set; code: A set of unambiguous rules that establishes a character set and the one-to-one relationship between the characters of the set and their coded rep
35、resentation by one or more bit combinations. 4.5 code table: A table showing the character allocated to each bit combination in a code. 4.6 control function: An action that affects the recording, processing, transmission, or interpretation of data, and that has a coded representation consisting of o
36、ne or more bit combinations. 4.7 device: A component of information processing equipment which can transmit and/or receive coded information within CC-data-elements. NOTE It may be an input/output device in the conventional sense. or a process such as an application program or a gateway function. 4.
37、8 escape sequence: A string of bit combinations that are used for control purposes in code extension procedures. The first of these bit combinations represents the control function ESCAPE. 1) To be published. 2 ISO/IEC 10367:1991 4.9 graphic character: A character, other than a control function, tha
38、t has a visual representation normally handwritten, printed or displayed, and that has a coded representation consisting of one or more bit combina- tions. 4.10 graphic symbol: A visual representation of a graphic character or of a control function. 4.11 position: That part of a code table identifie
39、d by its column and row co-ordinates. 4.12 repertoire: A specified set of characters that are represented by means of one or more bit combinations of a coded character set. 4.13 user: A person or other entity that invokes the services provided by a device. NOTES I. This entity may be a process such
40、as an application program if the “device ” is a code convertor or a gateway function, for example. 2. The characters, as supplied by the user or made available to him, may be in the form of codes local to the device, or of non- conventional visible representations. provided that 2.2 above is satisfi
41、ed. 5 Notation, code tables and names Each coded character set is represented as a part of an S-bit code table. These sets are identical with those of the International Register where they are shown in a 7-bit environment. 5.1 Notation The bits of the bit combination of the S-bit code are identified
42、 by b8, b7, b6, b5, b4, b3, b2 and bl, where b8 is the highest-order, or most-significant bit and bl is the lowest-order, or least-significant bit. The bit combinations may be interpreted to represent integers in the range 0 to 255 in binary notation by attributing the following weights to the indiv
43、idual bits. Bit b8 b7 b6 b5 b4 b3 b2 bl Weight 128 64 32 16 8 4 2 1 Using these weights, these 8-bit combinations represent numbers in the range 0 to 255. The bit combinations are identified by notations of the form xx/yy, where xx and yy are numbers in the range 00 to 15. The correspondence between
44、 the notations of the form xx/yy and the bit combinations consisting of the bits b8 to bl is as follows: - xx is the number represented by b8, b7, b6 and b5 where these bits are given the weights 8, 4, 2 and 1 respectively; - yy is the number represented by b4, b3, b2 and bl where these bits are giv
45、en the weights 8, 4, 2 and 1 respectively. The notations of the form xx/yy are the same as those used to identify code table positions, where xx is the column number and yy is the row number (see 5.2). 5.2 Layout of the code tables An r - 6 2 zi a - 8 k 6 6 ae T 7 i A . 08 c i e 5 6 - I E F i! ij -
46、cl - iI - . I I / I i i f i 14 lSO/lEC 10367: 1991 Table 10 - Names of the characters of table 9 -ir comb. - 1 o/o0 lO/Ol 10102 10103 1 o/o4 lo/o5 1 O/O6 1 o/o7 1 Of08 lot09 IO/l0 10/l 1 10112 10113 10114 IO/IS 11100 ll/Ol 1 l/O2 1 l/O3 1 l/O4 1 l/OS 1 l/O6 11/07 ll/OS 11/09 1 l/IO ll/ll II/12 11/13
47、 11/14 1 l/15 12/00 12/01 12102 12103 12104 12m 12/06 12107 12108 12109 12110 12/l 1 12/12 1203 12114 1211.5 Name NO-BREAK SPACE LATlN CAPITAL LETl-BR A WITH OGONEK LATlN SMALL LETIBR KRA (Greenlandic) LATIN CAPITAL LEITER R WITH CEDILLA CURRENCY SIGN LATIN CAPITAL LETTBR I WITH TILDE LATIN CAPITAL
48、LEll-ER L WITH CEDILLA SECTION SIGN DIAERESIS LATlN CAPITAL LETI-ER S WITH CARON LATlN CAPITAL LElTER E WITH MACRON LATIN CAPITAL LEI-I-BR G WITH CEDILLA LATIN CAPITAL LEI-IER T WITH STROKE SOFT HYPHEN LATIN CAPITAL LE-l-l-BR Z WITH CARON MACRON DEGREE SIGN LATIN SMALL LETIER A WITH OGONEK OGONEK LA
49、TIN SMALL LETTER R WITH CEDILLA ACUTE ACCENT LATIN SMALL LEITBR I WITH TILDE LATIN SMALL LETIBR L WITH CEDILLA CARON CEDILLA LATIN SMALL LETTER S WITH CARON LATIN SMALL LETTBR E WITH MACRON LATIN SMALL LETTER G WITH CEDILLA LATIN SMALL LEI-IBR T WITH STROKE LATIN CAPITAL LETTER ENG (Lappish) LATIN SMALL LETTER Z WITH CARON LATIN SMALL LETTER ENG (Lappish) LATIN CAPITAL LETTER A WITH MACRON LATIN CAPITAL LEl-l-BR A WITH ACUTE LATIN CAPITAL LEl-lBR A WITH CIRCUMFLEX LATIN CAPITAL LEI-IER A WITH TILDE LATIN CAPITAL LEI-IER A WITH DIAERESIS LATIN CAPITAL LEITBR A WITH RING ABOVE LATIN