1、INTERNATIONAL STANDARD ISOIIEC 10367 First edition 1991-12-15 -_ Information technology - Standardized coded graphic character sets for use in 8-bit codes Technologies de /information - Jeux de caracteres graphiques cod code code table control function device escape sequence graphic character graphi
2、c symbol position repertoire user 5 Notation, code tables and names 5.1 Notation 5.2 Layout of the code tables 5.3 Names 5.3.1 NO-BREAK SPACE (NBSP) 5.3.2 SOFT HYPHEN (SHY) 6 Specification of the character sets 4 Code tables and lists of character names 5 Basic GO Set 6 Latin Alphabet No. 1 Suppleme
3、ntary Set 8 2 2 2 2 2 2 2 2 2 2 3 3 3 3 3 4 4 8 ISOIIEC 1991 All rights reserved. No part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permlsslon In writing from the publisher. ISO/IEC Copyrlght
4、 Office l Case Fostale 56 l Cl-i-1211 Genkve 20 l Switzerland Printed in Switzerland ii Copyright American National Standards Institute Provided by IHS under license with ANSINot for ResaleNo reproduction or networking permitted without license from IHS-,-,-ISO/IEC 10367:1991 Latin Alphabet No. 2, S
5、upplementary Set Latin Alphabet No. 3, Supplementary Set Latin Alphabet No. 4, Supplementary Set Latin alphabet No. 5, Supplementary Set Cyrillic Supplementary Set Arabic Supplementary Set Greek Supplementary Set Hebrew Supplementary Set Supplementary Set for Latin Alphabets No. 1 or No. 5, and No.
6、2 Basic Box Drawing Set Annexes A - Specification and use of the Supplementary Set of ISO/IEC 6937 B - Font design C - List of the standardized character names 10 12 14 16 18 20 22 24 26 28 30 34 35 . . . III Copyright American National Standards Institute Provided by IHS under license with ANSINot
7、for ResaleNo reproduction or networking permitted without license from IHS-,-,-ISO/IEC 10367:1991 Foreword IS0 (the International organization for Standardization) and IEC (the International Electrotechnical Commission) form the specialized system for worldwide standardization. National bodies that
8、are members of IS0 or IEC participate in the development of International Standards through technical committees established by the respective organization to deal with particular fields of technical activity. IS0 and IEC technical committees collaborate in fields of mutual interest. Other internati
9、onal organizations, governmental and non- governmental, in liaison with IS0 and IEC, also take part in the work. In the field of information technology, IS0 and IEC have established a joint technical committee, ISO/IEC JTCl. Draft International Standards adopted by the joint technical committee are
10、circulated to national bodies for voting. Publication as an International Standard requires approval by at least 75% of the national bodies casting a vote. International Standard ISO/IEC 10367 was prepared by Joint Technical Committee ISO/IEC JTCI, Information technology. Annexes A and C form an int
11、egral part of this International Standard. Annex B is for information only. iv Copyright American National Standards Institute Provided by IHS under license with ANSINot for ResaleNo reproduction or networking permitted without license from IHS-,-,-ISO/IEC 10367:1991 Introduction In the course of th
12、e past years two different g-bit codes were developed by ISO/IEC/JTCl/SCZ. An International Standard, ISO/IEC 6937, was developed with a view to satisfying the needs of CCITT-defined Telematic services: Teletex and Videotex. It is based on a primary and supplementary set of graphic characters, the l
13、atter containing a series of so-called non-spacing diacritical marks for the generation of accented letters. Thus, the coded representation of a graphic character may consist of one or more bit combinations. The standard specifies the allowed repertoire of 333 characters. ISO/IEC 6937 is applicable
14、not only to 8-bit coding but also to 7-bit coding. A family of S-bit code tables, IS0 8859, was also developed to satisfy a need for single-byte coded graphic characters in particular in data processing applications. It consists of several parts, each specifying an 8-bit single- byte coded graphic c
15、haracter set of up to 191 characters. The selection of characters for each set is such that it satisfies the needs of several languages of a large, relatively coherent, geographical area. The different parts of 1SO 8859 cover not only the Latin script but also the Arabic, Cyrillic, Greek and Hebrew
16、scripts. Many applications need a code structure which permits more than one script to be represented in information interchange, for example Latin Alphabet No. 1 (“Western” Europe) with the Greek script or Latin Alphabet No. 2 (“Eastern” Europe) with the Cyrillic script. For this purpose ISO/IEC 48
17、73 is suitable since it allows up to 382 graphic characters grouped in four G sets, by using a small selection of the facilities offered by IS0 2022. Within this structure the graphic character sets from the various parts of IS0 8859 may be used in conjunction with each other. Since some characters
18、appear in more than one of these sets, rules are needed to avoid violation of the principle of one-to-one relationship between a character and its coded representation when such sets are used together. The new version of ISO/lEC 4873 (3rd edition of 1991) contains such rules. In a similar way ISO/IE
19、C 6937 may be used in conjunction with additional non-Latin graphic character sets taken from the parts of IS0 8859. Thus this International Standard specifies a collection of coded graphic character sets usable within the structure of ISO/IEC 4873. Copyright American National Standards Institute Pr
20、ovided by IHS under license with ANSINot for ResaleNo reproduction or networking permitted without license from IHS-,-,-INTERNATIONAL STANDARD ISOllEC 10367:1991(E) Information technology - Standardized coded graphic character sets for use in 8-bit codes 1 Scope This International Standard specifies
21、 a unique coded graphic character set for use as GO set and a series of coded graphic character sets of up to 96 characters for use as the Cl, G2 and G3 sets in versions of lSO/lEC 4873. All sets specified in this International Standard are shown as elements of an g-bit code. These sets are intended
22、 for use in data and text processing applications and may also be used for information interchange. They contain graphic characters used for general purpose applications in typical office environments. This International Standard does not specify the control functions to be allocated to the CO and C
23、l sets of versions of ISO/IEC 4873. ISO/IEC 6429 specifies these control functions; the required control functions shall be selected from that International Standard depending on the application considered. 2 Conformance 2.1 Conformance of information interchange A coded-character-data-element (CC-d
24、ata-element) within coded information for interchange is in conformance with this International Standard if all the coded representations of characters within that CC-data-element conform to the requirements of clause 6 and of ISOllEC 4873. A claim of conformance shall identify the selected characte
25、r sets. 2.2 Conformance of devices A device is in conformance with this International Standard if it conforms to the requirements of 2.2.1, and either or both of 2.2.2 and 2.2.3. A claim of conformance shall identify the document which contains the description specified in 2.2.1, and shall identify
26、the selected character sets. 2.2.1 Device description A device that conforms to this International Standard shall be the subject of a description that identifies the means by which the user may supply characters to the device, or may recognize them when they are made available to him, as specified r
27、espectively in 2.2.2 and 2.2.3. 2.2.2 Originating devices An originating device shall allow its user to supply any sequence of characters from the selected character sets, and shall be capable of transmitting their coded representations within a CC-data-element. 2.2.3 Receiving devices A receiving d
28、evice shall be capable of receiving and interpreting any coded representations of characters that are within a CC-data-element, and that conform to 2.1, and shall make the corresponding characters available to its user in such a way that the user can identify them from among those from the selected
29、character sets, and can distinguish them from each other. Copyright American National Standards Institute Provided by IHS under license with ANSINot for ResaleNo reproduction or networking permitted without license from IHS-,-,-ISO/IEC 10367:1991 3 Normative references The following standards contai
30、n provisions which, through reference in this text, constitute provisions of this International Standard. At the time of publication, the editions indicated were valid. All standards are subject to revision, and parties to agreements based on this International Standard are encouraged to investigate
31、 the possi- bility of applying the most recent editions of the standards listed below. Members of IEC and IS0 maintain registers of currently valid lnternational Standards. IS0 2022: 1986, Information technology - IS0 7-bit and 8-bir coded character sets - Code extension techniques ISO/lEC 4873:1991
32、, Information technology - g-bit code for information interchange - Structure and rules for implementation. ISO/lEC 6429: ), Information rechnology - Control functions for 7-bit and 8-bit coded character sets. ISO/IEC 6937: , ) Information technology - Coded graphic character set for the communicati
33、on of texts using lhe Latin alphabet. lSO/IEC 8859, Information processing - 8-bil single-byte coded graphic character sets. lnternarional Register of Coded Character Sets to Be Used wifh Escape Sequences. (IS0 2375) 4 Definitions For the purpose of this International Standard the following definiti
34、ons apply. 4.1 bit combination: An ordered set of bits used for the representation of characters. 4.2 character: A member of a set of elements used for the organization, control or representation of data. 4.3 coded-character-data-element (CC-data-element): An element of interchanged information that
35、 is specified to consist of a sequence of coded representations of characters, in accordance with one or more identified standards for coded character sets. NOTES 1. In a communication environment according to the Reference Model for Open Systems Interconnection (IS0 74%). a CC- data-element will fo
36、rm all or part of the information that corresponds to the Presentation-Protocol-Data-Unit (PPDU) defined in that International Standard. 2. When information interchange is accomplished by means of interchangeable media, a CC-data-element will form all or part of the information that corresponds to t
37、he user data, and not that recorded during formatting and initialization. 4.4 coded character set; code: A set of unambiguous rules that establishes a character set and the one-to-one relationship between the characters of the set and their coded representation by one or more bit combinations. 4.5 c
38、ode table: A table showing the character allocated to each bit combination in a code. 4.6 control function: An action that affects the recording, processing, transmission, or interpretation of data, and that has a coded representation consisting of one or more bit combinations. 4.7 device: A compone
39、nt of information processing equipment which can transmit and/or receive coded information within CC-data-elements. NOTE It may be an input/output device in the conventional sense. or a process such as an application program or a gateway function. 4.8 escape sequence: A string of bit combinations th
40、at are used for control purposes in code extension procedures. The first of these bit combinations represents the control function ESCAPE. 1) To be published. 2 Copyright American National Standards Institute Provided by IHS under license with ANSINot for ResaleNo reproduction or networking permitte
41、d without license from IHS-,-,-ISO/IEC 10367:1991 4.9 graphic character: A character, other than a control function, that has a visual representation normally handwritten, printed or displayed, and that has a coded representation consisting of one or more bit combina- tions. 4.10 graphic symbol: A v
42、isual representation of a graphic character or of a control function. 4.11 position: That part of a code table identified by its column and row co-ordinates. 4.12 repertoire: A specified set of characters that are represented by means of one or more bit combinations of a coded character set. 4.13 us
43、er: A person or other entity that invokes the services provided by a device. NOTES I. This entity may be a process such as an application program if the “device ” is a code convertor or a gateway function, for example. 2. The characters, as supplied by the user or made available to him, may be in th
44、e form of codes local to the device, or of non- conventional visible representations. provided that 2.2 above is satisfied. 5 Notation, code tables and names Each coded character set is represented as a part of an S-bit code table. These sets are identical with those of the International Register wh
45、ere they are shown in a 7-bit environment. 5.1 Notation The bits of the bit combination of the S-bit code are identified by b8, b7, b6, b5, b4, b3, b2 and bl, where b8 is the highest-order, or most-significant bit and bl is the lowest-order, or least-significant bit. The bit combinations may be inte
46、rpreted to represent integers in the range 0 to 255 in binary notation by attributing the following weights to the individual bits. Bit b8 b7 b6 b5 b4 b3 b2 bl Weight 128 64 32 16 8 4 2 1 Using these weights, these 8-bit combinations represent numbers in the range 0 to 255. The bit combinations are
47、identified by notations of the form xx/yy, where xx and yy are numbers in the range 00 to 15. The correspondence between the notations of the form xx/yy and the bit combinations consisting of the bits b8 to bl is as follows: - xx is the number represented by b8, b7, b6 and b5 where these bits are gi
48、ven the weights 8, 4, 2 and 1 respectively; - yy is the number represented by b4, b3, b2 and bl where these bits are given the weights 8, 4, 2 and 1 respectively. The notations of the form xx/yy are the same as those used to identify code table positions, where xx is the column number and yy is the
49、row number (see 5.2). 5.2 Layout of the code tables An r - 6 2 zi a - 8 k 6 6 ae T 7 i A . 08 c i e 5 6 - I E F i! ij - cl - iI - . I I / I i i f i 14 Copyright American National Standards Institute Provided by IHS under license with ANSINot for ResaleNo reproduction or networking permitted without license from IHS-,-,-lSO/lEC 10367: 1991 Table 10 - Names of the characters of table 9 -ir comb. - 1 o/o0 lO/Ol 10102 10103 1 o/o4 lo/o5 1 O/O6