1、BRITISH STANDARD BS ISO/IEC 8859-9:1999 Information technology 8-bit single-byte coded graphic character sets Part 9: Latin alphabet No. 5 ICS 35.040BSISO/IEC8859-9:1999 This British Standard, having been prepared under the directionof the DISC Board, waspublished under the authorityof the Standards
2、 Committee and comes intoeffecton 15April1999 BSI 02-2000 ISBN 0 580 29210 X National foreword This British Standard reproduces verbatim ISO/IEC8859-9:1999 and implements it as the UK national standard. The UK participation in its preparation was entrusted to Technical Committee IST/2, Character Set
3、s and Information Coding, which has the responsibility to: aid enquirers to understand the text; present to the responsible international/European committee any enquiries on the interpretation, or proposals for change, and keep the UK interests informed; monitor related international and European de
4、velopments and promulgate them in the UK. A list of organizations represented on this committee can be obtained on request to its secretary. Cross-references The British Standards which implement international or European publications referred to in this document may be found in the BSI Standards Ca
5、talogue under the section entitled “International Standards Correspondence Index”, or by using the “Find” facility of the BSI Standards Electronic Catalogue. A British Standard does not purport to include all the necessary provisions of a contract. Users of British Standards are responsible for thei
6、r correct application. Compliance with a British Standard does not of itself confer immunity from legal obligations. Summary of pages This document comprises a front cover, an inside front cover, pagesi andii, theISO/IEC title page, pagesii toiv, pages1 to10 and a back cover. This standard has been
7、updated (see copyright date) and may have had amendments incorporated. This will be indicated in the amendment table on the inside front cover. Amendments issued since publication Amd. No. Date CommentsBSISO/IEC8859-9:1999 BSI 02-2000 i Contents Page National foreword Inside front cover Foreword iii
8、 Text of ISO/IEC 8859-9 1ii blankBSISO/IEC8859-9:1999 ii BSI 02-2000 Contents Page Foreword iii Introduction 1 1 Scope 1 2 Conformance 1 3 Normative references 1 4 Definitions 2 5 Notation, code table and names 2 6 Specification of the coded character set 3 7 Identification of the character set 8 An
9、nex A (informative) Coverage of languages by parts 1 to 10 of ISO/IEC 8859 9 Annex B (informative) Main differences between the first edition and this second edition of this part of ISO/IEC 8859 10 Annex C (informative) Bibliography 10 Table 1 Character set, coded representation 4 Table 2 Code table
10、 of Latin alphabet No. 5 7 Table A.1 Language coverage 9BSISO/IEC8859-9:1999 BSI 02-2000 iii Foreword ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission) form the specialized system for worldwide standardization. National bodies that are m
11、embers of ISOorIEC participate in the development of International Standards through technical committees established by the respective organization to deal with particular fields of technical activity. ISOandIEC technical committees collaborate in fields of mutual interest. Other international orga
12、nizations, governmental and nongovernmental, in liaison with ISOandIEC, also take part in the work. In the field of information technology, ISOandIEC have established a joint technical committee, ISO/IECJTC1. Draft International Standards adopted by the joint technical committee are circulated to na
13、tional bodies for voting. Publication as an International Standard requires approval by at least75% of the national bodies casting a vote. International Standard ISO/IEC8859-9 was prepared by Joint Technical Committee ISO/IECJTC1, Information technology, Subcommittee SC2, Coded character sets. This
14、second edition cancels and replaces the first edition (ISO/IEC8859-9:1989) which has been technically revised. ISO/IEC 8859 consists of the following parts, under the general title Information technology 8-bit single-byte coded graphic character sets: Part 1: Latin alphabet No. 1; Part 2: Latin alph
15、abet No. 2; Part 3: Latin alphabet No. 3; Part 4: Latin alphabet No. 4; Part 5: Latin/Cyrillic alphabet; Part 6: Latin/Arabic alphabet; Part 7: Latin/Greek alphabet; Part 8: Latin/Hebrew alphabet; Part 9: Latin alphabet No. 5; Part 10: Latin alphabet No. 6. Annex A to Annex C of this part of ISO/IEC
16、8859 are for information only.iv blankBSISO/IEC8859-9:1999 BSI 02-2000 1 Introduction ISO/IEC 8859 consists of several parts. Each part specifies a set of up to191 graphic characters and the coded representation of these characters by means of a single 8-bit byte. Each set is intended for use for a
17、particular group of languages. 1 Scope This part of ISO/IEC8859 specifies a set of191 coded graphic characters identified as Latin alphabet No.5. This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains g
18、raphic characters used for general purpose applications in typical office environments in at least the following languages: Albanian, Basque, Breton, Catalan, Danish, Dutch, English, Faroese, Finnish, French (with restrictions, seeAnnexA.1, Notes), Frisian, Galician, German, Greenlandic, Irish Gaeli
19、c (new orthography), Italian, Latin, Luxemburgish, Norwegian, Portuguese, Rhaeto-Romanic, Scottish Gaelic, Spanish, Swedish and Turkish. This set of coded graphic characters may be regarded as a version of an 8-bit code according to ISO/IEC2022 or ISO/IEC4873 at level1. This part of ISO/IEC8859 may
20、not be used in conjunction with any other parts of ISO/IEC8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC10367 should be used instead within a version of ISO/IEC4873 at level2 or level3
21、. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC6429. However, control functions are not used to create composite graphic symbols from two or more graphic characters (see clause6). NOTEISO/IEC8859 is not intended for use with Telematic
22、services defined by ITU-T. If information coded according to ISO/IEC8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point. 2 Conformance 2.1 Conformance of information interchange A coded-character-data-element (CC-data-element)
23、 within coded information for interchange is in conformance with this part of ISO/IEC8859 if all the coded representations of graphic characters within that CC-data-element conform to the requirements of clause6. 2.2 Conformance of devices A device is in conformance with this part of ISO/IEC8859 if
24、it conforms to the requirements of2.2.1, and either or both of2.2.2 and2.2.3. Aclaim of conformance shall identify the document which contains the description specified in2.2.1. 2.2.1 Device description A device that conforms to this part of ISO/IEC8859 shall be the subject of a description that ide
25、ntifies the means by which the user may supply characters to the device, or may recognize them when they are made available to him, as specified respectively in2.2.2 and2.2.3. 2.2.2 Originating devices An originating device shall allow its user to supply any sequence of characters from those specifi
26、ed in clause6, and shall be capable of transmitting their coded representations within a CC-data-element. 2.2.3 Receiving devices A receiving device shall be capable of receiving and interpreting any coded representations of characters that are within a CC-data-element, and that conform to clause6,
27、and shall make the corresponding characters available to its user in such a way that the user can identify them from among those specified there, and can distinguish them from each other. 3 Normative references The following standards contain provisions which, through reference in this text, constit
28、ute provisions of this part of ISO/IEC8859. At the time of publication, the editions indicated were valid. All standards are subject to revision, and parties to agreements based on this part of ISO/IEC8859 are encouraged to investigate the possibility of applying the most recent editions of the stan
29、dards indicated below. Members of IECandISO maintain registers of currently valid International Standards. ISO/IEC 2022:1994, Information technology Character code structure and extension techniques. ISO/IEC 4873:1991, Information technology ISO8-bit code for information interchange Structure and ru
30、les for implementation. ISO/IEC 8824-1:1995, Information technology Abstract Syntax Notation One (ASN.1): Specification of basic notation. BSISO/IEC8859-9:1999 2 BSI 02-2000 4 Definitions For the purposes of this part of ISO/IEC8859 the following definitions apply. 4.1 bit combination an ordered set
31、 of bits used for the representation of characters 4.2 byte a bit string that is operated upon as a unit 4.3 character a member of a set of elements used for the organization, control, or representation of data 4.4 code table a table showing the characters allocated to each bit combination in a code
32、 4.5 coded character set; code a set of unambiguous rules that establishes a character set and the one-to-one relationship between the characters of the set and their bit combinations 4.6 coded-character-data-element (CC-data-element) an element of interchanged information that is specified to consi
33、st of a sequence of coded representations of characters, in accordance with one or more identified standards for coded character sets 4.7 graphic character a character, other than a control function, that has a visual representation normally handwritten, printed or displayed, and that has a coded re
34、presentation consisting of one or more bit combinations NOTEIn ISO/IEC8859 a single bit combination is used to represent each character. 4.8 graphic symbol a visual representation of a graphic character or of a control function 4.9 position that part of a code table identified by its column and row
35、coordinates 5 Notation, code table and names 5.1 Notation The bits of the bit combinations of the 8-bit code are identified by b 8 , b 7 , b 6 , b 5 , b 4 , b 3 , b 2 , and b 1 , where b 8is the highest-order, or most-significant bit and b 1is the lowest-order, or least-significant bit. The bit comb
36、inations may be interpreted to represent numbers in binary notation by attributing the following weights to the individual bits: Using these weights, the bit combinations are identified by notations of the form xx/yy, where xx and yy are numbers in the range 00to15. The correspondence between the no
37、tations of the form xx/yy and the bit combinations consisting of the bits b 8to b 1is as follows: xx is the number represented by b 8 , b 7 , b 6and b 5where these bits are given the weights8,4,2, and1 respectively. yy is the number represented by b 4 , b 3 , b 2and b 1where these bits are given the
38、 weights8,4,2, and1 respectively. The bit combinations are also identified by notations of the form hk, where h and k are numbers in the range0 toF in hexadecimal notation. The number h is the same as the number xx described above, and the numberk the same as the number yy described above. 5.2 Layou
39、t of the code table An 8-bit code table consists of256 positions arranged in16 columns and16 rows. The columns and the rows are numbered00 to15. In hexadecimal notation the columns and the rows are numbered0 toF. The code table positions are identified by notations of the form xx/yy, where xx is the
40、 column number and yy is the row number. The column and row numbers are shown at the top and left edges of the table respectively. The code table positions are also identified by notations of the form hk, where h is the column number and k is the row number in hexadecimal notation. The column and ro
41、w numbers are shown at the bottom and right edges of the table respectively. The positions of the code table are in one-to-one correspondence with the bit combinations of the code. The notation of a code table position, of the form xx/yy, or of the form hk, is the same as that of the corresponding b
42、it combination. Bit b 8 b 7 b 6 b 5 b 4 b 3 b 2 b 1 Weight 128 64 32 16 8 4 2 1BSISO/IEC8859-9:1999 BSI 02-2000 3 5.3 Names and meanings This part of ISO/IEC8859 assigns a unique name and a unique identifier to each graphic character. These names and identifiers have been taken from ISO/IEC10646-1 (
43、E). This part of ISO/IEC8859 also specifies an acronym for each of the characters SPACE, NO-BREAK SPACE and SOFT HYPHEN. For acronyms only Latin capital letters A to Z are used. It is intended that the acronyms be retained in all translations of the text. Except for SPACE (SP), NO-BREAK SPACE (NBSP)
44、 and SOFT HYPHEN (SHY), this part of ISO/IEC8859 does not define and does not restrict the meanings of graphic characters. This part of ISO/IEC8859 specifies a graphic symbol for each graphic character. This symbol is shown in the corresponding position of the code table. However, this part, or any
45、other part, of ISO/IEC8859 does not specify a particular style or font design for imaging graphic characters. Annex B of ISO/IEC10367 gives further information on this subject. 5.3.1 SPACE (SP) A graphic character the visual representation of which consists of the absence of a graphic symbol. 5.3.2
46、NO-BREAK SPACE (NBSP) A graphic character the visual representation of which consists of the absence of a graphic symbol, for use when a line break is to be prevented in the text as presented. 5.3.3 SOFT HYPHEN (SHY) A graphic character that is imaged by a graphic symbol identical with, or similar t
47、o, that representing HYPHEN, for use when a line break has been established within a word. 6 Specification of the coded character set This part of ISO/IEC8859 specifies 191 characters allocated to the bit combinations of the code table (Table 2). None of these characters are combining characters. NO
48、TECombining characters are described in ISO/IEC2022:1994 subclause 6.3.3. Control functions, such as BACKSPACE or CARRIAGE RETURN, shall not be used to create composite graphic symbols, which are made up from the graphic representations of two or more characters. 6.1 Characters of the set and their
49、coded representation SeeTable 1.BSISO/IEC8859-9:1999 4 BSI 02-2000 Table 1 Character set, coded representation Bit combination Hex Identifier Name 02/00 20 U+0020 SPACE 02/01 21 U+0021 EXCLAMATION MARK 02/02 22 U+0022 QUOTATION MARK 02/03 23 U+0023 NUMBER SIGN 02/04 24 U+0024 DOLLAR SIGN 02/05 25 U+0025 PERCENT SIGN 02/06 26 U+0026 AMPERSAND 02/07 27 U+0027 APOSTROPHE 02/08 28 U+0028 LEFT PARENTHESIS 02/09 29 U+0029 RIGHT PARENTHESIS 02/10 2A U+002A ASTERISK 02/11 2B U+002B PLUS SIGN 02/12 2C U+002C COMMA 02/13 2D U+002D HYPHEN-