1、INTERNATIONAL STANDARD ISOAEC 8859-10 Second edition 1998-07-01 Information technology - code: A set of unambiguous rules that establishes a character set and the one-to-one relationship between the characters of the set and their bit combinations. 4.6 coded-character-data-element (CC-data- element)
2、: An element of interchanged information that is specified to consist of a sequence of coded representations of characters, in accordance with one or more identified standards for coded character sets. 4.7 graphic character: A character, other than a control function, that has a visual representatio
3、n normally handwritten, printed or displayed, and that has a coded representation consisting of one or more bit combinations. NOTE - In ISO/IEC 8859 a single bit combination is used to represent each character. 4.8 graphic symbol: A visual representation of a graphic character or of a control functi
4、on. The bit combinations may be interpreted to represent numbers in binary notation by attributing the following weights to the individual bits: Bit b6 b7 b6 b5 b4 b, b, b, Weight 128 64 32 16 8 4 2 1 I 1 I I I ! I I 1 Using these weights, the bit combinations are identified by notations of the form
5、 xx/yy, where xx and yy are numbers in the range 00 to 15. The correspondence between the notations of the form xx/yy and the bit combinations consisting of the bits b, to b, is as follows: - xx is the number represented by b, b, b, and b, where these bits are given the weights 8, 4, 2, and 1 respec
6、tively. - yy is the number represented by b, b, b, and b, where these bits are given the weights 8, 4, 2, and 1 respectively. The bit combinations are also identified by notations of the form hk, where h and k are numbers in the range 0 to F in hexadecimal notation. The number h is the same as the n
7、umber xx described above, and the number k the same as the number yy described above. 5.2 Layout of the code table An 8-bit code table consists of 256 positions arranged in 16 columns and 16 rows. The columns and the rows are numbered 00 to 15. In hexa- decimal notation the columns and the rows are
8、numbered 0 to F. The code table positions are identified by notations of the form xx/yy, where xx is the column number and yy is the row number. The column and row numbers are shown at the top and left edges of the table respectively. The code table positions are also identified by notations of the
9、form hk, where h is the column number and k is the row number in hexadecimal notation. The column and row numbers are shown at the bottom and right edges of the table respectively. 4.9 position: That part of a code table identified by its column and row coordinates. 5 Notation, code table and names
10、5.1 Notation The bits of the bit combinations of the 8-bit code are identified by b, b, b, b, b, b, b, and b, where b, is the highest-order, or most-significant bit and b, is the lowest-order, or least-significant bit. The positions of the code table are in one-to-one correspondence with the bit com
11、binations of the code. .The notation of a code table position, of the form xx/vv. or of the form hk. is the same as that of ,I the corresponding bit combination. 5.3 Names and meanings This part of ISO/IEC 8859 assigns a unique name and a unique identifier to each graphic character. These names and
12、identifiers have been taken from 2 0 ISO/IEC ISO/IEC 10646-l (E). This part of ISO/IEC 8859 also specifies an acronym for each of the characters SPACE, NO-BREAK SPACE and SOFT HYPHEN. For acronyms only Latin capital letters A to Z are used. It is intended that the acronyms be retained in all transla
13、tions of the text. Except for SPACE (SP), NO-BREAK SPACE (NBSP) and SOFT HYPHEN (SHY), this part of ISO/IEC 8859 does not define and does not restrict the meanings of graphic characters. This part of ISO/IEC 8859 specifies a graphic symbol for each graphic character. This symbol is shown in the corr
14、esponding position of the code table. However, this part, or any other part, of ISO/IEC 8859 does not specify a particular style or font design for imaging graphic characters. Annex B of ISO/IEC 10367 gives further information on this subject. 5.3.1 SPACE (SP) A graphic character the visual represen
15、tation of which consists of the absence of a graphic symbol. 5.3.2 NO-BREAK SPACE (NBSP) A graphic character the visual representation of which consists of the absence of a graphic symbol, for use when a line break is to be prevented in the text as presented. 5.3.3 SOFT HYPHEN (SHY) A graphic charac
16、ter that is imaged by a graphic symbol identical with, or similar to, that representing HYPHEN, for use when a line break has been established within a word. 6 Specification of the coded character set This part of ISO/IEC 8859 specifies 191 characters allocated to the bit combinations of the code ta
17、ble (table 2). None of these characters are combining characters. NOTE - Combining characters are described in ISO/IEC 2022:1994 subclause 6.3.3. Control functions, such as BACKSPACE or CARRIAGE RETURN, shall not be used to create composite graphic symbols, which are made up from the graphic represe
18、ntations of two or more characters. 6.1 Characters of the set and their coded representation See ,table 1. Table l- Character set, coded representation ISOAEC 8859-lo:1998 (E) 3il :ombi- Hex Identifier Name lation 02100 20 U+OO20 SPACE 02/01 21 UtO021 EXCLAMATION MARK 02/02 22 UtO022 QUOTATION MARK
19、02103 23 UtO023 NUMBERSIGN 02104 24 UtO024 DOLLARSIGN 02105 25 UtO025 PERCENTSIGN 02106 26 UtO026 AMPERSAND 0207 27 UtO027 APOSTROPHE 0208 28 UtO028 LEFT PARENTHESIS 02/09 29 UtO029 RIGHT PARENTHESIS 02/10 2A Ut002A ASTERISK 0201 28 UtOO2B PLUSSIGN 02/12 2C UtOO2C COMMA 02113 2D Ut002D HYPHEN-MINUS
20、0204 2E Ut002E FULLSTOP 02/15 2F UtOO2F SOLIDUS 03/00 30 U+OO30 DIGITZERO 03/01 31 UtO031 DIGITONE 03/02 32 UtO032 DIGITTWO 03/03 33 UtO033 DIGITTHREE 03/04 34 UtO034 DIGIT FOUR 03/05 35 UtO035 DIGIT FIVE 03106 36 UtO036 DIGIT SIX 03107 37 UtO037 DIGITSEVEN 03108 38 UtO038 DIGIT EIGHT 03109 39 UtO03
21、9 DIGIT NINE 03/10 3A Ut003A COLON 03111 36 UtOO3B SEMICOLON 03/12 3C UtOO3C LESS-THAN SIGN 03/13 3D UtOO3D EQUALSSIGN 03/14 3E Ut003E GREATER-THAN SIGN 03/15 3F UtOO3F QUESTIONMARK 04/00 40 UtOO40 COMMERCIAL AT 04/01 41 UtO041 LATIN CAPITAL LETTER A 04/02 42 UtO042 LATIN CAPITAL LETTER B 04/03 43 U
22、tO043 LATIN CAPITAL LETTER C 04/04 44 UtO044 LATIN CAPITAL LETTER D 04105 45 UtO045 LATIN CAPITAL LETTER E 04/06 46 UtO046 LATIN CAPITAL LETTER F 04107 47 UtO047 LATINCAPITALLETTERG 04/08 48 UtO048 LATIN CAPITAL LETTER H 04/09 49 UtO049 LATIN CAPITAL LETTER I 04/10 4A Ut004A LATINCAPITALLETTERJ 0411
23、1 49 UtOO4B LATIN CAPITAL LETTER K 0402 4C UtOO4C LATIN CAPITAL LETTER L 04/13 4D Ut004D LATIN CAPITAL LETTER M 04/14 4E Ut004E LATIN CAPITAL LETTER N 04115 4F UtOO4F LATIN CAPITALLETTER 0 05100 50 UtOO50 LATIN CAPITALLETTER P 05101 51 UtO051 LATIN CAPITAL LETTER C 05102 52 UtO052 LATIN CAPITAL LETT
24、ER R 05/03 53 UtO053 LATIN CAPITALLETTER S 05/04 54 UtO054 LATIN CAPITALLETTERT 05/05 55 UtO055 LATIN CAPITALLETTER U 05/06 56 UtO056 LATIN CAPITAL LETTER V 05107 57 UtO057 LATIN CAPITAL LETTER W D5108 58 UtO058 LATIN CAPITAL LETTER X 35109 59 UtO059 LATIN CAPITAL LETTER Y 35HO 5A Ut005A LATIN CAPIT
25、AL LETTER Z 35Hl 5B UtOO5B LEFTSQUAREBRACKET 35/12 5C UtOO5C REVERSE SOLIDUS 35113 50 Ut005D RIGHT SQUARE BRACKET 35114 5E Ut005E CIRCUMFLEX ACCENT 35115 5F UtOO5F LOW LINE ISOAEC 8859-lo:1998 (E) 0 ISO/IEC lit ombi- lation 06100 06/01 06/02 06103 06104 06/05 06/06 06/07 06/08 06109 06110 06111 06/l
26、 2 06/13 06114 06115 07/00 07101 07102 07103 07104 07105 07106 07107 07108 07109 07110 07/l 1 07112 07113 07114 1 o/o0 10101 10102 10103 10104 10105 1 O/O6 10107 1 O/O8 10109 lO/lO 10111 10112 10113 10/14 10115 1 l/O0 11101 1 l/O2 11103 11104 11/05 11/06 1 l/07 1 l/O8 11109 1 l/l0 1 l/l 1 11112 1111
27、3 11/14 11/15 - iex Identifier Name - 60 61 62 63 64 65 66 67 68 69 6A 6B 6C 6D 6E 6F 70 71 72 73 74 75 76 77 78 79 7A 78 7c 7D 7E UtOO60 UtO061 UtO062 UtO063 UtO064 UtO065 UtO066 Ut0067 UtO068 UtO069 Ut006A Ut006B UtOO6C Ut006D Ut006E UtOO6F Ut0070 utoo71 utoo72 utoo73 utoo74 utoo75 UtO076 Ut0077 U
28、tOO78 utoo79 Ut007A Ut007B utoo7c Ut007D Ut007E GRAVE ACCENT LATIN SMALL LETTER A LATIN SMALL LETTER B LATIN SMALL LETTER C LATIN SMALL LETTER D LATIN SMALL LETTER E LATIN SMALL LETTER F LATIN SMALL LETTER G LATIN SMALL LETTER H LATIN SMALL LETTER I LATIN SMALL LETTER J LATIN SMALL LETTER K LATIN SM
29、ALL LETTER L LATIN SMALL LETTER M LATIN SMALL LETTER N LATIN SMALL LETTER 0 LATIN SMALL LETTER P LATIN SMALL LETTER Q LATIN SMALL LETTER R LATIN SMALL LETTER S LATIN SMALL LETTER T LATIN SMALL LETTER U LATIN SMALL LETTER V LATIN SMALL LETTER W LATIN SMALL LETTER X LATIN SMALL LETTER Y LATIN SMALL LE
30、TTER Z LEFTCURLYBRACKET VERTICAL LINE RIGHT CURLY BRACKET TILDE A0 Al A2 A3 A4 A5 A6 A7 A8 A9 AA AB AC AD AE AF 90 91 B2 B3 84 85 B6 87 B8 89 BA BB 3C 3D BE BF - UtOOAO uto104 uto112 uto122 Ut012A Ut0128 Ut0136 UtOOA7 Ut013B uto11o UtO160 Ut0166 Ut017D UtOOAD Ut016A Ut014A UtOOBO uto105 uto113 uto12
31、3 UtO12B uto129 uto137 UtOOB7 uto13c UtOlll UtO161 Ut0167 Ut017E ut2015 UtO16B UtO14B NO-BREAK SPACE LATIN CAPITAL LETTER A WITH OGONEK LATIN CAPITAL LETTER E WITH MACRON LATIN CAPITAL LETTER G WITH CEDILLA LATIN CAPITAL LETTER I WITH MACRON LATIN CAPITAL LETTER I WITH TILDE LATIN CAPITAL LETTER K W
32、ITH CEDILLA SECTION SIGN LATIN CAPITAL LETTER L WITH CEDILLA LATIN CAPITAL LETTER D WITH STROKE LATIN CAPITAL LETTER S WITH CARON LATIN CAPITAL LETTER T WITH STROKE LATIN CAPITAL LETTER Z WITH CARON SOFT HYPHEN LATIN CAPITAL LETTER U WITH MACRON LATIN CAPITAL LETTER ENG (Semi) DEGREE SIGN LATIN SMAL
33、L LETTER A WITH OGONEK LATIN SMALL LETTER E WITH MACRON LATIN SMALL LETTER G WITH CEDILLA LATIN SMALL LETTER I WITH MACRON LATIN SMALL LETTER I WITH TILDE LATIN SMALL LETTER K WITH CEDILLA MIDDLE DOT LATIN SMALL LETTER L WITH CEDILLA LATIN SMALL LETTER D WITH STROKE LATIN SMALL LETTER S WITH CARON L
34、ATIN SMALL LETTER T WITH STROKE LATIN SMALL LETTER Z WITH CARON HORIZONTAL BAR LATIN SMALL LETTER U WITH MACRON LATIN SMALL LETTER ENG (Semi) Table 1 (continued) Table 1 (concluded) lit ombi- Hex Identifier Name lation 12/00 CO UtOlOO LATIN CAPITAL LETTER A WITH MACRON 12/01 Cl UtOOCl LATIN CAPITAL
35、LETTER A WITH ACUTE 12102 C2 UtOOC2 LATIN CAPITAL LETTER A WITH CIRCUMFLEX 12103 C3 UtOOC3 LATIN CAPITAL LETTER A WITH TILDE 1204 C4 UtOOC4 LATIN CAPITAL LETTER A WITH DIAERESIS 12/05 C5 UtOOC5 LATIN CAPITAL LETTER A WITH RING ABOVE 12106 C6 UtOOC6 LATIN CAPITAL LETTER AE 12/07 C7 Ut012E LATIN CAPIT
36、AL LETTER I WITH OGONEK 12108 C8 UtOlOC LATIN CAPITAL LETTER C WITH CARON 12/09 C9 UtOOC9 LATIN CAPITAL LETTER E WITH ACUTE 12110 CA Ut0118 LATIN CAPITAL LETTER E WITH OGONEK 12/l 1 CB UtOOCB LATIN CAPITAL LETTER E WITH DIAERESIS 12/12 CC UtO116 LATIN CAPITAL LETTER E WITH DOT ABOVE 12/13 CD UtOOCD
37、LATIN CAPITAL LETTER I WITH ACUTE 12/14 CE UtOOCE LATIN CAPITAL LETTER I WITH CIRCUMFLEX 12/15 CF UtOOCF LATIN CAPITAL LETTER I WITH DIAERESIS 13100 DO UtOODO LATIN CAPITAL LETTER ETH (Icelandic) 13101 Dl Ut0145 LATIN CAPITAL LETTER N WITH CEDILLA 13102 D2 UtO14C LATIN CAPITAL LETTER 0 WITH MACRON 1
38、3103 D3 UcOOD3 LATIN CAPITAL LETTER 0 WITH ACUTE 13104 D4 UtOOD4 LATIN CAPITAL LETTER 0 WITH CIRCUMFLEX 13/05 D5 UtOOD5 LATIN CAPITAL LETTER 0 WITH TILDE 13106 D6 UtOOD6 LATIN CAPITAL LETTER 0 WITH DIAERESIS 13107 D7 Ut0168 LATIN CAPITAL LETTER U WITH TILDE 13/08 D8 UtOOD8 LATIN CAPITAL LETTER 0 WIT
39、H STROKE 13109 D9 Ut0172 LATIN CAPITAL LETTER U WITH OGONEK 13110 DA UtOODA LATIN CAPITAL I.ETTER U WITH ACUTE 13111 DB UtOODB L4TIN CAPITAL LETTER U WITH CIRCUMFLEX 13112 DC UtOODC t 4TIN CAPITAL LETTER U WITH DIAERESIS 13113 DD UtOODD LATIN CAPITAL LETTER Y WITH ACUTE 13114 DE UtOODE LATIN CAPITAL
40、 LETTER THORN (Icelandic) 13/15 DF UtOODF LATIN SMALL LETTER SHARP S (German) 14100 EO UtOlOl LATIN SMALL LETTER A WITH MACRON 14101 El UtOOEl LATIN SMALL LETTER A WITH ACUTE 14/02 E2 UtOOE2 LATIN SMALL LETTER A WITH CIRCUMFLEX 14103 E3 UtOOE3 LATIN SMALL LETTER A WITH TILDE 14/04 E4 UtOOE4 LATIN SM
41、ALL LETTER A WITH DIAERESIS 14105 E5 UtOOE5 LATIN SMALL LETTER A WITH RING ABOVE 14106 E6 UtOOE6 LATIN SMALL LETTER AE 14107 E7 UtO12F LATIN SMALL LETTER I WITH OGQNEK 14108 E8 UtOlOD LATIN SMALL LETTER C WITH CARON 14109 E9 UtOOE9 LATIN SMALL LETTER E WITH ACUTE 14/10 EA UtO119 LATIN SMALL LETTER E
42、 WITH OGONEK 14/l 1 EB UtOOEB LATIN SMALL LETTER E WITH DIAERESIS 14/12 EC UtO117 LATIN SMALL LETTER E WITH DOT ABOVE 14/13 ED UtOOED LATIN SMALL LETTER I WITH ACUTE 14/14 EE UtOOEE LATIN SMALL LETTER I WITH CIRCUMFLEX 14115 EF UtOOEF LATIN SMALL LETTER I WITH DIAERESIS 15/00 FO UtOOFO LATIN SMALL L
43、ETTER ETH (Icelandic) 15101 Fl Ut0146 LATIN SMALL LETTER N WITH CEDILLA 15102 F2 Ut014D LATIN SMALL LETTER 0 WITH MACRON 15103 F3 UtOOF3 LATIN SMALL LETTER 0 WITH ACUTE 15104 F4 UtOOF4 LATIN SMALL LETTER 0 WITH CIRCUMFLEX 15105 F5 UtOOF5 LATIN SMALL LETTER 0 WITH TILDE 15/06 F6 UtOOF6 LATIN SMALL LE
44、TTER 0 WITH DIAERESIS 15107 F7 Ut0169 LATIN SMALL LETTER U WITH TILDE 15108 F8 UtOOF8 LATIN SMALL LETTER 0 WITH STROKE 15109 F9 Ut0173 LATIN SMALL LETTER U WITH OGONEK 15110 FA UtOOFA LATIN SMALL LETTER U WITH ACUTE 15/l 1 FB UtOOFB LATIN SMALL LETTER U WITH CIRCUMFLEX 15/12 FC UtOOFC LATIN SMALL LE
45、TTER U WITH DIAERESIS 15/13 FD UtOOFD LATIN SMALL LETTER Y WITH ACUTE 15/14 FE UtOOFE LATIN SMALL LETTER THORN (Icelandic) 15/15 FF Ut0138 LATIN SMALL LETTER KRA (Greenlandic) 4 0 ISO/IEC ISO/IEC 8859-l 0:1998 (E) 6.2 Code table For each character in the set the code table (table 2) shows a graphic
46、symbol at the position in the code table corresponding to the bit combination specified in table 1. The shaded positions in the code table correspond to bit combinations that do not represent graphic characters. Their use is outside the scope of ISOAEC 8859; it is specified in other International St
47、andards, for example ISOAEC 6429. Table 2 - Code table of Latin alphabet No. 6 i -0 -1 0 1 I -0 -1 3d01Oi030i05Oi070809101112131i15 NBSP o 6 - 7 8 lSO/lEC 8859-l 0: 1998 (E) 0 ISO/IEC 7 Identification of the character set 7.1 Identification according to ISO/IEC 2022 and ISO/IEC 4873 When the identif
48、ication methods of ISO/IEC 8824-l are used this part of ISO/IEC 8859 shall be identified by the following object identifiers: The graphic characters of this part of ISO/IEC 8859 constitute a single coded character set. However in accordance with ISO/IEC 2022 and ISO/IEC 4873 the code table of this p
49、art of ISO/IEC 8859 may be considered to consist of the following components: - The character SPACE represented by bit combination 02/00; - character set iso standard 8859 10 abstract-syntax (1) ) _ coded representations iso standard 8859 10 transfer-syntax (0) The corresponding object descriptors shall be: - character set “IS0 8859 part 10 repertoire” - a 94-character GO graphic character set represented by bit combinations 02/01 to 07/14; - a 96-character Gl graphic character set represented by bit combinations 1 O/O0 to 15/l 5. When the identification methods of ISO
copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1