1、INTERNATIONAL STANDARD ISOAEC 8859-4 First edition 1998-07-01 Information technology - 8-bit single-byte coded graphic character sets - Part 4: Latin alphabet No. 4 Technologies de /information - Jeux de caractkres graphiques cod code: A set of unambiguous rules that establishes a character set and
2、the one-to-one relationship between the characters of the set and their bit combinations. 4.6 coded-character-data-element (CC-data- element): An element of interchanged information that is specified to consist of a sequence of coded representations of characters, in accordance with one or more iden
3、tified standards for coded character sets. 4.7 graphic character: A character, other than a control function, that has a visual representation normally handwritten, printed or displayed, and that has a coded representation consisting of one or more bit combinations. NOTE - In ISO/IEC 8859 a single b
4、it combination is used to represent each character. 4.8 graphic symbol: A visual representation of a graphic character or of a control function. 4.9 position: That part of a code table identified by its column and row coordinates. 5 Notation, code table and names 5.1 Notation The bits of the bit com
5、binations of the 8-bit code are identified by b, b, b, b, b, b, b, and b, where b, is the highest-order, or most-significant bit and b, is the lowest-order, or least-significant bit. The bit combinations may be interpreted to represent numbers in binary notation by attributing the following weights
6、to the individual bits: Bit b8 b7 b6 bs b4 b3 b2 b, Weight 128 64 32 16 8 4 2 1 Using these weights, the bit combinations are identified by notations of the form xx/yy, where xx and yy are numbers in the range 00 to 15. The correspondence between the notations of the form xx/yy and the bit combinati
7、ons consisting of the bits b, to b, is as follows: - xx is the number represented by b, b, b, and b, where these bits are given the weights 8, 4, 2, and 1 respectively. - yy is the number represented by b, b, b, and b, where these bits are given the weights 8, 4, 2, and 1 respectively. The bit combi
8、nations are also identified by notations of the form hk, where h and k are numbers in the range 0 to F in hexadecimal notation. The number h is the same as the number xx described above, and the number k the same as the number yy described above. 5.2 Layout of the code table An 8-bit code table cons
9、ists of 256 positions arranged in 16 columns and 16 rows. The columns and the rows are numbered 00 to 15. In hexa- decimal notation the columns and the rows are numbered 0 to F. The code table positions are identified by notations of the form xx/yy, where xx is the column number and yy is the row nu
10、mber. The column and row numbers are shown at the top and left edges of the table respectively. The code table positions are also identified by notations of the form hk, where h is the column number and k is the row number in hexadecimal notation. The column and row numbers are shown at the bottom a
11、nd right edges of the table respectively. The positions of the code table are in one-to-one correspondence with the bit combinations of the code. The notation of a code table position, of the form xx/yy, or of the form hk, is the same as that of the corresponding bit combination. 5.3 Names and meani
12、ngs This part of ISO/IEC 8859 assigns a unique name and a unique identifier to each graphic character. These names and identifiers have been taken from 2 0 ISO/IEC ISO/IEC 10646-l (E). This part of ISO/IEC 8859 also specifies an acronym for each of the characters SPACE, NO-BREAK SPACE and SOFT HYPHE
13、N. For acronyms only Latin capital letters A to Z are used. It is intended that the acronyms be retained in all translations of the text. Except for SPACE (SP), NO-BREAK SPACE (NBSP) and SOFT HYPHEN (SHY), this part of ISO/IEC 8859 does not define and does not restrict the meanings of graphic charac
14、ters. This part of ISO/IEC 8859 specifies a graphic symbol for each graphic character. This symbol is shown in the corresponding position of the code table. However, this part, or any other part, of ISO/IEC 8859 does not specify a particular style or font design for imaging graphic characters. Annex
15、 B of ISO/IEC 10367 gives further information on this subject. 5.3.1 SPACE (SP) A graphic character the visual representation of which consists of the absence of a graphic symbol. 5.3.2 NO-BREAK SPACE (NBSP) A graphic character the visual representation of which consists of the absence of a graphic
16、symbol, for use when a line break is to be prevented in the text as presented. 5.3.3 SOFT HYPHEN (SHY) A graphic character that is imaged by a graphic symbol identical with, or similar to, that representing HYPHEN, for use when a line break has been established within a word. 6 Specification of the
17、coded character set This part of ISO/IEC 8859 specifies 191 characters allocated to the bit combinations of the code table (table 2). None of these characters are combining characters. NOTE - Combining characters are described in ISO/IEC 2022:1994 subclause 6.3.3. Control functions, such as BACKSPAC
18、E or CARRIAGE RETURN, shall not be used to create composite graphic symbols, which are made up from the graphic representations of two or more characters. 6.1 Characters of the set and their coded representation See table 1. ISOAEC 8859-4:1998 (E) Table 1 - Character set, coded representation 3it :o
19、mbi- Hex Identifier Name iation 02/00 20 UtOO.20 SPACE 02101 21 UtO021 EXCLAMATION MARK 02/02 22 UtO022 QUOTATION MARK 02/03 23 UtO023 NUMBERSIGN 02/04 24 UtO024 DOLLARSIGN 02/05 25 UtO025 PERCENTSIGN 02/06 26 UtO026 AMPERSAND 02/07 27 UtO027 APOSTROPHE 02/08 28 UtO028 LEFTPARENTHESIS 02/09 29 UtO02
20、9 RIGHT PARENTHESIS 0200 2A Ut002A ASTERISK 02/11 28 Ut002B PLUSSIGN 02/12 2C UtOO2C COMMA 0203 2D Ut002D HYPHEN-MINUS 02/14 2E Ut002E FULLSTOP 0205 2F UtOO2F SOLIDUS 03/00 30 UtOO30 DIGITZERO 03101 31 UtO031 DIGITONE 03102 32 UtO032 DIGITTWO 03/03 33 UtO033 DIGITTHREE 03/04 34 UtO034 DIGIT FOUR 03/
21、05 35 UtO035 DIGIT FIVE 03106 36 UtO036 DIGITSIX 03107 37 UtO037 DIGITSEVEN 03108 38 UtO038 DIGIT EIGHT 03/09 39 UtO039 DIGITNINE 03110 3A Ut003A SOLON 03111 38 UtOO3B SEMICOLON 03/12 3C UtOO3C ILESS-THAN SIGN 03113 3D Ut003D EQUALSSIGN 03/14 3E Ut003E GREATER-THAN SIGN 03/15 3F UtOO3F QUESTION MARK
22、 04100 40 UtOO40 COMMERCIALAT 04/01 41 UtO041 LATIN CAPITAL LETTER A 04102 42 UtO042 LATIN CAPITALLETTER B 04/03 43 UtO043 LATIN CAPITAL LETTER C 04104 44 UtO044 LATIN CAPITAL LETTER D 04/05 45 UtO045 LATIN CAPITAL LETTER E 04106 46 UtO046 LATIN CAPITAL LETTER F 04/07 47 UtO047 LATIN CAPlTALLEllER G
23、 04108 48 UtO048 LATIN CAPITALLEITER H 04/09 49 UtO049 LATIN CAPITALLETTERI 04HO 4A Ut004A LATINCAPITALLETTERJ 04111 4B UtOO4B LATIN CAPITAL LElTER K 04112 4C UtOO4C LATlNCAPlTALLEllER L 04/13 4D Ut004D LATIN CAPITALLETTER M 04114 4E Ut004E LATINCAPITALLETTERN 04/15 4F UtOO4F LATIN CAPlTALLEllER 0 0
24、5/00 50 UtOO50 LATIN CAPlTALLEllER P 05/01 51 UtO051 LATIN CAPITAL LETTER Q 05102 52 UtO052 LATIN CAPITAL LETTER R 05103 53 UtO053 LATIN CAPITALLETTER S 05104 54 UtO054 LATIN CAPITAL LETTER T 05105 55 UtO055 LATIN CAPITAL LETTER U 05106 56 UtO056 LATIN CAPITAL LETTER V 05107 57 UtO057 LATIN CAPITAL
25、LETTER W 05108 58 UtO058 LATIN CAPITAL LETTER X 05109 59 UtO059 LATIN CAPITALLETTER Y 05/10 5A UtOO5A LATIN CAPITALLETTERZ 05/11 58 UtOO5B LEFTSQUAREBRACKET 05/12 5C UtOO5C REVERSESOLIDUS 05113 5D Ut005D RIGHTSQUARE BRACKET 05114 5E Ut005E CIRCUMFLEX ACCENT 05115 5F UtOO5F LOWLINE 3 ISOAEC 8859-4:19
26、98 (E) Table 1 (continued) 0 ISOIIEC Table 1 (concluded) 3it :ombi- Hex Identifier Name lation 06100 60 UtOO60 GRAVE ACCENT 06101 61 UtO061 LATIN SMALL LETTER A 06/02 62 UtO062 LATIN SMALL LETTER B 06103 63 UtO063 LATIN SMALL LETTER C 06/04 64 UtO064 LATIN SMALL LETTER D 06/05 65 UtO065 LATIN SMALL
27、LETTER E 06106 66 Ut0066 LATIN SMALL LETTER F 06/07 67 UtO067 LATIN SMALL LETTER G 06/08 68 UtO068 LATIN SMALL LETTER H 06109 69 UtO069 LATIN SMALL LETTER I 06110 6A Ut006A LATIN SMALL LETTER J 06/l 1 6B UtOO6B LATIN SMALL LETTER K 06112 6C UtOO6C LATIN SMALL LETTER L 06113 6D Ut006D LATIN SMALL LET
28、TER M 06114 6E Ut006E LATIN SMALL LETTER N 06115 6F UtOO6F LATIN SMALL LETTER 0 07/00 70 UtOO70 LATIN SMALL LETTER P 07101 71 UtOO71 LATIN SMALL LETTER Q 07102 72 UtO072 LATIN SMALL LETTER R 07103 73 UtO073 LATIN SMALL LETTER S 07104 74 UtO074 LATIN SMALL LETTER T 07/05 75 UtO075 LATIN SMALL LETTER
29、U 07106 76 UtO076 LATIN SMALL LETTER V 07107 77 UtO077 LATIN SMALL LETTER W 07/08 78 UtO078 LATIN SMALL LETTER X 07109 79 UtO079 LATIN SMALL LETTER Y 07110 7A Ut007A LATIN SMALL LETTER Z 07111 78 UtOO7B LEFT CURLY BRACKET 07/12 7C UtOO7C VERTICAL LINE 07113 7D Ut007D RIGHT CURLY BRACKET 07/14 7E Ut0
30、07E TILDE IO/O0 A0 UtOOAO NO-BREAK SPACE 10101 Al UtO104 LATIN CAPITAL LETTER A WITH OGONEK lo/O2 A2 Ut0138 LATIN SMALL LETTER KRA (Greenlandic) lo/O3 A3 Ut0156 LATIN CAPITAL LETTER R WITH CEDILLA 10104 A4 UtOOA4 CURRENCY SIGN 10105 A5 UtO128 LATIN CAPITAL LETTER I WITH TILDE 10106 A6 UtOl3B LATIN C
31、APITAL LETTER L WITH CEDILLA 10107 A7 UtOOA7 SECTION SIGN 10108 A8 UtOOA8 DIAERESIS 10109 A9 UtOl60 LATIN CAPITAL LETTER S WITH CARON 10110 AA Ut0112 LATIN CAPITAL LETTER E WITH MACRON lO/ll AB UtO122 LATIN CAPITAL LETTER G WITH CEDILLA lo/l2 AC Ut0166 LATIN CAPITAL LETTER T WITH STROKE lo/l3 AD UtO
32、OAD SOFT HYPHEN 10114 AE Ut017D LATIN CAPITAL LETTER Z WITH CARON 10115 AF UtOOAF MACRON 11100 BO UtOOBO DEGREE SIGN 11101 Bl UtO105 LATIN SMALL LETTER A WITH OGONEK 11102 B2 Ut02DB OGONEK 11103 83 UtO157 LATIN SMALL LETTER R WITH CEDILLA 11104 84 UtOOB4 ACUTE ACCENT 11105 85 UtO129 LATIN SMALL LETT
33、ER I WITH TILDE 1 l/O6 B6 UtOl3C LATIN SMALL LETTER L WITH CEDILLA 11107 B7 UtO2C7 CARON 11108 B8 UtOOB8 CEDILLA 11109 B9 UtO161 LATIN SMALL LETTER S WITH CARON 1 l/l0 BA UtO113 LATIN SMALL LETTER E WITH MACRON 11111 BB Ut0123 LATIN SMALL LETTER G WITH CEDILLA 11112 BC UtO167 LATIN SMALL LETTER T WI
34、TH STROKE 11113 BD Ut014A LATIN CAPITAL LETTER ENG (Semi) llH4 BE Ut017E LATIN SMALL LETTER Z WITH CARQN 11115 EF Ut014B LATIN SMALL LETTER ENG (Semi) 3il :ombi- Hex Identifier Name iation 12/00 CO UtOlOO LATIN CAPITAL LETTER A WITH MACRON 12101 Cl UtOOCl LATIN CAPITAL LETTER A WITH ACUTE 12/02 C2 U
35、tOOC2 LATIN CAPITAL LETTER A WITH CIRCUMFLEX 12/03 C3 UtOOC3 LATIN CAPITAL LETTER A WITH TILDE 12/04 C4 UtOOC4 LATIN CAPITAL LETTER A WITH DIAERESIS 12105 C5 UtOOC5 LATIN CAPITAL LETTER A WITH RING ABOVE 12/06 C6 UtOOC6 LATIN CAPITAL LElTER AE 12107 C7 Ut012E LATIN CAPITAL LETTER I WITH OGONEK 12108
36、 C8 UtOlOC LATIN CAPITAL LETTER C WITH CARON 12109 C9 UtOOC9 LATIN CAPITAL LETTER E WITH ACUTE 12110 CA UtOl18 LATIN CAPITAL LETTER E WITH OGONEK 12111 CB UtOOCB LATIN CAPITAL LETTER E WITH DIAERESIS 12112 CC UtOl16 LATIN CAPITAL LETTER E WITH DOT ABOVE 12.03 CD UtOOCD LATIN CAPITAL LETTER I WITH AC
37、UTE 12114 CE UtOOCE LATIN CAPITAL LETTER I WITH CIRCUMFLEX 1215 CF Ut012A LATIN CAPITAL LETTER I WITH MACRON 13100 DO UtOllO LATIN CAPITAL LETTER D WITH STROKE 13101 Dl UtO145 LATIN CAPITAL LETTER N WITH CEDILLA 13/02 D2 UtOl4C LATIN CAPITAL LETTER 0 WITH MACRON 13/03 D3 UtO136 LATIN CAPITAL LETTER
38、K WITH CEDILLA 13104 D4 UtOOD4 LATIN CAPITAL LETTER 0 WITH CIRCUMFLEX 13/05 D5 UtOOD5 LATIN CAPITAL LETTER 0 WITH TILDE 13106 D6 UtOOD6 LATIN CAPITAL LETTER 0 WITH DIAERESIS 13107 D7 UtOOD7 MULTIPLICATION SIGN 13/08 D8 UtOOD8 LATIN CAPITAL LETTER 0 WITH STROKE 13109 D9 UtO172 LATIN CAPITAL LETTER U
39、WITH OGONEK 13/10 DA UtOODA LATIN CAPITAL LETTER U WITH ACUTE 13111 DB UtOODB LATIN CAPITAL LETTER U WITH CIRCUMFLEX 13/12 DC UtOODC LATIN CAPITAL LETTER U WITH DIAERESIS 13113 DD UtO168 LATIN CAPITAL LETTER U WITH TILDE 13/14 DE Ut016A LATIN CAPITAL LETTER U WITH MACRON 13H5 DF UtOODF LATIN SMALL L
40、ETTER SHARP S (German) 14/00 EO UtOlOl LATIN SMALL LETTER A WITH MACRON 14101 El UtOOEl LATIN SMALL LETTER A WITH ACUTE 14102 E2 UtOOE2 LATIN SMALL LETTER A WITH CIRCUMFLEX 14/03 E3 UtOOE3 LATIN SMALL LETTER A WITH TILDE 14/04 E4 UtOOE4 LATIN SMALL LETTER A WITH DIAERESIS 14105 E5 UtOOE5 LATIN SMALL
41、 LETTER A WITH RING ABOVE 14/06 E6 UtOOE6 LATIN SMALL LETTER AE 14/07 E7 UtOl2F LATIN SMALL LETTER I WITH OGONEK 14108 E8 UtOlOD LATIN SMALL LETTER C WITH CARON 14109 E9 UtOOE9 LATIN SMALL LETTER E WITH ACUTE 14110 EA UtOll9 LATIN SMALL LETTER E WITH OGONEK 14111 EB UtOOEB LATIN SMALL LETTER E WITH
42、DIAERESIS 14112 EC UtOl17 LATIN SMALL LETTER E WITH DOT ABOVE 14113 ED UtOOED LATIN SMALL LETTER I WITH ACUTE 14114 EE UtOOEE LATIN SMALL LETTER I WITH CIRCUMFLEX 14115 EF Ut012B LATIN SMALL LETTER I WITH MACRON 15100 FO UtOlll LATIN SMALL LETTER D WITH STROKE 15101 Fl UtO146 LATIN SMALL LETTER N WI
43、TH CEDILLA 15/02 F2 Ut014D LATIN SMALL LETTER 0 WITH MACRON 15103 F3 UtO137 LATIN SMALL LETTER K WITH CEDILLA 15104 F4 UtOOF4 LATIN SMALL LETTER 0 WITH CIRCUMFLEX 15105 F5 UtOOF5 LATIN SMALL LETTER 0 WITH TILDE 15106 F6 UtOOF6 LATIN SMALL LETTER 0 WITH DIAERESIS 15107 F7 UtOOF7 DIVISION SIGN 15/08 F
44、8 UtOOF8 LATIN SMALL LETTER 0 WITH STROKE 15109 F9 Ut0173 LATIN SMALL LETTER U WITH OGONEK 15110 FA UtOOFA LATIN SMALL LETTER U WITH ACUTE 15111 FB UtOOFB LATIN SMALL LETTER U WITH CIRCUMFLEX 15/12 FC UtOOFC LATIN SMALL LETTER U WITH DIAERESIS 15113 FD Ut0169 LATIN SMALL LETTER U WITH TILDE 15114 FE
45、 UtOl6B LATIN SMALL LETTER U WITH MACRON 15115 FF Ut02D9 DOT ABOVE 4 0 ISOAEC ISOAEC 8859-4:1998 (E) 6.2 Code table For each character in the set the code table (table 2) shows a graphic symbol at the position in the code table corresponding to the bit combination specified in table 1. The shaded po
46、sitions in the code table correspond to bit combinations that do not represent graphic characters. Their use is outside the scope of ISOAEC 8859; it is specified in other International Standards, for example ISOAEC 6429. Table 2 - Code table of Latin alphabet No. 4 OOOl()l IlAQaq I qqiif)ldr;ll 0010
47、()2 u 2 B R b r K 1 ii0 2 6 2 0011()3 #3CScs Frit$Sk3 0 10004 ,$4DT dt xl ihi 0101()5 %5EUeu ITWuiG5 OI I 006 K C k N n n“ ?ZTUiiE 111115 ; ? 0 I 0 -0jOTF - Tl23456789ABCDETk 5 ISO/IEC 8859-4: 1998 (E) 0 ISO/IEC 7 Identification of the character set 7.1 Identification according to ISO/IEC 2022 and I
48、SO/IEC 4873 When the identification methods of ISO/IEC 8824-l are used this part of ISO/IEC 8859 shall be identified by the following object identifiers: The graphic characters of this part of ISO/IEC 8859 constitute a single coded character set. However in accordance with ISO/IEC 2022 and ISO/IEC 4
49、873 the code table of this part of ISO/IEC 8859 may be considered to consist of the following components: - The character SPACE represented by bit combination 02/00; - a 94-character GO graphic character set represented by bit combinations 02101 to 07/14; - character set iso standard 8859 4 abstract-syntax (1) - coded representations iso standard 8859 4 transfer-syntax (0) The corresponding object descriptors shall be: - character set “IS0 8859 part 4 repertoire” - coded representations “IS0 8859 part 4 code” - a 96-character Gl graphic character set repr