ECMA 118-1986 8-Bit Single-Byte Coded Graphic Character Sets Latin Greek Alphabet《8-位单字节编码的图形字符集 拉丁 希腊字母》.pdf

上传人:arrownail386 文档编号:704606 上传时间:2019-01-03 格式:PDF 页数:15 大小:625.07KB
下载 相关 举报
ECMA 118-1986 8-Bit Single-Byte Coded Graphic Character Sets Latin Greek Alphabet《8-位单字节编码的图形字符集 拉丁 希腊字母》.pdf_第1页
第1页 / 共15页
ECMA 118-1986 8-Bit Single-Byte Coded Graphic Character Sets Latin Greek Alphabet《8-位单字节编码的图形字符集 拉丁 希腊字母》.pdf_第2页
第2页 / 共15页
ECMA 118-1986 8-Bit Single-Byte Coded Graphic Character Sets Latin Greek Alphabet《8-位单字节编码的图形字符集 拉丁 希腊字母》.pdf_第3页
第3页 / 共15页
ECMA 118-1986 8-Bit Single-Byte Coded Graphic Character Sets Latin Greek Alphabet《8-位单字节编码的图形字符集 拉丁 希腊字母》.pdf_第4页
第4页 / 共15页
ECMA 118-1986 8-Bit Single-Byte Coded Graphic Character Sets Latin Greek Alphabet《8-位单字节编码的图形字符集 拉丁 希腊字母》.pdf_第5页
第5页 / 共15页
点击查看更多>>
资源描述

1、ECMA ECMA*118 86 I 3404593 O001192 8 I ECMA EUROPEAN CO M PUTER MAN U FACTURE RS ASSOCIATION STANDARD ECMLA-118 8-BIT SINGLE-BYTE CODED GRAPHIC CHARACTER SETS I LATIN/GREEK ALPHABET December 1986 ECMA ECMA*LLB Bb m 3404593 OOOLL93 T m i- BRIEF HISTORY The adoption of ECMA-6 (IS0 646) as the agreed i

2、nternational 7-bit code for information interchange had led to the development of many national, international and application-oriented versions of this code which are in wide use today. These versions have a number of limitations generally inherent to the size of the code: - they do not provide all

3、 graphic characters which may be needed, - for some characters, specially for accented letters, it is neces- sary to resort to BACKSPACE sequences, which creates problems when processing data containing such composite characters, - interchange among different versions is practically limited to the 8

4、2 common graphic characters. With the advent of 8-bit coding it was possible to increase the num- ber of graphic characters. IS0 6937/2, for example, provides a char- acter set covering the requirements of most languages based on the Latin alphabet. This character set, although well suited for text

5、communication, is difficult to use for processing as some graphic characters are represented by one and others by two bit combina- tions. Thus the need was recognized for coded graphic character sets, each of which: - is the same for all users of a given area, - provides single-byte coding of all gr

6、aphic characters thus permit- - takes into account character sets used in the industry. Since 1982 the urgency of the need for an 8-bit single-byte coded character set was recognized in ECMA as well as in ANSI/X3L2 and nu- merous working papers were exchanged between the two groups. In February 1984

7、 ECMA TC1 submitted to ISO/TC97/SC2 a proposal for such a coded character set. At its meeting of April 1984 SC2 decided to submit to TC97 a proposal for a new item of work for this topic. Technical discussions during and after this meeting led TC1 to adopt the coding scheme proposed by X3L2. Interna

8、tional Standard IS0 8859/1 is based on this joint ANSI/ECMA proposal. ECMA published the 1st edition of its corresponding Standard ECMA-94 in March 1985. After this first publication, the work of ECMA TC1 on further coded graphic character sets has led to the following results: ting easy processing,

9、 i) The present Standard for a Latin/Greek coded graphic character set. This set has been agreed.by ELOT, the Greek Standardiza- tion Institution. It will be submitted to IS0 for processing under the fast-track procedure. ECMA ECMA*LLB 86 3404593 OOOLL94 L E ii) The second Edition of Standard ECMA-9

10、4, dated June 1986, com- prising four coded graphic character sets for the Latin script, identified as Latin Alphabets No 1 to No 4. These alphabets have a number of characters in common, in particular those al- located to columns 02 to 07. Latin Alphabet No 2 has been sub- mitted to IS0 and is the

11、subject of IS0 8859/2. Latin Alphabets No. 3 and No. 4 are processed as IS0 DP 8859/3 and DP 8859/4. iii) A series of ECMA Standards for coded graphic character sets comprising those characters of the Latin Alphabets allocated to columns 02 to 07 and characters of another script for multi- ple-langu

12、age applications. These ECMA standards cover the Cyrillic and Arabic scripts. They have been submitted to IS0 as DIS 8859/5 and DIS 8859/6, respectively, for fast-track pro- cessing as IS0 standards. Adopted as an ECMA Standard by the General Assembly of December 12, 1986. 1. 2. 3. 4. 5. 6. 7. 8. 9.

13、 ECMA ECMA*LL8 86 3LiO4573 OOOLL75 3 TABLE OF CONTENTS Page SCOPE FIELD OF APPLICATION CONFORMANCE REFERENCES DEFINITIONS 5.1 Bit Combination; Byte 5.2 Character 5.3 Coded Character Set; Code 5.4 Code Table 5.5 Graphic Character 5.6 Graphic Symbol 5.7 Position NOTATION, CODE TABLE AND NAMES 6.1 Nota

14、tion 6.2 Layout of the Code Table 6.3 Names and Meanings 6.3.1 SPACE (SP) 6.3.2 NO-BREAK SPACE (NBSP) 6.3.3 SOFT HYPHEN (SHY) SPECIFICATION OF THE CODED CHARACTER SET 1 1 1 1 1 1 2 2 2 2 2 2 2 2 3 3 3 3 3 4 4 4 7.1 7.2 Code Table Characters of the Set and their Coded Representation DESIGNATION OF TH

15、E CHARACTER SET BIT COMBINATIONS NOT TO BE USED 9 * 10 ECMA ECMA*LIB-Bb H 3404593 OOOI19b 5 U -1- 1. SCOPE This ECMA Standard defines a set of 185 graphic characters iden- tified as Latin/Greek Alphabet, and specifies the coded repre- sentation of each of these characters by means of a single 8-bit

16、byte. None of these characters are lvnon-spacing“. The use of control functions, such as BACKSPACE or CARRIAGE RETURN for the coded representation of composite characters is prohibited by this Standard. 2. FIELD OF APPLICATION This set of graphic characters, the Latin/Greek Alphabet, is in- tended f

17、or use in data and text processing applications and may also be used for information interchange. This set is suited for multiple-language applications involving the Latin and the Greek scripts. It allows handling of data and text expressed in Greek. This set of graphic characters is suitable for us

18、e in a version of an 8-bit code according to ECMA-35 or ECMA-43. 3. CONFORMANCE A set of graphic characters is in conformance with this Standard if it comprises all graphic characters specified herein to the exclusion of any other and if their coded representations are those specified by this Standa

19、rd. 4. REFERENCES ECMA-6 : 7-bit Input/Output Coded Character Set ECMA-35 : Code Extension Techniques ECMA-43 : 8-bit Coded Character Set - Structure and Rules ECMA-48 : Control Functions ECMA-94 : 8-bit Single-Byte Coded Graphic Character Sets - ECMA-113 : 8-bit Single-Byte Coded Graphic Character

20、Sets - ECMA-114 : 8-bit Single-Byte Coded Graphic Character Sets - Latin Alphabets No 1 to No 4. Latin/Cyrillic Alphabet Latin/Arabic Alphabet 5. DEFINITIONS For the purpose of this Standard the following definitions apply: 5.1 Bit Combination; Byte An ordered set of bits that represents a character

21、 or is used as a part of the representation of a character. _ ECMA ECMA*LLB Bb 3404593 OOOLL97 7 I 5.2 5.3 5.4 5.5 5.6 5.7 -2- Character A member of a set of elements used for the organization, con- trol or representation of data. Coded Character Set; Code A set of unambiguous rules that establishes

22、 a character set and the one-to-one relationship between each character of the set and its coded representation. Code Table A table showing the character allocated to each bit combina- tion in a code. Graphic Character A character, other than a control function, that has a visual representation norm

23、ally handwritten, printed or displayed, and that has a coded representation consisting of one or more bit combinations. Note 1 In this Standard a single bit combination is used to represent each character. Graphic Symbol A visual representation of a graphic character. Position That part of a code ta

24、ble identified by its column and row Co-ordinates. 6. NOTATION, CODE TABLE AND NAMES 6.1 Notation The bits of the bit combinations of the 8-bit code are identi- fied by b, b, b6, b5, b, b, b, and b, where b, is the high- est-order, or most-significant bit and b, is the lowest-order, or least-signifi

25、cant bit. The bit Combinations may be interpreted to represent numbers in binary notation by attributing the following weights to the individual bits: Bit bl3 b7 b6 b5 b4 b3 bz bi Weight 128 64 32 16 8 4 2 1 . Using these weights, the bit combinations of the 8-bit code represent numbers in the range

26、 .O to 255. In this Standard, the bit combinations are identified by nota- tions of the form xx/yy, where xx and n are numbers in the range O0 to 15. The correspondence between the notations of the form xx/yy and the bit combinations consisting of the bits b, to b, is as fOllOWS: ECMA ECMA*KLKLB ib

27、a 3404593 0001L98 9 m -3- - xx is the number represented by b, b, b, and b, where these - yy is the number represented by b , b, b, and b, where these bits are given the weights 8, 4, 2 and 1 respectively; bits are given the weights 8, 4, 4 and 1 respectively. 6.2 Layout of the Code Table An 8-bit c

28、ode table consists of 256 positions arranged in 16 columns and 16 rows. The columns and the rows are numbered O0 to 15. 6.3 The code tible positions are identified by notations of the form xx/yy, where xx is the column number and yy is the row number. The positions of the code table are in one-to-on

29、e correspon- dence with the bit combinations of the code. The notation of a code table position, of the form xx/yy, is the same as that of the corresponding bit combination. Names and Meaninqs This Standard assigns at least one name to each character. In addition, it specifies a graphic symbol for e

30、ach graphic char- acter. By convention only capital letters, the graphic symbols of small letters and hyphens are used for writing the names of the characters. The names chosen to denote graphic characters are intended to reflect their customary meaning. However, except for SPACE (SP), NO-BREAK SPAC

31、E (NBSP) and SOFT HYPHEN (SHY), this Stan- dard does not define and does not restrict the meanings of graphic characters. Neither does it specify a particular style or font design for imaging graphic characters. 6.3.1 SPACE (SP) This character may be interpreted as a graphic character, a control cha

32、racter or as both. As a graphic character it has the visual representation consisting of the absence of a graphic symbol. A graphic character the visual representation of which con- sists of the absence of a graphic symbol, for use when a line break is to be prevented in the text as presented. 6.3.2

33、 NO-BREAK?SPACE (NBSP) 6.3.3 SOFT HYPHEN (SHY) A graphic character that is imaged by a graphic symbol identical with, or similar to, that representing HYPHEN, for use when a line break has been established within a word. -4- 7. SPECIFICATION OF THE CODED CHARACTER SET This Standard specifies 185 cha

34、racters allocated to the bit com- binations of the Code Table. 7.1 Characters of the Set and their Coded Representation I Bit Combination Name 02/00 02/01 02/02 02/03 02/04 02/05 02/06 02/07 02/08 02/09 02/10 02/11 02/12 02/13 02/14 02/15 03/00 03/01 03/02 03/03 03/04 03/05 03/06 03/07 03/08 03/09 0

35、3/10 03/11 03/12 SPACE EXCLAMATION MARK QUOTATION MARK NUMBER SIGN DOLLAR SIGN PERCENT SIGN AMPERSAND APOSTROPHE LEFT PARENTHESIS RIGHT PARENTHESIS ASTERISK PLUS SIGN COMMA HYPHEN, MINUS SIGN FULL STOP SOLIDUS DIGIT ZERO DIGIT ONE DIGIT TWO DIGIT THREE DIGIT FOUR DIGIT FIVE DIGIT SIX DIGIT SEVEN DIG

36、IT EIGHT DIGIT NINE COLON SEMICOLON (Eromatiko) LESS-THAN SIGN ECMA ECMA*LL8 86 i 3404593 O001200 3 H -5- Bit Combination Name 03/13 03/14 03/15 04/00 04/01 04/02 04/03 04/04 04/05 04/06 04/07 04/08 04/09 04/10 04/11 04/12 04/13 04/14 04/15 05/00 05/01 05/02 05/03 05/04 05/05 05/06 05/07 05/08 05/09

37、 05/10 05/11 05/12 05/13 05/14 05/15 06/00 EQUALS SIGN GREATER-THAN SIGN QUESTION MARK COMMERCIAL AT CAPITAL LETTER A CAPITAL LETTER B CAPITAL LETTER C CAPITAL LETTER D CAPITAL LETTER E CAPITAL LETTER F CAPITAL LETTER G CAPITAL LETTER H CAPITAL LETTER I CAPITAL LETTER J CAPITAL LETTER K CAPITAL LETT

38、ER L CAPITAL LETTER M CAPITAL LETTER N CAPITAL LETTER O CAPITAL LETTER P CAPITAL LETTER Q CAPITAL LETTER R CAPITAL LETTER S CAPITAL LETTER T CAPITAL LETTER U CAPITAL LETTER V CAPITAL LETTER W CAPITAL LETTER X CAPITAL LETTER Y CAPITAL LETTER 2 LEFT SQUARE BRACKET REVERSE SOLIDUS . RIGHT SQUARE BRACKE

39、T CIRCUMFLEX ACCENT LOW LINE GRAVE ACCENT SMALL LETTER a SMALL LETTER b SMALL LETTER C SMALL LETTER d SMALL LETTER e SMALL LETTER f SMALL LETTER g SMALL LETTER h SMALL LETTER i SMALL LETTER j SMALL LETTER k SMALL LETTER 1 SMALL LETTER m SMALL LETTER n SMALL LETTER o SMALL LETTER p SMALL LETTER g SMA

40、LL LETTER r SMALL LETTER S SMALL LETTER t SMALL LETTER u SMALL LETTER v SMALL LETTER W SMALL LETTER X SMALL LETTER y SMALL LETTER z LEFT CURLY BRACKET VERTICAL LINE RIGHT CURLY BRACKET TILDE NO-BREAK SPACE LEFT SINGLE QUOTATION MARK RIGHT SINGLE QUOTATION MARK POUND SIGN This position shall not be u

41、sed This position shall not be used - ECMA ECMA*LLB 86 E 3404593 000120L 5 -6- Bit Combination Name 06/01 06/02 06/03 06/04 06/05 06/06 06/07 06/08 06/09 06/10 06/11 06/12 06/13 06/14 06/15 07/00 07/01 07/02 07/03 07/04 07/05 07/06 07/07 07/08 07/09 07/10 07/11 07/12 07/13 07/14 10/00 10/01 10/02 10

42、/03 10/04 10/05 I- - Bit Combination Name 10/06 10/07 10/08 10/09 10/10 10/11 10/12 10/13 10/14 10/15 11/00 11/01 11/02 11/03 11/04 11/05 11/06 11/07 11/08 11/09 11/10 11/11 11/12 11/13 11/14 11/15 12/00 12/01 12/02 12/03 12/04 12/05 12/06 12/07 12/08 12/09 BROKEN BAR PARAGRAPH SIGN DIAERESIS (Dialy

43、tika) COPYRIGHT SIGN This position shall not be used LEFT ANGLE QUOTATION MARK NOT SIGN SOFT HYPHEN This position shall not be used HORIZONTAL BAR (Parenthetiki pavla) DEGREE SIGN PLUS-MINUS SIGN SUPERSCRIPT TWO SUPERSCRIPT THREE ACCENT (Tonos) DIAERESIS AND ACCENT (Dialytika and Tonos) MIDDLE DOT (

44、Ano Teleial CAPITAL GREEK LETTER ALPHA WITH ACCENT CAPITAL GREEK LETTER EPSILON WITH ACCENT CAPITAL GREEK LETTER ETA WITH ACCENT CAPITAL GREEK LETTER IOTA WITH ACCENT RIGHT ANGLE QUOTATION MARK CAPITAL GREEK LETTER OMICRON WITH ACCENT VULGAR FRACTION ONE HALF CAPITAL GREEK LETTER UPSILON WITH ACCENT

45、 CAPITAL GREEK LETTER OMEGA WITH ACCENT SMALL GREEK LETTER IOTA WITH DIAERESIS AND ACCENT CAPITAL GREEK LETTER ALPHA CAPITAL GREEK LETTER BETA CAPITAL GREEK LETTER GAMMA CAPITAL GREEK LETTER DELTA CAPITAL GREEK LETTER EPSILON CAPITAL GREEK LETTER ZETA CAPITAL GREEK LETTER ETA CAPITAL GREEK LETTER TH

46、ETA CAPITAL GREEK LETTER IOTA 12/10 12/11 12/12 12/13 12/14 12/15 13/00 13/01 13/02 13/03 13/04 13/05 13/06 13/07 13/08 13/09 13/10 13/11 13/12 13/13 13/14 13/15 14/00 14/01 14/02 14/03 14/04 14/05 14/06 14/07 14/08 14/09 14/10 14/11 14/12 14/13 ECMA ECMAP11A 86 sl 3404593 0001203 9 M -8- CAPITAL GR

47、EEK LETTER KAPPA CAPITAL GREEK LETTER LAMDA CAPITAL GREEK LETTER MU CAPITAL GREEK LETTER NU CAPITAL GREEK LETTER KSI CAPITAL GREEK LETTER OMICRON CAPITAL GREEK LETTER PI CAPITAL GREEK LETTER RHO This position shall not be used CAPITAL GREEK LETTER SIGMA CAPITAL GREEK LETTER TAU CAPITAL GREEK LETTER

48、UPSILON CAPITAL GREEK LETTER PHI CAPITAL GREEK LETTER KHI CAPITAL GREEK LETTER PSI CAPITAL GREEK LETTER OMEGA CAPITAL GREEK LETTER IOTA WITH DIAERESIS CAPITAL GREEK LETTER UPSILON WITH DIAERESIS SMALL GREEK LETTER ALPHA WITH ACCENT SMALL GREEK LETTER EPSILON WITH ACCENT SMALL GREEK LETTER ETA WITH A

49、CCENT SMALL GREEK LETTER IOTA WITH ACCENT SMALL GREEK LETTER UPSILON WITH DIAERESIS AND ACCEN SMALL GREEK LETTER ALPHA SMALL GREEK LETTER BETA SMALL GREEK LETTER GAMMA SMALL GREEK LETTER DELTA SMALL GREEK LETTER EPSILON SMALL GREEK LETTER ZETA SMALL GREEK LETTER ETA SMALL GREEK LETTER THETA SMALL GREEK LETTER IOTA SMALL GREEK LETTER KAPPA SMALL GREEK LETTER LAMDA SMALL GREEK LETTER MU SMALL GREEK LETTER NU ECMA ECMA*11I 86 3404593 0001204 O -9- Bit.Combination Name 14/14 14/15 15/00 15/01 15/02 15/03 15/04 15/05 15/06 15/07 15/08 15/09 15/10 15/11 15/12 15/13 15

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 标准规范 > 国际标准 > 其他

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1