1、BRITISH STANDARD BS ISO/IEC 8859-6:1999 Information technology 8-bit single-byte coded graphic character sets Part 6: Latin/Arabic alphabet ICS 35.040BSISO/IEC8859-6:1999 This British Standard, having been prepared under the directionof the DISC Board, waspublished under the authorityof the Standard
2、s Committee and comes into effect on 15 April 1999 BSI 02-2000 ISBN 0 580 29209 6 National foreword This British Standard reproduces verbatim ISO/IEC8859-6:1999 and implements it as the UK national standard. The UK participation in its preparation was entrusted to Technical Committee IST/2, Characte
3、r sets and information coding, which has the responsibility to: aid enquirers to understand the text; present to the responsible international/European committee any enquiries on the interpretation, or proposals for change, and keep the UK interests informed; monitor related international and Europe
4、an developments and promulgate them in the UK. A list of organizations represented on this committee can be obtained on request to its secretary. Cross-references The British Standards which implement international or European publications referred to in this document may be found in the BSI Standar
5、ds Catalogue under the section entitled “International Standards Correspondence Index”, or by using the “Find” facility of the BSI Standards Electronic Catalogue. A British Standard does not purport to include all the necessary provisions of a contract. Users of British Standards are responsible for
6、 their correct application. Compliance with a British Standard does not of itself confer immunity from legal obligations. Summary of pages This document comprises a front cover, an inside front cover, pages i and ii, the ISO/IEC title page, pages ii to iv, pages 1 to 11 and a back cover. This standa
7、rd has been updated (see copyright date) and may have had amendments incorporated. This will be indicated in the amendment table on the inside front cover. Amendments issued since publication Amd. No. Date CommentsBSISO/IEC8859-6:1999 BSI 02-2000 i Contents Page National foreword Inside front cover
8、Foreword iii Text of ISO/IEC 8859-6 1ii blankBSISO/IEC8859-6:1999 ii BSI 02-2000 Contents Page Foreword ii Introduction 1 1 Scope 1 2 Conformance 1 3 Normative references 1 4 Definitions 1 5 Notation, code table and names 2 6 Specification of the coded character set 3 7 Identification of the charact
9、er set 9 Annex A (informative) Coverage of languages by parts 1 to 10 of ISO/IEC8859 10 Annex B (informative) Main differences between ISO 8859-6:1987 and this first edition of this part of ISO/IEC 8859 11 Annex C (informative) Bibliography 11 Table 1 Character set, coded representation 4 Table 2 Co
10、de table of Latin/Arabic alphabet 8 Table A.1 Language coverage 10BSISO/IEC8859-6:1999 BSI 02-2000 iii Foreword ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission) form the specialized system for worldwide standardization. National bodies
11、that are members of ISO or IEC participate in the development of International Standards through technical committees established by the respective organization to deal with particular fields of technical activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other inte
12、rnational organizations, governmental and nongovernmental, in liaison with ISO and IEC, also take part in the work. In the field of information technology, ISO and IEC have established a joint technical committee, ISO/IECJTC 1. Draft International Standards adopted by the joint technical committee a
13、re circulated to national bodies for voting. Publication as an International Standard requires approval by at least75% of the national bodies casting a vote. International Standard ISO/IEC8859-6 was prepared by Joint Technical CommitteeISO/IECJTC1, Information technology, Subcommittee SC2, Coded cha
14、racter sets. This edition cancels and replaces ISO8859-6:1987 which has been technically revised. ISO/IEC8859 consists of the following parts, under the general title Information technology 8-bit single-byte coded graphic character sets: Part 1: Latin alphabet No. 1; Part 2: Latin alphabet No. 2; Pa
15、rt 3: Latin alphabet No. 3; Part 4: Latin alphabet No. 4; Part 5: Latin/Cyrillic alphabet; Part 6: Latin/Arabic alphabet; Part 7: Latin/Greek alphabet; Part 8: Latin/Hebrew alphabet; Part 9: Latin alphabet No. 5; Part 10: Latin alphabet No. 6. Annex A to Annex C of this part of ISO/IEC8859 are for i
16、nformation only.iv blankBSISO/IEC8859-6:1999 BSI 02-2000 1 Introduction ISO/IEC8859 consists of several parts. Each part specifies a set of up to191 graphic characters and the coded representation of these characters by means of a single8-bit byte. Each set is intended for use for a particular group
17、 of languages. 1 Scope This part of ISO/IEC8859 specifies a set of146 coded graphic characters identified as Latin/Arabic alphabet. This set of coded graphic characters is intended for use in data and text processing applications and also for information interchange. The set contains graphic charact
18、ers used for general purpose applications in typical office environments in at least the following languages: Arabic, English and Latin. Some of the characters in this set are combining characters (seeclause6). This set of coded graphic characters may be regarded as a version of an8-bit code accordi
19、ng to ISO/IEC2022 or ISO/IEC4873 at level1. This part of ISO/IEC8859 may not be used in conjunction with any other parts of ISO/IEC8859. If coded characters from more than one part are to be used together, by means of code extension techniques, the equivalent coded character sets from ISO/IEC10367 s
20、hould be used instead within a version of ISO/IEC4873 at level2 or level3. The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC6429. However, control functions are not used to create composite graphic symbols from two or more graphic characte
21、rs (seeclause6). NOTEISO/IEC8859 is not intended for use with Telematic services defined by ITU-T. If information coded according to ISO/IEC8859 is to be transferred to such services, it will have to conform to the requirements of those services at the access-point. 2 Conformance 2.1 Conformance of
22、information interchange A coded-character-data-element (CC-data-element) within coded information for interchange is in conformance with this part of ISO/IEC8859 if all the coded representations of graphic characters within that CC-data-element conform to the requirements of clause6. 2.2 Conformance
23、 of devices A device is in conformance with this part of ISO/IEC8859 if it conforms to the requirements of2.2.1, and either or both of2.2.2 and2.2.3. A claim of conformance shall identify the document which contains the description specified in2.2.1. 2.2.1 Device description A device that conforms t
24、o this part of ISO/IEC8859 shall be the subject of a description that identifies the means by which the user may supply characters to the device, or may recognize them when they are made available to him, as specified respectively in2.2.2 and2.2.3. 2.2.2 Originating devices An originating device sha
25、ll allow its user to supply any sequence of characters from those specified in clause6, and shall be capable of transmitting their coded representations within a CC-data-element. 2.2.3 Receiving devices A receiving device shall be capable of receiving and interpreting any coded representations of ch
26、aracters that are within a CC-data-element, and that conform to clause6, and shall make the corresponding characters available to its user in such a way that the user can identify them from among those specified there, and can distinguish them from each other. 3 Normative references The following st
27、andards contain provisions which, through reference in this text, constitute provisions of this part of ISO/IEC8859. At the time of publication, the editions indicated were valid. All standards are subject to revision, and parties to agreements based on this part of ISO/IEC8859 are encouraged to inv
28、estigate the possibility of applying the most recent editions of the standards indicated below. Members of IEC and ISO maintain registers of currently valid International Standards. ISO/IEC 2022:1994, Information technology Character code structure and extension techniques. ISO/IEC 4873:1991, Inform
29、ation technology ISO 8-bit code for information interchange Structure and rules for implementation. ISO/IEC 8824-1:1995, Information technology Abstract Syntax Notation One (ASN.1): Specification of basic notation. 4 Definitions For the purposes of this part of ISO/IEC8859 the following definitions
30、apply: 4.1 bit combination an ordered set of bits used for the representation of characters 4.2 byte a bit string that is operated upon as a unitBSISO/IEC8859-6:1999 2 BSI 02-2000 4.3 character a member of a set of elements used for the organization, control, or representation of data 4.4 code table
31、 a table showing the characters allocated to each bit combination in a code 4.5 coded character set; code a set of unambiguous rules that establishes a character set and the one-to-one relationship between the characters of the set and their bit combinations 4.6 coded-character-data-element (CC-data
32、-element) an element of interchanged information that is specified to consist of a sequence of coded representations of characters, in accordance with one or more identified standards for coded character sets 4.7 graphic character a character, other than a control function, that has a visual represe
33、ntation normally handwritten, printed or displayed, and that has a coded representation consisting of one or more bit combinations NOTEIn ISO/IEC8859 a single bit combination is used to represent each character. 4.8 graphic symbol a visual representation of a graphic character or of a control functi
34、on 4.9 position that part of a code table identified by its column and row coordinates 5 Notation, code table and names 5.1 Notation The bits of the bit combinations of the8-bit code are identified by b 8 , b 7 , b 6 , b 5 , b 4 , b 3 , b 2 , and b 1 , where b 8is the highest-order, or most-signific
35、ant bit and b 1is the lowest-order, or least-significant bit. The bit combinations may be interpreted to represent numbers in binary notation by attributing the following weights to the individual bits: Using these weights, the bit combinations are identified by notations of the form xx/yy, where xx
36、 and yy are numbers in the range00 to15. The correspondence between the notations of the form xx/yy and the bit combinations consisting of the bits b 8to b 1is as follows: xx is the number represented by b 8 , b 7 , b 6andb 5where these bits are given the weights8,4,2, and1 respectively. yy is the n
37、umber represented by b 4 , b 3 , b 2and b 1where these bits are given the weights8,4,2, and1 respectively. The bit combinations are also identified by notations of the form hk, where h and k are numbers in the range 0 to F in hexadecimal notation. The number h is the same as the number xx described
38、above, and the number k the same as the number yy described above. 5.2 Layout of the code table An 8-bit code table consists of256 positions arranged in16columns and16rows. The columns and the rows are numbered00 to15. In hexadecimal notation the columns and the rows are numbered0 toF. The code tabl
39、e positions are identified by notations of the form xx/yy, where xx is the column number and yy is the row number. The column and row numbers are shown at the top and left edges of the table respectively. The code table positions are also identified by notations of the form hk, where h is the column
40、 number and k is the row number in hexadecimal notation. The column and row numbers are shown at the bottom and right edges of the table respectively. The positions of the code table are in one-to-one correspondence with the bit combinations of the code. The notation of a code table position, of the
41、 form xx/yy, or of the form hk, is the same as that of the corresponding bit combination. 5.3 Names and meanings This part of ISO/IEC8859 assigns a unique name and a unique identifier to each graphic character. These names and identifiers have been taken from ISO/IEC10646-1 (E). This part of ISO/IEC
42、8859 also specifies an acronym for each of the characters SPACE, NO-BREAK SPACE and SOFT HYPHEN. For acronyms only Latin capital letters A to Z are used. It is intended that the acronyms be retained in all translations of the text. Except for SPACE (SP), NO-BREAK SPACE (NBSP) and SOFT HYPHEN (SHY),
43、this part of ISO/IEC8859 does not define and does not restrict the meanings of graphic characters. Bit b 8 b 7 b 6 b 5 b 4 b 3 b 2 b 1 Weight 128 64 32 16 8 4 2 1BSISO/IEC8859-6:1999 BSI 02-2000 3 This part of ISO/IEC8859 specifies a graphic symbol for each graphic character. This symbol is shown in
44、 the corresponding position of the code table. However, this part, or any other part, of ISO/IEC8859 does not specify a particular style or font design for imaging graphic characters. Annex B of ISO/IEC10367 gives further information on this subject. 5.3.1 SPACE (SP) A graphic character the visual r
45、epresentation of which consists of the absence of a graphic symbol. 5.3.2 NO-BREAK SPACE (NBSP) A graphic character the visual representation of which consists of the absence of a graphic symbol, for use when a line break is to be prevented in the text as presented. 5.3.3 SOFT HYPHEN (SHY) A graphic
46、 character that is imaged by a graphic symbol identical with, or similar to, that representing HYPHEN, for use when a line break has been established within a word. 6 Specification of the coded character set This part of ISO/IEC8859 specifies146 characters allocated to the bit combinations of the co
47、de table (Table 2). Some of these characters are combining characters. They are identified in Table 1 as such. NOTECombining characters are described in ISO/IEC2022:1994 subclause6.3.3. The coded representation of a combining character shall follow that of the base character with which it is associa
48、ted. Any combining character may be associated with any non-combining character in the ranges12/01 to13/10 and14/01 to14/10 (hexadecimal C1 to DA and E1 to EA). Control functions, such as BACKSPACE or CARRIAGE RETURN, shall not be used to create composite graphic symbols, which are made up from the
49、graphic representations of two or more characters. NOTEThere is only one set of DIGITS in this part. How these will be imaged, is a matter of local conventions. In the code table, graphic symbols for the most common styles of writing digits are given next to each other. In this way data communication between various Arabic writing countries remains possible without code conversion. 6.1 Characters of the set and their coded representation See Table 1.BSISO/IEC8859-6:1999 4 BSI 02-2000 Table 1 Character set, coded representation Bi