1、 INCITS/ISO/IEC 8859-8:1999R2010 (ISO/IEC 8859-8:1999, IDT) Information technology - 8-bit single-byte coded graphic character sets - Part 8: Latin/Hebrew alphabet Reaffirmed as INCITS/ISO/IEC 8859-8:1999 R2015INCITS/ISO/IEC 8859-8:1999R2010PDF disclaimerThis PDF file may contain embedded typefaces.
2、 In accordance with Adobes licensing policy, this file may be printed or viewed but shall not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In downloading this file, parties accept therein the responsibility of not infringing
3、Adobes licensing policy. The ISO Central Secretariat accepts no liability in this area.Adobe is a trademark of Adobe Systems Incorporated.Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF-creation parameters were optimized fo
4、r printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the address given below.Adopted by INCITS (InterNational Committee for Information Technology
5、 Standards) as an American National Standard.Date of ANSI Approval: 1/4/2010Published by American National Standards Institute, 25 West 43rd Street, New York, New York 10036Copyright 2010 by Information Technology Industry Council (ITI). All rights reserved.These materials are subject to copyright c
6、laims of International Standardization Organization (ISO), International Electrotechnical Commission (IEC), American National Standards Institute (ANSI), and Information Technology Industry Council (ITI). Not for resale. No part of this publication may be reproduced in any form, including an electro
7、nic retrieval system, without the prior written permission of ITI. All requests pertaining to this standard should be submitted to ITI, 1101 K Street NW, Suite 610, Washington DC 20005.Printed in the United States of Americaii ITIC 2010 All rights reserved ISO/IEC ISO/IEC 8859-8:1999 (E)ForewordISO
8、(the International Organization for Standardization) and IEC (theInternational Electrotechnical Commission) form the specializedsystem for worldwide standardization. National bodies that aremembers of ISO or IEC participate in the development ofInternational Standards through technical committees es
9、tablished bythe respective organization to deal with particular fields of technicalactivity. ISO and IEC technical committees collaborate in fields ofmutual interest. Other international organizations, governmental andnongovernmental, in liaison with ISO and IEC, also take part in thework.In the fie
10、ld of information technology, ISO and IEC have establisheda joint technical committee, ISO/IEC JTC1. Draft InternationalStandards adopted by the joint technical committee are circulated tonational bodies for voting. Publication as an International Standardrequires approval by at least 75% of the nat
11、ional bodies casting avote.International Standard ISO/IEC 8859-8 was prepared by JointTechnical Committee ISO/IEC JTC 1, Information technology,Subcommittee SC 2, Coded character sets.This edition cancels and replaces ISO 8859-8:1988 which has beentechnically revised.ISO/IEC 8859 consists of the fol
12、lowing parts, under the general titleInformation technology 8-bit single-byte coded graphic charactersets: Part 1: Latin alphabet No. 1 Part 2: Latin alphabet No. 2 Part 3: Latin alphabet No. 3 Part 4: Latin alphabet No. 4 Part 5: Latin/Cyrillic alphabet Part 6: Latin/Arabic alphabet Part 7: Latin/G
13、reek alphabet Part 8: Latin/Hebrew alphabet Part 9: Latin alphabet No. 5 Part 10: Latin alphabet No. 6Annexes A to D of this part of ISO/IEC 8859 are for information only.iiiISO/IEC 8859-8:1999 (E) ISO/IECIntroductionISO/IEC 8859 consists of several parts. Each part specifies a set ofup to 191 graph
14、ic characters and the coded representation of thesecharacters by means of a single 8-bit byte. Each set is intended foruse for a particular group of languages.ivINTERNATIONAL STANDARD ISO/IEC ISO/IEC 8859-8:1999 (E)Information technology 8-bit single-byte coded graphic character sets Part 8: Latin/H
15、ebrew alphabet1 ScopeThis part of ISO/IEC 8859 specifies a set of 155coded graphic characters identified as Latin/Hebrewalphabet.This set of coded graphic characters is intended foruse in data and text processing applications andalso for information interchange.The set contains graphic characters us
16、ed forgeneral purpose applications in typical officeenvironments in at least the following languages:English, Hebrew, Latin.It is not intended for pointed Hebrew.This set of coded graphic characters may beregarded as a version of an 8-bit code according toISO/IEC 2022 or ISO/IEC 4873 at level 1.This
17、 part of ISO/IEC 8859 may not be used inconjunction with any other parts of ISO/IEC 8859.If coded characters from more than one part are tobe used together, by means of code extensiontechniques, the equivalent coded character setsfrom ISO/IEC 10367 should be used instead withina version of ISO/IEC 4
18、873 at level 2 or level 3.The coded characters in this set may be used inconjunction with coded control functions selectedfrom ISO/IEC 6429. However, control functions arenot used to create composite graphic symbols fromtwo or more graphic characters (see clause 6).NOTE ISO/IEC 8859 is not intended
19、for use withTelematic services defined by ITU-T. If information codedaccording to ISO/IEC 8859 is to be transferred to suchservices, it will have to conform to the requirements ofthose services at the access-point.2 Conformance2.1 Conformance of information interchangeA coded-character-data-element
20、(CC-data-element)within coded information for interchange is inconformance with this part of ISO/IEC 8859 if all thecoded representations of graphic characters withinthat CC-data-element conform to the requirementsof clause 6.2.2 Conformance of devicesA device is in conformance with this part ofISO/
21、IEC 8859 if it conforms to the requirements of2.2.1, and either or both of 2.2.2 and 2.2.3. A claimof conformance shall identify the document whichcontains the description specified in 2.2.1.2.2.1 Device descriptionA device that conforms to this part of ISO/IEC 8859shall be the subject of a descript
22、ion that identifiesthe means by which the user may supply charactersto the device, or may recognize them when they aremade available to him, as specified respectively in2.2.2 and 2.2.3.2.2.2 Originating devicesAn originating device shall allow its user to supplyany sequence of characters from those
23、specified inclause 6, and shall be capable of transmitting theircoded representations within a CC-data-element.2.2.3 Receiving devicesA receiving device shall be capable of receiving andinterpreting any coded representations of charactersthat are within a CC-data-element, and that conformto clause 6
24、, and shall make the correspondingcharacters available to its user in such a way thatthe user can identify them from among thosespecified there, and can distinguish them from eachother.3 Normative referencesThe following standards contain provisions which,through reference in this text, constitute p
25、rovisionsof this part of ISO/IEC 8859. At the time of publica-tion, the editions indicated were valid. All standardsare subject to revision, and parties to agreementsbased on this part of ISO/IEC 8859 are encouragedto investigate the possibility of applying the mostrecent editions of the standards i
26、ndicated below.Members of IEC and ISO maintain registers ofcurrently valid International Standards.1ISO/IEC 8859-8:1999 (E) ISO/IECISO/IEC 2022:1994, Information technology Character code structure and extension techniques.ISO/IEC 4873:1991, Information technology ISO 8-bit code for information inte
27、rchange Structure and rules for implementation.ISO/IEC 8824-1:1995, Information technology Abstract Syntax Notation One (ASN.1): Specifica-tion of basic notation.4 DefinitionsFor the purposes of this part of ISO/IEC 8859 thefollowing definitions apply:4.1 bi-directional text: A text which may contai
28、nstrings of characters with left-to-right and right-to-left directions.4.2 bit combination: An ordered set of bits usedfor the representation of characters.4.3 byte: A bit string that is operated upon as a unit.4.4 character: A member of a set of elementsused for the organization, control, or repres
29、entationof data.4.5 code table: A table showing the charactersallocated to each bit combination in a code.4.6 coded character set; code: Asetofunambiguous rules that establishes a character setand the one-to-one relationship between thecharacters of the set and their bit combinations.4.7 coded-chara
30、cter-data-element (CC-data-element): An element of interchanged informationthat is specified to consist of a sequence of codedrepresentations of characters, in accordance withone or more identified standards for codedcharacter sets.4.8 directional character properties: A set ofmutually exclusive pro
31、perties which may qualify themembers of a character set. These properties areused by algorithms which transform text fromprocessing sequence into presentation sequence.Examples of values for directional characterproperties are “right-to-left“, “left-to-right“, “digit“,“numeric separator“, “neutral“.
32、4.9 graphic character: A character, other than acontrol function, that has a visual representationnormally handwritten, printed or displayed, and thathas a coded representation consisting of one ormore bit combinations.NOTE In ISO/IEC 8859 a single bit combination is usedto represent each character.
33、4.10 graphic symbol: A visual representation ofa graphic character or of a control function.4.11 implicit directionality: A text presentationmethod in which the direction is determined by analgorithm. The algorithm is based on the directionalcharacter properties of the character, its positionrelativ
34、e to the preceding and following character andto the primary direction.4.12 left-to-right character: A character specificto a script written from left to right like the Latinscript or the Greek script. Typical examples are theletters AZ.4.13 position: That part of a code table identifiedby its colum
35、n and row coordinates.4.14 right-to-left character: A character specificto a script written from right to left like the Arabicscript or the Hebrew script. Typical examples arethe letters of the Hebrew alphabet.5 Notation, code table and names5.1 NotationThe bits of the bit combinations of the 8-bit
36、code areidentified by b8,b7,b6,b5,b4,b3,b2, and b1, whereb8is the highest-order, or most-significant bit and b1is the lowest-order, or least-significant bit.The bit combinations may be interpreted torepresent numbers in binary notation by attributingthe following weights to the individual bits:Using
37、 these weights, the bit combinations areBit b8b7b6b5b4b3b2b1Weight 128 64 32 16 8 4 2 1identified by notations of the form xx/yy, where xxand yy are numbers in the range 00 to 15. Thecorrespondence between the notations of the formxx/yy and the bit combinations consisting of the bitsb8to b1is as fol
38、lows: xx is the number represented by b8,b7,b6andb5where these bits are given the weights 8, 4, 2,and 1 respectively. yy is the number represented by b4,b3,b2andb1where these bits are given the weights 8, 4, 2,and 1 respectively.The bit combinations are also identified by notationsof the form hk, wh
39、ere h and k are numbers in therange 0 to F in hexadecimal notation. The numberh is the same as the number xx described above,and the number k the same as the number yydescribed above.2 ISO/IEC ISO/IEC 8859-8:1999 (E)5.2 Layout of the code tableAn 8-bit code table consists of 256 positionsarranged in
40、 16 columns and 16 rows. The columnsand the rows are numbered 00 to 15. In hexa-decimal notation the columns and the rows arenumbered 0 to F.The code table positions are identified by notationsof the form xx/yy, where xx is the column numberand yy is the row number. The column and rownumbers are sho
41、wn at the top and left edges of thetable respectively. The code table positions arealso identified by notations of the form hk, where his the column number and k is the row number inhexadecimal notation. The column and rownumbers are shown at the bottom and right edges ofthe table respectively.The p
42、ositions of the code table are in one-to-onecorrespondence with the bit combinations of thecode. The notation of a code table position, of theform xx/yy, or of the form hk, is the same as that ofthe corresponding bit combination.5.3 Names and meaningsThis part of ISO/IEC 8859 assigns a unique namean
43、d a unique identifier to each graphic character.These names and identifiers have been taken fromISO/IEC 10646-1 (E). This part of ISO/IEC 8859also specifies an acronym for each of the charactersSPACE, NO-BREAK SPACE, SOFT HYPHEN,LEFT-TO-RIGHT MARK and RIGHT-TO-LEFTMARK. For acronyms only Latin capit
44、al letters A toZ are used. It is intended that the acronyms beretained in all translations of the text.Except for SPACE (SP), NO-BREAK SPACE(NBSP), SOFT HYPHEN (SHY), LEFT-TO-RIGHTMARK (LRM) and RIGHT-TO-LEFT MARK (RLM),this part of ISO/IEC 8859 does not define and doesnot restrict the meanings of g
45、raphic characters.This part of ISO/IEC 8859 specifies a graphicsymbol for each graphic character. This symbol isshown in the corresponding position of the codetable. However, this part, or any other part, ofISO/IEC 8859 does not specify a particular style orfont design for imaging graphic characters
46、. AnnexB of ISO/IEC 10367 gives further information on thissubject.5.3.1 SPACE (SP)A graphic character the visual representation ofwhich consists of the absence of a graphic symbol.5.3.2 NO-BREAK SPACE (NBSP)A graphic character the visual representation ofwhich consists of the absence of a graphic s
47、ymbol,for use when a line break is to be prevented in thetext as presented.5.3.3 SOFT HYPHEN (SHY)A graphic character that is imaged by a graphicsymbol identical with, or similar to, that representingHYPHEN, for use when a line break has beenestablished within a word.5.3.4 LEFT-TO-RIGHT MARK (LRM)A
48、graphic character the visual representation ofwhich consists of the absence of a graphic symbol,which acts like a left-to-right character in a bi-directional text (such as LATIN SMALL LETTER A).5.3.5 RIGHT-TO-LEFT MARK (RLM)A graphic character the visual representation ofwhich consists of the absenc
49、e of a graphic symbol,which acts like a right-to-left character in a bi-directional text (such as HEBREW LETTER ALEF).6 Specification of the coded character setThis part of ISO/IEC 8859 specifies 155 charactersallocated to the bit combinations of the code table(table 2).Control functions, such as BACKSPACE orCARRIAGE RETURN, shall not be used to createcomposite
copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1