NISO Z39 47-1993 Extended Latin Alphabet Coded Character Set for Bibliographic Use《文献用扩展拉丁字母编码字符集》.pdf

上传人:吴艺期 文档编号:1008932 上传时间:2019-03-19 格式:PDF 页数:28 大小:2.86MB
下载 相关 举报
NISO Z39 47-1993 Extended Latin Alphabet Coded Character Set for Bibliographic Use《文献用扩展拉丁字母编码字符集》.pdf_第1页
第1页 / 共28页
NISO Z39 47-1993 Extended Latin Alphabet Coded Character Set for Bibliographic Use《文献用扩展拉丁字母编码字符集》.pdf_第2页
第2页 / 共28页
NISO Z39 47-1993 Extended Latin Alphabet Coded Character Set for Bibliographic Use《文献用扩展拉丁字母编码字符集》.pdf_第3页
第3页 / 共28页
NISO Z39 47-1993 Extended Latin Alphabet Coded Character Set for Bibliographic Use《文献用扩展拉丁字母编码字符集》.pdf_第4页
第4页 / 共28页
NISO Z39 47-1993 Extended Latin Alphabet Coded Character Set for Bibliographic Use《文献用扩展拉丁字母编码字符集》.pdf_第5页
第5页 / 共28页
点击查看更多>>
资源描述

1、Extended Latin AlphabetCoded Character Setfor Bibliographic UseAbstract: This standard establishes computer codes for an extended Latinalphabet character set to be used in bibliographic work when handling non-English items. The standard addresses special characters in languages using theLatin alphab

2、et as well as combining marks (diacritics) required for romanizationand transliteration. This standard establishes the 7-bit and 8-bit code values.ANSI/NISO Z39.47-1993 (R2003) ISSN: 1041-5653Developed by theNational Information Standards OrganizationApproved May 3, 1993 by theAmerican National Stan

3、dards InstituteBethesda, Maryland, U.S.A.P r e s sPublished by NISO Press P.O. Box 1056 Bethesda, MD 20827 Copyright 01993 by the National Information Standards Organization All rights reserved under International and Pan-American Copyright Conventions. No part of this book may be reproduced or tran

4、smitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage or retrieval system, without prior permission in writing from the publisher. All inquiries should be addressed to NISO Press, PO. Box 1056, Bethesda, MD 20827. ISSN: 1041-5653 IS

5、BN: l-880124-02-5 Printed in the United States of America 0 m This paper meets the requirements of ANSI/NISO 239.48-1992 (Permanence of Paper). Library of Congress Cataloging-in-Publication Data Extended Latin alphabet coded character set for bibliographic use : American national standard extended L

6、atin alphabet coded character set for bibliographic use : approved May 3, 1993, by the American National Standards Institute / developed by the National Information Standards Organization. p. cm. - (National information standards series, ISSN 1041-5653) “This standard may be identified by the use of

7、 the notation ANSEL”-Foreword. ISBN l-880124-02-5 (pbk) 1. Character sets (Data processing)-Standards-United States. 2. Machine-readable bibliographic data-Standards-United States. 3. Cataloging-Data processing-Stan- dards-united States. I. American National Standards Institute. II. National Informa

8、tion Standards Organization (U.S.). III. Title: American national standard extended Latin alphabet coded character set for bibliographic use. IV. Title: ANSEL. V. Series. 2699.35.C48E98 1993 025.3 16- Greek characters a, b, and d and superscripts and subscripts for O-9, (, ), +, - are added through

9、escape sequences. The set just described constitutes what is commonly called the “ALA character set.” It is also the USMARC character set, and as such it is fully described in the publication USMARC Specifications for Record Structure, Character Sets, Tapes. It should be noted that the USMARC (or “A

10、LA”) set could be changed from the above descrip- tion, for example, to incorporate the characters in column 12, so the latest edition of the USMARC specifications document should be consulted for the exact specification. At the time of reaffirmation, the text of ANSI/NISO 239.47 was revised to a) d

11、elete references to six ANSI romanization standards which have been retired, b) incorporate three new definitions, c) clarify the lan- guage of the section dealing with character modifiers, and d) update/correct the appendixes. In the absence of published ANSI standards, it is recommended that the p

12、ublication ALA-LC Romanization Tables be con- sulted for guidance on romanization and translitera- tion of non-roman scripts. NISO acknowledges with thanks and appreciation the contributions of Randall K. Barry of the Library of Congress, Network Development and MARC Stan- dards Office, in revising

13、this standard. Suggestions for improving this standard are wel- come. They should be sent to the National Informa- tion Standards Organization, PO. Box 1056, Bethesda, MD 20827, (301) 975-2814. This standard was processed and approved for submittal to ANSI by the National Information Standards Organ

14、ization. NISO approval of this stan- dard does not necessarily imply that all Voting Mem- bers voted for its approval. At the time it approved this standard, NISO had the following members: American Society for Information Science Nolan Pope American Society of Indexers Jessica Milstead Patricia S.

15、Kuhr (Alt) American Theological Library Association Myron Chace Apple Computer, Inc. Karen Higginbottom Page iv ANWNISO 239.47-1993 Art Libraries Society of North America Patricia J. Barnett Pamela J. Parry (Ah) Association of American Mary Lou Menches University Presses Association of Information a

16、nd Dissemination Centers Bruce H. Kiesel Association for Information Management and Image Management Marilyn Courtot Association of Jewish Libraries Bella Hass Weinberg Pearl Berger (Alt) The Association for Recorded Sound Collections Donald McCormick Barbara Sawka (Alt) As 8sociation of Research Du

17、ane Web ster Libraries AT for example, . : Spacing graphic character. A graphic character whose use is followed by the forward movement of the output device to the next character position. For the purposes of this standard, the term includes special characters, special symbols, and punctuation marks

18、. Special character- An alphabetic character other than A-Z or other spacing graphic character; for example, cle. Special symbol -A conventional sign used in of words or word groups; for example, 0 % I place 1414 fl 14/5 Cl 14/6 B 1417 fi 1418 Ei 1419 fi 14/10 ?j L. 14/l 1 fi 14/12 ?I i 14/13 9 F G

19、14/14 _ 1517 g 1518 f-l Y 1519 a 15/10 fi 15/11 ?q L. 15114 ? Q low rising tone mark grave accent acute accent circumflex accent tilde matron breve dot above umlaut (diaeresis) haEek (caron) circle above (angstrom) ligature, left half ligature, right half high comma, off center double acute accent c

20、andrabindu cedilla right hook dot below double dot below circle below double underscore underscore left hoof right cedilla half circle below (upadhmaniy a) double tilde, left half double tilde, right half high comma, centered cui regle esta meme niiio -. -. gaJ eJs alta iaba iippna vzdy bar akademiE

21、 akademiZ rozdelovac idoszaki Alirev ca vietg teda kh tbah San;skrta Ghulam iamar dZrziga khQng humantus galan alan geotermika Page 7 ANSIRVISO 239.47-1993 Table 7. ASCII control characters Column/row Mnemonic o/o NUL O/l SOH o/2 STX o/3 ETX o/4 EOT o/5 ENQ O/6 ACK o/7 BEL O/8 BS o/9 HT o/10 LF o/11

22、 VT o/12 FF o/13 CR o/14 so o/15 SI l/O DLE l/l DC1 l/2 DC2 l/3 DC3 l/4 DC4 l/5 NAK l/6 SYN l/7 ETB l/8 CAN l/9 EM l/10 SUB l/11 ESC l/12 FS l/13 GS l/14 RS l/15 us 7/15 DEL Meaning null start of heading start of text end of text end of transmission enquiry acknowledge bell backspace horizontal tabu

23、lation line feed vertical tabulation form feed carriage return shift out shift in data link escape device control 1 device control 2 device control 3 device control 4 negative acknowledge synchronous idle end of transmission block cancel end of medium substitute escape file separator group separator

24、 record separator unit separator delete Page 8 . ANWNISO 239.47-1993 Table 8. ASCII graphic characters Column/row 2/o 2/l 212 213 214 215 216 217 218 219 2110 2111 2112 2113 2114 2115 Graphic . ! U # $ % acute accent) opening parenthesis closing parenthesis asterisk plus L comma (cedilla) hyphen (mi

25、nus) period (decimal point) slant (slash) 3/o to 3/9 3110 3/l 1 3112 3113 3114 3115 0.9 . . . 9 ? digits 0 through 9 colon semicolon less than equals greater than question mark 4/o 4/l to 5110 A.Z 5111 5112 5113 1 5114 A 5115 - commercial at Latin letters A through Z - uppercase opening bracket reve

26、rse slant (backslash) closing bracket circumflex underline 6/O 6/l to 7110 7111 7112 7113 7114 opening single quotation mark (grave accent) Latin letters a through z - lowercase opening brace (opening curly bracket) vertical line (pipe) closing brace (closing curly bracket) tilde ANWNISO 239.47-1993

27、 Appendix A Languages using ANSEL character modifiers or special characters (This appendix is not part of American National Standard ANWNISO 239.47-1993 and is included for information only.) In Tables Al and A2, the Latin alphabet script languages and transliterated non-Latin alphabet script langua

28、ges are listed with ANSEL character modifiers and special characters found in each language. Only lowercase letters are shown unless the uppercase form would not be obvious, in which case, it is shown in parentheses. In cases in which character modifiers are used as pronunciation signs (as in Afrika

29、ans or Vietnamese), only a few of the possible character modifiers with alphabetic character combinations are shown. The transliteration characters are those required by the ALA-LC romanization tables.2 ?he romanization schemes adopted by the American Library Association and the Library of Congress

30、are published under the title: ALA-LC Romanization Tables. For fkther information contact the Cataloging Policy and Support Office, Library of Congress, Washington, DC 20540-4305. Table Al. Latin alphabet script languages Language Afrikaans Albanian Anglo-Saxon Catalan Croatian Czech Danish Dutch Es

31、peranto Estonian Faroese Finnish French German Hawaiian Hungarian Icelandic (Modern) Indonesian Italian Latvian Special characters zxE6 . d ZI30 S&5 oe 6 9 (Continued) Page 10 ANSI/NISO 239.47-1993 Table Al. Latin alphabet script languages (Concluded) Language Character modifiers Special characters

32、Lithuanian Navaho Norwegian Polish Portuguese Romanian Slovak Slovene Spanish Swedish Tagalog Turkish (Modern) Vietnamese Wendic Page 11 ANSUNISO 239.47-1993 Table A2. Transliterated non-Latin alphabet script languages Language Character modifiers Special characters Amharic Arabic Armenian Assamese

33、Belorussian Bengali . BraJ Bulgarian Burmese Chinese Church Slavic Dogri Georgian Greek Gujarati Hebrew Hindi Japanese Kannada Khmer Konkani Korean Lahnda Page 12 ANSUNISO 239.47-1993 Page 13 ANSUNISO 239.47-1993 Page 14 ANWNISO 239.47-1993 Appendix B ANSEL Character modifiers and special characters

34、 with languages of occurrence (This appendix is not part of American National Standard ANWNISO 239.47-1993 and is included for information only.) In the following tables, ANSEL character modifiers and special characters are listed with the languages in which each occurs. Double character modifiers t

35、hat are sometimes used in Vietnamese and other languages are not given a separate entry. The transliterations take into consideration the tables included in Appendix A. Table Bl. Character modifiers Name Examples Languages acute accent Albanian, Catalan, Croatian, Czech, Dutch Faroese, French, Hawai

36、ian, Hungarian, Icelandic (Modern), Navaho, Polish, Portuguese, Slovak, Slovene, Spanish, Tagalog, Vietnamese, Wendic Transliterated: Church Slavic, Gujarati, Hebrew, Hindi, Kannada, Khmer, Konkani, Maithili, Malayalam, Manipuri, Marathi, Mewari, Nepali, Oriya, Pahari, Persian, Prakrit, Pushto, Raja

37、sthani, Sanskrit, Serbian, Sindhi, Sinhalese, Tamil, Telugu, Tibetan, Urdu, Yiddish breve Transliterated: Belorussian, Braj, Bulgarian, Chinese, Church, Slavic, Dogri, Georgian, Hindi, Khmer, Korean, Lahnda, Maithili, Mewari, Nepali, Pahari, Panjabi, Rajasthani, Russian, Sindhi, Ukrainian (Continued

38、) Page 15 ANSUNISO 239.47-1993 Table Bl. Character modifiers (Continued) Name Examples candrabindu mfifi cedilla cs circle above (angstrom) 0 AU circle below lr 0 0 circumflex accent 2eeghJ” ssutiy dot above Languages _I Transliterated: Assamese, Bengali, Braj, Bulgarian, Hindi Maithili, Manipuri, M

39、ewari, Nepali, Oriya, Pahari d cP (Continued) Page ANWNISO 239.47-1993 k w * (0) / Table B2. Special characters (ConcZuded) Name Examples ligature ae = ( Languages Anglo-Saxon, Danish, Faroese, Icelandic (Modern), Navaho, Norwegian Transliterated: Lao, Thai ligature oe Anglo-Saxon, French Transliter

40、ated: Lao, Thai middle dot . Catalan Transliterated: Khmer slash d Croatian, Vietnamese Transliterated: Macedonian, Serbian slash 1 slash o soft sign (msgkii znak) Navaho, Polish, Wendic Danish, Faroese, Norwegian Transliterated: Arabic, Belorussian, Church, Slavic, Hebrew, Khmer, Persian, Pushto, R

41、ussian, Tibetan, Ukrainian, Yiddish thorn b (7) Anglo-Saxon, Icelandic (Modern) Page 21 ANSI/NISO Z39.47-1993 (R2003)ERRATA SHEET1. ANSI/NISO Z39.47-1993 is Registration # 231 in the ISO International Register ofCoded Character Sets to be Used with Escape Sequences. It is available at this url:http:/www.itscj.ipsj.or.jp/ISO-IR/231.pdf2. Page 7, Table 6 the character encoded as “7/7“ (7-bit) and 15/7 (8-bit) is named “lefthook“ (not “left hoof“).3. Page 11, Table A1 for the Vietnamese language delete the character modifier a with amacron.

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 标准规范 > 国际标准 > 其他

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1