1、BSI Standards Publication BS ISO 7098:2015 Information and documentation Romanization of ChineseBS ISO 7098:2015 BRITISH STANDARD National foreword This British Standard is the UK implementation of ISO 7098:2015. The UK participation in its preparation was entrusted to Technical Committee IDT/2, Inf
2、ormation and documentation. A list of organizations represented on this committee can be obtained on request to its secretary. This publication does not purport to include all the necessary provisions of a contract. Users are responsible for its correct application. The British Standards Institution
3、 2015. Published by BSI Standards Limited 2015 ISBN 978 0 580 79721 7 ICS 01.140.10 Compliance with a British Standard cannot confer immunity from legal obligations. This British Standard was published under the authority of the Standards Policy and Strategy Committee on 31 December 2015. Amendments
4、/corrigenda issued since publication Date T e x t a f f e c t e dBS ISO 7098:2015 ISO 2015 Information and documentation Romanization of Chinese Information et documentation Romanisation du chinois INTERNATIONAL STANDARD ISO 7098 Third edition 2015-12-15 Reference number ISO 7098:2015(E)BS ISO 7098:
5、2015ISO 7098:2015(E)ii ISO 2015 All rights reserved COPYRIGHT PROTECTED DOCUMENT ISO 2015, Published in Switzerland All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including ph
6、otocopying, or posting on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below or ISOs member body in the country of the requester. ISO copyright office Ch. de Blandonnet 8 CP 401 CH-1214 Vernier, Geneva, Switzerland Tel. +41
7、 22 749 01 11 Fax +41 22 749 09 47 copyrightiso.org www.iso.orgBS ISO 7098:2015ISO 7098:2015(E)Foreword iv Introduction v 1 Scope . 1 2 T erms and definitions . 1 3 General principles of conversion of writing systems . 2 4 Principles for converting ideophonographic characters 3 5 Pinyin 4 6 Syllabic
8、 forms . 4 7 Tones . 5 8 Punctuation 7 9 Numerals . 7 10 Chinese Pinyin Orthography 7 11 Transcription rules for named entities 8 12 Automatic transcription for named entities .12 12.1 Fully automatic syllable transcription .12 12.2 Rule-based and semi-automatic word transcription 12 Annex A (normat
9、ive) Table of Chinese syllable forms 14 Annex B (normative) Table of hexadecimal codes of Chinese vowels with tones .16 Annex C (normative) Ambiguity index for Chinese syllables .17 Bibliography .18 ISO 2015 All rights reserved iii Contents PageBS ISO 7098:2015ISO 7098:2015(E) Foreword ISO (the Inte
10、rnational Organization for Standardization) is a worldwide federation of national standards bodies (ISO member bodies). The work of preparing International Standards is normally carried out through ISO technical committees. Each member body interested in a subject for which a technical committee has
11、 been established has the right to be represented on that committee. International organizations, governmental and non-governmental, in liaison with ISO, also take part in the work. ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of electrotechnical s
12、tandardization. The procedures used to develop this document and those intended for its further maintenance are described in the ISO/IEC Directives, Part 1. In particular the different approval criteria needed for the different types of ISO documents should be noted. This document was drafted in acc
13、ordance with the editorial rules of the ISO/IEC Directives, Part 2 (see www.iso.org/directives). Attention is drawn to the possibility that some of the elements of this document may be the subject of patent rights. ISO shall not be held responsible for identifying any or all such patent rights. Deta
14、ils of any patent rights identified during the development of the document will be in the Introduction and/or on the ISO list of patent declarations received (see www.iso.org/patents). Any trade name used in this document is information given for the convenience of users and does not constitute an e
15、ndorsement. For an explanation on the meaning of ISO specific terms and expressions related to conformity assessment, as well as information about ISOs adherence to the WTO principles in the Technical Barriers to Trade (TBT) see the following URL: Foreword - Supplementary information The committee r
16、esponsible for this document is ISO/TC 46, Information and documentation. This third edition cancels and replaces the second edition (ISO 7098:1991), which has been technically revised. Annexes A, B and C form the integral parts of this International Standard.iv ISO 2015 All rights reservedBS ISO 70
17、98:2015ISO 7098:2015(E) Introduction The first edition of ISO 7098 was published in 1982 after ISO/TC 46 recognized the need for an International Standard specifying the Chinese phonetic alphabet. The second edition was published in 1991. This third edition is in response to new application needs, f
18、or instance to reflect current Chinese romanization practice and new developments in China and the rest of the world. ISO 2015 All rights reserved vBS ISO 7098:2015BS ISO 7098:2015Information and documentation Romanization of Chinese 1 Scope This International Standard explains the principles of the
19、 Romanization of Modern Chinese Putonghua (Mandarin Chinese), the official language of the Peoples Republic of China as defined in the Directives for the Promotion of Putonghua, promulgated on 1956-02-06 by the State Council of China. This International Standard can be applied in documentation of bi
20、bliographies, catalogues, indices, toponymic lists, etc. 2 T erms a nd definiti ons For the purposes of this document, the following terms and definitions apply. 2.1 character element of a writing system, whether or not alphabetical, that represents a phoneme, a syllable, a word or even prosodic cha
21、racteristics of the language, by using graphical symbols (letters, diacritical marks, syllabic signs, punctuation marks, prosodic accents, etc.) or a combination of these signs (a letter having an accent or a diacritical mark) EXAMPLE a, B, or are, therefore, characters as well as basic letters. 2.2
22、 alphabets ordered character set, the order of which has been agreed upon 2.3 alphabetical characters character set that contains letters (2.8) 2.4 alphanumeric characters character set that contains both letters (2.8) and digits 2.5 graphic character character that has a visual representation and i
23、s normally produced by writing, printing or displaying 2.6 ideophonographical character graphic character (2.6) that represents an object or a concept and is associated with a sound element in a natural language EXAMPLE Chinese hanzi 鹤(crane), Japanese kanji 戦(war) and Korean hanja 册(book) are ideop
24、honographical characters. 2.7 Chinese characters ideophonographical character set for recording the Chinese language Note 1 to entry: Chinese characters (hanzi) are also used in the writing systems of other languages. 2.8 letter graphic character (2.6) that, when appearing alone or combined with oth
25、ers, is primarily used to represent a sound element of a spoken language INTERNATIONAL ST ANDARD ISO 7098:2015(E) ISO 2015 All rights reserved 1BS ISO 7098:2015ISO 7098:2015(E) 2.9 word segmentation process of splitting text into a sequence of word segmentation unit SOURCE: ISO 24614-1:2010, 2.25 3
26、General principles of conversion of writing systems 3.1 The words in a language, which are written according to a given script (the converted system), sometimes have to be rendered according to a different system (the conversion system), normally used for a different language. This operation is ofte
27、n performed for historical or geographical texts, cartographical documents and, in particular, for bibliographical work in every case where it is necessary to write words supplied in various alphabets in a manner that allows intercalation with other words in a single alphabet so as to enable a unifo
28、rm alphabetization to be made in bibliographies, catalogues, indices, toponymic lists, etc. It is indispensable in that it permits the univocal transmission of a written message between two countries using different writing systems or exchanging a message, the writing of which is different from thei
29、r own. It, thereby, permits transmission by manual as well as mechanical or electronic means. The two basic methods of conversion of a system of writing are transliteration and transcription. 3.2 Transliteration is the operation which consists of representing the characters of an entirely alphabetic
30、al character or alphanumeric character system of writing by the characters of the conversion alphabet. In principle, this conversion should be made character by character: each character of the converted alphabet is rendered by one character, and one only of the conversion alphabet, to ensure the co
31、mplete and unambiguous reversibility of the conversion alphabet into the converted alphabet. When the number of characters used in the conversion system is smaller than the number of characters of the converted system, it is necessary to use digraphs or diacritical marks. In this case, one shall avo
32、id as far as possible arbitrary choices and the use of purely conventional marks and try to maintain a certain phonetic logic in order to give the system a wide acceptance. However, it shall be accepted that the graphism obtained may not always be correctly pronounced according to the phonetic habit
33、s of the language (or of all the languages) which usually use(s) the conversion alphabet. On the other hand, this graphism shall be such that the reader who knows the converted language may mentally restore unequivocally the original graphism and, thus, pronounce it correctly. 3.3 Retransliteration
34、is the operation which consists of converting the characters of a conversion alphabet to those of the converted alphabet. This operation is the exact opposite of transliteration; it is carried out by applying the rules of a system of transliteration in reverse order so as to reconstitute the transli
35、terated word to its original form. 3.4 Transcription is the operation which consists of representing the characters of a language, whatever the original system of writing, by the phonetic system of letters or signs of the conversion language. A transcription system is of necessity based on the ortho
36、graphical conventions of a conversion language and its alphabet. The users of a transcription system shall, therefore, have a knowledge of the conversion language to be able to pronounce the characters correctly. Transcription is not strictly reversible. Transcription may be used for the conversion
37、of all writing systems. It is the only method that can be used for systems that are not entirely alphabetical and for all ideophonographic writing systems (Chinese, Japanese, etc.).2 ISO 2015 All rights reservedBS ISO 7098:2015ISO 7098:2015(E) 3.5 Romanization is the conversion of non-Latin writing
38、systems to the Latin alphabet by means of transliteration or transcription. To carry out Romanization, it is possible to use either transliteration or transcription or a combination of these two methods, according to the nature of the converted system. 3.6 A conversion system proposed for internatio
39、nal use may call for compromise and the sacrifice of certain national customs. It is, therefore, necessary for each national community of users to accept concessions, fully abstaining in every case from imposing as a matter of course solutions that are actually justified only by national practice (f
40、or example, regarding pronunciation, orthography, etc.). However, these concessions would obviously not relate to the use that a country makes of its national writing system: when this national system is not converted, the characters constituting it shall be accepted in the form in which they are wr
41、itten in the national language. When a country uses two systems univocally, converting one into the other to write its own language, the system of transliteration thus implemented shall be taken a priori as a basis for the international standardized system, as far as it is compatible with the other
42、principles mentioned hereafter. 3.7 Where necessary, the conversion systems should specify an equivalent for each character, not only the letters but also the punctuation marks, numbers, etc. They should similarly take into account the arrangement of the sequence of characters that make up the text,
43、 for example, the direction of the script, and specify the way of distinguishing words and of using separation signs and capital letters, following as closely as possible the customs of the language(s) which use the converted writing system. 4 Principles for converting ideophonographic characters 4.
44、1 The structure of ideophonographic characters, where conveyance of meaning is of greater importance than that of pronunciation, entails the existence of a large number of characters (more than 60 000 in the case of Chinese), thus, making sign by sign transliteration impossible and resulting in the
45、need to devise a system of transcription. Each character shall, therefore, be transcribed by one or more Latin letters standing for the pronunciation or pronunciations of the character in question. This means that the transcriber shall be familiar with the reading or readings of the text to be trans
46、cribed. 4.2 In as much as the transcription of ideophonographic characters is merely a matter of phonetic notation in Latin letters of characters of the languages which use them, identical characters will require different transcriptions depending on whether they are found in Chinese, Japanese or Ko
47、rean texts. 4.3 On the other hand, the same character within the same language shall always be transcribed in the same way regardless of the type of graphic representation utilized (traditional form or simplified form of a Chinese character), except where a single character has more than one pronunc
48、iation. 4.4 Reversibility of Romanization systems of ideophonographic characters is impossible due to the following factors: the disparity in pronunciation of a given character in two different languages or within a single language; the high frequency of homophones within the same language (see Anne
49、x C); the possible coexistence of several writing systems within a given text. ISO 2015 All rights reserved 3BS ISO 7098:2015ISO 7098:2015(E) 4.5 In the case of those languages which use, even within the same text, more than one kind of script (for example Kana and Chinese characters in Japanese, Hangul and Chinese characters in Korean), both the transcription of the ideophonographic characters and the conversion of the other types of characters (for example Kana/Hangul) should yield a consistent and homogeneous system of Romanization.