1、 Reference number ISO 30042:2008(E) ISO 2008INTERNATIONAL STANDARD ISO 30042 First edition 2008-12-15 Systems to manage terminology, knowledge and content TermBase eXchange (TBX) Systmes de gestion de la terminologie, de la connaissance et du contenu TermBase eXchange (TBX) ISO 30042:2008(E) PDF dis
2、claimer This PDF file may contain embedded typefaces. In accordance with Adobes licensing policy, this file may be printed or viewed but shall not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In downloading this file, parties
3、 accept therein the responsibility of not infringing Adobes licensing policy. The ISO Central Secretariat accepts no liability in this area. Adobe is a trademark of Adobe Systems Incorporated. Details of the software products used to create this PDF file can be found in the General Info relative to
4、the file; the PDF-creation parameters were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the address given below. COPYRIGHT PRO
5、TECTED DOCUMENT ISO 2008 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISOs
6、member body in the country of the requester. ISO copyright office Case postale 56 CH-1211 Geneva 20 Tel. + 41 22 749 01 11 Fax + 41 22 749 09 47 E-mail copyrightiso.org Web www.iso.org Published in Switzerland ii ISO 2008 All rights reservedISO 30042:2008(E) Contents 1 Scope 1 2 Normative references
7、 1 3 Terms and definitions 2 4 Relationship to other standards .5 5 Applications of TBX .5 6 Fundamental principles .5 6.1 General .5 6.2 Principles relating to grouping and representing data-categories .6 7 Requirements for TBX files .7 7.1 Compliance requirements 7 7.2 Examples of non-compliance .
8、8 7.3 Implementation levels .8 8 The core-structure module 8 8.1 Introduction .8 8.2 Hierarchy .9 8.3 Components of a terminological entry 10 8.4 Elements that can appear at multiple levels of the entry .10 8.5 Elements that occur only at the term level or lower 11 8.6 Handling of text .12 8.7 Meta
9、data elements .14 8.8 Attributes .15 8.9 Character set issues .16 8.10 Language .16 9 The default data-category constraints .16 9.1 Introduction .16 9.2 Data-categories built into the core structure DTD of TBX .17 9.3 Data-categories specialized from meta data-categories through the default XCS file
10、 .17 10 Examples 21 10.1 Example of a typical TBX file .21 10.2 Examples of encoding TBX elements .22 10.3 Examples of TBX entries 23 11 Referencing objects .25 11.1 General information about referencing .25 11.2 Referencing a file that is embedded in the back matter of a TBX file .26 11.3 Referenci
11、ng a file from the back matter 26 11.4 Referencing a file directly in the entry 27 11.5 Referencing an external source .27 11.6 Referencing and documenting a bibliographic source 27 11.7 Referencing and documenting information about a responsible person or organization 28 11.8 Referencing an externa
12、l concept system, classification system, or thesaurus 29 11.9 Referencing a TBX entry from within a corpus 29 12 Creating customized TBX TMLs .29 12.1 General information about TMLs .29 12.2 Example of an XCS file for a user-defined TBX TML 30 12.3 Creating customized picklist display names 31 Annex
13、 A (Normative) DTD for the core structure module .33 Annex B (Normative) DTD for the data-category constraints (XCS file) .38 Annex C (Normative) Default XCS file .40 C.1 Introduction .40 C.2 XCS file for the default data-categories and constraints 40 Annex D (Normative) Descriptions of the core str
14、ucture elements and attributes and the default data- categories 48 Annex D.1 General information about the descriptions 48 D.2 Macros .48 ISO 2008 All rights reserved iii ISO 2008 All rights reserved iiiISO 30042:2008(E) D.3 Attribute classes 49 D.4 Elements .50 D.5 Default data-categories 61 Annex
15、E (Normative) Descriptions of elements and attributes for the XCS file 73 E.1 Introduction .73 E.2 Attribute classes .73 E.3 Elements 73 Annex F (Informative) Integrated schema and other TBX resources .81 Annex G (Informative) TBX-Basic 82 Annex H (Informative) Summary of changes 83 Annex I (Informa
16、tive) Indexes 88 I.1 Core-module DTD .88 I.2 XCS DTD .89 I.3 Terminological data-categories 89 Bibliography 91 iv ISO 2008 All rights reservedISO 30042:2008(E) Foreword The International Organization for Standardization (ISO) is a worldwide federation of national standards bodies (ISO member bodies)
17、. The work of preparing International Standards is normally carried out through ISO technical committees. Each member body interested in a subject for which a technical committee has been established has the right to be represented on that committee. International organizations, governmental and non
18、-governmental, in liaison with ISO, also take part in the work. ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of electrotechnical standardization. International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part
19、 2. The main task of ISO technical committees is to prepare International Standards. Draft International Standards adopted by the technical committees are circulated to the member bodies for voting. Publication as an International Standard requires approval by at least 75 % of the member bodies cast
20、ing a vote. Attention is drawn to the possibility that some of the elements of this document may be the subject of patent rights. ISO shall not be held responsible for identifying any or all such patent rights. ISO 30042 was prepared by LISA OSCAR and was adopted, under a special “fast-track procedu
21、re“, by Technical Committee ISO/TC 37, Terminology and other language and content resources , Subcommittee SC 3, Systems to manage terminology, knowledge and content , in parallel with its approval by the ISO member bodies. The Localization Industry Standards Association (LISA - www.lisa.org ) is th
22、e standards organization for the globalization industry. Within LISA, the OSCAR (Open Standards for Container/content Allowing Reuse) Special Interest Group develops XML-based standards for automated language-processing in the areas of globalization, internationalization, localization, and translati
23、on, including standards for translation memory, terminology, text memory, word/character counts, and other related areas. The main task of the OSCAR Special Interest Group is to develop standards to facilitate and automate the globalization of products and services in a way that supports local langu
24、age and culture conventions. Publication as an OSCAR standard requires approval by the OSCAR steering committee. An earlier version of TBX was developed and published by LISA in 2002. TBX and the TBX logo are registered trademarks of LISA, and the TBX logo is subject to terms of use as defined by LI
25、SA. LISA maintains copyright on the TBX specification that is available on the LISA Web site, and ISO maintains copyright on the TBX specification that it distributes as ISO 30042. The technical content of these two documents is identical, and is subject to joint maintenance by a team of ISO TC 37 a
26、nd LISA OSCAR members. ISO 2008 All rights reserved vISO 30042:2008(E) Introduction This International Standard defines an XML-based framework for representing structured terminological data referred to as TermBase eXchange (TBX). Within this framework, a variety of terminological markup languages (
27、TMLs) can be defined. A TML defined by TBX can facilitate the interchange of terminological data between users, which include people such as translators and writers, and applications and systems, such as Computer Assisted Translation tools and controlled authoring software. Therefore, it can be used
28、 for both human-oriented and machine- oriented terminological data. In this manner, it can enable the flow of terminological information throughout the information production cycle, both inside an organization and with outside service providers. The intended audience for this document consists of tw
29、o groups: (1) programmers and analysts who wish to develop software applications that process TBX-compliant data files; (2) terminologists and other language specialists who wish to analyse a terminological data collection for representation in TBX or to understand a TBX file. This version of TBX is
30、 an update of a version that was published by the Localization Industry Standards Association (LISA) in 2002. Among other enhancements, the current version provides reference to an integrated schema that includes the core-structure module and the data-category constraints in combined declarations us
31、ing the Relax NG and Schematron languages. It also provides reference to a TBX-compliant TML called TBX-Basic. Users of this International Standard should first study the body (clauses 1-12). The suggested use of annexes A-I is described below. (1) The core-structure module of TBX All TMLs within th
32、e TBX framework have the same core structure. The core-structure module is described in Clause 8. A DTD for the core-structure module is found in Annex A. The elements, attributes, and data types are described in Annex D, and listed alphabetically in Annex I. (2) The XCS module TMLs may differ with
33、respect to which data-categories are allowed, and at what levels of a terminological entry these data-categories can occur. These constraints on the core structure, which define a particular TML, are formally represented in an XCS file. A DTD for the XCS module is found in Annex B. The elements and
34、attributes are described in Annex E, and listed alphabetically in Annex I. (3) The default XCS of TBX The TBX-default TML is constrained by the default XCS file. The TBX default XCS is described in Clause 9. The default XCS file is provided in Annex C. The data-categories are described in Annex D, a
35、nd listed alphabetically in Annex I. (4) Compliance checking of TBX document instances Once a TBX TML has been defined by an XCS, a TBX document instance can be checked for compliance with that TML. The requirements for compliance are found in Clause 7. One can use a variety of methods and schema de
36、finition languages to check compliance. In particular, the Relax NG schema referred to in Annex F can be used to check whether a TBX document instance is compliant with the TBX-default TML. Annex F also indicates where a TBX user can find additional resources for compliance checking. Another TBX TML
37、, called TBX-Basic, is referred to in Annex G. (5) Changes that have been made to TBX since its submission to ISO in February 2007 are summarized in Annex H. Summary of annexes: A: DTD for core-structure module B: DTD for XCS module C: Default XCS that defines the TBX-default TML D: Descriptions of
38、core structure elements and attributes D.5: Descriptions of default data-categories E: Descriptions of XCS elements and attributes F: Relax NG schema and other resources for compliance checking G: Reference to TBX-Basic H: Summary of changes to TBX I: Indexes (alphabetical lists of elements and data
39、-categories) vi ISO 2008 All rights reservedSystems to manage terminology, knowledge, and content - TermBase eXchange (TBX) 1 Scope The TBX framework defined by this International Standard is designed to support various types of processes involving terminological data, including analysis, descriptiv
40、e representation, dissemination, and interchange (exchange), in various computer environments. The primary purpose of TBX is for interchange of terminological data. It is limited in its ability to represent presentational markup. Intended application areas include translation and authoring. TBX is m
41、odular in order to support the varying types of terminological data, or data-categories , that are included in different terminological databases (termbases). TBX includes two modules: a core structure, and a formalism for identifying a set of data-categories and their constraints, both expressed in
42、 XML. The term TBX , when used alone, refers to the framework consisting of these two interacting modules. To maximize interoperability of the actual terminological data, TBX also provides a default set of data-categories that are commonly used in terminological databases. However, subsets or supers
43、ets of the default set of data-categories can be used within the TBX framework to support specific user requirements. TBX, when used with its default set of data-categories, qualifies as a terminological markup language (TML) as defined in ISO 16642, which will be referred to as the TBX-default TML
44、in this International Standard. Likewise, other markup languages that comply with TBX and use a subset of the default set of data-categories are also TMLs, but may go by other names, such as the one referred to in Annex G (Informative) TBX-Basic . 2 Normative references The following referenced docu
45、ments are indispensable for the application of this document. For dated references, only the edition cited applies. For undated references, the latest edition of the referenced document (including any amendments) applies. ISO 639-1:2002, Codes for the representation of names of languages Part 1: Alp
46、ha-2 code ISO 639-2:1998, Codes for the representation of names of languages Part 2: Alpha-3 code ISO 639-3:2007, Codes for the representation of names of languages Part 3: Alpha-3 code for comprehensive coverage of languages ISO/IEC 646:1991, Information technology ISO 7-bit coded character set for
47、 information interchange ISO 3166-1:2006, Codes for the representation of names of countries and their subdivisions Part 1: Country codes ISO 8601:2004, Data elements and interchange formats Information interchange Representation of dates and times ISO/IEC 10646, Information technology Universal Mul
48、tiple-Octet Coded Character Set (UCS) ISO 12200:1999, Computer applications in terminology Machine-readable terminology interchange format (MARTIF) Negotiated interchange ISO 12620, Computer applications in terminology Data categories ISO 16642:2003, Computer applications in terminology Terminologic
49、al markup framework ISO 2008 All rights reserved 1 INTERNATIONAL STANDARD ISO 30042:2008(E)ISO 30042:2008(E) 3 Terms and definitions For the purposes of this International Standard, the following terms and definitions apply: 3.1 analysis identification of the elements and structure of a terminological data collection in order to make explicit the data fields, their types, and their relationships 3.2 blindness property of a data format indicating the degree to which the data are sufficiently defined that it is unnecessary for the
copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1