1、BS ISO30042:2008ICS 01.020; 35.240.30NO COPYING WITHOUT BSI PERMISSION EXCEPT AS PERMITTED BY COPYRIGHT LAWBRITISH STANDARDSystems to manageterminology,knowledge and content TermBase eXchange(TBX)This British Standardwas published under theauthority of the StandardsPolicy and StrategyCommittee on 31
2、 July 2009 BSI 2009ISBN 978 0 580 64860 1Amendments/corrigenda issued since publicationDate CommentsBS ISO 30042:2008National forewordThis British Standard is the UK implementation of ISO 30042:2008.The UK participation in its preparation was entrusted to TechnicalCommittee TS/1, Terminology.A list
3、of organizations represented on this committee can be obtained onrequest to its secretary.This publication does not purport to include all the necessary provisionsof a contract. Users are responsible for its correct application.Compliance with a British Standard cannot confer immunityfrom legal obli
4、gations.BS ISO 30042:2008Reference numberISO 30042:2008(E)ISO 2008INTERNATIONAL STANDARD ISO30042First edition2008-12-15Systems to manage terminology, knowledge and content TermBase eXchange (TBX) Systmes de gestion de la terminologie, de la connaissance et du contenu TermBase eXchange (TBX) BS ISO
5、30042:2008ISO 30042:2008(E) PDF disclaimer This PDF file may contain embedded typefaces. In accordance with Adobes licensing policy, this file may be printed or viewed but shall not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing
6、. In downloading this file, parties accept therein the responsibility of not infringing Adobes licensing policy. The ISO Central Secretariat accepts no liability in this area. Adobe is a trademark of Adobe Systems Incorporated. Details of the software products used to create this PDF file can be fou
7、nd in the General Info relative to the file; the PDF-creation parameters were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the
8、 address given below. COPYRIGHT PROTECTED DOCUMENT ISO 2008 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from eithe
9、r ISO at the address below or ISOs member body in the country of the requester. ISO copyright office Case postale 56 CH-1211 Geneva 20 Tel. + 41 22 749 01 11 Fax + 41 22 749 09 47 E-mail copyrightiso.org Web www.iso.org Published in Switzerland ii ISO 2008 All rights reservedBS ISO 30042:2008ISO 300
10、42:2008(E)Contents1 Scope12 Normative references13 Terms and definitions24 Relationship to other standards.55 Applications of TBX.56 Fundamental principles.56.1 General.56.2 Principles relating to grouping and representing data-categories.67 Requirements for TBX files.77.1 Compliance requirements77.
11、2 Examples of non-compliance.87.3 Implementation levels.88 The core-structure module88.1 Introduction.88.2 Hierarchy.98.3 Components of a terminological entry108.4 Elements that can appear at multiple levels of the entry.108.5 Elements that occur only at the term level or lower118.6 Handling of text
12、.128.7 Meta data elements.148.8 Attributes.158.9 Character set issues.168.10 Language.169 The default data-category constraints.169.1 Introduction.169.2 Data-categories built into the core structure DTD of TBX.179.3 Data-categories specialized from meta data-categories through the default XCS file.1
13、710 Examples2110.1 Example of a typical TBX file.2110.2 Examples of encoding TBX elements.2210.3 Examples of TBX entries2311 Referencing objects.2511.1 General information about referencing.2511.2 Referencing a file that is embedded in the back matter of a TBX file.2611.3 Referencing a file from the
14、 back matter2611.4 Referencing a file directly in the entry2711.5 Referencing an external source.2711.6 Referencing and documenting a bibliographic source2711.7 Referencing and documenting information about a responsible person or organization2811.8 Referencing an external concept system, classifica
15、tion system, or thesaurus2911.9 Referencing a TBX entry from within a corpus2912 Creating customized TBX TMLs.2912.1 General information about TMLs.2912.2 Example of an XCS file for a user-defined TBX TML3012.3 Creating customized picklist display names31Annex A (Normative) DTD for the core structur
16、e module.33Annex B (Normative) DTD for the data-category constraints (XCS file).38Annex C (Normative) Default XCS file.40C.1 Introduction.40C.2 XCS file for the default data-categories and constraints40Annex D (Normative) Descriptions of the core structure elements and attributes and the default dat
17、a-categories48Annex D.1 General information about the descriptions48D.2 Macros.48 ISO 2008 All rights reserved iii ISO 2008 All rights reserved iiiBS ISO 30042:2008ISO 30042:2008(E)D.3 Attribute classes49D.4 Elements.50D.5 Default data-categories61Annex E (Normative) Descriptions of elements and att
18、ributes for the XCS file73E.1 Introduction.73E.2 Attribute classes.73E.3 Elements73Annex F (Informative) Integrated schema and other TBX resources.81Annex G (Informative) TBX-Basic82Annex H (Informative) Summary of changes83Annex I (Informative) Indexes88I.1 Core-module DTD.88I.2 XCS DTD.89I.3 Termi
19、nological data-categories89Bibliography91iv ISO 2008 All rights reservedBS ISO 30042:2008ISO 30042:2008(E)ForewordThe International Organization for Standardization (ISO) is a worldwide federation of national standards bodies (ISO member bodies). The work of preparing International Standards is norm
20、ally carried out through ISO technical committees. Each member body interested in a subject for which a technical committee has been established has the right to be represented on that committee. International organizations, governmental and non-governmental, in liaison with ISO, also take part in t
21、he work. ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of electrotechnical standardization.International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2.The main task of ISO technical committees is to prepa
22、re International Standards. Draft International Standards adopted by the technical committees are circulated to the member bodies for voting. Publication as an International Standard requires approval by at least 75 % of the member bodies casting a vote.Attention is drawn to the possibility that som
23、e of the elements of this document may be the subject of patent rights. ISO shall not be held responsible for identifying any or all such patent rights.ISO 30042 was prepared by LISA OSCAR and was adopted, under a special “fast-track procedure“, by Technical Committee ISO/TC 37, Terminology and othe
24、r language and content resources, Subcommittee SC 3, Systems to manage terminology, knowledge and content, in parallel with its approval by the ISO member bodies.The Localization Industry Standards Association (LISA - www.lisa.org) is the standards organization for the globalization industry. Within
25、 LISA, the OSCAR (Open Standards for Container/content Allowing Reuse) Special Interest Group develops XML-based standards for automated language-processing in the areas of globalization, internationalization, localization, and translation, including standards for translation memory, terminology, te
26、xt memory, word/character counts, and other related areas. The main task of the OSCAR Special Interest Group is to develop standards to facilitate and automate the globalization of products and services in a way that supports local language and culture conventions. Publication as an OSCAR standard r
27、equires approval by the OSCAR steering committee. An earlier version of TBX was developed and published by LISA in 2002.TBX and the TBX logo are registered trademarks of LISA, and the TBX logo is subject to terms of use as defined by LISA. LISA maintains copyright on the TBX specification that is av
28、ailable on the LISA Web site, and ISO maintains copyright on the TBX specification that it distributes as ISO 30042. The technical content of these two documents is identical, and is subject to joint maintenance by a team of ISO TC 37 and LISA OSCAR members. ISO 2008 All rights reserved vBS ISO 3004
29、2:2008ISO 30042:2008(E)IntroductionThis International Standard defines an XML-based framework for representing structured terminological data referred to as TermBase eXchange (TBX). Within this framework, a variety of terminological markup languages (TMLs) can be defined. A TML defined by TBX can fa
30、cilitate the interchange of terminological data between users, which include people such as translators and writers, and applications and systems, such as Computer Assisted Translation tools and controlled authoring software. Therefore, it can be used for both human-oriented and machine-oriented ter
31、minological data. In this manner, it can enable the flow of terminological information throughout the information production cycle, both inside an organization and with outside service providers.The intended audience for this document consists of two groups: (1) programmers and analysts who wish to
32、develop software applications that process TBX-compliant data files; (2) terminologists and other language specialists who wish to analyse a terminological data collection for representation in TBX or to understand a TBX file.This version of TBX is an update of a version that was published by the Lo
33、calization Industry Standards Association (LISA) in 2002. Among other enhancements, the current version provides reference to an integrated schema that includes the core-structure module and the data-category constraints in combined declarations using the Relax NG and Schematron languages. It also p
34、rovides reference to a TBX-compliant TML called TBX-Basic.Users of this International Standard should first study the body (clauses 1-12). The suggested use of annexes A-I is described below.(1) The core-structure module of TBXAll TMLs within the TBX framework have the same core structure. The core-
35、structure module is described in Clause 8. A DTD for the core-structure module is found in Annex A. The elements, attributes, and data types are described in Annex D, and listed alphabetically in Annex I.(2) The XCS moduleTMLs may differ with respect to which data-categories are allowed, and at what
36、 levels of a terminological entry these data-categories can occur. These constraints on the core structure, which define a particular TML, are formally represented in an XCS file. A DTD for the XCS module is found in Annex B. The elements and attributes are described in Annex E, and listed alphabeti
37、cally in Annex I.(3) The default XCS of TBXThe TBX-default TML is constrained by the default XCS file. The TBX default XCS is described in Clause 9. The default XCS file is provided in Annex C. The data-categories are described in Annex D, and listed alphabetically in Annex I.(4) Compliance checking
38、 of TBX document instancesOnce a TBX TML has been defined by an XCS, a TBX document instance can be checked for compliance with that TML. The requirements for compliance are found in Clause 7. One can use a variety of methods and schema definition languages to check compliance. In particular, the Re
39、lax NG schema referred to in Annex F can be used to check whether a TBX document instance is compliant with the TBX-default TML. Annex F also indicates where a TBX user can find additional resources for compliance checking. Another TBX TML, called TBX-Basic, is referred to in Annex G.(5) Changes tha
40、t have been made to TBX since its submission to ISO in February 2007 are summarized in Annex H.Summary of annexes:A: DTD for core-structure moduleB: DTD for XCS moduleC: Default XCS that defines the TBX-default TMLD: Descriptions of core structure elements and attributesD.5: Descriptions of default
41、data-categoriesE: Descriptions of XCS elements and attributesF: Relax NG schema and other resources for compliance checkingG: Reference to TBX-BasicH: Summary of changes to TBXI: Indexes (alphabetical lists of elements and data-categories)vi ISO 2008 All rights reservedBS ISO 30042:2008Systems to ma
42、nage terminology, knowledge, and content - TermBase eXchange (TBX)1 ScopeThe TBX framework defined by this International Standard is designed to support various types of processes involving terminological data, including analysis, descriptive representation, dissemination, and interchange (exchange)
43、, in various computer environments. The primary purpose of TBX is for interchange of terminological data. It is limited in its ability to represent presentational markup. Intended application areas include translation and authoring.TBX is modular in order to support the varying types of terminologic
44、al data, or data-categories, that are included in different terminological databases (termbases). TBX includes two modules: a core structure, and a formalism for identifying a set of data-categories and their constraints, both expressed in XML. The term TBX, when used alone, refers to the framework
45、consisting of these two interacting modules.To maximize interoperability of the actual terminological data, TBX also provides a default set of data-categories that are commonly used in terminological databases. However, subsets or supersets of the default set of data-categories can be used within th
46、e TBX framework to support specific user requirements.TBX, when used with its default set of data-categories, qualifies as a terminological markup language (TML) as defined in ISO 16642, which will be referred to as the TBX-default TML in this International Standard. Likewise, other markup languages
47、 that comply with TBX and use a subset of the default set of data-categories are also TMLs, but may go by other names, such as the one referred to in Annex G (Informative) TBX-Basic.2 Normative referencesThe following referenced documents are indispensable for the application of this document. For d
48、ated references, only the edition cited applies. For undated references, the latest edition of the referenced document (including any amendments) applies.ISO 639-1:2002, Codes for the representation of names of languages Part 1: Alpha-2 code ISO 639-2:1998, Codes for the representation of names of l
49、anguages Part 2: Alpha-3 code ISO 639-3:2007, Codes for the representation of names of languages Part 3: Alpha-3 code for comprehensive coverage of languagesISO/IEC 646:1991, Information technology ISO 7-bit coded character set for information interchange ISO 3166-1:2006, Codes for the representation of names of countries and their subdivisions Part 1: Country codes ISO 8601:2004, Data elements and interchange formats Information interchange Representation of dates and times ISO/IEC 10646, Information technology Universal Multiple-Octet Cod