BS ISO IEC 13250-4-2009 Information technology - Topic maps - Canonicalization《信息技术 主题图 规范化》.pdf

上传人:eastlab115 文档编号:396287 上传时间:2018-10-18 格式:PDF 页数:24 大小:2MB
下载 相关 举报
BS ISO IEC 13250-4-2009 Information technology - Topic maps - Canonicalization《信息技术 主题图 规范化》.pdf_第1页
第1页 / 共24页
BS ISO IEC 13250-4-2009 Information technology - Topic maps - Canonicalization《信息技术 主题图 规范化》.pdf_第2页
第2页 / 共24页
BS ISO IEC 13250-4-2009 Information technology - Topic maps - Canonicalization《信息技术 主题图 规范化》.pdf_第3页
第3页 / 共24页
BS ISO IEC 13250-4-2009 Information technology - Topic maps - Canonicalization《信息技术 主题图 规范化》.pdf_第4页
第4页 / 共24页
BS ISO IEC 13250-4-2009 Information technology - Topic maps - Canonicalization《信息技术 主题图 规范化》.pdf_第5页
第5页 / 共24页
亲,该文档总共24页,到这儿已超出免费预览范围,如果喜欢就下载吧!
资源描述

1、BS ISO/IEC 13250-4:2009 ICS 35.240.30 NO COPYING WITHOUT BSI PERMISSION EXCEPT AS PERMITTED BY COPYRIGHT LAW BRITISH STANDARD Information Technology Topic Maps Part 4: CanonicalizationThis British Standard was published under the authority of the Standards Policy and Strategy Committee on 30 Septemb

2、er 2009 BSI 2009 ISBN 978 0 580 54609 9 Amendments/corrigenda issued since publication Date Comments BS ISO/IEC 13250-4:2009 National foreword This British Standard is the UK implementation of ISO/IEC 13250-4:2009. The UK participation in its preparation was entrusted to Technical Committee IST/41,

3、Document description and processing language. A list of organizations represented on this committee can be obtained on request to its secretary. This publication does not purport to include all the necessary provisions of a contract. Users are responsible for its correct application. Compliance with

4、 a British Standard cannot confer immunity from legal obligations.BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009Reference number ISO/IEC 13250-4:2009(E) ISO/IEC 2009INTERNATIONAL STANDARD ISO/IEC 13250-4 First edition 2009-03-0

5、1 Information technology Topic Maps Part 4: Canonicalization Technologies de linformation Plans relatifs des sujets Partie 4: Canonicalisation BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 ISO/IEC 13250-4:2009(E) PDF disclaime

6、r This PDF file may contain embedded typefaces. In accordance with Adobes licensing policy, this file may be printed or viewed but shall not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In downloading this file, parties accep

7、t therein the responsibility of not infringing Adobes licensing policy. The ISO Central Secretariat accepts no liability in this area. Adobe is a trademark of Adobe Systems Incorporated. Details of the software products used to create this PDF file can be found in the General Info relative to the fi

8、le; the PDF-creation parameters were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the address given below. COPYRIGHT PROTECTED

9、 DOCUMENT ISO/IEC 2009 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISOs me

10、mber body in the country of the requester. ISO copyright office Case postale 56 CH-1211 Geneva 20 Tel. + 41 22 749 01 11 Fax + 41 22 749 09 47 E-mail copyrightiso.org Web www.iso.org Published in Switzerland ii ISO/IEC 2009 All rights reservedBS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IE

11、C 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 iii ISO/IEC 2009 All rights reserved Contents Page Foreword iv Introduction. v 1 Scope 1 2 Normative references 1 3 Canonicalization 1 3.1 Introduction 1 3.2 Notational conventions. 2 3.3 CXTM document information item. 2 3.4 Constructin

12、g a representation of a topic map item 2 3.5 Constructing a representation of a topic item. 2 3.6 Constructing a representation of the topic name item. 3 3.7 Constructing a representation of a variant item. 3 3.8 Constructing a representation of an occurrence item 3 3.9 Constructing a representation

13、 of an association item. 4 3.10 Constructing a representation of the association role item. 4 3.11 Constructing a representation of the reifier property 4 3.12 Constructing a representation of the scope property 5 3.13 Constructing a representation of the item identifiers property. 5 3.14 Constructi

14、ng a representation of the datatype property 5 3.15 Constructing a representation of the type property. 5 3.16 Constructing a representation of the value property. 5 3.17 Constructing a representation of locator values. 6 3.18 Normalizing locator values 6 3.19 Constructing the number attribute. 6 3.

15、20 Encoding of string properties. 7 3.21 Encoding of positional values. 7 3.22 Default property values for element information items 7 3.23 Default property values for attribute information items. 7 4 Canonical sort order 7 4.1 Introduction 7 4.2 Information type and basic type sort order. 7 4.3 Com

16、parison of strings. 8 4.4 Comparison of sets 8 4.5 Comparison order for locators 8 4.6 Canonical sort order for topic items. 8 4.7 Canonical sort order for topic name items. 8 4.8 Canonical sort order for variant items. 8 4.9 Canonical sort order for occurrence items. 8 4.10 Canonical sort order for

17、 association items. 9 4.11 Canonical sort order for association role items. 9 Annex A (informative) RELAX-NG schema for CXTM 10 Bibliography 12 ISO/IEC 13250-4:2009(E)BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 ISO/IEC 13250

18、-4:2009(E) iv ISO/IEC 2009 All rights reservedForeword ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission) form the specialized system for worldwide standardization. National bodies that are members of ISO or IEC participate in the develop

19、ment of International Standards through technical committees established by the respective organization to deal with particular fields of technical activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other international organizations, governmental and non-governmenta

20、l, in liaison with ISO and IEC, also take part in the work. In the field of information technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC 1. International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2. The main task of the

21、joint technical committee is to prepare International Standards. Draft International Standards adopted by the joint technical committee are circulated to national bodies for voting. Publication as an International Standard requires approval by at least 75 % of the national bodies casting a vote. Att

22、ention is drawn to the possibility that some of the elements of this document may be the subject of patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights. ISO/IEC 13250-4 was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology,

23、 Subcommittee SC 34, Document description and processing languages. ISO/IEC 13250 consists of the following parts, under the general title Information technology Topic Maps: Part 2: Data model Part 3: XML syntax The following parts are under preparation. Part 1: Overview and basic concepts Part 4: C

24、anonicalization Part 5: Reference model Part 6: Compact syntax Part 7: Graphical notation BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 ISO/IEC 13250-4:2009(E) v ISO/IEC 2009 All rights reserved Introduction This part of ISO/I

25、EC 13250 defines a format known as Canonical XTM, or CXTM for short. The format is an XML format, and has the property that it guarantees that two equivalent Topic Maps Data Model instances (ISO/IEC 13250-2) will always produce byte-by-byte identical serializations, and that non-equivalent instances

26、 will always produce different serializations. CXTM thus enables direct comparison of two topic maps to determine equality by comparison of their canonical serializations. The purpose of CXTM is to allow the creation of test suites for various Topic Maps-related technologies that are easily portable

27、 between different Topic Maps implementations, so long as these support CXTM. CXTM is not intended to be used for the interchange of topic maps, although this is possible. The standard format for interchange of topic maps is XTM (ISO/IEC 13250-3).BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS IS

28、O/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 INTERNATIONAL STANDARD ISO/IEC 13250-4:2009(E) 1 ISO/IEC 2009 All rights reserved 1 Scope This part of ISO/IEC 132

29、50 defines the CXTM format, and specifies how CXTM files are produced from topic maps by means of a transformation from the Topic Maps Data Model (ISO/IEC 13250-2) to the XML Infoset XML Infoset. 2 Normative references The following referenced documents are indispensable for the application of this

30、document. For dated references, only the edition cited applies. For undated references, the latest edition of the referenced document (including any amendments) applies. NOTE Each of the following documents has a unique identifier that is used to cite the document in the text. The unique identifier

31、consists of the part of the reference up to the first comma. ISO/IEC 10646, Information technology Universal Multiple-Octet Coded Character Set (UCS) Unicode, The Unicode Standard, Version 5.0.0, The Unicode Consortium, Reading, Massachusetts, USA, Addison- Wesley Developers Press, 2007, ISBN 0-321-

32、48091-0, http:/www.unicode.org/versions/Unicode5.0.0/ RFC 3986, Uniform Resource Identifier (URI): Generic Syntax, Internet Standards Track Specification, January 2005, http:/www.ietf.org/rfc/rfc3986.txt XML-C14N, Canonical XML, Version 1.0, World Wide Web Consortium, 15 March 2001, available at XML

33、 Infoset, XML Information Set (Second Edition), World Wide Web Consortium, 4 February 2004, available atISO/IEC 13250-2, Information technology Topic Maps Part 2: Data model XMLSCHEMA-2, XML Schema Part 2: Datatypes Second Edition, World Wide Web Consortium, 28 October 2004, available at 3 Canonical

34、ization 3.1 Introduction Canonicalization is performed in three steps: 1. A document information item representing the CXTM document is produced from the topic map item as described in 3.3. 2. For each element information item that is a descendant of the document information item from the previous s

35、tep, the following operations are performed: A character information item is added to the children property of the information item in the elements parent property immediately after the element itself. The character information items character code property is set to #x0A. Information technology Top

36、ic Maps Part 4: Canonicalization The canonicalization process takes two parameters: a topic map item (that is, an instance of the Topic Maps Data Model, defined in ISO/IEC 13250-2) and a base locator. The process produces a canonicalization of the topic map, with all locators in the topic map rewrit

37、ten to be relative to the given base locator. The purpose of the base locator is to allow references to the local filesystem to be stripped out, thus making CXTM test cases portable between different systems.BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2

38、009 BS ISO/IEC 13250-4:2009 2 If the elements local name property is set to “topicMap“, “topic“, “name“, “variant“, “occurrence“, “association“, “role“, “scope“, “itemIdentifiers“, “subjectLocators“, or “subjectIdentifiers“, a character information item is added to the children property of the eleme

39、nt as the first element. The character information items character code property is set to #x0A. 3. The document information item is serialized to a Canonical XML representation as described in XML-C14N. 3.2 Notational conventions Information item properties from W3C XML-Infoset are referred to usin

40、g property name, in order to distinguish them from properties from ISO/IEC 13250-2. 3.3 CXTM document information item There is exactly one CXTM document information item in the XML Infoset generated by the canonicalization of the topic map item. The CXTM document information item has the following

41、properties: 1. children A list containing only the representation of the topic map item 2. document element The element information item that represents the topic map item 3. notations The empty set 4. unparsed entities The empty set 5. base URI No value 6. standalone No value 7. version No value 8.

42、 all declarations processed False 3.4 Constructing a representation of a topic map item A topic map item is represented by an element information item with the following properties: 1. local name The string “topicMap“ 2. children A list of element information items in the following order: 1. A repre

43、sentation of the item identifiers property, if any 2. A representation of each topic item in the topics property of the topic map item in canonical sort order 3. A representation of each association item in the associations property of the topic map item in canonical sort order 3. attributes A repre

44、sentation of the reifier property 3.5 Constructing a representation of a topic item A topic item is represented by an element information item with the following properties: 1. local name The string “topic“ 2. children A list of element information items in the following order: 1. If the value of su

45、bject identifiers property of the topic item is not the empty set, then an element information item with the following properties: 1. local name The string “subjectIdentifiers“ 2. children A representation of each locator in the subject identifiers property in canonical sort order 3. attributes The

46、empty set 2. If the value of the subject locators property of the topic item is not the empty set, then an element information item with the following properties: 1. local name The string “subjectLocators“ 2. children A representation of each locator in the subject locators property in canonical sor

47、t order 3. attributes The empty set ISO/IEC 2009 All rights reserved ISO/IEC 13250-4:2009(E)BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 BS ISO/IEC 13250-4:2009 3 3. A representation of the item identifiers property, if any 4. A representation of ea

48、ch of the topic name items of the topic names property in canonical sort order 5. A representation of each of the occurrence items of the occurrences property in canonical sort order 6. For each of the association role items of the roles played property in canonical sort order, an element informatio

49、n item with the following properties 1. local name set to the string “rolePlayed“ 2. children An empty list 3. attributes A set containing one attribute information item as follows: 1. local name set to the string “ref“ 2. normalized value A sequence of character information items representing a string value constructed by the c

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 标准规范 > 国际标准 > BS

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1