1、 ISO 2014 Language resource management Semantic annotation framework Part 7: Spatial information (ISOspace) Gestion des ressources linguistiques Cadre dannotation smantique Partie 7: Information spatiale (ISOspace) INTERNATIONAL STANDARD ISO 24617-7 First edition 2014-12-15 Reference number ISO 2461
2、7-7:2014(E) ISO 24617-7:2014(E)ii ISO 2014 All rights reserved COPYRIGHT PROTECTED DOCUMENT ISO 2014 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized otherwise in any form or by any means, electronic or mechanical, including photocopying, or
3、posting on the internet or an intranet, without prior written permission. Permission can be requested from either ISO at the address below or ISOs member body in the country of the requester. ISO copyright office Case postale 56 CH-1211 Geneva 20 Tel. + 41 22 749 01 11 Fax + 41 22 749 09 47 E-mail c
4、opyrightiso.org Web www.iso.org Published in Switzerland ISO 24617-7:2014(E)Contents Page Foreword iv Introduction v 1 Scope . 1 2 Normative references 1 3 T erms and definitions . 1 4 List of tags 4 5 Overview . 5 6 Motivation and requirements . 6 7 Specification of ISOspac e for spatial annotation
5、 7 7.1 Overview: annotation vs. representation. 7 7.2 Abstract syntax for the ISOspace annotation structure . 7 8 Representation of ISOspace-conformant annotations 8 8.1 XML-based concrete syntax: outline 8 8.1.1 Overview . 8 8.1.2 Basic entitles . 9 8.1.3 Signals . 9 8.1.4 Links . 9 8.1.5 Root elem
6、ent . 9 8.2 Conventions for tagging 9 8.2.1 Naming conventions . 9 8.2.2 Convention for inline tagging extents .10 8.3 Basic entity tags 11 8.3.1 11 8.3.2 13 8.3.3 .13 8.3.4 .14 8.3.5 for non-motion event 15 8.3.6 15 8.3.7 .16 8.3.8 16 8.4 Link tags .17 8.4.1 17 8.4.2 .17 8.4.3 18 8.4.4 .20 8.5 Root
7、 tag: .20 8.6 Summary .21 8.6.1 Identifier 21 8.6.2 Shared attributes 22 8.6.3 IDRef as value 23 Annex A (normative) Core annotation guidelines .24 Bibliography .52 ISO 2014 All rights reserved iii ISO 24617-7:2014(E) Foreword ISO (the International Organization for Standardization) is a worldwide f
8、ederation of national standards bodies (ISO member bodies). The work of preparing International Standards is normally carried out through ISO technical committees. Each member body interested in a subject for which a technical committee has been established has the right to be represented on that co
9、mmittee. International organizations, governmental and non-governmental, in liaison with ISO, also take part in the work. ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of electrotechnical standardization. The procedures used to develop this document
10、 and those intended for its further maintenance are described in the ISO/IEC Directives, Part 1. In particular the different approval criteria needed for the different types of ISO documents should be noted. This document was drafted in accordance with the editorial rules of the ISO/IEC Directives,
11、Part 2 (see www.iso.org/directives). Attention is drawn to the possibility that some of the elements of this document may be the subject of patent rights. ISO shall not be held responsible for identifying any or all such patent rights. Details of any patent rights identified during the development o
12、f the document will be in the Introduction and/or on the ISO list of patent declarations received (see www.iso.org/patents). Any trade name used in this document is information given for the convenience of users and does not constitute an endorsement. For an explanation on the meaning of ISO specifi
13、c terms and expressions related to conformity assessment, as well as information about ISOs adherence to the WTO principles in the Technical Barriers to Trade (TBT) see the following URL: Foreword - Supplementary information The committee responsible for this document is ISO/TC 37, Terminology and o
14、ther language and content resources, Subcommittee SC 4, Language resource management. ISO 24617 consists of the following parts, under the general title Language resource management Semantic annotation framework (semAF): Part 1: Time and events (SemAF-Time, ISO-TimeML) Part 2: Dialogue acts Part 4:
15、Semantic roles (SemAF-SR) Part 5: Discourse structures (SemAF-DS) Part 6: Principles of semantic annotation (SemAF-Basics) Part 7: Spatial information (ISOspace) Part 8: Semantic relations in discourse (SemAF-DRel)iv ISO 2014 All rights reserved ISO 24617-7:2014(E) Introduction The automatic recogni
16、tion of spatial information in natural language is currently attracting considerable attention in the fields of computational linguistics and artificial intelligence. The development of algorithms that exhibit “spatial awareness” promises to add needed functionality to NLP systems, from named entity
17、 recognition to question-answering and text-based inference. However, in order for such systems to reason spatially, they require the enrichment of textual data with the annotation of spatial information in language. This involves a large range of linguistic constructions, including spatially anchor
18、ing events, descriptions of objects in motion, viewer-relative descriptions of scenes, absolute spatial descriptions of locations, and many other constructions. This part of ISO 24617 was developed in collaboration with the ISOspace working group at Brandeis University with the aim to provide an Int
19、ernational Standard for the representation of spatial information relating to locations, motions and non-motion events in language. NOTE The ISOspace Working Group is headed by James Pustejovsky, jampespcs.brandeis.edu, Brandeis University, Waltham, MA, U.S.A. This part of ISO 24617 provides normati
20、ve specifications and guidelines not only for spatial information, but also for information content in motion and various other types of event in language. The main parts of this part of ISO 24617 consist of the following: a) Scope; b) Normative references; c) Terms and definitions; d) List of tags
21、or names of elements; e) Overview; f) Motivation and requirements; g) Specification of the ISOspace annotation structure; h) Representation of ISOspace-conformant annotations. Clause 8 introduces an XML-based concrete syntax for representing spatial-related or motion-related annotations based on the
22、 annotation structure of ISOspace that is presented in Clause 7 with a UML- based metamodel. A formal semantics for ISOspace will be provided as part of a future new work item within the semantic annotation framework. This will be coordinated with the temporal semantics and specification of ISO 2461
23、7-1 (SemAF-Time, ISO-TimeML), thereby producing a rich semantics that will be directly useable by practitioners in computational linguistics and other communities (see Clause 6). The multilingual extension of ISOspace will also be treated in a separate part of the ISO 24617- series in the near futur
24、e. NOTE Although the schema and DTD are not part of the present document as normative annexes, they will both be found in a webpage relating to the ISOspace specification. Normative Annex A is an integral part of ISO 24617 and provides core annotation guidelines. ISO 2014 All rights reserved v Langu
25、age resource management Semantic annotation framework Part 7: Spatial information (ISOspace) 1 Scope This part of ISO 24617 provides a framework for encoding a broad range not only of spatial information, but also of spatiotemporal information relating to motion as expressed in natural language text
26、s. This part of ISO 24617 includes references to locations, general spatial entities, spatial relations (involving topological, orientational, and metric values), dimensional information, motion events, and paths. 2 Normative references The following documents, in whole or in part, are normatively r
27、eferenced in this document and are indispensable for its application. For dated references, only the edition cited applies. For undated references, the latest edition of the referenced document (including any amendments) applies. ISO 24617-1, Language resource management Semantic annotation framewor
28、k (SemAF) Part 1: Time and events (SemAF-Time, ISO-TimeML) ISO/IEC 14977, Information technology Syntactic metalanguage Extended BNF 3 T erms a nd definiti ons For the purposes of this document, the terms and definitions given in ISO 24617-1 and the following apply. 3.1 document creation location dc
29、l unique place or set of places associated with a document that represents the location (3.7) in which the document was created Note 1 to entry: Some collaboratively written documents, such as GoogleDoc 1)documents and chat logs, might refer not only to a single location but also to a set of locatio
30、ns spread out across the world. Besides, for example, the creation place of the Hebrew bible or the creation place of each of the books in it is uncertain. The attribute dcl will, therefore, have the value “false” which is to be understood to mean “unspecified”, while the value “true” is to be under
31、stood to mean “specified”. 3.2 event eventuality something that can be said to obtain or hold true, to happen or to occur Note 1 to entry: This is a very broad notion of event, also known in the literature as “eventuality” and includes all kinds of actions, states, processes, etc. It is not to be co
32、nfused with the narrower notion of event (as opposed to the notion of “state”) as something that happens at a certain point in time (e.g. the clock striking two or waking up) or during a short period of time (e.g. laughing). In ISO-TimeML, the term event is used in a broader sense and is equivalent
33、to the term eventuality. 1) GoogleDoc is an example of a suitable product available commercially. This information is given for the convenience of users of this document and does not constitute an endorsement by ISO of these products. INTERNATIONAL ST ANDARD ISO 24617-7:2014(E) ISO 2014 All rights r
34、eserved 1 ISO 24617-7:2014(E) SOURCE: ISO 24617-1:2012 3.3 event-path path (3.13) or trajectory followed by a spatial entity (3.17) coincident with a motion-event (3.9) 3.4 extent textual segment which is string of character segments in text to be annotated EXAMPLE Tokens, words, and non-contiguous
35、phrases (e.g. a complex verb like “look . up”) are extents. 3.5 f ig u r e spatial entity (3.17) that is considered to be the focal object, which is related to some reference object 3.6 ground spatial entity (3.17) that acts as reference for a figure (3.5) 3.7 location point or finite area that is p
36、ositioned within a space (3.16) 3.8 measure magnitude of a spatial dimension or relation EXAMPLE Distance is a spatial relation. 3.9 motion motion-event action or process involving the translocation of a spatial object, transformation of some spatial property of an object, or change in the conformat
37、ion of an object Note 1 to entry: A motion (3.9) in ISOspace is a particular kind of event (3.2). 3.10 motion-signal adjunct motion-adjunct path (3.13) of motion and/or manner of motion information contributed by a particle or by a prepositional, adverbial phrase, in conjunction with a motion (3.9)-
38、related text Note 1 to entry: This terminology is specific to ISOspace and is different from the general term “adjunct” which is used to describe optional syntactic elements. 3.11 non-cosuming tag tag (3.19) that has no associated extent (3.4) Note 1 to entry: The extent (3.4) of a non-consuming tag
39、 is a “null” string. EXAMPLE In John ate an apple but Mary a pear, there are at least two ways of marking up the non- consuming tag: a) John ate e1an apple, but Mary e2a pear; b) 1) 2) (non-consuming tag)2 ISO 2014 All rights reserved ISO 24617-7:2014(E) 3.12 orientation orientation(al) relation rel
40、ation between a figure (3.5) and a ground (3.6) that expresses the spatial disposition or direction of a spatial object within a frame of reference 3.13 path location (3.7) that consists of a series of locations (3.7) Note 1 to entry: A spatial object path is a location where the focus is on the pot
41、ential for traversal or which functions as a boundary. This includes common nouns like road, coastline, and river and proper names like Route 66 and Kangamangus Highway. Some nouns, such as valley, can be ambiguous. It can be understood as a path in we walked down the valley or as a place (3.14) in
42、we live in the valley. Note 2 to entry: A path might be represented as an undirected graph whose vertices are locations (3.7) and whose edges signify continuity; that is to say, a path has no inherent directionality. 3.14 place geographic or administrative entity that is situated at a location (3.7)
43、 3.15 region connected, non-empty point-set defined by a domain and its boundary points Note 1 to entry: The term “region” as defined here does not refer to a political or administrative region such as “the Canary Islands” or “Hong Kong, SAR”, where SAR is the acronym of “Special Administrative Regi
44、on”. 3.16 space dimensional extent in which objects and events (3.2) have a relative position and direction 3.17 spatial entity object that is situated at a unique location (3.7) for some period of time, and typically has the potential to undergo translocation Note 1 to entry: A spatial entity can a
45、lso be understood as an object that participates in a spatial relation. In John is sitting in a car, both John and car could be understood as spatial entities or as being the figure (3.5) and the ground (3.6), respectively, of the sitting-in situation. 3.18 spatial signal segment or series of segmen
46、ts of a text that rebounds to orientational (3.12) or topological relations (3.20) 3.19 tag element name name associated with textual segments for annotation or for a relation between these segments Note 1 to entry: The following are two kinds of tag for annotation: a) extent tag, which is associate
47、d with textual segments referring to basic entities or signals; b) link tag, for representing spatial relations. 3.20 topological relation relation that expresses the connectedness or continuity of spaces (3.16) ISO 2014 All rights reserved 3 ISO 24617-7:2014(E) 4 List of tags 4.1 General The tag in
48、 angled brackets stands for the name of an XML element. See 8.2. 4.2 Extent tags: Basic entities and signals 4.2.1 measure extent tag representing some measure (3.8) 4.2.2 motion extent tag representing a motion (3.9) 4.2.3 motionSignal extent tag representing a motion-signal (3.10) 4.2.4 non-motion
49、 event extent tag representing a non-motion event (3.9) 4.2.5 path extent tag that represents a path (3.13) 4.2.6 place extent tag that represents a place (3.14) 4.2.7 spatialEntity extent tag that represents a spatial entity (3.17) 4.2.8 spatialSignal extent tag that represents a spatial signal (3.18) 4.3 Link tags 4.3.1 mLink linking tag that represents some measure (3.8)4 ISO 2014 All rights reserved ISO 24617-7:2014(E) 4.3.2 moveLink linking tag that represents a relation between a motion (3.9) and participa