1、 Reference numberISO/IEC 15938-3:2002(E)ISO/IEC 2002INTERNATIONAL STANDARD ISO/IEC15938-3First edition2002-05-15Information technology Multimedia content description interface Part 3: Visual Technologies de linformation Interface de description du contenu multimdia Partie 3: Visuel Adopted by INCITS
2、 (InterNational Committee for Information Technology Standards) as an American National Standard.Date of ANSI Approval: 12/3/2002Published by American National Standards Institute,25 West 43rd Street, New York, New York 10036Copyright 2002 by Information Technology Industry Council (ITI).All rights
3、reserved.These materials are subject to copyright claims of International Standardization Organization (ISO), InternationalElectrotechnical Commission (IEC), American National Standards Institute (ANSI), and Information Technology Industry Council(ITI). Not for resale. No part of this publication ma
4、y be reproduced in any form, including an electronic retrieval system, withoutthe prior written permission of ITI. All requests pertaining to this standard should be submitted to ITI, 1250 Eye Street NW,Washington, DC 20005.Printed in the United States of AmericaISO/IEC 15938-3:2002(E) PDF disclaime
5、r This PDF file may contain embedded typefaces. In accordance with Adobes licensing policy, this file may be printed or viewed but shall not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In downloading this file, parties accep
6、t therein the responsibility of not infringing Adobes licensing policy. The ISO Central Secretariat accepts no liability in this area. Adobe is a trademark of Adobe Systems Incorporated. Details of the software products used to create this PDF file can be found in the General Info relative to the fi
7、le; the PDF-creation parameters were optimized for printing. Every care has been taken to ensure that the file is suitable for use by ISO member bodies. In the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the address given below. ISO/IEC 2002 All ri
8、ghts reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISOs member body in the country of t
9、he requester. ISO copyright office Case postale 56 CH-1211 Geneva 20 Tel. + 41 22 749 01 11 Fax + 41 22 749 09 47 E-mail copyrightiso.ch Web www.iso.ch Printed in Switzerland ii ISO/IEC 2002 All rights reserved ISO/IEC 15938-3:2002(E) ISO/IEC 2002 All rights reserved iiiContents Page Forewordv Intro
10、ductionvi 1 Scope 1 1.1 Organization of the document1 1.2 Overview of Visual Description Tools .1 2 Terms and Definitions.2 2.1 Default reference axis .2 2.2 DCT coefficients 2 2.3 Data element 3 3 Abbreviations and Symbols .3 3.1 General3 3.2 Abbreviations.3 3.3 Arithmetic operators .3 3.4 Logical
11、operators.3 3.5 Relational operators3 3.6 Bitwise operators.4 3.7 Conditional operator .4 3.8 Assignment 4 3.9 Mnemonics .4 3.10 Constants .4 3.11 Functions4 4 Conventions .5 4.1 Method of describing the DDL representation syntax.5 4.2 Method of describing the binary representation syntax .5 4.3 Met
12、hod of describing the descriptor semantics 8 5 Basic structures.8 5.1 Introduction8 5.2 Grid layout8 5.3 Time series .11 5.4 Multiple view 15 5.5 Spatial 2D coordinates16 5.6 Temporal interpolation23 6 Color29 6.1 Introduction29 6.2 Color space 29 6.3 Color quantization .33 6.4 Dominant color 35 6.5
13、 Scalable color 37 6.6 Color layout42 6.7 Color structure.50 6.8 GoF/GoP Color.56 7 Texture57 7.1 Introduction57 7.2 Homogeneous texture.57 7.3 Texture browsing.61 7.4 Edge histogram63 8 Shape 66 8.1 Introduction66 ISO/IEC 15938-3:2002(E) iv ISO/IEC 2002 All rights reserved 8.2 Region shape .66 8.3
14、Contour shape68 8.4 Shape 3D.71 9 Motion .73 9.1 Introduction73 9.2 Camera motion.73 9.3 Motion trajectory81 9.4 Parametric motion .84 9.5 Motion activity87 10 Localization 92 10.1 Introduction92 10.2 Region locator92 10.3 Spatio-temporal locator 96 11 Others .103 11.1 Introduction103 11.2 Face reco
15、gnition 103 Annex A (normative) Basis functions for FaceRecognition .105 A.1 Basis matrix105 A.2 Mean face 169 Annex B (normative) Binary representation of media time tools.171 B.1 Introduction 171 B.2 Binary representation syntax172 B.3 Descriptor components semantics 173 Annex C (informative) Pate
16、nt statements .174 ISO/IEC 15938-3:2002(E) ISO/IEC 2002 All rights reserved vForeword ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission) form the specialized system for worldwide standardization. National bodies that are members of ISO or
17、 IEC participate in the development of International Standards through technical committees established by the respective organization to deal with particular fields of technical activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other international organizations, g
18、overnmental and non-governmental, in liaison with ISO and IEC, also take part in the work. In the field of information technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC 1. International Standards are drafted in accordance with the rules given in the ISO/IEC Directives
19、, Part 3. The main task of the joint technical committee is to prepare International Standards. Draft International Standards adopted by the joint technical committee are circulated to national bodies for voting. Publication as an International Standard requires approval by at least 75 % of the nati
20、onal bodies casting a vote. Attention is drawn to the possibility that some of the elements of this part of ISO/IEC 15938 may be the subject of patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights. ISO/IEC 15938-3 was prepared by Joint Technical Comm
21、ittee ISO/IEC JTC 1, Information technology, Subcommittee SC 29, Coding of audio, picture, multimedia and hypermedia information. ISO/IEC 15938 consists of the following parts, under the general title Information technology Multimedia content description interface: Part 1: Systems Part 2: Descriptio
22、n definition language Part 3: Visual Part 4: Audio Part 5: Multimedia description schemes Part 6: Reference software Part 7: Conformance testing Part 8: Extraction and use of MPEG-7 descriptions Annexes A and B form a normative part of this part of ISO/IEC 15938. Annex C is for information only. ISO
23、/IEC 15938-3:2002(E) vi ISO/IEC 2002 All rights reserved Introduction This standard, also known as “Multimedia Content Description Interface,“ provides a standardized set of technologies for describing multimedia content. The standard addresses a broad spectrum of multimedia applications and require
24、ments by providing a metadata system for describing the features of multimedia content. The following are specified in this standard: Description Schemes (DS) describe entities or relationships pertaining to multimedia content. Description Schemes specify the structure and semantics of their compone
25、nts, which may be Description Schemes, Descriptors, or datatypes. Descriptors (D) describe features, attributes, or groups of attributes of multimedia content. Datatypes are the basic reusable datatypes employed by Description Schemes and Descriptors Description Definition Language (DDL) defines Des
26、cription Schemes, Descriptors, and Datatypes by specifying their syntax, and allows their extension. Systems tools support delivery of descriptions, multiplexing of descriptions with multimedia content, synchronization, file format, and so forth. This standard is subdivided into eight parts: Part 1
27、Systems: specifies the tools for preparing descriptions for efficient transport and storage, compressing descriptions, and allowing synchronization between content and descriptions. Part 2 Description definition language: specifies the language for defining the standard set of description tools (DSs
28、, Ds, and datatypes) and for defining new description tools. Part 3 Visual: specifies the description tools pertaining to visual content. Part 4 Audio: specifies the description tools pertaining to audio content. Part 5 Multimedia description schemes: specifies the generic description tools pertaini
29、ng to multimedia including audio and visual content. Part 6 Reference software: provides a software implementation of the standard. Part 7 Conformance testing: specifies the guidelines and procedures for testing conformance of implementations of the standard. Part 8 Extraction and use of MPEG-7 desc
30、riptions: provides guidelines and examples of the extraction and use of descriptions. This document contains the visual elements (Descriptors and Description Schemes) that are considered for being part of the standard. All these Descriptive Structures are classified according to the types of visual
31、features they describe. For each Descriptive Structure, there is one corresponding section in this document. The section specifies textual and binary syntax and semantics of the structures. INTERNATIONAL STANDARD ISO/IEC 15938-3:2002(E) ISO/IEC 2002 All rights reserved 1Information technology Multim
32、edia content description interface Part 3: Visual 1 Scope 1.1 Organization of the document The structure of this document is as follows. Clauses 2-4 specify the terms, abbreviations, symbols and conventions used throughout the document. Clauses 5-11 contain definitions of the description tools stand
33、ardized by 15938-3 grouped by the visual features they are associated with, starting with basic structures and containers in Clause 5, through color, texture, shape, motion, localization in Clause 10. Clause 11 contains the remaining, unclassified items. Each description tool is described by the fol
34、lowing subclauses: Syntax: Normative DDL specification of the Ds or DSs. Binary Syntax: Normative binary representation of the Ds or DSs. Semantic: Normative definition of the semantics of all the components of the corresponding D or DS. 1.2 Overview of Visual Description Tools This part of ISO/IEC
35、15938 specifies tools for description of visual content, including still images, video and 3D models. These tools are defined by their syntax in DDL and binary representations and semantics associated with the syntactic elements. They enable description of the visual features of the visual material,
36、 such as color, texture, shape and motion, as well as localization of the described objects in the image or video sequence. An overview of the visual description tools is shown in Figure 1. The basic structure description tools include five supporting tools of visual descriptions defined in clauses
37、611. They are categorized into two groups, descriptor containers and basic supporting tools. The former consists of three datatypes, GridLayout providing efficient representations of visual features on grids, TimeSeries representing temporal arrays of several descriptions, and MultipleView describin
38、g a 3D object using several pictures captured from different view angles. The latter contains two tools, Spatial2DCoordinateSystem used to specify the 2D coordinate system and TemporalInterpolation indicating the interpolation method between two samples on a time axis. The remaining description tool
39、s, except for the FaceRecognition descriptor, are associated with visual features and are grouped into five feature categories: Color, Texture, Shape, Motion and Localization. The color description tools include four color descriptors to represent different aspects of color features: representative
40、colors (DominantColor), color distribution (ScalableColor), spatial distribution of colors (ColorLayout and ColorStructure). It also contains two supporting tools, ColorSpace and ColorQuantization used in DominantColor and an extension of ScalableColor to a group of frames or pictures (GoFGoPColor).
41、 All the color descriptors can be extracted from arbitrarily shaped regions. The texture description tools facilitate browsing (TextureBrowsing) and similarity retrieval (HomogeneousTexture and EdgeHistogram) using the texture of a still or moving image region. All the texture descriptors can be ext
42、racted from arbitrarily shaped regions. The shape description tools include two descriptors that characterize different shape features of a 2D object or region. The RegionShape descriptor captures the distribution of all pixels within a region and the Contour Shape descriptor characterizes the shape
43、 properties of the contour of an object. The Shape3D descriptor provides an intrinsic shape characterization of 3D mesh models. The motion description tools include four descriptors that characterize various aspects of motion. The CameraMotion descriptor specifies a set of basic camera operations su
44、ch as, for example, panning and tilting. The motion of a key point (pixel) from a moving object or region can be characterized by the MotionTrajectory descriptor. The ParametricMotion descriptor characterizes an evolution of an arbitrarily shaped region over time in terms of a 2D geometric transform
45、ation. Finally, the MotionActivity descriptor captures the pace of the motion in the sequence, as perceived by the viewer. All motion descriptors except for CameraMotion can be extracted from arbitrarily shaped regions. The localization description tools can be used to indicate regions of interest i
46、n the spatial (RegionLocator) and spatio-temporal (SpatioTemporalLocator) domains. ISO/IEC 15938-3:2002(E) 2 ISO/IEC 2002 All rights reserved The FaceRecognition descriptor is not associated with any particular visual feature and can be used to describe a human face for applications requiring the ma
47、tching and retrieval of face images. Basic StructuresDescriptor ContainersGridLayoutTimeSeriesMultipleViewBasic Supporting ToolsTemporalInterpolationSpatial2DcoordinateSystemColorColor Feature DescriptorsDominantColorScalableColorColorLayoutColorStructureGofGopColorColor Supporting ToolsColorSpaceCo
48、lorQuantizationTextureHomogeneousTextureTextureBrowsingEdgeHistogramMotionCameraMotionMotionTrajectoryParametricMotionMotionActivityLocalizationRegionLocatorSpatioTemporalLocatorOtherFaceRecognitionRegionShapeContourShapeShape3DShapeVisual FeaturesFigure 1 Overview of Visual Description Tools 2 Term
49、s and Definitions 2.1 Default reference axis The default reference axis for angle calculation is the positive x (horizontal) axis. Positive angle is calculated anti-clockwise. 2.2 DCT coefficients DCT coefficient The signed amplitude of a specific cosine basis function. AC coefficient Any DCT coefficient for which the frequency in one or both dimensions is non-zero. DC coefficient The DCT coefficient for which the frequency in both dimensions is zero. ISO/IEC 15938-3:2002(E) ISO/IEC 200