1、 ETSI TS 103 223 V1.1.1 (2015-04) MDA; Object-Based Audio Immersive Sound Metadata and Bitstream TECHNICAL SPECIFICATION ETSI ETSI TS 103 223 V1.1.1 (2015-04)2 Reference DTS/JTC-027 Keywords audio, broadcast, contribution ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33
2、4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available i
3、n electronic versions and/or in print. The content of any electronic and/or print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevai
4、ling document is the print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI
5、documents is available at http:/portal.etsi.org/tb/status/status.asp If you find errors in the present document, please send your comment to one of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form
6、 or by any means, electronic or mechanical, including photocopying and microfilm except as authorized by written permission of ETSI. The content of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in a
7、ll media. European Telecommunications Standards Institute 2015. European Broadcasting Union 2015. All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are Trade Marks of ETSI registered for the benefit of its Members. 3GPPTM and LTE are Trade Marks of ETSI registered for the benefit of
8、its Members and of the 3GPP Organizational Partners. GSM and the GSM logo are Trade Marks registered and owned by the GSM Association. ETSI ETSI TS 103 223 V1.1.1 (2015-04)3 Contents Intellectual Property Rights 9g3Foreword . 9g3Modal verbs terminology 9g31 Scope 10g32 References 10g32.1 Normative r
9、eferences . 10g32.2 Informative references 10g33 Definitions and abbreviations . 11g33.1 Definitions 11g33.2 Abbreviations . 11g34 MDA Core Metadata 12g34.1 Introduction (informative) 12g34.2 Timeline . 14g34.3 Audio Objects . 14g34.4 Coordinate System . 14g34.5 Object Model 15g34.5.1 General 15g34.
10、5.2 Namespace 15g34.5.3 Versioning. 15g34.5.4 Program 15g34.5.5 Header . 16g34.5.5.1 General 16g34.5.5.2 programURI 16g34.5.5.3 sampleRate 16g34.5.5.4 constraintSets 16g34.5.5.5 extensions 16g34.5.6 Entities 16g34.5.6.1 General 16g34.5.6.2 id . 17g34.5.6.3 extensions 17g34.5.7 Group 17g34.5.7.1 Gene
11、ral 17g34.5.8 Switch . 18g34.5.8.1 General 18g34.5.9 Fragment. 18g34.5.9.1 General 18g34.5.9.2 offset property . 18g34.5.9.3 duration property. 18g34.5.10 MonoSourceFragment 18g34.5.10.1 General 18g34.5.10.2 audioEssence . 19g34.5.10.3 gain 19g34.5.11 ObjectFragment 19g34.5.11.1 General 19g34.5.11.2
12、 position 19g34.5.11.3 aperture . 20g34.5.11.4 divergence . 20g34.5.11.5 coherent. 21g34.5.11.6 renderingExceptions 21g34.5.11.7 contentKind . 21g34.5.12 LFEFragment 21g34.5.12.1 General 21g34.5.13 AudioSamples . 21g34.5.13.1 assetOffset. 22g34.5.13.2 assetURI 22g3ETSI ETSI TS 103 223 V1.1.1 (2015-0
13、4)4 4.5.14 RenderingException 22g34.5.14.1 General 22g34.5.14.2 targetConfiguration property . 22g34.5.15 PositionRenderingException. 22g34.5.15.1 General 22g34.5.15.2 position 22g34.5.16 ChannelRenderingException 22g34.5.16.1 General 22g34.5.16.2 gains 23g34.5.17 ChannelGain . 23g34.5.17.1 General
14、23g34.5.17.2 gain property . 23g34.5.17.3 channel property 23g34.5.18 Extension 23g34.5.19 Position . 23g34.5.19.1 General 23g34.5.19.2 radius. 23g34.5.19.3 azimuth 24g34.5.19.4 elevation 24g34.6 Overall Constraints . 24g34.6.1 Aligned Fragment Instances . 24g34.7 URI Constants 24g34.8 Basic Data Ty
15、pes 24g34.8.1 General 24g34.8.2 Real . 24g34.8.3 Rational . 24g34.8.4 Integer. 25g34.8.5 URI . 25g35 MDA Reference Renderer 25g35.1 Overview 25g35.2 Configuration . 26g35.2.1 General 26g35.2.2 soundfieldName 27g35.2.3 Speakers 27g35.2.4 patches 28g35.2.5 Virtual Sources . 28g35.3 Rendering Process 2
16、9g35.3.1 ProcessOffset 29g35.3.2 Render Object Fragment . 30g35.3.3 RenderPatch 31g35.3.4 Point Source Rendering 32g35.4 Extent Rendering 32g36 MDA Core Bitstream . 33g36.1 Introduction 33g36.2 Structures 34g36.2.1 General 34g36.2.2 Bitstream . 34g36.2.3 Frame 35g36.2.4 Assets 36g36.2.5 Slice Struct
17、ure . 36g36.2.6 Entities 37g36.2.7 LFEFragment 38g36.2.8 ObjectFragment 38g36.2.9 Group 38g36.2.10 Switch . 39g36.3 Packets 39g36.3.1 General 39g36.3.2 Frame Header Packet 40g36.3.2.1 General 40g36.3.2.2 fBitstreamVersion . 40g3ETSI ETSI TS 103 223 V1.1.1 (2015-04)5 6.3.2.3 fProgramNamespace . 40g36
18、.3.2.4 fProgramURI . 40g36.3.2.5 fSampleRate 40g36.3.2.6 fExtensions 41g36.3.2.7 fOffset . 41g36.3.2.8 fDuration . 41g36.3.2.9 fCRC . 41g36.3.3 Asset Frame Packet . 42g36.3.3.1 General 42g36.3.3.2 fId 42g36.3.4 fAssetEncoding . 42g36.3.4.1 General 42g36.3.4.2 fAssetBytes . 42g36.3.5 Frame End Packet
19、 . 42g36.3.6 Slice Header Packet 43g36.3.6.1 General 43g36.3.6.2 fDuration . 43g36.3.7 Object Fragment Packet 43g36.3.7.1 General 43g36.3.7.2 fPosition 43g36.3.7.3 fAperture . 43g36.3.7.4 fDivergence. 43g36.3.7.5 fCoherent . 43g36.3.7.6 fContentKind . 43g36.3.7.7 fRenderingExceptions . 44g36.3.8 LFE
20、 Fragment Packet . 44g36.3.9 Group Start Packet 44g36.3.10 Group End Packet . 44g36.3.11 SwitchStartPacket . 44g36.3.12 SwitchEndPacket 44g36.3.13 EntityPacket 45g36.3.13.1 General 45g36.3.13.2 fId 45g36.3.13.3 fExtensions 45g36.3.14 FragmentPacket 45g36.3.15 MonoSourceFragmentPacket 45g36.3.15.1 Ge
21、neraI 45g36.3.15.2 fAssetURI 45g36.3.15.3 fAssetOffset 45g36.3.15.4 fGain . 46g36.3.16 UnexpectedPacket . 46g36.4 Common Data Structures . 46g36.4.1 PacketHeader 46g36.4.2 ChannelGain . 46g36.4.2.1 General 46g36.4.2.2 fGain . 47g36.4.3 RenderingException 47g36.4.4 Labels 47g36.4.5 PackedLength . 48g
22、36.4.6 Extension 48g36.4.7 FixedArray 48g36.4.8 Position . 48g36.4.8.1 General 48g36.4.8.2 fRadius 48g36.4.8.3 fAzimuth . 48g36.4.8.4 fElevation 48g36.4.9 ByteArray . 49g36.4.10 BERUInt32 . 49g36.4.11 PackedUInt64 . 49g36.4.12 PackedUInt32 . 49g36.4.13 PackedUInt16 . 49g36.4.14 OptionalItem . 49g3ET
23、SI ETSI TS 103 223 V1.1.1 (2015-04)6 6.4.15 UTF8String . 50g36.5 Constants 50g36.5.1 Packet Kinds . 50g36.5.2 Bitstream Version . 50g37 MDA Broadcast Extensions . 50g37.1 Summary 50g37.2 Higher Order Ambisonics 50g37.2.1 General 50g37.2.2 HOAMonoFragment . 51g37.2.2.1 General 51g37.2.2.2 Semantics 5
24、1g37.2.2.2.1 channelNumber 51g37.2.2.2.2 normalizationType . 51g37.2.3 HOAObjectFragment 51g37.2.3.1 General 51g37.2.3.2 Semantics 51g37.2.3.2.1 HOAOrder . 51g37.2.3.2.2 adaptorMatrix 51g37.2.3.2.3 Members 52g37.3 Broadcast Extensions . 52g37.3.1 General 52g37.3.2 BroadcastExtension 52g37.3.2.1 Gene
25、ral 52g37.3.3 Program Broadcast Extension . 52g37.3.3.1 General 52g37.3.3.2 Semantics 52g37.3.3.2.1 programComplexity . 52g37.3.3.2.2 programLoudness 52g37.3.3.2.3 targetLoudness . 53g37.3.3.2.4 programDRC . 53g37.3.3.3 Group Broadcast Extension . 53g37.3.3.3.1 General 53g37.3.3.4 Semantics 53g37.3.
26、3.4.1 groupKind 53g37.3.3.4.2 groupKindParameters 53g37.3.4 Entity Broadcast Extension . 53g37.3.4.1 General 53g37.3.4.2 Semantics 54g37.3.4.2.1 entityLoudness . 54g37.3.5 ObjectFragment Broadcast Extension . 54g37.3.5.1 General 54g37.3.5.2 Semantics 54g37.3.5.2.1 Interactivity . 54g37.3.5.2.2 Lock
27、. 54g37.3.5.2.3 priority . 55g37.3.5.2.4 snap 55g37.3.5.2.5 dialogFraction 55g37.4 Data Types 55g37.4.1 ComplexityProfileType 55g37.4.1.1 General 55g37.4.1.2 Semantics 55g37.4.1.2.1 maxNumberObjects . 55g37.4.1.2.2 minFragmentLength 55g37.4.1.2.3 HOAFlag . 55g37.4.2 LoudnessProfileType 56g37.4.2.1 G
28、eneral 56g37.4.2.2 Semantics 56g37.4.2.2.1 measurementConfiguration . 56g37.4.2.2.2 IntegratedLoudness 56g37.4.2.2.3 IntegratedDialogLoudness . 56g37.4.2.2.4 IntegratedNonDialogLoudness 56g3ETSI ETSI TS 103 223 V1.1.1 (2015-04)7 7.4.2.2.5 ShortTermLoudness 57g37.4.2.2.6 MomentaryLoudness . 57g37.4.2
29、.2.7 InstantanenousLoudness 57g37.4.2.2.8 LoudnessRange 57g37.4.2.2.9 TruePeak 57g37.4.3 LoudnessType. 57g37.4.3.1 General 57g37.4.3.2 Semantics 57g37.4.3.2.1 value 57g37.4.3.2.2 units . 57g37.4.4 DRCType 58g37.4.4.1 General 58g37.4.4.2 Semantics 58g37.4.4.2.1 DRCProfile 58g37.4.4.2.2 DialogGain 58g
30、37.4.5 GroupParameterType 58g37.4.5.1 General 58g37.4.5.2 Semantics 59g37.4.5.2.1 configuration . 59g37.4.6 InteractivityType . 59g37.4.6.1 General 59g37.4.6.2 Semantics 59g37.4.6.2.1 azimuthDelta . 59g37.4.6.2.2 elevationDelta 59g37.4.6.2.3 apertureDelta . 59g37.4.6.2.4 divergenceDelta . 59g37.4.6.
31、2.5 gainDelta . 59g37.4.7 LockType 59g37.4.7.1 General 59g37.4.7.2 Semantics 60g37.4.7.2.1 locker . 60g37.4.7.2.2 keyID . 60g37.5 Constants 60g37.5.1 Conventions 60g37.5.1.1 Namespaces. 60g37.5.1.2 Constants . 60g37.5.2 Profile Constants . 60g37.5.3 loudnessUnit Constants . 61g37.5.4 groupKind Const
32、ants 61g37.5.5 HOA Normalization 61g37.6 BPX Bitstream 61g37.6.1 General 61g37.6.2 Higher Order Ambisonics . 62g37.6.2.1 HOAMonoSourceFragment 62g37.6.2.2 HOAObjectFragment 62g37.6.3 Broadcast Extensions 62g37.6.3.1 General 62g37.6.3.2 ProgramBroadcastExtension . 63g37.6.3.3 Group Broadcast Extensio
33、n . 63g37.6.3.4 Entity Broadcast Extension . 63g37.6.3.5 ObjectFragment Broadcast Extension . 63g37.6.4 Data Types 64g37.6.4.1 General 64g37.6.4.2 ComplexityProfileType . 64g37.6.4.3 LoudnessProfileType 64g37.6.4.4 LoudnessType . 64g37.6.4.5 DRCType 64g37.6.4.6 GroupParameterType 65g37.6.4.7 Interac
34、tivityType . 65g37.6.4.8 LockType 65g3Annex A (normative): Structured Specification Language 66g3ETSI ETSI TS 103 223 V1.1.1 (2015-04)8 A.1 General . 66g3A.2 Macro . 66g3A.3 Structure . 66g3A.4 Basic Type 66g3A.5 Type Aliasing . 66g3A.6 Control Statements . 67g3A.7 Fields 67g3A.8 Constants 67g3Annex
35、 B (informative): XML MDA Broadcast Schema . 68g3Annex C (informative): Bibliography . 71g3Annex D (informative): Change History 72g3History 73g3ETSI ETSI TS 103 223 V1.1.1 (2015-04)9 Intellectual Property Rights IPRs essential or potentially essential to the present document may have been declared
36、to ETSI. The information pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is a
37、vailable from the ETSI Secretariat. Latest updates are available on the ETSI Web server (http:/ipr.etsi.org). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR
38、 000 314 (or the updates on the ETSI Web server) which are, or may be, or may become, essential to the present document. Foreword This Technical Specification (TS) has been produced by Joint Technical Committee (JTC) Broadcast of the European Broadcasting Union (EBU), Comit Europen de Normalisation
39、ELECtrotechnique (CENELEC) and the European Telecommunications Standards Institute (ETSI). NOTE: The EBU/ETSI JTC Broadcast was established in 1990 to co-ordinate the drafting of standards in the specific field of broadcasting and related fields. Since 1995 the JTC Broadcast became a tripartite body
40、 by including in the Memorandum of Understanding also CENELEC, which is responsible for the standardization of radio and television receivers. The EBU is a professional association of broadcasting organizations whose work includes the co-ordination of its members activities in the technical, legal,
41、programme-making and programme-exchange domains. The EBU has active members in about 60 countries in the European broadcasting area; its headquarters is in Geneva. European Broadcasting Union CH-1218 GRAND SACONNEX (Geneva) Switzerland Tel: +41 22 717 21 11 Fax: +41 22 717 24 81 Modal verbs terminol
42、ogy In the present document “shall“, “shall not“, “should“, “should not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NOT allowed in ET
43、SI deliverables except when used in direct citation. ETSI ETSI TS 103 223 V1.1.1 (2015-04)10 1 Scope The present document specifies the object model, reference renderer, bitstream syntax and broadcast extensions for MDA. MDA, short for Multi-Dimension Audio, is a metadata model and bitstream represe
44、ntation of an object-based soundfield for linear content, for use in cinema and broadcast applications. The presentdocument consists of four main clauses. The metadata clause (Clause 4) provides a metadata model independent of (bitstream) representation, with a strong emphasis on cinematic content.
45、Clause 5 specifies a reference renderer, providing semantics for the MDA metadata model. Clause 6 specifies a preferred bitstream representation of the MDA metadata model. Note that the metadata model allows for more than one bitstream representation. Finally, Clause 7 specifies an extension of the
46、core MDA model to include metadata and bitstream elements specifically suited for broadcast content. This Clause includes among others metadata for Loudness, Higher Order Ambisonics and Interactivity. Unless otherwise stated, MDA metadata are specified using Unified Modeling Language 4. Note that th
47、e MDA core metadata, reference renderer and bitstream documents have been submitted to SMPTE 25CSS “Immersive Sound Model and Bitstream“ i.2 for consideration towards an interoperable immersive sound model and bitstream for cinematographic linear content. 2 References 2.1 Normative references Refere
48、nces are either specific (identified by date of publication and/or edition number or version number) or non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the reference document (including any amendments) applies. Referenced docu
49、ments which are not found to be publicly available in the expected location might be found at http:/docbox.etsi.org/Reference. NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee their long term validity. The following referenced documents are necessary for the application of the present document. 1 IETF RFC 3986 (January 2005): “Uniform Resource Identifier (URI): Generic Syntax“. 2 Recommendation ITU-R BS.1770-3: “Algorithms to measure audio programme loudness and true-peak audio level“. 3 Recommendat
copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1