1、 ETSI TS 103 589 V1.1.1 (2018-06) Higher Order Ambisonics (HOA) Transport Format TECHNICAL SPECIFICATION ETSI ETSI TS 103 589 V1.1.1 (2018-06)2 Reference DTS/JTC-047 Keywords audio, broadcasting, TV, UHDTV ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fa
2、x: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic ver
3、sions and/or in print. The content of any electronic and/or print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is
4、 the print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is ava
5、ilable at https:/portal.etsi.org/TB/ETSIDeliverableStatus.aspx If you find errors in the present document, please send your comment to one of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by
6、 any means, electronic or mechanical, including photocopying and microfilm except as authorized by written permission of ETSI. The content of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all med
7、ia. ETSI 2018. European Broadcasting Union 2018. All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are trademarks of ETSI registered for the benefit of its Members. 3GPPTM and LTETMare trademarks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partner
8、s. oneM2M logo is protected for the benefit of its Members. GSMand the GSM logo are trademarks registered and owned by the GSM Association. ETSI ETSI TS 103 589 V1.1.1 (2018-06)3 Contents Intellectual Property Rights 4g3Foreword . 4g3Modal verbs terminology 4g31 Scope 5g32 References 5g32.1 Normativ
9、e references . 5g32.2 Informative references 5g33 Definitions and abbreviations . 6g33.1 Definitions 6g33.2 Abbreviations . 6g34 Higher Order Ambisonics (HOA) Transport Format . 7g34.1 Introduction 7g34.2 Generic HOA Transport Format . 7g34.3 ISO/IEC 23008-3-based HOA Transport Format (HoaTransportT
10、ype = 1) . 11g34.3.1 Introduction. 11g34.3.2 HOA Transport Format defined in ISO/IEC 23008-3 . 12g34.3.3 Implementation of HOA Transport Encoder (TE) and HOA Emission Encoder (EE) . 12g34.4 ISO/IEC 23008-3-based HOA Transport Format modified for SN3D Normalization (HoaTransportType = 2) . 16g34.5 V-
11、vector based HOA Transport Format (HoaTransportType = 3) . 23g35 HOA Transport Format Audio Stream . 25g35.1 Introduction 25g35.2 Syntax of HOA Transport Format Audio Stream . 26g35.3 Application Examples of HOA Transport Format Audio Stream 28g3Annex A (informative): Example guidelines for implemen
12、ting HOA transport over SDI utilizing communications modem technologies . 30g3Annex B (informative): Example guidelines for HOA production 32g3History 33g3ETSI ETSI TS 103 589 V1.1.1 (2018-06)4 Intellectual Property Rights Essential patents IPRs essential or potentially essential to normative delive
13、rables may have been declared to ETSI. The information pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect
14、of ETSI standards“, which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web server (https:/ipr.etsi.org/). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of othe
15、r IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web server) which are, or may be, or may become, essential to the present document. Trademarks The present document may include trademarks and/or tradenames which are asserted and/or registered by their owners. ETSI claims no owner
16、ship of these except for any which are indicated as being the property of ETSI, and conveys no right to use or reproduce any trademark and/or tradename. Mention of those trademarks in the present document does not constitute an endorsement by ETSI of products, services or organizations associated wi
17、th those trademarks. Foreword This Technical Specification (TS) has been produced by Joint Technical Committee (JTC) Broadcast of the European Broadcasting Union (EBU), Comit Europen de Normalisation ELECtrotechnique (CENELEC) and the European Telecommunications Standards Institute (ETSI). NOTE: The
18、 EBU/ETSI JTC Broadcast was established in 1990 to co-ordinate the drafting of standards in the specific field of broadcasting and related fields. Since 1995 the JTC Broadcast became a tripartite body by including in the Memorandum of Understanding also CENELEC, which is responsible for the standard
19、ization of radio and television receivers. The EBU is a professional association of broadcasting organizations whose work includes the co-ordination of its members activities in the technical, legal, programme-making and programme-exchange domains. The EBU has active members in about 60 countries in
20、 the European broadcasting area; its headquarters is in Geneva. European Broadcasting Union CH-1218 GRAND SACONNEX (Geneva) Switzerland Tel: +41 22 717 21 11 Fax: +41 22 717 24 81 Modal verbs terminology In the present document “shall“, “shall not“, “should“, “should not“, “may“, “need not“, “will“,
21、 “will not“, “can“ and “cannot“ are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in direct citation. ETSI ETSI TS 103 589 V1.1.1 (2018-06)5 1 Scope
22、Higher Order Ambisonics (HOA) signals are able to deliver a significantly enhanced immersive sound compared to conventional stereo or 5.1 channel audio signals. However, there are some use cases where HOA signals cannot be transported because of the large number of HOA input channels. The present do
23、cument provides an HOA transport format which allows unrestricted HOA order signals to be transported. 2 References 2.1 Normative references References are either specific (identified by date of publication and/or edition number or version number) or non-specific. For specific references, only the c
24、ited version applies. For non-specific references, the latest version of the referenced document (including any amendments) applies. Referenced documents which are not found to be publicly available in the expected location might be found at https:/docbox.etsi.org/Reference. NOTE: While any hyperlin
25、ks included in this clause were valid at the time of publication, ETSI cannot guarantee their long term validity. The following referenced documents are necessary for the application of the present document. 1 ISO/IEC 23008-3:2015/AMD 1:2016: “Information technology - High efficiency coding and medi
26、a delivery in heterogeneous environments - Part 3: 3D audio, 3D Audio Profile and Levels“. NOTE: Available at https:/www.iso.org/standard/67953.html. 2 ISO/IEC 23008-3:2015/DAM 5: “Information technology - High efficiency coding and media delivery in heterogeneous environments - Part 3: 3D audio, Au
27、dio Metadata Enhancements“. NOTE: Available at https:/www.iso.org/standard/74433.html. 3 ISO/IEC 23008-3:2015: “Information technology - High efficiency coding and media delivery in heterogeneous environments - Part 3: 3D audio“. NOTE: Available at https:/www.iso.org/standard/63878.html. 4 ISO/IEC 2
28、3008-3:2015/AMD 3:2017: “ Information technology - High efficiency coding and media delivery in heterogeneous environments - Part 3: 3D audio, MPEG-H 3D Audio Phase 2“. NOTE: Available at https:/www.iso.org/standard/69561.html. 5 ISO/IEC 13818-1:2015: “Information technology - Generic coding of movi
29、ng pictures and associated audio information - Part 1: Systems“. NOTE: Available at https:/www.iso.org/standard/67331.html. 2.2 Informative references References are either specific (identified by date of publication and/or edition number or version number) or non-specific. For specific references,
30、only the cited version applies. For non-specific references, the latest version of the referenced document (including any amendments) applies. NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee their long term validity. ETSI ETSI TS 103 58
31、9 V1.1.1 (2018-06)6 The following referenced documents are not necessary for the application of the present document but they assist the user with regard to a particular subject area. i.1 SMPTE Motion Imaging Journal: “Building The Worlds Most Complex TV Network: A Test Bed for Broadcasting Immersiv
32、e and Interactive Audio“ R. L. Bleidt et al.: pp. 26-34, 2017. NOTE: Available at http:/ieeexplore.ieee.org/document/7963945/. 3 Definitions and abbreviations 3.1 Definitions For the purposes of the present document, the following terms and definitions apply: MPEG-H Audio Stream (MHAS): self-contain
33、ed stream format to transport ISO/IEC 23008-3 data MPEG-H 3DA: MPEG-H 3D Audio standard defined in ISO/IEC 23008-3 1 to 4. 3.2 Abbreviations For the purposes of the present document, the following abbreviations apply: ACN Ambisonic Channel Number AGC Adaptive Gain Control AU Access Unit BG Backgroun
34、d (audio channel) CRC Cyclic Redundancy Check DAW Digital Audio Workstation FG Foreground (audio channel) HDMI High-Definition Multimedia Interface HD-SDI High-Definition Serial Digital Interface HOA Higher Order Ambisonics HTF HOA Transport Format HTFAS HOA Transport Format Audio Stream ISO Interna
35、tional Organization for Standardization MHAS MPEG-H Audio Stream MMT MPEG media transport MPEG Moving Pictures Experts Group MPEG-H LC MPEG-H Audio Low Complexity profile NOTE: As defined in ISO/IEC 23008-3 1. NOC Network Operation Centre OTA Over The Air (media) OTT Over The Top (media) PCM Pulse C
36、ode Modulation SDI Serial Digital Interface SID Single Index Designation SMPTE Society of Motion Picture InputAudioBitDepth = (InputAudioBitDepthIdx+1)*8; HoaFrameLengthIdx; NumOfHoaCoeffs = ( HoaOrder + 1 )2; NumOfTransportChannels = NumOfHoaCoeffs; HoaNormalization; HoaCoeffOrdering; IsScreenRelat
37、ive; else if (HoaTransportType = 1) HoaNormalization = 1; HoaCoeffOrdering = 0; NumOfTransportChannels = CodedNumOfTransportChannels+1; HOAConfig(); else if (HoaTransportType = 2) HoaNormalization = 0; HoaCoeffOrdering = 0; NumOfTransportChannels = CodedNumOfTransportChannels+1; HOAConfig_SN3D(); is
38、ScreenRelative = isScreenRelative_E; else if (HoaTransportType = 3) InputSamplingFrequency; InputAudioBitDepth = (InputAudioBitDepthIdx+1)*8; HoaFrameLengthIdx; NumOfHoaCoeffs = ( HoaOrder + 1 )2; HoaNormalization = 0; HoaCoeffOrdering = 0; IsScreenRelative; NumOfTransportChannels = CodedNumOfTransp
39、ortChannels+1; if (IsScreenRelative) if (hasNonStandardScreenSize) if (isCenteredInAzimuth) bsScreenSizeAz; else bsScreenSizeLeftAz; bsScreenSizeRightAz; bsScreenSizeTopEl; bsScreenSizeBottomEl; 4 2 3 5 2 2 1 5 5 4 2 3 5 1 5 1 1 9 10 10 9 9 uimsbf uimsbf uimsbf uimsbf uimsbf uimsbf uimsbf uimsbf uim
40、sbf uimsbf uimsbf uimsbf uimsbf uimsbf uimsbf bslbf bslbf uimsbf uimsbf uimsbf uimsbf uimsbf Table 2: Semantics of HOATransportConfig() HoaTransportType This element contains information about HOA transport mode. 0: HOA coefficients (as defined in this clause) 1: ISO/IEC 23008-3-based HOA Transport
41、Format as defined in clause 4.3 2: Modified ISO/IEC 23008-3-based HOA Transport Format for SN3D normalization as defined in clause 4.4 3: V-vector based HOA Transport Format as defined in clause 4.5 InputSamplingFrequency This element contains information about input sampling frequency. 0: 24 kHz 1:
42、 32 kHz 2: 44,1 kHz 3: 48 kHz 4: 96 kHz 5: 192 kHz 6 - 15: reserved ETSI ETSI TS 103 589 V1.1.1 (2018-06)10 InputAudioBitDepthIdx This element determines the input audio bit depth by InputAudioBitDepth = (InputAudioBitDepthIdx+1)*8. HoaOrder This element determines the HOA order of the coded signal.
43、 HoaNormalization This element contains information about HOA coefficient normalization. 0: SN3D normalization 1: N3D normalization 2: FuMa normalization 3: reserved HoaCoeffOrdering This element contains information about HOA coefficient ordering. 0: ACN 1: SID 2-3: reserved IsScreenRelative This e
44、lement contains information about whether the content is: 0: not screen related 1: screen related hasNonStandardScreenSize This flag specifies whether the defined production screen size is different from the default screen size. The definition is done via viewing angles (in degrees) corresponding to
45、 the screen edges. The default screen size is defined with the following values (a 4K display at an optimal viewing distance): g2030g142g135g136g150g3404 g884g891g484g882g953g481g3g2030g148g139g137g138g150g3404 g3398g884g891g484g882g953g2016g150g145g146g3404 g883g889g484g887g953g481g3g2016g132g145g1
46、50g150g145g143g3404 g3398g883g889g484g887g953isCenteredInAzimuth This flag defines whether the production screen is frontal and centered in azimuth (absolute values of the azimuth angles of the left and right screen edge are identical) or not. bsScreenSizeAz This field defines the azimuth angles (in
47、 degree) corresponding to the left and right screen edge: g2030g142g135g136g150g3404 g882g481g887g3g132g149g22g133g148g135g135g144g22g139g156g135g4g156 g2030g142g135g136g150g3404 g143g139g144g3g4666g143g131g154g4666g2030g142g135g136g150g3g481g882g4667g481g883g890g882g4667 g2030g148g139g137g138g150g3
48、404 g3398g882g481g887g3g132g149g22g133g148g135g135g144g22g139g156g135g4g156 g2030g148g139g137g138g150g3404 g143g139g144g4666g143g131g154g4666g2030g148g139g137g138g150g3g481g3398g883g890g882g4667g481g882g4667 bsScreenSizeLeftAz This field defines the azimuth angle (in degree) corresponding to the lef
49、t screen edge: g2030g142g135g136g150g3404 g882g481g887g3g4666g132g149g22g133g148g135g135g144g22g139g156g135g15g135g136g150g4g156g3398g887g883g883g4667 g2030g142g135g136g150g3404 g143g139g144g4666g143g131g154g4666g2030g142g135g136g150g3g481g3398g883g890g882g4667g481g883g890g882g4667 bsScreenSizeRightAz This field defines the azimuth angle (in degree) corresponding to the right screen edge: g2030g148g139g137g138g150g3404 g882g481g887g3g4666g132g149g22g133g148g135g135g144g22g139g156g135g21g139g137g138g150g4g15