1、 ETSI TS 103 190-2 V1.1.1 (2015-09) Digital Audio Compression (AC-4) Standard Part 2: Immersive and personalized audio floppy3TECHNICAL SPECIFICATION ETSI ETSI TS 103 190-2 V1.1.1 (2015-09)2 Reference DTS/JTC-029-2 Keywords audio, broadcasting, codec, content, digital, distribution, object audio, pe
2、rsonalization ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be downl
3、oaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The content of any electronic and/or print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any existin
4、g or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subjec
5、t to revision or change of status. Information on the current status of this and other ETSI documents is available at http:/portal.etsi.org/tb/status/status.asp If you find errors in the present document, please send your comment to one of the following services: https:/portal.etsi.org/People/Commit
6、eeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm except as authorized by written permission of ETSI. The content of the PDF version shall not be modified without the written au
7、thorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. European Telecommunications Standards Institute 2015. All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are Trade Marks of ETSI registered for the benefit of its Members. 3GPPTM and
8、LTE are Trade Marks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. GSM and the GSM logo are Trade Marks registered and owned by the GSM Association. ETSI ETSI TS 103 190-2 V1.1.1 (2015-09)3 Contents Intellectual Property Rights 15g3Foreword . 15g3Modal ver
9、bs terminology 15g3Introduction 16g31 Scope 17g32 References 17g32.1 Normative references . 17g32.2 Informative references 17g33 Definitions, symbols, abbreviations and conventions 18g33.1 Definitions 18g33.2 Symbols 19g33.3 Abbreviations . 20g33.4 Conventions 21g34 Decoding the AC-4 bitstream . 22g
10、34.1 Introduction 22g34.2 Channels and objects 22g34.3 Immersive Audio 23g34.4 Personalized Audio. 23g34.5 AC-4 Bitstream 24g34.5.1 Bitstream structure 24g34.5.2 Data dependencies 26g34.6 Decoder compatibilities 27g34.7 Decoding modes . 27g34.7.1 Introduction. 27g34.7.2 Full decoding mode 27g34.7.3
11、Core decoding mode . 28g34.8 Decoding process . 28g34.8.1 Overview 28g34.8.2 Selecting a presentation 29g34.8.3 Decoding of substreams 30g34.8.3.1 Introduction . 30g34.8.3.2 Identification of substream type 31g34.8.3.3 Substream decoding overview 32g34.8.3.4 Decoding of object properties . 33g34.8.3
12、.4.1 Introduction . 33g34.8.3.4.2 Object Audio Metadata location 34g34.8.3.5 Spectral Frontend(s) 34g34.8.3.6 Stereo and Multichannel Processing (SMP) 35g34.8.3.7 Inverse Modified Discrete Cosine Transformation (IMDCT) . 35g34.8.3.8 Simple Coupling (S-CPL) . 35g34.8.3.9 QMF Analysis . 35g34.8.3.10 C
13、ompanding 35g34.8.3.11 A-SPX . 36g34.8.3.12 Advanced Joint Channel Coding (A-JCC) 37g34.8.3.13 Advanced Joint Object Coding (A-JOC) . 37g34.8.3.14 Advanced coupling - A-CPL . 37g34.8.3.15 Dialogue Enhancement . 38g34.8.3.16 Direct DRC bitstream gain application . 38g34.8.3.17 Substream gain applicat
14、ion for operation with associated audio. 39g34.8.3.18 Substream gain application for operation with dialogue substreams 39g34.8.3.19 Substream Rendering 40g34.8.4 Mixing of decoded substreams . 40g34.8.5 Loudness correction 40g34.8.5.1 Introduction . 40g3ETSI ETSI TS 103 190-2 V1.1.1 (2015-09)4 4.8.
15、5.2 Dialnorm location . 41g34.8.5.3 Downmix loudness correction . 41g34.8.5.4 Alternative presentation loudness correction 41g34.8.5.5 Realtime loudness correction data . 42g34.8.6 Dynamic Range Control . 42g34.8.7 QMF Synthesis . 42g34.8.8 Sample rate conversion . 42g35 Algorithmic details . 42g35.
16、1 Bitstream processing 42g35.1.1 Introduction. 42g35.1.2 Elementary Stream Muxing Tool 43g35.1.3 Efficient High Frame Rate mode 45g35.2 Stereo and Multichannel Processing (SMP) for immersive audio 46g35.2.1 Introduction. 46g35.2.2 Interface 47g35.2.2.1 Inputs. 47g35.2.2.2 Outputs 47g35.2.2.3 Control
17、s. 47g35.2.3 Processing the immersive_channel_element 47g35.2.3.1 Introduction . 47g35.2.3.2 immersive_codec_mode g1488 SCPL, ASPX_SCPL, ASPX_ACPL_1 48g35.2.3.3 immersive_codec_mode = ASPX_ACPL_2 . 49g35.2.3.4 immersive_codec_mode = ASPX_AJCC 49g35.2.4 Processing the 22_2_channel_element . 50g35.3 S
18、imple Coupling (S-CPL) 50g35.3.1 Introduction. 50g35.3.2 Interface 50g35.3.2.1 Inputs. 50g35.3.2.2 Outputs 51g35.3.3 Reconstruction of the output channels 51g35.3.3.1 Full decoding. 51g35.3.3.2 Core decoding . 52g35.4 Advanced Spectral Extension (A-SPX) post-processing tool . 52g35.4.1 Introduction.
19、 52g35.4.2 Inputs 52g35.4.3 Processing . 52g35.5 Advanced Coupling (A-CPL) for immersive audio 53g35.5.1 Introduction. 53g35.5.2 Processing the immersive_channel_element 53g35.6 Advanced Joint Channel Coding (A-JCC) . 54g35.6.1 Introduction. 54g35.6.2 Interface 55g35.6.2.1 Inputs. 55g35.6.2.2 Output
20、s 55g35.6.2.3 Controls. 55g35.6.3 Processing . 55g35.6.3.1 Parameter band to QMF subband mapping . 55g35.6.3.2 Differential decoding and dequantization . 55g35.6.3.3 Interpolation 57g35.6.3.4 Decorrelator and transient ducker . 58g35.6.3.5 Reconstruction of the output channels 58g35.6.3.5.1 Input ch
21、annels 58g35.6.3.5.2 A-JCC full decoding mode 59g35.6.3.5.3 A-JCC core decoding mode . 63g35.7 Advanced Joint Object Coding (A-JOC) 66g35.7.1 Introduction. 66g35.7.2 Interface 66g35.7.2.1 Inputs. 66g35.7.2.2 Outputs 66g35.7.2.3 Controls. 66g35.7.3 Processing . 66g3ETSI ETSI TS 103 190-2 V1.1.1 (2015
22、-09)5 5.7.3.1 Parameter band to QMF subband mapping . 66g35.7.3.2 Differential decoding 67g35.7.3.3 Dequantization 68g35.7.3.4 Parameter time interpolation . 73g35.7.3.5 Decorrelator and transient ducker . 74g35.7.3.6 Signal reconstruction using matrices . 74g35.7.3.6.1 Processing 74g35.7.3.6.2 Deco
23、rrelation input matrix. 78g35.8 Dialogue Enhancement (DE) for immersive audio 78g35.8.1 Introduction. 78g35.8.2 Processing . 78g35.8.2.1 DE for core decoding of A-JCC coded 9.X.4 content . 78g35.8.2.2 DE for core decoding of parametric A-CPL coded 9.X.4 content. 81g35.8.2.3 DE for full decoding of A
24、-JOC coded content 82g35.8.2.4 DE for core decoding of A-JOC coded content. 82g35.8.2.5 DE for non A-JOC coded object audio content . 83g35.9 Object Audio Metadata Timing 84g35.9.1 Introduction. 84g35.9.2 Synchronization of object properties 84g35.10 Rendering . 85g35.10.1 Introduction. 85g35.10.2 C
25、hannel Audio Renderer 85g35.10.2.1 Introduction . 85g35.10.2.2 General rendering matrix 86g35.10.2.3 Panning of a stereo or mono signal . 87g35.10.2.4 Substream downmix or upmix for full decoding . 88g35.10.2.5 Matrix coefficients for channel based renderer for full decoding . 88g35.10.2.6 Substream
26、 downmix or upmix for core decoding . 92g35.10.2.7 Matrix coefficients for channel based renderer for core decoding 92g35.10.3 Intermediate Spatial Format rendering . 93g35.10.3.1 Introduction . 93g35.10.3.2 Conventions 93g35.10.3.3 Interface 94g35.10.3.3.1 Inputs . 94g35.10.3.3.2 Outputs 94g35.10.3
27、.3.3 Controls . 94g35.10.3.4 Processing . 94g35.11 Accurate frame rate control 94g36 Bitstream syntax . 95g36.1 Introduction 95g36.2 Syntax specification . 97g36.2.1 AC-4 frame info 97g36.2.1.1 ac4_toc 97g36.2.1.2 ac4_presentation_info . 98g36.2.1.3 ac4_presentation_v1_info . 99g36.2.1.4 frame_rate_
28、fractions_info . 101g36.2.1.5 presentation_config_ext_info 101g36.2.1.6 ac4_substream_group_info . 101g36.2.1.7 ac4_sgi_specifier . 102g36.2.1.8 ac4_substream_info_chan . 102g36.2.1.9 ac4_substream_info_ajoc 103g36.2.1.10 bed_dyn_obj_assignment 104g36.2.1.11 ac4_substream_info_obj . 104g36.2.1.12 ac
29、4_presentation_substream_info . 105g36.2.1.13 oamd_substream_info . 105g36.2.1.14 ac4_hsf_ext_substream_info . 106g36.2.2 AC-4 substreams . 106g36.2.2.1 Introduction . 106g36.2.2.2 ac4_substream . 106g36.2.2.3 ac4_presentation_substream 107g36.2.2.4 oamd_substream 108g3ETSI ETSI TS 103 190-2 V1.1.1
30、(2015-09)6 6.2.3 Audio data . 108g36.2.3.1 audio_data_chan 108g36.2.3.2 audio_data_objs 109g36.2.3.3 objs_to_channel_mode 109g36.2.3.4 audio_data_ajoc 110g36.2.3.5 ajoc_dmx_de_data. 110g36.2.4 Channel elements 111g36.2.4.1 immersive_channel_element . 111g36.2.4.2 immers_cfg 112g36.2.4.3 22_2_channel
31、_element 112g36.2.4.4 var_channel_element. 113g36.2.5 Advanced Joint Object Coding (A-JOC) 113g36.2.5.1 ajoc 113g36.2.5.2 ajoc_ctrl_info 114g36.2.5.3 ajoc_data . 114g36.2.5.4 ajoc_data_point_info . 115g36.2.5.5 ajoc_huff_data . 115g36.2.6 Advanced Joint Channel Coding (A-JCC) 115g36.2.6.1 ajcc_data
32、. 115g36.2.6.2 ajcc_framing_data . 116g36.2.6.3 ajced 117g36.2.6.4 ajcc_huff_data . 117g36.2.7 Metadata . 117g36.2.7.1 metadata 117g36.2.7.2 basic_metadata 118g36.2.7.3 further_loudness_info . 119g36.2.7.4 extended_metadata 120g36.2.7.5 dialog_enhancement 122g36.2.7.6 de_data 122g36.2.8 Object Audio
33、 Metadata (OAMD) . 123g36.2.8.1 oamd_common_data . 123g36.2.8.2 oamd_timing_data. 123g36.2.8.3 oamd_dyndata_single 124g36.2.8.4 oamd_dyndata_multi. 125g36.2.8.5 object_info_block 125g36.2.8.6 object_basic_info 126g36.2.8.7 object_render_info 126g36.2.9 Presentation data . 127g36.2.9.1 loud_corr . 12
34、7g36.2.9.2 custom_dmx_data . 129g36.2.9.3 cdmx_parameters 130g36.2.9.4 tool_scr_to_c_l 131g36.2.9.5 tool_b4_to_b2 . 131g36.2.9.6 tool_t4_to_t2 . 131g36.2.9.7 tool_t4_to_f_s_b . 131g36.2.9.8 tool_t4_to_f_s . 132g36.2.9.9 tool_t2_to_f_s_b . 132g36.2.9.10 tool_t2_to_f_s . 132g36.3 Description of bitstr
35、eam elements 132g36.3.1 Introduction. 132g36.3.2 AC-4 frame info 133g36.3.2.1 ac4_toc - AC-4 table of contents . 133g36.3.2.1.1 bitstream_version 133g36.3.2.1.2 br_code 133g36.3.2.1.3 b_iframe_global . 133g36.3.2.1.4 b_program_id 133g36.3.2.1.5 short_program_id 133g36.3.2.1.6 b_program_uuid_present
36、. 133g36.3.2.1.7 program_uuid 133g36.3.2.1.8 total_n_substream_groups . 134g36.3.2.2 ac4_presentation_v1_info - AC-4 presentation version 1 information . 134g36.3.2.2.1 b_single_substream_group 134g3ETSI ETSI TS 103 190-2 V1.1.1 (2015-09)7 6.3.2.2.2 presentation_config . 134g36.3.2.2.3 mdcompat 134g
37、36.3.2.2.4 b_presentation_group_index . 135g36.3.2.2.5 b_presentation_filter 135g36.3.2.2.6 b_enable_presentation . 135g36.3.2.2.7 b_multi_pid . 135g36.3.2.2.8 n_substream_groups_minus2 135g36.3.2.3 presentation_version - Presentation version information 135g36.3.2.3.1 b_tmp . 135g36.3.2.4 frame_rat
38、e_fractions_info - Frame rate fraction information 135g36.3.2.4.1 b_frame_rate_fraction . 135g36.3.2.4.2 b_frame_rate_fraction_is_4 . 135g36.3.2.5 ac4_substream_group_info - AC-4 substream group information 135g36.3.2.5.1 b_substreams_present 135g36.3.2.5.2 n_lf_substreams_minus2 . 135g36.3.2.5.3 b_
39、channel_coded . 135g36.3.2.5.4 sus_ver . 135g36.3.2.5.5 b_oamd_substream 136g36.3.2.5.6 b_ajoc 136g36.3.2.6 ac4_sgi_specifier - AC-4 substream group information specifier . 136g36.3.2.6.1 group_index . 136g36.3.2.7 ac4_substream_info_chan - AC-4 substream information for channel based substreams 136
40、g36.3.2.7.1 Introduction . 136g36.3.2.7.2 channel_mode 136g36.3.2.7.3 b_4_back_channels_present 136g36.3.2.7.4 b_centre_present 137g36.3.2.7.5 top_channels_present 137g36.3.2.7.6 b_audio_ndot . 137g36.3.2.8 ac4_substream_info_ajoc - AC-4 substream information for object based substreams using A-JOC
41、. 137g36.3.2.8.1 b_lfe . 137g36.3.2.8.2 b_static_dmx . 137g36.3.2.8.3 n_fullband_dmx_signals_minus1 137g36.3.2.8.4 b_oamd_common_data_present 137g36.3.2.8.5 n_fullband_upmix_signals_minus1 . 138g36.3.2.9 bed_dyn_obj_assignment - Bed and dynamic object assignment . 138g36.3.2.9.1 b_dyn_objects_only .
42、 138g36.3.2.9.2 b_isf . 138g36.3.2.9.3 isf_config . 138g36.3.2.9.4 b_ch_assign_code 138g36.3.2.9.5 bed_chan_assign_code 138g36.3.2.9.6 b_chan_assign_mask . 138g36.3.2.9.7 b_nonstd_bed_channel_assignment 139g36.3.2.9.8 nonstd_bed_channel_assignment_mask 139g36.3.2.9.9 std_bed_channel_assignment_mask
43、139g36.3.2.9.10 n_bed_signals_minus1 139g36.3.2.9.11 nonstd_bed_channel_assignment 139g36.3.2.10 ac4_substream_info_obj - AC-4 substream information for object based substreams 140g36.3.2.10.1 n_objects_code 140g36.3.2.10.2 b_dynamic_objects 140g36.3.2.10.3 b_lfe . 140g36.3.2.10.4 b_bed_objects 140g
44、36.3.2.10.5 b_bed_start 140g36.3.2.10.6 b_isf_start 140g36.3.2.10.7 res_bytes 140g36.3.2.10.8 reserved_data . 140g36.3.2.11 ac4_presentation_substream_info - Presentation substream information . 140g36.3.2.11.1 b_alternative 140g36.3.2.11.2 b_pres_ndot . 141g36.3.2.12 oamd_substream_info - Object au
45、dio metadata substream information . 141g36.3.2.12.1 b_oamd_ndot . 141g36.3.3 AC-4 substreams . 141g3ETSI ETSI TS 103 190-2 V1.1.1 (2015-09)8 6.3.3.1 ac4_presentation_substream - AC-4 presentation substream 141g36.3.3.1.1 b_name_present . 141g36.3.3.1.2 b_length . 141g36.3.3.1.3 name_len . 141g36.3.
46、3.1.4 presentation_name . 141g36.3.3.1.5 n_targets_minus1. 141g36.3.3.1.6 target_level 141g36.3.3.1.7 target_device_category 141g36.3.3.1.8 tdc_extension . 142g36.3.3.1.9 b_ducking_depth_present 142g36.3.3.1.10 max_ducking_depth 142g36.3.3.1.11 b_loud_corr_target 142g36.3.3.1.12 loud_corr_target 142
47、g36.3.3.1.13 n_substreams_in_presentation . 142g36.3.3.1.14 b_active . 142g36.3.3.1.15 alt_data_set_index . 142g36.3.3.1.16 b_additional_data 142g36.3.3.1.17 add_data_bytes_minus1. 142g36.3.3.1.18 add_data. 142g36.3.3.1.19 drc_metadata_size_value . 143g36.3.3.1.20 b_more_bits . 143g36.3.3.1.21 drc_f
48、rame . 143g36.3.3.1.22 b_substream_group_gains_present 143g36.3.3.1.23 b_keep . 143g36.3.3.1.24 sg_gain. 143g36.3.3.1.25 b_associated . 143g36.3.3.1.26 b_associate_is_mono . 143g36.3.3.1.27 pres_ch_mode 143g36.3.3.1.28 pres_ch_mode_core . 144g36.3.3.1.29 b_pres_4_back_channels_present 145g36.3.3.1.3
49、0 pres_top_channel_pairs . 145g36.3.3.1.31 b_pres_has_lfe . 145g36.3.3.2 oamd_substream 145g36.3.3.2.1 Introduction . 145g36.3.3.2.2 b_oamd_common_data_present 145g36.3.3.2.3 b_oamd_timing_present 145g36.3.4 Audio data . 146g36.3.4.1 b_some_signals_inactive . 146g36.3.4.2 dmx_active_signals_mask . 146g36.3.4.3 b_dmx_timing . 146g36.3.4.4 b_oamd_extension_present . 146g36.3.4.5 skip_data . 146g36.3.4.6 b_umx_timing . 146g36.3.4.7 b_derive_timing_from_dmx 146g36.3.5 Channel elements 146g36.3.5.1 immersive_codec_mode_code 146g36.3.5.2 core_channel_config . 146g36.3.5.3 core_5ch_
copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1