1、 ETSI TS 103 190-2 V1.2.1 (2018-02) Digital Audio Compression (AC-4) Standard; Part 2: Immersive and personalized audio floppy3TECHNICAL SPECIFICATION ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)2 Reference RTS/JTC-043-2 Keywords audio, broadcasting, codec, content, digital, distribution, object audio, p
2、ersonalization ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be down
3、loaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The content of any electronic and/or print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any existi
4、ng or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subje
5、ct to revision or change of status. Information on the current status of this and other ETSI documents is available at https:/portal.etsi.org/TB/ETSIDeliverableStatus.aspx If you find errors in the present document, please send your comment to one of the following services: https:/portal.etsi.org/Pe
6、ople/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm except as authorized by written permission of ETSI. The content of the PDF version shall not be modified without the
7、 written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. ETSI 2018. All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are trademarks of ETSI registered for the benefit of its Members. 3GPPTM and LTE are trademarks of ETSI regist
8、ered for the benefit of its Members and of the 3GPP Organizational Partners. oneM2M logo is protected for the benefit of its Members. GSM and the GSM logo are trademarks registered and owned by the GSM Association. ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)3 Contents Intellectual Property Rights 17g3Fo
9、reword . 17g3Modal verbs terminology 17g3Introduction 18g3Motivation . 18g3Structure of the present document . 18g31 Scope 19g32 References 19g32.1 Normative references . 19g32.2 Informative references 19g33 Definitions, symbols, abbreviations and conventions 20g33.1 Definitions 20g33.2 Symbols 25g3
10、3.3 Abbreviations . 25g33.4 Conventions 26g34 Decoding the AC-4 bitstream . 28g34.1 Introduction 28g34.2 Channels and objects 28g34.3 Immersive audio . 29g34.4 Personalized Audio. 29g34.5 AC-4 bitstream . 30g34.5.1 Bitstream structure 30g34.5.2 Data dependencies 32g34.5.3 Frame rates 33g34.6 Decoder
11、 compatibilities 33g34.7 Decoding modes . 34g34.7.1 Introduction. 34g34.7.2 Full decoding mode 34g34.7.3 Core decoding mode . 34g34.8 Decoding process . 35g34.8.1 Overview 35g34.8.2 Selecting an audio presentation 36g34.8.3 Decoding of substreams 36g34.8.3.1 Introduction . 36g34.8.3.2 Identification
12、 of substream type 37g34.8.3.3 Substream decoding overview 39g34.8.3.4 Decoding of object properties . 40g34.8.3.4.1 Introduction . 40g34.8.3.4.2 Object audio metadata location . 40g34.8.3.5 Spectral frontends . 41g34.8.3.6 Stereo and multichannel processing (SMP) 42g34.8.3.7 Inverse modified discre
13、te cosine transformation (IMDCT) 42g34.8.3.8 Simple coupling (S-CPL) 42g34.8.3.9 QMF analysis 42g34.8.3.10 Companding 42g34.8.3.10.1 Introduction . 42g34.8.3.10.2 Channel audio substream . 42g34.8.3.10.3 Channel audio substream with an immersive channel element . 42g34.8.3.10.4 Object audio substrea
14、m . 43g34.8.3.11 A-SPX . 43g34.8.3.11.1 Introduction . 43g34.8.3.11.2 Core decoding mode with ASPX_SCPL codec mode . 43g34.8.3.11.3 Full decoding mode with ASPX_SCPL codec mode 44g3ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)4 4.8.3.12 Advanced joint channel coding (A-JCC) 44g34.8.3.13 Advanced joint obj
15、ect coding (A-JOC) . 44g34.8.3.14 Advanced coupling (A-CPL) 45g34.8.3.15 Dialogue enhancement 45g34.8.3.16 Direct dynamic range control bitstream gain application 46g34.8.3.17 Substream gain application for operation with associated audio. 46g34.8.3.18 Substream gain application for operation with d
16、ialogue substreams 47g34.8.3.19 Substream rendering 47g34.8.4 Mixing of decoded substreams . 47g34.8.5 Loudness correction 48g34.8.5.1 Introduction . 48g34.8.5.2 Dialnorm location . 48g34.8.5.3 Downmix loudness correction . 48g34.8.5.4 Alternative audio presentation loudness correction 49g34.8.5.5 R
17、eal-time loudness correction data . 49g34.8.6 Dynamic range control 49g34.8.7 QMF synthesis 49g34.8.8 Sample rate conversion . 50g35 Algorithmic details . 50g35.1 Bitstream processing 50g35.1.1 Introduction. 50g35.1.2 Elementary stream multiplexing tool 50g35.1.3 Efficient high frame rate mode . 52g
18、35.2 Stereo and multichannel processing (SMP) for immersive audio 54g35.2.1 Introduction. 54g35.2.2 Interface 55g35.2.2.1 Inputs. 55g35.2.2.2 Outputs 55g35.2.2.3 Controls. 55g35.2.3 Processing the immersive_channel_element 55g35.2.3.1 Introduction . 55g35.2.3.2 immersive_codec_mode g1488 SCPL, ASPX_
19、SCPL, ASPX_ACPL_1 56g35.2.3.3 immersive_codec_mode = ASPX_ACPL_2 . 57g35.2.3.4 immersive_codec_mode = ASPX_AJCC 57g35.2.4 Processing the 22_2_channel_element . 58g35.3 Simple coupling (S-CPL) . 58g35.3.1 Introduction. 58g35.3.2 Interface 58g35.3.2.1 Inputs. 58g35.3.2.2 Outputs 58g35.3.3 Reconstructi
20、on of the output channels 59g35.3.3.1 Full decoding. 59g35.3.3.2 Core decoding . 59g35.4 Advanced spectral extension (A-SPX) postprocessing tool . 60g35.4.1 Introduction. 60g35.4.2 Interface 60g35.4.2.1 Inputs. 60g35.4.2.2 Outputs 60g35.4.3 Processing. 60g35.5 Advanced coupling (A-CPL) for immersive
21、 audio 61g35.5.1 Introduction. 61g35.5.2 Processing the immersive_channel_element 61g35.6 Advanced joint channel coding (A-JCC) 63g35.6.1 Introduction. 63g35.6.2 Interface 63g35.6.2.1 Inputs. 63g35.6.2.2 Outputs 63g35.6.2.3 Controls. 63g35.6.3 Processing . 64g35.6.3.1 Parameter band to QMF subband m
22、apping . 64g35.6.3.2 Differential decoding and dequantization . 64g3ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)5 5.6.3.3 Interpolation 65g35.6.3.4 Decorrelator and transient ducker . 66g35.6.3.5 Reconstruction of the output channels 67g35.6.3.5.1 Input channels 67g35.6.3.5.2 A-JCC full decoding mode 67g
23、35.6.3.5.3 A-JCC core decoding mode . 71g35.7 Advanced joint object coding (A-JOC) 74g35.7.1 Introduction. 74g35.7.2 Interface 75g35.7.2.1 Inputs. 75g35.7.2.2 Outputs 75g35.7.2.3 Controls. 75g35.7.3 Processing . 75g35.7.3.1 Parameter band to QMF subband mapping . 75g35.7.3.2 Differential decoding 76
24、g35.7.3.3 Dequantization 77g35.7.3.4 Parameter time interpolation . 82g35.7.3.5 Decorrelator and transient ducker . 83g35.7.3.6 Signal reconstruction using matrices . 83g35.7.3.6.1 Processing 83g35.7.3.6.2 Decorrelation input matrix. 86g35.8 Dialogue enhancement for immersive audio 87g35.8.1 Introdu
25、ction. 87g35.8.2 Processing . 87g35.8.2.1 Dialogue enhancement for core decoding of A-JCC coded 9.X.4 content 87g35.8.2.2 Dialogue enhancement for core decoding of parametric A-CPL coded 9.X.4 content . 90g35.8.2.3 Dialogue enhancement for full decoding of A-JOC coded content . 91g35.8.2.4 Dialogue
26、enhancement for core decoding of A-JOC coded content . 92g35.8.2.5 Dialogue enhancement for non A-JOC coded object audio content 93g35.9 Object audio metadata timing . 93g35.9.1 Introduction. 93g35.9.2 Synchronization of object properties 93g35.10 Rendering . 94g35.10.1 Introduction. 94g35.10.2 Chan
27、nel audio renderer . 95g35.10.2.1 Introduction . 95g35.10.2.2 General rendering matrix 96g35.10.2.3 Panning of a stereo or mono signal . 96g35.10.2.4 Substream downmix or upmix for full decoding . 97g35.10.2.5 Matrix coefficients for channel-based renderer for full decoding . 98g35.10.2.6 Substream
28、downmix or upmix for core decoding . 101g35.10.2.7 Matrix coefficients for channel-based renderer for core decoding 101g35.10.3 Intermediate spatial format rendering . 102g35.10.3.1 Introduction . 102g35.10.3.2 Conventions 102g35.10.3.3 Interface 103g35.10.3.3.1 Inputs . 103g35.10.3.3.2 Outputs 103g
29、35.10.3.3.3 Controls . 103g35.10.3.4 Processing . 103g35.11 Accurate frame rate control 103g36 Bitstream syntax . 104g36.1 Introduction 104g36.2 Syntax specification . 107g36.2.1 AC-4 frame info 107g36.2.1.1 ac4_toc 107g36.2.1.2 ac4_presentation_info . 108g36.2.1.3 ac4_presentation_v1_info . 109g36.
30、2.1.4 frame_rate_fractions_info . 110g36.2.1.5 presentation_config_ext_info 111g36.2.1.6 ac4_substream_group_info . 111g3ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)6 6.2.1.7 ac4_sgi_specifier . 112g36.2.1.8 ac4_substream_info_chan . 112g36.2.1.9 ac4_substream_info_ajoc 113g36.2.1.10 bed_dyn_obj_assignme
31、nt 114g36.2.1.11 ac4_substream_info_obj . 114g36.2.1.12 ac4_presentation_substream_info . 115g36.2.1.13 oamd_substream_info . 115g36.2.1.14 ac4_hsf_ext_substream_info . 116g36.2.2 AC-4 substreams . 116g36.2.2.1 Introduction . 116g36.2.2.2 ac4_substream . 116g36.2.2.3 ac4_presentation_substream 117g3
32、6.2.2.4 oamd_substream 118g36.2.3 Audio data . 119g36.2.3.1 audio_data_chan 119g36.2.3.2 audio_data_objs 119g36.2.3.3 objs_to_channel_mode 120g36.2.3.4 audio_data_ajoc 120g36.2.3.5 ajoc_dmx_de_data. 121g36.2.3.6 ajoc_bed_info 121g36.2.4 Channel elements 121g36.2.4.1 immersive_channel_element . 121g3
33、6.2.4.2 immers_cfg 123g36.2.4.3 22_2_channel_element 123g36.2.4.4 var_channel_element. 124g36.2.5 Advanced joint object coding (A-JOC) 124g36.2.5.1 ajoc 124g36.2.5.2 ajoc_ctrl_info 125g36.2.5.3 ajoc_data . 125g36.2.5.4 ajoc_data_point_info . 126g36.2.5.5 ajoc_huff_data . 126g36.2.6 Advanced joint ch
34、annel coding (A-JCC) 127g36.2.6.1 ajcc_data . 127g36.2.6.2 ajcc_framing_data . 128g36.2.6.3 ajced 128g36.2.6.4 ajcc_huff_data . 128g36.2.7 Metadata . 129g36.2.7.1 metadata 129g36.2.7.2 basic_metadata 129g36.2.7.3 further_loudness_info . 130g36.2.7.4 extended_metadata 132g36.2.7.5 dialog_enhancement
35、133g36.2.7.6 de_data 134g36.2.8 Object audio metadata (OAMD) . 135g36.2.8.1 oamd_common_data . 135g36.2.8.2 oamd_timing_data. 135g36.2.8.3 oamd_dyndata_single 136g36.2.8.4 oamd_dyndata_multi. 137g36.2.8.5 object_info_block 137g36.2.8.6 object_basic_info 138g36.2.8.7 object_render_info 138g36.2.8.8 b
36、ed_render_info 140g36.2.8.9 trim 141g36.2.8.10 add_per_object_md . 141g36.2.8.11 ext_prec_pos . 142g36.2.8.12 ext_prec_alt_pos . 142g36.2.8.13 tool_tb_to_f_s_b . 142g36.2.8.14 tool_tb_to_f_s . 143g36.2.8.15 tool_tf_to_f_s_b 143g36.2.8.16 tool_tf_to_f_s 143g36.2.9 Presentation data . 143g36.2.9.1 lou
37、d_corr . 143g3ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)7 6.2.9.2 custom_dmx_data . 145g36.2.9.3 cdmx_parameters 146g36.2.9.4 tool_scr_to_c_l 147g36.2.9.5 tool_b4_to_b2 . 147g36.2.9.6 tool_t4_to_t2 . 147g36.2.9.7 tool_t4_to_f_s_b . 147g36.2.9.8 tool_t4_to_f_s . 148g36.2.9.9 tool_t2_to_f_s_b . 148g36.2.
38、9.10 tool_t2_to_f_s . 149g36.3 Description of bitstream elements 149g36.3.1 Introduction. 149g36.3.2 AC-4 frame information . 149g36.3.2.1 ac4_toc - AC-4 table of contents . 149g36.3.2.1.1 bitstream_version 149g36.3.2.1.2 br_code 149g36.3.2.1.3 b_iframe_global . 150g36.3.2.1.4 b_program_id 150g36.3.
39、2.1.5 short_program_id 150g36.3.2.1.6 b_program_uuid_present . 150g36.3.2.1.7 program_uuid 150g36.3.2.1.8 total_n_substream_groups . 150g36.3.2.2 ac4_presentation_v1_info - AC-4 audio presentation version 1 information 150g36.3.2.2.1 b_single_substream_group 150g36.3.2.2.2 presentation_config . 150g
40、36.3.2.2.3 mdcompat 151g36.3.2.2.4 b_presentation_id 151g36.3.2.2.4a presentation_id 151g36.3.2.2.5 b_presentation_filter 151g36.3.2.2.6 b_enable_presentation . 151g36.3.2.2.7 b_multi_pid . 152g36.3.2.2.8 n_substream_groups_minus2 152g36.3.2.3 presentation_version - presentation version information
41、152g36.3.2.3.1 b_tmp . 152g36.3.2.4 frame_rate_fractions_info - frame rate fraction information 152g36.3.2.4.1 b_frame_rate_fraction . 152g36.3.2.4.2 b_frame_rate_fraction_is_4 . 152g36.3.2.5 ac4_substream_group_info - AC-4 substream group information 152g36.3.2.5.1 b_substreams_present 152g36.3.2.5
42、.2 n_lf_substreams_minus2 . 152g36.3.2.5.3 b_channel_coded . 152g36.3.2.5.4 sus_ver . 152g36.3.2.5.5 b_oamd_substream 152g36.3.2.5.6 b_ajoc 153g36.3.2.6 ac4_sgi_specifier - AC-4 substream group information specifier . 153g36.3.2.6.1 group_index . 153g36.3.2.7 ac4_substream_info_chan - AC-4 substream
43、 information for channel based substreams 153g36.3.2.7.1 Introduction . 153g36.3.2.7.2 channel_mode 153g36.3.2.7.3 b_4_back_channels_present 153g36.3.2.7.4 b_centre_present 154g36.3.2.7.5 top_channels_present 154g36.3.2.7.6 b_audio_ndot . 154g36.3.2.8 ac4_substream_info_ajoc - object type informatio
44、n for A-JOC coded substreams 154g36.3.2.8.0 Introduction . 154g36.3.2.8.1 b_lfe . 155g36.3.2.8.2 b_static_dmx . 155g36.3.2.8.3 n_fullband_dmx_signals_minus1 155g36.3.2.8.4 b_oamd_common_data_present 155g36.3.2.8.5 n_fullband_upmix_signals_minus1 . 155g36.3.2.8.6 bed_dyn_obj_assignment - bed and dyna
45、mic object assignment 155g36.3.2.9 AC-4 substream information for object based substreams using A-JOC 155g36.3.2.10 ac4_substream_info_obj - object type information for direct-coded substreams 155g3ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)8 6.3.2.10.1 Introduction . 155g36.3.2.10.2 n_objects_code 156g
46、36.3.2.10.3 b_dynamic_objects and b_dyn_objects_only 156g36.3.2.10.4 b_lfe . 156g36.3.2.10.5 b_bed_objects 156g36.3.2.10.6 b_bed_start 156g36.3.2.10.7 b_isf_start 156g36.3.2.10.8 Interpreting object position properties . 156g36.3.2.10.9 res_bytes 158g36.3.2.10.10 reserved_data . 158g36.3.2.11 ac4_pr
47、esentation_substream_info - presentation substream information 159g36.3.2.11.1 b_alternative 159g36.3.2.11.2 b_pres_ndot . 159g36.3.2.12 oamd_substream_info - object audio metadata substream information 159g36.3.2.12.1 b_oamd_ndot . 159g36.3.3 AC-4 substreams . 159g36.3.3.1 ac4_presentation_substrea
48、m - AC-4 presentation substream 159g36.3.3.1.1 b_name_present . 159g36.3.3.1.2 b_length . 159g36.3.3.1.3 name_len . 159g36.3.3.1.4 presentation_name . 159g36.3.3.1.5 n_targets_minus1. 159g36.3.3.1.6 target_level 159g36.3.3.1.7 target_device_category . 160g36.3.3.1.8 tdc_extension . 160g36.3.3.1.9 b_
49、ducking_depth_present 160g36.3.3.1.10 max_ducking_depth 160g36.3.3.1.11 b_loud_corr_target 160g36.3.3.1.12 loud_corr_target 160g36.3.3.1.13 n_substreams_in_presentation . 160g36.3.3.1.14 b_active . 160g36.3.3.1.15 alt_data_set_index . 161g36.3.3.1.16 b_additional_data 161g36.3.3.1.17 add_data_bytes_minus1. 161g36.3.3.1.18 add_data. 161g36.3.3.1.19 drc_metadata_size_value . 161g36.3.3.1.20 b_more_bits . 161g36.3.3.1.21 drc_frame . 161g36.3.3.1.22 b_substream_group_gains_present 161g36.3.3.1.23 b_keep . 161g36.3.3.1.24 sg_gain. 162g36.3.3.1.25 b_associated . 162g36.3.3.1.26 b
copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1