ETSI TS 103 190-2-2018 Digital Audio Compression (AC-4) Standard Part 2 Immersive and personalized audio (V1 2 1 Includes Diskette).pdf

上传人:eventdump275 文档编号:739886 上传时间:2019-01-11 格式:PDF 页数:250 大小:2.70MB
下载 相关 举报
ETSI TS 103 190-2-2018 Digital Audio Compression (AC-4) Standard Part 2 Immersive and personalized audio (V1 2 1 Includes Diskette).pdf_第1页
第1页 / 共250页
ETSI TS 103 190-2-2018 Digital Audio Compression (AC-4) Standard Part 2 Immersive and personalized audio (V1 2 1 Includes Diskette).pdf_第2页
第2页 / 共250页
ETSI TS 103 190-2-2018 Digital Audio Compression (AC-4) Standard Part 2 Immersive and personalized audio (V1 2 1 Includes Diskette).pdf_第3页
第3页 / 共250页
ETSI TS 103 190-2-2018 Digital Audio Compression (AC-4) Standard Part 2 Immersive and personalized audio (V1 2 1 Includes Diskette).pdf_第4页
第4页 / 共250页
ETSI TS 103 190-2-2018 Digital Audio Compression (AC-4) Standard Part 2 Immersive and personalized audio (V1 2 1 Includes Diskette).pdf_第5页
第5页 / 共250页
点击查看更多>>
资源描述

1、 ETSI TS 103 190-2 V1.2.1 (2018-02) Digital Audio Compression (AC-4) Standard; Part 2: Immersive and personalized audio floppy3TECHNICAL SPECIFICATION ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)2 Reference RTS/JTC-043-2 Keywords audio, broadcasting, codec, content, digital, distribution, object audio, p

2、ersonalization ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be down

3、loaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The content of any electronic and/or print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any existi

4、ng or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subje

5、ct to revision or change of status. Information on the current status of this and other ETSI documents is available at https:/portal.etsi.org/TB/ETSIDeliverableStatus.aspx If you find errors in the present document, please send your comment to one of the following services: https:/portal.etsi.org/Pe

6、ople/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm except as authorized by written permission of ETSI. The content of the PDF version shall not be modified without the

7、 written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. ETSI 2018. All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are trademarks of ETSI registered for the benefit of its Members. 3GPPTM and LTE are trademarks of ETSI regist

8、ered for the benefit of its Members and of the 3GPP Organizational Partners. oneM2M logo is protected for the benefit of its Members. GSM and the GSM logo are trademarks registered and owned by the GSM Association. ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)3 Contents Intellectual Property Rights 17g3Fo

9、reword . 17g3Modal verbs terminology 17g3Introduction 18g3Motivation . 18g3Structure of the present document . 18g31 Scope 19g32 References 19g32.1 Normative references . 19g32.2 Informative references 19g33 Definitions, symbols, abbreviations and conventions 20g33.1 Definitions 20g33.2 Symbols 25g3

10、3.3 Abbreviations . 25g33.4 Conventions 26g34 Decoding the AC-4 bitstream . 28g34.1 Introduction 28g34.2 Channels and objects 28g34.3 Immersive audio . 29g34.4 Personalized Audio. 29g34.5 AC-4 bitstream . 30g34.5.1 Bitstream structure 30g34.5.2 Data dependencies 32g34.5.3 Frame rates 33g34.6 Decoder

11、 compatibilities 33g34.7 Decoding modes . 34g34.7.1 Introduction. 34g34.7.2 Full decoding mode 34g34.7.3 Core decoding mode . 34g34.8 Decoding process . 35g34.8.1 Overview 35g34.8.2 Selecting an audio presentation 36g34.8.3 Decoding of substreams 36g34.8.3.1 Introduction . 36g34.8.3.2 Identification

12、 of substream type 37g34.8.3.3 Substream decoding overview 39g34.8.3.4 Decoding of object properties . 40g34.8.3.4.1 Introduction . 40g34.8.3.4.2 Object audio metadata location . 40g34.8.3.5 Spectral frontends . 41g34.8.3.6 Stereo and multichannel processing (SMP) 42g34.8.3.7 Inverse modified discre

13、te cosine transformation (IMDCT) 42g34.8.3.8 Simple coupling (S-CPL) 42g34.8.3.9 QMF analysis 42g34.8.3.10 Companding 42g34.8.3.10.1 Introduction . 42g34.8.3.10.2 Channel audio substream . 42g34.8.3.10.3 Channel audio substream with an immersive channel element . 42g34.8.3.10.4 Object audio substrea

14、m . 43g34.8.3.11 A-SPX . 43g34.8.3.11.1 Introduction . 43g34.8.3.11.2 Core decoding mode with ASPX_SCPL codec mode . 43g34.8.3.11.3 Full decoding mode with ASPX_SCPL codec mode 44g3ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)4 4.8.3.12 Advanced joint channel coding (A-JCC) 44g34.8.3.13 Advanced joint obj

15、ect coding (A-JOC) . 44g34.8.3.14 Advanced coupling (A-CPL) 45g34.8.3.15 Dialogue enhancement 45g34.8.3.16 Direct dynamic range control bitstream gain application 46g34.8.3.17 Substream gain application for operation with associated audio. 46g34.8.3.18 Substream gain application for operation with d

16、ialogue substreams 47g34.8.3.19 Substream rendering 47g34.8.4 Mixing of decoded substreams . 47g34.8.5 Loudness correction 48g34.8.5.1 Introduction . 48g34.8.5.2 Dialnorm location . 48g34.8.5.3 Downmix loudness correction . 48g34.8.5.4 Alternative audio presentation loudness correction 49g34.8.5.5 R

17、eal-time loudness correction data . 49g34.8.6 Dynamic range control 49g34.8.7 QMF synthesis 49g34.8.8 Sample rate conversion . 50g35 Algorithmic details . 50g35.1 Bitstream processing 50g35.1.1 Introduction. 50g35.1.2 Elementary stream multiplexing tool 50g35.1.3 Efficient high frame rate mode . 52g

18、35.2 Stereo and multichannel processing (SMP) for immersive audio 54g35.2.1 Introduction. 54g35.2.2 Interface 55g35.2.2.1 Inputs. 55g35.2.2.2 Outputs 55g35.2.2.3 Controls. 55g35.2.3 Processing the immersive_channel_element 55g35.2.3.1 Introduction . 55g35.2.3.2 immersive_codec_mode g1488 SCPL, ASPX_

19、SCPL, ASPX_ACPL_1 56g35.2.3.3 immersive_codec_mode = ASPX_ACPL_2 . 57g35.2.3.4 immersive_codec_mode = ASPX_AJCC 57g35.2.4 Processing the 22_2_channel_element . 58g35.3 Simple coupling (S-CPL) . 58g35.3.1 Introduction. 58g35.3.2 Interface 58g35.3.2.1 Inputs. 58g35.3.2.2 Outputs 58g35.3.3 Reconstructi

20、on of the output channels 59g35.3.3.1 Full decoding. 59g35.3.3.2 Core decoding . 59g35.4 Advanced spectral extension (A-SPX) postprocessing tool . 60g35.4.1 Introduction. 60g35.4.2 Interface 60g35.4.2.1 Inputs. 60g35.4.2.2 Outputs 60g35.4.3 Processing. 60g35.5 Advanced coupling (A-CPL) for immersive

21、 audio 61g35.5.1 Introduction. 61g35.5.2 Processing the immersive_channel_element 61g35.6 Advanced joint channel coding (A-JCC) 63g35.6.1 Introduction. 63g35.6.2 Interface 63g35.6.2.1 Inputs. 63g35.6.2.2 Outputs 63g35.6.2.3 Controls. 63g35.6.3 Processing . 64g35.6.3.1 Parameter band to QMF subband m

22、apping . 64g35.6.3.2 Differential decoding and dequantization . 64g3ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)5 5.6.3.3 Interpolation 65g35.6.3.4 Decorrelator and transient ducker . 66g35.6.3.5 Reconstruction of the output channels 67g35.6.3.5.1 Input channels 67g35.6.3.5.2 A-JCC full decoding mode 67g

23、35.6.3.5.3 A-JCC core decoding mode . 71g35.7 Advanced joint object coding (A-JOC) 74g35.7.1 Introduction. 74g35.7.2 Interface 75g35.7.2.1 Inputs. 75g35.7.2.2 Outputs 75g35.7.2.3 Controls. 75g35.7.3 Processing . 75g35.7.3.1 Parameter band to QMF subband mapping . 75g35.7.3.2 Differential decoding 76

24、g35.7.3.3 Dequantization 77g35.7.3.4 Parameter time interpolation . 82g35.7.3.5 Decorrelator and transient ducker . 83g35.7.3.6 Signal reconstruction using matrices . 83g35.7.3.6.1 Processing 83g35.7.3.6.2 Decorrelation input matrix. 86g35.8 Dialogue enhancement for immersive audio 87g35.8.1 Introdu

25、ction. 87g35.8.2 Processing . 87g35.8.2.1 Dialogue enhancement for core decoding of A-JCC coded 9.X.4 content 87g35.8.2.2 Dialogue enhancement for core decoding of parametric A-CPL coded 9.X.4 content . 90g35.8.2.3 Dialogue enhancement for full decoding of A-JOC coded content . 91g35.8.2.4 Dialogue

26、enhancement for core decoding of A-JOC coded content . 92g35.8.2.5 Dialogue enhancement for non A-JOC coded object audio content 93g35.9 Object audio metadata timing . 93g35.9.1 Introduction. 93g35.9.2 Synchronization of object properties 93g35.10 Rendering . 94g35.10.1 Introduction. 94g35.10.2 Chan

27、nel audio renderer . 95g35.10.2.1 Introduction . 95g35.10.2.2 General rendering matrix 96g35.10.2.3 Panning of a stereo or mono signal . 96g35.10.2.4 Substream downmix or upmix for full decoding . 97g35.10.2.5 Matrix coefficients for channel-based renderer for full decoding . 98g35.10.2.6 Substream

28、downmix or upmix for core decoding . 101g35.10.2.7 Matrix coefficients for channel-based renderer for core decoding 101g35.10.3 Intermediate spatial format rendering . 102g35.10.3.1 Introduction . 102g35.10.3.2 Conventions 102g35.10.3.3 Interface 103g35.10.3.3.1 Inputs . 103g35.10.3.3.2 Outputs 103g

29、35.10.3.3.3 Controls . 103g35.10.3.4 Processing . 103g35.11 Accurate frame rate control 103g36 Bitstream syntax . 104g36.1 Introduction 104g36.2 Syntax specification . 107g36.2.1 AC-4 frame info 107g36.2.1.1 ac4_toc 107g36.2.1.2 ac4_presentation_info . 108g36.2.1.3 ac4_presentation_v1_info . 109g36.

30、2.1.4 frame_rate_fractions_info . 110g36.2.1.5 presentation_config_ext_info 111g36.2.1.6 ac4_substream_group_info . 111g3ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)6 6.2.1.7 ac4_sgi_specifier . 112g36.2.1.8 ac4_substream_info_chan . 112g36.2.1.9 ac4_substream_info_ajoc 113g36.2.1.10 bed_dyn_obj_assignme

31、nt 114g36.2.1.11 ac4_substream_info_obj . 114g36.2.1.12 ac4_presentation_substream_info . 115g36.2.1.13 oamd_substream_info . 115g36.2.1.14 ac4_hsf_ext_substream_info . 116g36.2.2 AC-4 substreams . 116g36.2.2.1 Introduction . 116g36.2.2.2 ac4_substream . 116g36.2.2.3 ac4_presentation_substream 117g3

32、6.2.2.4 oamd_substream 118g36.2.3 Audio data . 119g36.2.3.1 audio_data_chan 119g36.2.3.2 audio_data_objs 119g36.2.3.3 objs_to_channel_mode 120g36.2.3.4 audio_data_ajoc 120g36.2.3.5 ajoc_dmx_de_data. 121g36.2.3.6 ajoc_bed_info 121g36.2.4 Channel elements 121g36.2.4.1 immersive_channel_element . 121g3

33、6.2.4.2 immers_cfg 123g36.2.4.3 22_2_channel_element 123g36.2.4.4 var_channel_element. 124g36.2.5 Advanced joint object coding (A-JOC) 124g36.2.5.1 ajoc 124g36.2.5.2 ajoc_ctrl_info 125g36.2.5.3 ajoc_data . 125g36.2.5.4 ajoc_data_point_info . 126g36.2.5.5 ajoc_huff_data . 126g36.2.6 Advanced joint ch

34、annel coding (A-JCC) 127g36.2.6.1 ajcc_data . 127g36.2.6.2 ajcc_framing_data . 128g36.2.6.3 ajced 128g36.2.6.4 ajcc_huff_data . 128g36.2.7 Metadata . 129g36.2.7.1 metadata 129g36.2.7.2 basic_metadata 129g36.2.7.3 further_loudness_info . 130g36.2.7.4 extended_metadata 132g36.2.7.5 dialog_enhancement

35、133g36.2.7.6 de_data 134g36.2.8 Object audio metadata (OAMD) . 135g36.2.8.1 oamd_common_data . 135g36.2.8.2 oamd_timing_data. 135g36.2.8.3 oamd_dyndata_single 136g36.2.8.4 oamd_dyndata_multi. 137g36.2.8.5 object_info_block 137g36.2.8.6 object_basic_info 138g36.2.8.7 object_render_info 138g36.2.8.8 b

36、ed_render_info 140g36.2.8.9 trim 141g36.2.8.10 add_per_object_md . 141g36.2.8.11 ext_prec_pos . 142g36.2.8.12 ext_prec_alt_pos . 142g36.2.8.13 tool_tb_to_f_s_b . 142g36.2.8.14 tool_tb_to_f_s . 143g36.2.8.15 tool_tf_to_f_s_b 143g36.2.8.16 tool_tf_to_f_s 143g36.2.9 Presentation data . 143g36.2.9.1 lou

37、d_corr . 143g3ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)7 6.2.9.2 custom_dmx_data . 145g36.2.9.3 cdmx_parameters 146g36.2.9.4 tool_scr_to_c_l 147g36.2.9.5 tool_b4_to_b2 . 147g36.2.9.6 tool_t4_to_t2 . 147g36.2.9.7 tool_t4_to_f_s_b . 147g36.2.9.8 tool_t4_to_f_s . 148g36.2.9.9 tool_t2_to_f_s_b . 148g36.2.

38、9.10 tool_t2_to_f_s . 149g36.3 Description of bitstream elements 149g36.3.1 Introduction. 149g36.3.2 AC-4 frame information . 149g36.3.2.1 ac4_toc - AC-4 table of contents . 149g36.3.2.1.1 bitstream_version 149g36.3.2.1.2 br_code 149g36.3.2.1.3 b_iframe_global . 150g36.3.2.1.4 b_program_id 150g36.3.

39、2.1.5 short_program_id 150g36.3.2.1.6 b_program_uuid_present . 150g36.3.2.1.7 program_uuid 150g36.3.2.1.8 total_n_substream_groups . 150g36.3.2.2 ac4_presentation_v1_info - AC-4 audio presentation version 1 information 150g36.3.2.2.1 b_single_substream_group 150g36.3.2.2.2 presentation_config . 150g

40、36.3.2.2.3 mdcompat 151g36.3.2.2.4 b_presentation_id 151g36.3.2.2.4a presentation_id 151g36.3.2.2.5 b_presentation_filter 151g36.3.2.2.6 b_enable_presentation . 151g36.3.2.2.7 b_multi_pid . 152g36.3.2.2.8 n_substream_groups_minus2 152g36.3.2.3 presentation_version - presentation version information

41、152g36.3.2.3.1 b_tmp . 152g36.3.2.4 frame_rate_fractions_info - frame rate fraction information 152g36.3.2.4.1 b_frame_rate_fraction . 152g36.3.2.4.2 b_frame_rate_fraction_is_4 . 152g36.3.2.5 ac4_substream_group_info - AC-4 substream group information 152g36.3.2.5.1 b_substreams_present 152g36.3.2.5

42、.2 n_lf_substreams_minus2 . 152g36.3.2.5.3 b_channel_coded . 152g36.3.2.5.4 sus_ver . 152g36.3.2.5.5 b_oamd_substream 152g36.3.2.5.6 b_ajoc 153g36.3.2.6 ac4_sgi_specifier - AC-4 substream group information specifier . 153g36.3.2.6.1 group_index . 153g36.3.2.7 ac4_substream_info_chan - AC-4 substream

43、 information for channel based substreams 153g36.3.2.7.1 Introduction . 153g36.3.2.7.2 channel_mode 153g36.3.2.7.3 b_4_back_channels_present 153g36.3.2.7.4 b_centre_present 154g36.3.2.7.5 top_channels_present 154g36.3.2.7.6 b_audio_ndot . 154g36.3.2.8 ac4_substream_info_ajoc - object type informatio

44、n for A-JOC coded substreams 154g36.3.2.8.0 Introduction . 154g36.3.2.8.1 b_lfe . 155g36.3.2.8.2 b_static_dmx . 155g36.3.2.8.3 n_fullband_dmx_signals_minus1 155g36.3.2.8.4 b_oamd_common_data_present 155g36.3.2.8.5 n_fullband_upmix_signals_minus1 . 155g36.3.2.8.6 bed_dyn_obj_assignment - bed and dyna

45、mic object assignment 155g36.3.2.9 AC-4 substream information for object based substreams using A-JOC 155g36.3.2.10 ac4_substream_info_obj - object type information for direct-coded substreams 155g3ETSI ETSI TS 103 190-2 V1.2.1 (2018-02)8 6.3.2.10.1 Introduction . 155g36.3.2.10.2 n_objects_code 156g

46、36.3.2.10.3 b_dynamic_objects and b_dyn_objects_only 156g36.3.2.10.4 b_lfe . 156g36.3.2.10.5 b_bed_objects 156g36.3.2.10.6 b_bed_start 156g36.3.2.10.7 b_isf_start 156g36.3.2.10.8 Interpreting object position properties . 156g36.3.2.10.9 res_bytes 158g36.3.2.10.10 reserved_data . 158g36.3.2.11 ac4_pr

47、esentation_substream_info - presentation substream information 159g36.3.2.11.1 b_alternative 159g36.3.2.11.2 b_pres_ndot . 159g36.3.2.12 oamd_substream_info - object audio metadata substream information 159g36.3.2.12.1 b_oamd_ndot . 159g36.3.3 AC-4 substreams . 159g36.3.3.1 ac4_presentation_substrea

48、m - AC-4 presentation substream 159g36.3.3.1.1 b_name_present . 159g36.3.3.1.2 b_length . 159g36.3.3.1.3 name_len . 159g36.3.3.1.4 presentation_name . 159g36.3.3.1.5 n_targets_minus1. 159g36.3.3.1.6 target_level 159g36.3.3.1.7 target_device_category . 160g36.3.3.1.8 tdc_extension . 160g36.3.3.1.9 b_

49、ducking_depth_present 160g36.3.3.1.10 max_ducking_depth 160g36.3.3.1.11 b_loud_corr_target 160g36.3.3.1.12 loud_corr_target 160g36.3.3.1.13 n_substreams_in_presentation . 160g36.3.3.1.14 b_active . 160g36.3.3.1.15 alt_data_set_index . 161g36.3.3.1.16 b_additional_data 161g36.3.3.1.17 add_data_bytes_minus1. 161g36.3.3.1.18 add_data. 161g36.3.3.1.19 drc_metadata_size_value . 161g36.3.3.1.20 b_more_bits . 161g36.3.3.1.21 drc_frame . 161g36.3.3.1.22 b_substream_group_gains_present 161g36.3.3.1.23 b_keep . 161g36.3.3.1.24 sg_gain. 162g36.3.3.1.25 b_associated . 162g36.3.3.1.26 b

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 标准规范 > 国际标准 > 其他

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1