1、Information technology Coding of audio-visual objects Part3:Audio AMENDMENT2:ALSsimpleprofileand transportofSAOCAmendment 2:2012 (IDT) toNational Standard of CanadaCAN/CSA-ISO/IEC 14496-3-10(ISO/IEC 14496-3:2009, IDT)NOT FOR RESALE.PUBLICATION NON DESTINE LA REVENTE.CSA Standards Update ServiceAmend
2、ment 2:2012 toCAN/CSA-ISO/IEC 14496-3-10March 2012Title:Information technology Coding of audio-visual objects Part3:Audio AMENDMENT2:ALSsimpleprofileand transportofSAOCPagination:15 pages (iii preliminary and 12 text)To register for e-mail notification about any updates to this publication go on-lin
3、e to shop.csa.caclick on E-mail Services under MY ACCOUNTclick on CSA Standards Update ServiceThe List ID that you will need to register for updates to this publication is 2420892.If you require assistance, please e-mail techsupportcsa.ca or call 416-747-2233.Visit CSAs policy on privacy at csagroup
4、.org/legal to find out how we protect your personal information.Reference numberISO/IEC 14496-3:2009/Amd.2:2010(E)ISO/IEC 2010INTERNATIONAL STANDARD ISO/IEC14496-3Fourth edition2009-09-01AMENDMENT 22010-10-15Information technology Coding of audio-visual objects Part 3: Audio AMENDMENT 2: ALS simple
5、profile and transport of SAOC Technologies de linformation Codage des objets audiovisuels Partie 3: Codage audio AMENDEMENT 2: Profil simple ALS et transport de SAOC ISO/IEC 14496-3:2009/Amd.2:2010(E) PDF disclaimer This PDF file may contain embedded typefaces. In accordance with Adobes licensing po
6、licy, this file may be printed or viewed but shall not be edited unless the typefaces which are embedded are licensed to and installed on the computer performing the editing. In downloading this file, parties accept therein the responsibility of not infringing Adobes licensing policy. The ISO Centra
7、l Secretariat accepts no liability in this area. Adobe is a trademark of Adobe Systems Incorporated. Details of the software products used to create this PDF file can be found in the General Info relative to the file; the PDF-creation parameters were optimized for printing. Every care has been taken
8、 to ensure that the file is suitable for use by ISO member bodies. In the unlikely event that a problem relating to it is found, please inform the Central Secretariat at the address given below. COPYRIGHT PROTECTED DOCUMENT ISO/IEC 2010 All rights reserved. Unless otherwise specified, no part of thi
9、s publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISOs member body in the country of the requester. ISO copyright office Case postale 56 CH-121
10、1 Geneva 20 Tel. + 41 22 749 01 11 Fax + 41 22 749 09 47 E-mail copyrightiso.org Web www.iso.org ii ISO/IEC 2010 All rights reservedISO/IEC 14496-3:2009/Amd.2:2010(E) ISO/IEC 2010 All rights reserved iiiForeword ISO (the International Organization for Standardization) and IEC (the International Elec
11、trotechnical Commission) form the specialized system for worldwide standardization. National bodies that are members of ISO or IEC participate in the development of International Standards through technical committees established by the respective organization to deal with particular fields of techn
12、ical activity. ISO and IEC technical committees collaborate in fields of mutual interest. Other international organizations, governmental and non-governmental, in liaison with ISO and IEC, also take part in the work. In the field of information technology, ISO and IEC have established a joint techni
13、cal committee, ISO/IEC JTC 1. International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2. The main task of the joint technical committee is to prepare International Standards. Draft International Standards adopted by the joint technical committee are cir
14、culated to national bodies for voting. Publication as an International Standard requires approval by at least 75 % of the national bodies casting a vote. Attention is drawn to the possibility that some of the elements of this document may be the subject of patent rights. ISO and IEC shall not be hel
15、d responsible for identifying any or all such patent rights. Amendment 2 to ISO/IEC 14496-3:2009 was prepared by Joint Technical Committee ISO/IEC JTC 1, Information technology, Subcommittee SC 29, Coding of audio, picture, multimedia and hypermedia information. Amendment 2:2012 to CAN/CSA-ISO/IEC 1
16、4496-3-10ISO/IEC 14496-3:2009/Amd.2:2010(E) ISO/IEC 2010 All rights reserved 1Information technology Coding of audio-visual objects Part 3: Audio AMENDMENT 2: ALS simple profile and transport of SAOC Changes in existing text and tables are highlighted by gray background. In 1.2, Normative references
17、, add: ISO/IEC 23003-2, Information technology MPEG audio technologies Part 2: Spatial Audio Object Coding (SAOC) In 1.3, Terms and definitions, alphabetically incorporate the following into the list and renumber the subsequent index-number-entries: LD MPEG Surround: Low Delay MPEG Surround SAOC: Sp
18、atial Audio Object Coding In 1.5.1.1, Audio object type definition, amend Table 1.1 by incorporating the updates below: ObjectType ID AudioObjectTypeRemark 42 (reserved) 43 SAOC44 LD MPEG Surround 45-95 (reserved) Amendment 2:2012 to CAN/CSA-ISO/IEC 14496-3-10ISO/IEC 14496-3:2009/Amd.2:2010(E) 2 ISO
19、/IEC 2010 All rights reservedAfter 1.5.1.2.37, add the following two new subclauses: 1.5.1.2.38 SAOC object type The SAOC object type conveys Spatial Audio Object Coding side information (see ISO/IEC 23003-2) in the MPEG-4 Audio framework. 1.5.1.2.39 LD MPEG Surround object type The LD MPEG Surround
20、 object type conveys Low Delay MPEG Surround Coding side information (see ISO/IEC 23003-2) in the MPEG-4 Audio framework. In 1.5.2.1 (Profiles), add: 14. The ALS Simple Profile contains the audio object type 36 (ALS). In 1.5.2.1 (Profiles), Table 1.3 (Audio Profiles definition), add: Object Type ID
21、Audio Object Type ALS Simple Profile 36 ALS X 42 (reserved) 43 SAOC 44 LD MPEG Surround In 1.5.2.3 (Levels within the profiles), add: Levels for the ALS Simple Profile Table AMD 2-1 Level for the ALS Simple Profile Level Max. number of channels Max. sampling rate kHz Max. word length bit Max. number
22、 of samples per frame Max. prediction order Max. BS* stages Max. MCC* stages 1 2 48 16 4096 15 3 1 * BS: Block switching, * MCC: Multi-channel coding The BGMC tool and the RLS-LMS tool are not permitted. Floating-point audio data is not supported. Amendment 2:2012 to CAN/CSA-ISO/IEC 14496-3-10ISO/IE
23、C 14496-3:2009/Amd.2:2010(E) ISO/IEC 2010 All rights reserved 3In 1.5.2.4 (audioProfileLevelIndication), insert the following new entries into Table 1.14 (audioProfileLevelIndication values) and adapt the “reserved for ISO use” range accordingly: Value Profile Level 0x3C ALS Simple Profile L1 0x3D S
24、AOC Baseline Profile L1 0x3E SAOC Baseline Profile L2 0x3F SAOC Baseline Profile L3 0x40 SAOC Baseline Profile L4 0x41 SAOC LD Profile L1 0x42 SAOC LD Profile L2 0x43 SAOC LD Profile L3 0x44 - 0x7F reserved for ISO use - In 1.6.2.1, extend Table1.15 “AudioSpecificConfig()”as follows: Table 1.15 Synt
25、ax of AudioSpecificConfig() Syntax No. of bits Mnemonic AudioSpecificConfig () sbrPresentFlag = -1; psPresentFlag = -1; mpsPresentFlag = -1; saocPresentFlag = -1; ldmpsPresentFlag = -1; if ( audioObjectType = 5 | audioObjectType = 29 ) Amendment 2:2012 to CAN/CSA-ISO/IEC 14496-3-10ISO/IEC 14496-3:20
26、09/Amd.2:2010(E) 4 ISO/IEC 2010 All rights reserved case 40: case 41: SymbolicMusicSpecificConfig() break; case 43: saocPresentFlag = 1; saocPayloadEmbedding; 1 uimsbf SaocSpecificConfig(); break; case 44: ldmpsPresentFlag = 1; ldsacPayloadEmbedding; 1 uimsbf LDSpatialSpecificConfig(); break; defaul
27、t: /* reserved */ extensionChannelConfiguration; 4 uimsbf if (extensionIdentifier = -1 11 bslbf if ( extensionIdentifier = 0x76a ) extensionIdentifier = -1; if ( audioObjectType != 30 1 uimsbf if ( mpsPresentFlag = 1 ) sacPayloadEmbedding = 1; sscLen; 8 uimsbf if ( sscLen = 0xff ) sscLenExt; 16 uims
28、bf sscLen += sscLenExt; SpatialSpecificConfig(); if (extensionIdentifier = -1 11 bslbf if ( extensionIdentifier = 0x7cb ) extensionIdentifier = -1; if ( audioObjectType != 43 1 uimsbf if ( saocPresentFlag = 1 ) saocPayloadEmbedding = 1; saocscLen; 8 uimsbf if ( saocscLen = 0xff ) saocscLenExt; 16 ui
29、msbf Amendment 2:2012 to CAN/CSA-ISO/IEC 14496-3-10ISO/IEC 14496-3:2009/Amd.2:2010(E) ISO/IEC 2010 All rights reserved 5saocscLen += saocscLenExt; SaocSpecificConfig(); if (extensionIdentifier = -1 11 bslbf if ( extensionIdentifier = 0x7cc ) extensionIdentifier = -1; if ( audioObjectType != 44 1 uim
30、sbf if ( ldmpsPresentFlag = 1 ) ldsacPayloadEmbedding = 1; ldsscLen; 8 uimsbf if ( ldsscLen = 0xff ) ldsscLenExt; 16 uimsbf ldsscLen += ldsscLenExt; LDSpatialSpecificConfig(); After 1.6.2.1.17, add 1.6.2.1.18 and 1.6.2.1.19 as follows: 1.6.2.1.18 SaocSpecificConfig Defined in 6.1 of ISO/IEC 23003-2.
31、 1.6.2.1.19 LDSpatialSpecificConfig Defined in B.2.1 of ISO/IEC 23003-2. In 1.6.2.2.1, extend Table 1.17 “Audio Object Types” as follows: Table 1.17 Audio Object Types Object Type ID Audio Object Type definition of elementary stream payloads and detailed syntax Mapping of audio payloads to access un
32、its and elementary streams 0 NULL 41 SMR Main ISO/IEC 14496-23 42 (reserved) 43 SAOC ISO/IEC 23003-2 44 LD MPEG Surround ISO/IEC 23003-2 Amendment 2:2012 to CAN/CSA-ISO/IEC 14496-3-10ISO/IEC 14496-3:2009/Amd.2:2010(E) 6 ISO/IEC 2010 All rights reservedAfter 1.6.3.20, add the following new subclauses
33、 1.6.3.21 until 1.6.3.28 as follows: 1.6.3.21 saocPayloadEmbedding The audio Object Type ID 43 SAOC is used to convey spatial audio object coding side information for SAOC decoding as defined in ISO/IEC 23003-2. Depending on this flag, the SAOC data payload, i.e., SAOCFrame(), is available by differ
34、ent means: Table AMD 2-2 saocPayloadEmbedding saocPayloadEmbedding Meaning 0 One SAOCFrame() is mapped into one access unit. Subsequent access units form one elementary stream. That elementary stream will always depend on another elementary stream that contains the underlying (downmixed) audio data.
35、 1 The top level payload is multiplexed into the underlying (downmixed) audio data. The actual multiplexing details depend on the presentation of the audio data (i.e., usually on the AOT). Note that this leads to an elementary stream with no real payload. That elementary stream will always depend on
36、 another elementary stream that contains both, the underlying (downmixed) audio data and the multiplexed spatial audio data. 1.6.3.22 saocPresentFlag A one bit field indicating the presence or absence of SAOC data. The value 1 indicates that the saocPresentFlag was not conveyed in the AudioSpecificC
37、onfig(). 1.6.3.23 saocscLen A helper variable indicating the number of bytes of the subsequent SaocSpecificConfig() data function including possible fill bits. 1.6.3.24 saocscLenExt A helper variable indicating the additional number of bytes of the subsequent SaocSpecificConfig() data function inclu
38、ding possible fill bits. 1.6.3.25 ldsacPayloadEmbedding The audio Object Type ID 44 LD MPEG Surround is used to convey low delay spatial audio coding side information for LD MPEG Surround decoding as defined in ISO/IEC 23003-2. Depending on this flag, the LD MPEG Surround data payload, i.e., LDSpati
39、alFrame(), is available by different means: Amendment 2:2012 to CAN/CSA-ISO/IEC 14496-3-10ISO/IEC 14496-3:2009/Amd.2:2010(E) ISO/IEC 2010 All rights reserved 7Table AMD 2-3 ldsacPayloadEmbedding ldsacPayloadEmbedding Meaning 0 One LDSpatialFrame() is mapped into one access unit. Subsequent access un
40、its form one elementary stream. That elementary stream will always depend on another elementary stream that contains the underlying (downmixed) audio data. 1 The top level payload is multiplexed into the underlying (downmixed) audio data. The actual multiplexing details depend on the presentation of
41、 the audio data (i.e., usually on the AOT). Note that this leads to an elementary stream with no real payload. That elementary stream will always depend on another elementary stream that contains both, the underlying (downmixed) audio data and the multiplexed spatial audio data. 1.6.3.26 ldmpsPresen
42、tFlag A one bit field indicating the presence or absence of LD MPEG Surround data. The value 1 indicates that the ldmpsPresentFlag was not conveyed in the AudioSpecificConfig(). 1.6.3.27 ldsscLen A helper variable indicating the number of bytes of the subsequent LDSpatialSpecificConfig() data functi
43、on including possible fill bits. 1.6.3.28 ldsscLenExt A helper variable indicating the additional number of bytes of the subsequent LDSpatialSpecificConfig() data function including possible fill bits. In 4.4.2.7, extend Table 4.57 “Syntax of extension_payload()” as follows: Table 4.57 Syntax of ext
44、ension_payload() Syntax No. of bits Mnemonic extension_payload(cnt) extension_type; 4 uimsbf align = 4; switch( extension_type ) case EXT_DYNAMIC_RANGE: return dynamic_range_info(); case EXT_SAC_DATA: return sac_extension_data(cnt); case EXT_SAOC_DATA: return saoc_extension_data(cnt); case EXT_LDSAC
45、_DATA: return ldsac_extension_data(cnt); case EXT_SBR_DATA: return sbr_extension_data(id_aac, 0); Note 1 case EXT_SBR_DATA_CRC: Amendment 2:2012 to CAN/CSA-ISO/IEC 14496-3-10ISO/IEC 14496-3:2009/Amd.2:2010(E) 8 ISO/IEC 2010 All rights reservedreturn sbr_extension_data(id_aac, 1); Note 1 case EXT_DAT
46、A_LENGTH: hlp = 1; len; 4 uimsbf if (len=15) len += add_len; 8 uimsbf hlp += 1; If (add_len=255) len += add_add_len; 16 uimsbf hlp += 2; return hlp+extension_payload(len); Note 2 case EXT_FILL_DATA: fill_nibble; /* must be 0000 */ 4 uimsbf for (i=0; icnt-1; i+) fill_bytei; /* must be 10100101 */ 8 u
47、imsbf return cnt; case EXT_DATA_ELEMENT: data_element_version; 4 uimsbf switch( data_element_version ) case ANC_DATA: loopCounter = 0; dataElementLength = 0; do dataElementLengthPart; 8 uimsbf dataElementLength += dataElementLengthPart; lopCounter+; while (dataElementLengthPart = 255); for (i=0; ida
48、taElementLength; i+) data_element_bytei; 8 uimsbf return (dataElementLength+loopCounter+1); default: align = 0; case EXT_FIL: default: for (i=0; i8*(cnt-1)+align; i+) other_bitsi; 1 uimsbf return cnt; Note 1: id_aac is the id_syn_ele of the corresponding AAC element (ID_SCE or ID_CPE) or ID_SCE in case of CCE. Note 2: The extension_payload() included here must not have extension_type = EXT_DATA_LENGTH. Amendment 2:2012 to CAN/CSA-ISO/IEC 14496-3-10ISO/IEC 14496-3:2009/Amd.2:2010(E) ISO/IEC 2010 All rights reserved 9In 4.4.2.7, after Ta