1、 ETSI TS 126 404 V14.0.0 (2017-04) Digital cellular telecommunications system (Phase 2+) (GSM); Universal Mobile Telecommunications System (UMTS); LTE; General audio codec audio processing functions; Enhanced aacPlus general audio codec; Enhanced aacPlus encoder Spectral Band Replication (SBR) part
2、(3GPP TS 26.404 version 14.0.0 Release 14) TECHNICAL SPECIFICATION ETSI ETSI TS 126 404 V14.0.0 (2017-04)13GPP TS 26.404 version 14.0.0 Release 14Reference RTS/TSGS-0426404ve00 Keywords GSM,LTE,UMTS ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33
3、4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions a
4、nd/or in print. The content of any electronic and/or print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the pr
5、int of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available
6、at https:/portal.etsi.org/TB/ETSIDeliverableStatus.aspx If you find errors in the present document, please send your comment to one of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any me
7、ans, electronic or mechanical, including photocopying and microfilm except as authorized by written permission of ETSI. The content of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. Eur
8、opean Telecommunications Standards Institute 2017. All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are Trade Marks of ETSI registered for the benefit of its Members. 3GPPTM and LTE are Trade Marks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Part
9、ners. GSM and the GSM logo are Trade Marks registered and owned by the GSM Association. ETSI ETSI TS 126 404 V14.0.0 (2017-04)23GPP TS 26.404 version 14.0.0 Release 14Intellectual Property Rights IPRs essential or potentially essential to the present document may have been declared to ETSI. The info
10、rmation pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available from the
11、 ETSI Secretariat. Latest updates are available on the ETSI Web server (https:/ipr.etsi.org/). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or th
12、e updates on the ETSI Web server) which are, or may be, or may become, essential to the present document. Foreword This Technical Specification (TS) has been produced by ETSI 3rd Generation Partnership Project (3GPP). The present document may refer to technical specifications or reports using their
13、3GPP identities, UMTS identities or GSM identities. These should be interpreted as being references to the corresponding ETSI deliverables. The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under http:/webapp.etsi.org/key/queryform.asp. Modal verbs terminology In the prese
14、nt document “shall“, “shall not“, “should“, “should not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NOT allowed in ETSI deliverables
15、except when used in direct citation. ETSI ETSI TS 126 404 V14.0.0 (2017-04)33GPP TS 26.404 version 14.0.0 Release 14Contents Intellectual Property Rights 2g3Foreword . 2g3Modal verbs terminology 2g3Foreword . 4g31 Scope 5g32 Normative references . 5g33 Definitions, symbols and abbreviations . 5g33.1
16、 Definitions 5g33.2 Symbols 6g33.3 Abbreviations . 7g34 Outline description . 7g35 SBR encoder description 7g35.1 SBR tools overview 7g35.1.1 Enhanced aacPlus sdynchronization without parametric stereo 8g35.1.2 Enhanced aacPlus synchronisation with parametric stereo . 9g35.1.3 SBR encoder modules ov
17、erview . 10g35.2 Analysis filterbank . 12g35.3 Frequency band tables 15g35.4 Time / frequency grid generation . 15g35.4.1 Transient detector . 15g35.4.2 Frame splitter 16g35.4.3 Frame generator 17g35.5 Envelope estimation . 21g35.6 Additional control parameters 22g35.6.1 Introduction. 22g35.6.2 Tona
18、lity estimation . 23g35.6.3 Noise-floor estimation 24g35.6.4 Inverse filtering estimation . 25g35.6.5 Additional sines estimation . 26g35.7 Data quantization 29g35.8 Envelope and noise floor coding 30g36 Bitstream 32g3Annex A (informative): Change history . 34g3History 35g3ETSI ETSI TS 126 404 V14.0
19、.0 (2017-04)43GPP TS 26.404 version 14.0.0 Release 14Foreword The present document describes the detailed mapping of the general audio service employing the aacPlus general audio codec within the 3GPP system. The contents of the present document are subject to continuing work within the TSG and may
20、change following formal TSG approval. Should the TSG modify the contents of this TS, it will be re-released by the TSG with an identifying change of release date and an increase in version number as follows: Version x.y.z where: x the first digit: 1 presented to TSG for information; 2 presented to T
21、SG for approval; 3 Indicates TSG approved document under change control. y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, updates, etc. z the third digit is incremented when editorial only changes have been incorporated in the specification; E
22、TSI ETSI TS 126 404 V14.0.0 (2017-04)53GPP TS 26.404 version 14.0.0 Release 141 Scope This Telecommunication Standard (TS) describes the SBR encoder part of the Enhanced aacPlus general audio codec 3. 2 Normative references This TS incorporates by dated and undated reference, provisions from other p
23、ublications. These normative references are cited in the appropriate places in the text and the publications are listed hereafter. For dated references, subsequent amendments to or revisions of any of these publications apply to this TS only when incorporated in it by amendment or revision. For unda
24、ted references, the latest edition of the publication referred to applies. 1 ISO/IEC 14496-3:2001/Amd.1:2003, Bandwidth Extension. 2 ISO/IEC 14496-3:2001/Amd.1:2003/DCOR1. 3 3GPP TS 26.401 : Enhanced aacPlus general audio codec; General Description 3 Definitions, symbols and abbreviations 3.1 Defini
25、tions For the purposes of this TS, the following definitions apply: band: (as in limiter band, noise floor band, etc.) a group of consecutive QMF subbands chirp factor: the bandwidth expansion factor of the formants described by a LPC polynomial Down Sampled SBR: the SBR Tool with a modified synthes
26、is filterbank resulting in a down sampled output signal with the same sample rate as the input signal to the SBR Tool. May be used whenever a lower sample rate output is desired. envelope scalefactor: an element representing the averaged energy of a signal over a region described by a frequency band
27、 and a time segment frequency band: interval in frequency, group of consecutive QMF subbands frequency border: frequency band delimiter, expressed as a specific QMF subband noise floor: a vector of noise floor scalefactors noise floor scalefactor: an element associated with a region described by a f
28、requency band and a time segment, representing the ratio between the energy of the noise to be added to the envelope adjusted HF generated signal and the energy of the same patch: a number of adjoining QMF subbands moved to a different frequency location SBR envelope: a vector of envelope scalefacto
29、rs SBR frame: time segment associated with one SBR extension data element SBR range: the frequency range of the signal generated by the SBR algorithm subband: a frequency range represented by one row in a QMF matrix, carrying a subsampled signal time border: time segment delimiter, expressed as a sp
30、ecific time slot time segment: interval in time, group of consecutive time slots ETSI ETSI TS 126 404 V14.0.0 (2017-04)63GPP TS 26.404 version 14.0.0 Release 14time / frequency grid: a description of SBR envelope time segments and associated frequency resolution tables as well as description of nois
31、e floor time segments time slot: finest resolution in time for SBR envelopes and noise floors. One time slot equals two subsamples in the QMF domain 3.2 Symbols For the purposes of this TS, the following symbols apply: Description of variables defined in one sub clause and used in other subclasses.
32、ch is the current channel, and when used as index in vectors left channel is represented by ch= 0 and right channel is represented ch= 1. EOrighas LEcolumns where each column is of length NLowor NHighdepending on the frequency resolution for each SBR envelope. The elements in EOrigcontains the envel
33、ope scalefactors of the original signal. has two column vectors containing the frequency border tables for low and high frequency resolution. internal sampling frequency of the SBR Tool, twice the sampling frequency of the core coder (after sampling frequency mapping, Table 4.55). The sampling frequ
34、ency of the SBR enhanced output signal is equal to the internal sampling frequency of the SBR Tool, unless the SBR Tool is operated in downsampled mode. If the SBR Tool is operated in downsampled mode, the output sampling frequency is equal to the sampling frequency of the core coder. fMasteris of l
35、ength NMaster+1 and contains QMF master frequency grouping information. fTableHighis of length NHigh+1 and contains frequency borders for high frequency resolution SBR envelopes. fTableLowis of length NLow+1 and contains frequency borders for low frequency resolution SBR envelopes. fTableNoiseis of
36、length NQ+1 and contains frequency borders used by noise floors. kxthe first QMF subband in the SBR range. k0the first QMF subband in the fMastertable. LEnumber of SBR envelopes. LQnumber of noise floors. M number of QMF subbands in the SBR range. middleBorder points to a specific time border. NLnum
37、ber of limiter bands. NMasternumber of frequency bands in the master frequency resolution table. NQnumber of noise floor bands. number of frequency bands for low and high frequency resolution. numPatches a variable indicating the number of patches in the SBR range. numTimeSlots number of SBR envelop
38、e time slots that exist within an AAC frame, 16 for a 1024 AAC frame . offset-values for the SBR envelope and noise floor data, when using coupled channels. ,TableLow TableHigh=Ff fSBRFs, Low HighNN=n24,12=panOffsetETSI ETSI TS 126 404 V14.0.0 (2017-04)73GPP TS 26.404 version 14.0.0 Release 14patchB
39、orders a vector containing the frequency borders of the patches. patchNumSubbands a vector holding the number of subbands in every patch. QOrighas LQcolumns where each column is of length NQand contains the noise floor scalefactors. frequency resolution for all SBR envelopes in the current SBR frame
40、, zero for low resolution, one for high resolution. reset a variable in the encoder and the decoder set to one if certain bitstream elements have changed from the previous SBR frame, otherwise set to zero. tEis of length LE+1 and contains start and stop time borders for all SBR envelopes in the curr
41、ent SBR frame. tHFAdjoffset for the envelope adjuster module. tHFGenoffset for the HF-generation module. tQis of length LQ+1 and contains start and stop time borders for all noise floors in the current SBR frame. 3.3 Abbreviations For the purposes of this TS, the following abbreviations apply. NA No
42、t Applicable aacPlus Combination of MPEG-4 AAC and MPEG-4 Bandwidth extension (SBR) Enhanced aacPlus Combination of MPEG-4 AAC, MPEG-4 Bandwidth extension (SBR) and MPEG-4 Parametric Stereo QMF Quadrature Mirror Filter SBR Spectral Band Replication 4 Outline description This TS is structured as foll
43、ows: Section 5.1 gives an encoder overview description. Section 5.2 gives a detailed description of the filterbanks used in the encoder. Section 5.3 gives a specification of the used frequency band tables. Section 5.4 gives a detailed description of the time grid calculation and the transient detect
44、ion. Section 5.5 gives a detailed description of the envelope estimation. Section 5.6 gives a detailed description of the estimation of the additional control parameters. Section 5.7 gives detailed description of the data quantisation. Section 5.8 gives a detailed description of the envelope coding.
45、 5 SBR encoder description 5.1 SBR tools overview The encoder part of the SBR tool estimates several parameters used by the high frequency reconstruction method on the decoder side. In order to synchronise the SBR bitstream data with the AAC codec, the two different modes of operation have to be con
46、sidered; normal aacPlus operation and aacPlus parametric stereo operation. In the normal case, the AAC encoder is responsible for downsampling of the input PCM signal, while the SBR encoder works in parallel on twice the sampling frequency compared to the downsampled signal. When using parametric st
47、ereo aacPlus, the SBR tool is also 0 1 ,., Lr r=rETSI ETSI TS 126 404 V14.0.0 (2017-04)83GPP TS 26.404 version 14.0.0 Release 14responsible for downsampling of the AAC coder signal. The two modes are outlined in the following sections and illustrated in Figure 1 and Figure 2. Figure 1 aacPlus block
48、diagram Figure 2 Parametric stereo aacPlus block diagram 5.1.1 Enhanced aacPlus sdynchronization without parametric stereo The time domain input PCM signal is assumed to be stored in a buffer x, where 2048 new samples are added to the end of the buffer every frame. Before adding new samples, all sam
49、ples in the buffer must be left-shifted 2048 samples. The buffersize amounts to 576 + 2048 + tinputDelaysamples, where tinputDelayequals the total AAC delay, i.e. the delay for the entire encoder decoder chain, plus the SBR decoder buffer delay minus the SBR encoder buffer delay. All delays are expressed in the SBR input sampling rate: The PCM buffer x is fed to the analysis QMF bank, where subband filtering is performed. The window stride of the QMF bank is illustrated in Figure 3a, which shows that the first window is applied from sample 0 to sample 639
copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1