ETSI TS 126 445-2017 Universal Mobile Telecommunications System (UMTS) LTE Codec for Enhanced Voice Services (EVS) Detailed algorithmic description (V14 1 0 3GPP TS 26 445 version .pdf

上传人:outsidejudge265 文档编号:741994 上传时间:2019-01-11 格式:PDF 页数:661 大小:7.60MB
下载 相关 举报
ETSI TS 126 445-2017 Universal Mobile Telecommunications System (UMTS) LTE Codec for Enhanced Voice Services (EVS) Detailed algorithmic description (V14 1 0 3GPP TS 26 445 version .pdf_第1页
第1页 / 共661页
ETSI TS 126 445-2017 Universal Mobile Telecommunications System (UMTS) LTE Codec for Enhanced Voice Services (EVS) Detailed algorithmic description (V14 1 0 3GPP TS 26 445 version .pdf_第2页
第2页 / 共661页
ETSI TS 126 445-2017 Universal Mobile Telecommunications System (UMTS) LTE Codec for Enhanced Voice Services (EVS) Detailed algorithmic description (V14 1 0 3GPP TS 26 445 version .pdf_第3页
第3页 / 共661页
ETSI TS 126 445-2017 Universal Mobile Telecommunications System (UMTS) LTE Codec for Enhanced Voice Services (EVS) Detailed algorithmic description (V14 1 0 3GPP TS 26 445 version .pdf_第4页
第4页 / 共661页
ETSI TS 126 445-2017 Universal Mobile Telecommunications System (UMTS) LTE Codec for Enhanced Voice Services (EVS) Detailed algorithmic description (V14 1 0 3GPP TS 26 445 version .pdf_第5页
第5页 / 共661页
点击查看更多>>
资源描述

1、 ETSI TS 126 445 V13.4.0 (2017-02) Universal Mobile Telecommunications System (UMTS); LTE; Codec for Enhanced Voice Services (EVS); Detailed algorithmic description (3GPP TS 26.445 version 13.4.0 Release 13) floppy3TECHNICAL SPECIFICATION ETSI ETSI TS 126 445 V13.4.0 (2017-02)13GPP TS 26.445 version

2、 13.4.0 Release 13Reference RTS/TSGS-0426445vd40 Keywords LTE,UMTS ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/

3、88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The content of any electronic and/or print versions of the present document shall not be modified without the prior

4、written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the present doc

5、ument should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at https:/portal.etsi.org/TB/ETSIDeliverableStatus.aspx If you find errors in the present document, please send your comment to one

6、of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm except as authorized by written permission of ETSI. The content

7、of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. European Telecommunications Standards Institute 2017. All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are Trade Marks

8、 of ETSI registered for the benefit of its Members. 3GPPTM and LTE are Trade Marks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. GSM and the GSM logo are Trade Marks registered and owned by the GSM Association. ETSI ETSI TS 126 445 V13.4.0 (2017-02)23GPP

9、TS 26.445 version 13.4.0 Release 13Intellectual Property Rights IPRs essential or potentially essential to the present document may have been declared to ETSI. The information pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found in ETSI

10、 SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web server (https:/ipr.etsi.org/). Pursuant to the ETSI IPR Policy, no i

11、nvestigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web server) which are, or may be, or may become, essential to the present document. Foreword The present docum

12、ent may refer to technical specifications or reports using their 3GPP identities, UMTS identities or GSM identities. These should be interpreted as being references to the corresponding ETSI deliverables. The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under http:/webapp

13、.etsi.org/key/queryform.asp. Modal verbs terminology In the present document “shall“, “shall not“, “should“, “should not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisi

14、ons). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in direct citation. ETSI ETSI TS 126 445 V13.4.0 (2017-02)33GPP TS 26.445 version 13.4.0 Release 13Contents Intellectual Property Rights 2g3Foreword . 2g3Modal verbs terminology 2g3Foreword . 14g31 Scope 14g32 Referenc

15、es 14g33 Definitions, abbreviations and mathematical expressions 16g33.1 Definitions 16g33.2 Abbreviations . 17g33.3 Mathematical Expressions 18g34 General description of the coder 19g34.1 Introduction 19g34.2 Input/output sampling rate 19g34.3 Codec delay 19g34.4 Coder overview 20g34.4.1 Encoder ov

16、erview . 20g34.4.1.1 Linear Prediction Based Operation . 21g34.4.1.2 Frequency Domain Operation . 21g34.4.1.3 Inactive Signal coding . 22g34.4.1.4 Source Controlled VBR Coding 22g34.4.2 Decoder overview . 22g34.4.2.1 Parametric Signal Representation Decoding (Bandwidth Extension) . 22g34.4.2.2 Frame

17、 loss concealment 23g34.4.3 DTX/CNG operation. 23g34.4.3.1 Inactive Signal coding . 23g34.4.4 AMR-WB-interoperable option 23g34.4.5 Channel-Aware Mode . 23g34.5 Organization of the rest of the Technical Standard 24g35 Functional description of the encoder 25g35.1 Common processing . 25g35.1.1 High-p

18、ass Filtering . 25g35.1.2 Complex low-delay filter bank analysis 25g35.1.2.1 Sub-band analysis . 25g35.1.2.2 Sub-band energy estimation 26g35.1.3 Sample rate conversion to 12.8 kHz . 27g35.1.3.1 Conversion of 16, 32 and 48 kHz signals to 12.8 kHz 27g35.1.3.2 Conversion of 8 kHz signals to 12.8 kHz 2

19、7g35.1.3.3 Conversion of input signals to 16, 25.6 and 32 kHz . 29g35.1.4 Pre-emphasis . 29g35.1.5 Spectral analysis . 30g35.1.5.1 Windowing and DFT. 30g35.1.5.2 Energy calculations . 31g35.1.6 Bandwidth detection . 32g35.1.6.1 Mean and maximum energy values per band 32g35.1.7 Bandwidth decision. 34

20、g35.1.8 Time-domain transient detection 37g35.1.9 Linear prediction analysis . 38g35.1.9.1 LP analysis window 38g35.1.9.2 Autocorrelation computation. 38g35.1.9.3 Adaptive lag windowing . 39g35.1.9.4 Levinson-Durbin algorithm . 39g35.1.9.5 Conversion of LP coefficients to LSP parameters 40g35.1.9.6

21、LSP interpolation 41g3ETSI ETSI TS 126 445 V13.4.0 (2017-02)43GPP TS 26.445 version 13.4.0 Release 135.1.9.7 Conversion of LSP parameters to LP coefficients 41g35.1.9.8 LP analysis at 16kHz . 42g35.1.10 Open-loop pitch analysis 43g35.1.10.1 Perceptual weighting . 43g35.1.10.2 Correlation function co

22、mputation . 44g35.1.10.3 Correlation reinforcement with past pitch values 45g35.1.10.4 Normalized correlation computation . 46g35.1.10.5 Correlation reinforcement with pitch lag multiples . 46g35.1.10.6 Initial pitch lag determination and reinforcement based on pitch coherence with other half-frames

23、 47g35.1.10.7 Pitch lag determination and parameter update 48g35.1.10.8 Correction of very short and stable open-loop pitch estimates . 49g35.1.10.9 Fractional open-loop pitch estimate for each subframe. 51g35.1.11 Background noise energy estimation 52g35.1.11.1 First stage of noise energy update .

24、52g35.1.11.2 Second stage of noise energy update . 54g35.1.11.2.1 Basic parameters for noise energy update . 54g35.1.11.2.2 Spectral diversity . 55g35.1.11.2.3 Complementary non-stationarity . 55g35.1.11.2.4 HF energy content . 56g35.1.11.2.5 Tonal stability 56g35.1.11.2.6 High frequency dynamic ran

25、ge 60g35.1.11.2.7 Combined decision for background noise energy update 60g35.1.11.3 Energy-based parameters for noise energy update 62g35.1.11.3.1 Closeness to current background estimate . 62g35.1.11.3.2 Features related to last correlation or harmonic event . 62g35.1.11.3.3 Energy-based pause dete

26、ction . 63g35.1.11.3.4 Long-term linear prediction efficiency 63g35.1.11.3.5 Additional long-term parameters used for noise estimation 64g35.1.11.4 Decision logic for noise energy update . 65g35.1.12 Signal activity detection 68g35.1.12.1 SAD1 module 69g35.1.12.1.1 SNR outlier filtering 71g35.1.12.2

27、 SAD2 module 72g35.1.12.3 Combined decision of SAD1 and SAD2 modules for WB and SWB signals . 75g35.1.12.4 Final decision of the SAD1 module for NB signals 75g35.1.12.5 Post-decision parameter update . 76g35.1.12.6 SAD3 module 77g35.1.12.6.1 Sub-band FFT 77g35.1.12.6.2 Computation of signal features

28、 78g35.1.12.6.3 Computation of SNR parameters . 81g35.1.12.6.4 Decision of background music 82g35.1.12.6.5 Decision of background update flag 83g35.1.12.6.6 SAD3 Pre-decision 84g35.1.12.6.7 SAD3 Hangover 86g35.1.12.7 Final SAD decision . 86g35.1.12.8 DTX hangover addition . 88g35.1.13 Coding mode de

29、termination 90g35.1.13.1 Unvoiced signal classification . 91g35.1.13.1.1 Voicing measure 92g35.1.13.1.2 Spectral tilt 92g35.1.13.1.3 Sudden energy increase from a low energy level 93g35.1.13.1.4 Total frame energy difference . 94g35.1.13.1.5 Energy decrease after spike . 94g35.1.13.1.6 Decision abou

30、t UC mode . 94g35.1.13.2 Stable voiced signal classification . 96g35.1.13.3 Signal classification for FEC. 96g35.1.13.3.1 Signal classes for FEC . 97g35.1.13.3.2 Signal classification parameters 97g35.1.13.3.3 Classification procedure 98g35.1.13.4 Transient signal classification . 99g35.1.13.5 Modif

31、ication of coding mode in special cases 100g3ETSI ETSI TS 126 445 V13.4.0 (2017-02)53GPP TS 26.445 version 13.4.0 Release 135.1.13.6 Speech/music classification. 101g35.1.13.6.1 First stage of the speech/music classifier . 101g35.1.13.6.2 Scaling of features in the first stage of the speech/music cl

32、assifier . 103g35.1.13.6.3 Log-probability and decision smoothing . 104g35.1.13.6.4 State machine and final speech/music decision . 105g35.1.13.6.5 Improvement of the classification for mixed and music content . 108g35.1.13.6.6 Second stage of the speech/music classifier 112g35.1.13.6.7 Context-base

33、d improvement of the classification for stable tonal signals . 114g35.1.13.6.8 Detection of sparse spectral content 118g35.1.13.6.9 Decision about AC mode . 120g35.1.13.6.10 Decision about IC mode 120g35.1.14 Coder technology selection . 120g35.1.14.1 ACELP/MDCT-based technology selection at 9.6kbps

34、, 16.4 and 24.4 kbps 121g35.1.14.1.1 Segmental SNR estimation of the MDCT-based technology 121g35.1.14.1.2 Segmental SNR estimation of the ACELP technology 127g35.1.14.1.3 Hysteresis and final decision . 128g35.1.14.2 TCX/HQ MDCT technology selection at 13.2 and 16.4 kbps . 129g35.1.14.3 TCX/HQ MDCT

35、 technology selection at 24.4 and 32 kbps 131g35.1.14.4 TD/Multi-mode FD BWE technology selection at 13.2 kbps and 32 kbps . 134g35.2 LP-based Coding 135g35.2.1 Perceptual weighting. 135g35.2.2 LP filter coding and interpolation . 136g35.2.2.1 LSF quantization . 136g35.2.2.1.1 LSF weighting function

36、 . 136g35.2.2.1.2 Bit allocation . 139g35.2.2.1.3 Predictor allocation 140g35.2.2.1.4 LSF quantizer structure . 140g35.2.2.1.5 LSFQ for voiced coding mode at 16 kHz internal sampling frequency : BC-TCVQ 145g35.2.2.1.6 Mid-frame LSF quantizer 152g35.2.3 Excitation coding 153g35.2.3.1 Excitation codin

37、g in the GC, VC and high rate IC/UC modes 153g35.2.3.1.1 Computation of the LP residual signal 154g35.2.3.1.2 Target signal computation . 154g35.2.3.1.3 Impulse response computation 155g35.2.3.1.4 Adaptive codebook 155g35.2.3.1.5 Algebraic codebook. 157g35.2.3.1.6 Combined algebraic codebook 167g35.

38、2.3.1.7 Gain quantization. 181g35.2.3.2 Excitation coding in TC mode 186g35.2.3.2.1 Glottal pulse codebook search . 186g35.2.3.2.2 TC frame configurations 190g35.2.3.2.4 Pitch period and gain coding in the TC mode . 192g35.2.3.2.5 Update of filter memories 195g35.2.3.3 Excitation coding in UC mode a

39、t low rates . 195g35.2.3.3.1 Structure of the Gaussian codebook 195g35.2.3.3.2 Correction of the Gaussian codebook spectral tilt . 196g35.2.3.3.3 Search of the Gaussian codebook 197g35.2.3.3.4 Quantization of the Gaussian codevector gain 198g35.2.3.3.5 Other parameters in UC mode . 199g35.2.3.4 Exci

40、tation coding in IC and UC modes at 9.6 kbps 199g35.2.3.4.2 Gaussian noise generation . 201g35.2.3.4.3 Gain coding . 201g35.2.3.4.4 Memory update 203g35.2.3.5 Excitation coding in GSC mode 203g35.2.3.5.1 Determining the subframe length 204g35.2.3.5.2 Computing time-domain excitation contribution . 2

41、04g35.2.3.5.3 Frequency transform of residual and time-domain excitation contribution . 205g35.2.3.5.4 Computing energy dynamics of transformed residual and quantization of noise level . 206g35.2.3.5.6 Find and encode the cut-off frequency 206g35.2.3.5.7 Band energy computation and quantization. 208

42、g35.2.3.5.8 PVQ Bit allocation 208g35.2.3.5.9 Quantization of difference signal. 209g3ETSI ETSI TS 126 445 V13.4.0 (2017-02)63GPP TS 26.445 version 13.4.0 Release 135.2.3.5.10 Spectral dynamic and noise filling 209g35.2.3.5.11 Quantized gain addition, temporal and frequency contributions combination

43、 209g35.2.3.5.12 Specifics for wideband 8kbps 210g35.2.3.5.13 Inverse DCT 211g35.2.3.5.14 Remove pre-echo in case of onset detection 211g35.2.4 Bass post-filter gain quantization 212g35.2.5 Source Controlled VBR Coding . 212g35.2.5.1 Principles of VBR Coding 212g35.2.5.2 EVS VBR Encoder Coding Modes

44、 and Bit-Rates 213g35.2.5.3 Prototype-Pitch-Period (PPP) Encoding . 213g35.2.5.3.1 PPP Algorithm . 213g35.2.5.3.2 Amplitude Quantization 214g35.2.5.3.3 Phase Quantization 215g35.2.5.4 Noise-Excited-Linear-Prediction (NELP) Encoding . 215g35.2.5.5 Average Data Rate (ADR) Control for the EVS VBR mode

45、215g35.2.6 Coding of upper band for LP-based Coding Modes . 219g35.2.6.1 Bandwidth extension in time domain 219g35.2.6.1.1 High band target signal generation 220g35.2.6.1.2 TBE LP analysis 221g35.2.6.1.3 Quantization of linear prediction parameters. 223g35.2.6.1.4 Interpolation of LSF coefficients .

46、 226g35.2.6.1.5 Target and residual energy calculation and quantization . 228g35.2.6.1.6 Generation of the upsampled version of the lowband excitation . 228g35.2.6.1.7 Non-Linear Excitation Generation 229g35.2.6.1.8 Spectral flip of non-linear excitation in time domain 230g35.2.6.1.9 Down-sample usi

47、ng all-pass filters 230g35.2.6.1.10 Adaptive spectral whitening 231g35.2.6.1.11 Envelope modulated noise mixing. 231g35.2.6.1.12 Spectral shaping of the noise added excitation 233g35.2.6.1.13 Post processing of the shaped excitation . 233g35.2.6.1.14 Estimation of temporal gain shape parameters 235g

48、35.2.6.1.15 Estimation of frame gain parameters . 238g35.2.6.1.16 Estimation of TEC/TFA envelope parameters. 240g35.2.6.1.17 Estimation of full-band frame energy parameters . 242g35.2.6.2 Multi-mode FD Bandwidth Extension Coding . 244g35.2.6.2.1 SWB/FB Multi-mode FD Bandwidth Extension . 245g35.2.6.

49、2.2 WB Multi-mode FD Bandwidth Extension . 256g35.2.6.3 Coding of upper band at 64 kb/s . 260g35.2.6.3.1 Coding in normal mode . 261g35.2.6.3.2 Coding in transient mode . 265g35.3 MDCT Coding Mode . 268g35.3.1 General description . 268g35.3.2 Time-to-frequency transformations 268g35.3.2.1 Transform sizes and MDCT configurations 268g35.3.2.2 Long block transformation (ALDO window) 268g35.3.2.2.1 Folding and on-the-fly window decimation. 270g35.3.2.2.2 eDCT . 272g35.3.2.3 Transient location dependent overlap and transform length . 274g35.3.2.4 Short block transformation 275g35.

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 标准规范 > 国际标准 > 其他

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1