ETSI TS 126 094-2018 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE Mandatory speech codec speech processing func.pdf

上传人:fatcommittee260 文档编号:741742 上传时间:2019-01-11 格式:PDF 页数:26 大小:281.29KB
下载 相关 举报
ETSI TS 126 094-2018 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE Mandatory speech codec speech processing func.pdf_第1页
第1页 / 共26页
ETSI TS 126 094-2018 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE Mandatory speech codec speech processing func.pdf_第2页
第2页 / 共26页
ETSI TS 126 094-2018 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE Mandatory speech codec speech processing func.pdf_第3页
第3页 / 共26页
ETSI TS 126 094-2018 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE Mandatory speech codec speech processing func.pdf_第4页
第4页 / 共26页
ETSI TS 126 094-2018 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE Mandatory speech codec speech processing func.pdf_第5页
第5页 / 共26页
点击查看更多>>
资源描述

1、 ETSI TS 126 094 V15.0.0 (2018-07) Digital cellular telecommunications system (Phase 2+) (GSM); Universal Mobile Telecommunications System (UMTS); LTE; Mandatory speech codec speech processing functions; Adaptive Multi-Rate (AMR) speech codec; Voice Activity Detector (VAD) (3GPP TS 26.094 version 15

2、.0.0 Release 15) TECHNICAL SPECIFICATION ETSI ETSI TS 126 094 V15.0.0 (2018-07)13GPP TS 26.094 version 15.0.0 Release 15Reference RTS/TSGS-0426094vf00 Keywords GSM,LTE,UMTS ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348

3、623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The conten

4、t of any electronic and/or print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Docume

5、nt Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at https:/portal.etsi.org/

6、TB/ETSIDeliverableStatus.aspx If you find errors in the present document, please send your comment to one of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechani

7、cal, including photocopying and microfilm except as authorized by written permission of ETSI. The content of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. ETSI 2018. All rights reserve

8、d. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are trademarks of ETSI registered for the benefit of its Members. 3GPPTM and LTETMare trademarks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. oneM2M logo is protected for the benefit of its Members. GSMand

9、the GSM logo are trademarks registered and owned by the GSM Association. ETSI ETSI TS 126 094 V15.0.0 (2018-07)23GPP TS 26.094 version 15.0.0 Release 15Intellectual Property Rights Essential patents IPRs essential or potentially essential to normative deliverables may have been declared to ETSI. The

10、 information pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available fro

11、m the ETSI Secretariat. Latest updates are available on the ETSI Web server (https:/ipr.etsi.org/). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (

12、or the updates on the ETSI Web server) which are, or may be, or may become, essential to the present document. Trademarks The present document may include trademarks and/or tradenames which are asserted and/or registered by their owners. ETSI claims no ownership of these except for any which are ind

13、icated as being the property of ETSI, and conveys no right to use or reproduce any trademark and/or tradename. Mention of those trademarks in the present document does not constitute an endorsement by ETSI of products, services or organizations associated with those trademarks. Foreword This Technic

14、al Specification (TS) has been produced by ETSI 3rd Generation Partnership Project (3GPP). The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or GSM identities. These should be interpreted as being references to the corresponding ETSI d

15、eliverables. The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under http:/webapp.etsi.org/key/queryform.asp. Modal verbs terminology In the present document “shall“, “shall not“, “should“, “should not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be i

16、nterpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in direct citation. ETSI ETSI TS 126 094 V15.0.0 (2018-07)33GPP TS 26.094 version 15.0.0 Release 15Contents In

17、tellectual Property Rights 2g3Foreword . 2g3Modal verbs terminology 2g3Foreword . 4g31 Scope 5g32 References 5g33 Technical Description of VAD Option 1 5g33.1 Definitions, symbols and abbreviations 5g33.1.1 Definitions 5g33.1.2 Symbols 5g33.1.2.1 Variables . 5g33.1.2.2 Constants. 6g33.1.2.3 Function

18、s . 7g33.1.3 Abbreviations 7g33.2 General . 7g33.3 Functional description 7g33.3.1 Filter bank and computation of sub-band levels . 8g33.3.2 Pitch detection 10g33.3.3 Tone detection 10g33.3.4 Correlated Complex Signal Analysis (and detection) . 11g33.3.5 VAD decision . 11g33.3.5.1 Hangover addition

19、. 12g33.3.5.2 Background noise estimation 14g34 Technical Description of VAD Option 2 16g34.1 Definitions, symbols and abbreviations 16g34.1.1 Definitions 16g34.1.2 Symbols 16g34.1.2.1 Variables . 16g34.1.2.2 Constants. 17g34.1.2.3 Functions . 17g34.1.3 Abbreviations 18g34.2 General . 18g34.3 Functi

20、onal description 18g34.3.1 Frequency Domain Conversion 19g34.3.2 Channel Energy Estimator 19g34.3.3 Channel SNR Estimator 20g34.3.4 Voice Metric Calculation 20g34.3.5 Frame SNR and Long-Term Peak SNR Calculation . 20g34.3.6 Negative SNR Sensitivity Bias . 21g34.3.7 VAD Decision 21g34.3.8 Spectral De

21、viation Estimator 21g34.3.9 Sinewave Detection 22g34.3.10 Background Noise Update Decision . 22g34.3.10 Background Noise Estimate Update . 23g35 Computational details . 23g3Annex A (informative) : Change history . 24g3History 25g3ETSI ETSI TS 126 094 V15.0.0 (2018-07)43GPP TS 26.094 version 15.0.0 R

22、elease 15Foreword This Technical Specification has been produced by the 3rdGeneration Partnership Project (3GPP). The contents of the present document are subject to continuing work within the TSG and may change following formal TSG approval. Should the TSG modify the contents of the present documen

23、t, it will be re-released by the TSG with an identifying change of release date and an increase in version number as follows: Version x.y.z where: x the first digit: 1 presented to TSG for information; 2 presented to TSG for approval; 3 or greater indicates TSG approved document under change control

24、. y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, updates, etc. z the third digit is incremented when editorial only changes have been incorporated in the document. ETSI ETSI TS 126 094 V15.0.0 (2018-07)53GPP TS 26.094 version 15.0.0 Release

25、151 Scope The present document specifies two alternatives for the Voice Activity Detector (VAD) to be used in the Discontinuous Transmission (DTX) as described in 3. Implementors of mobile station and infrastructure equipment conforming to the AMR specifications can choose which of the two VAD optio

26、ns to implement. There are no interoperability factors associated with this choice. The requirements are mandatory on any VAD to be used either in User Equipment (UE) or Base Station Systems (BSS)s that utilize the AMR speech codec. 2 References The following documents contain provisions which, thro

27、ugh reference in this text, constitute provisions of the present document. - References are either specific (identified by date of publication, edition number, version number, etc.) or non-specific. - For a specific reference, subsequent revisions do not apply. - For a non-specific reference, the la

28、test version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Release as the present document. 1 3GPP TS 26.073: “Adaptive Multi-Rate (AMR); ANSI C source code“. 2 3GPP TS

29、26.090: “Transcoding functions“. 3 3 GPP TS 26.093: “Source Controlled Rate operation“. 4 ITU, The International Telecommunications Union, Blue Book, Vol. III, Telephone Transmission Quality, IXth Plenary Assembly, Melbourne, 14-25 November, 1988, Recommendation G.711, Pulse code modulation (PCM) of

30、 voice frequencies. 3 Technical Description of VAD Option 1 3.1 Definitions, symbols and abbreviations 3.1.1 Definitions For the purposes of the present document, the following terms and definitions apply: frame: time interval of 20 ms corresponding to the time segmentation of the speech transcoder

31、3.1.2 Symbols For the purposes of the present document, the following symbols apply. 3.1.2.1 Variables bckr_estn background noise estimate burst_count counts length of a speech burst, used by VAD hangover addition hang_count hangover counter, used by VAD hangover addition complex_hang_count hangover

32、 counter, used by CAD hangover addition complex_hang_timer hangover initator, used fo Complex Activity Estimation lagcount pitch detection counter leveln signal level new_speech pointer of the speech encoder, points a buffer containing last received samples of a speech frame 2 ETSI ETSI TS 126 094 V

33、15.0.0 (2018-07)63GPP TS 26.094 version 15.0.0 Release 15noise_level average level of the background noise estimate oldlagcount lagcount of the previous frame pitch flag indicating presence of a periodic signal complex_warning flag indicating the presence of a complex signal. best_corr_hp normalized

34、 and limited value from maximum HP filtered correlation vector corr_hp filtered best_corr_hp values pow_sum power of the input frame s(i) samples of the input framer snr_sum measure between input frame and noise estimate stat_count stationarity counter stat_rat measure indicating stationary T_opn op

35、en-loop lags 2 t0 autocorrelation maxima calculated by the open-loop pitch analysis 2 t1 signal power related to the autocorrelation maxima t0 2 tone flag indicating the presence of a tone vad_thr VAD threshold VAD_flag boolean VAD flag vadreg intermediate VAD decision complex_low intermediate compl

36、ex signal decisions complex_high intermediate complex signal decisions 3.1.2.2 Constants ALPHA_UP1 constant for updating noise estimate (see clause 3.3.5.2) ALPHA_DOWN1 constant for updating noise estimate (see clause 3.3.5.2) ALPHA_UP2 constant for updating noise estimate (see clause 3.3.5.2) ALPHA

37、_DOWN2 constant for updating noise estimate (see clause 3.3.5.2) ALPHA3 constant for updating noise estimate (see clause 3.3.5.2) ALPHA4 constant for updating average signal level (see clause 3.3.5.2) ALPHA5 constant for updating average signal level (see clause 3.3.5.2) BURST_LEN_HIGH_NOISE constan

38、t for controlling VAD hangover addition (see clause 3.3.5.1) BURST_LEN_LOW_NOISE constant for controlling VAD hangover addition (see clause 3.3.5.1) COEFF3 coefficient for the filter bank (see clause 3.3.1) COEFF5_1 coefficient for the filter bank (see clause 3.3.1) COEFF5_2 coefficient for the filt

39、er bank (see clause 3.3.1) HANG_LEN_HIGH_NOISE constant for controlling VAD hangover addition (see clause 3.3.5.1) HANG_LEN_LOW_NOISE constant for controlling VAD hangover addition (see clause 3.3.5.2) HANG_NOISE_THR constant for controlling VAD hangover addition (see clause 3.3.5.2) L_FRAME size of

40、 a speech frame, 160 L_NEXT length for the lookahead of the speech encoder, 40 LTHRESH threshold for pitch detection (see clause 3.3.2) NOISE_MAX maximum value for noise estimate (see clause 3.3.5.2) NOISE_MIN minimum value for noise estimate (see clause 3.3.5.2) NTHRESH threshold for pitch detectio

41、n (see clause 3.3.2) POW_PITCH_THR threshold for pitch detection (see clause 3.3.5) POW_COMPLEX_THR threshold for complex detection (see clause 3.3.5) STAT_COUNT threshold for stationary detection (see clause 3.3.5.2) CAD_MIN_STAT_COUNT minimum threshold after complex warning STAT_THR threshold for

42、stationary detection (see clause 3.3.5.2) STAT_THR_LEVEL threshold for stationary detection (see clause 3.3.5.2) TONE_THR threshold for tone detection (see clause 3.3.3) VAD_P1 constant of computation for VAD threshold (see clause 3.3.5.2) VAD_POW_LOW constant for controlling VAD hangover addition (

43、see clause 3.3.5.1) VAD_SLOPE constant of computation for VAD threshold (see clause 3.3.5) VAD_THR_HIGH constant of computation for VAD threshold (see clause 3.3.5) CVAD_THRESH_ADAPT_HIGH constant for updating complex_high CVAD_THRESH_ADAPT_LOW constant for updating complex_low CVAD_THRESH_HANG cons

44、tant for updating complex_hang_timer CVAD_HANG_LIMIT constant for initiating complex_hang_count CVAD_HANG_LENGTH constant for resetting complex_hang_count ETSI ETSI TS 126 094 V15.0.0 (2018-07)73GPP TS 26.094 version 15.0.0 Release 153.1.2.3 Functions + addition- subtraction * multiplication / divis

45、ion | x | absolute value of x AND Boolean ANDOR Boolean ORxnnab()=MIN(x,y) = MAX(x,y) = 3.1.3 Abbreviations For the purposes of the present document, the following abbreviations apply: ANSI American National Standards Institute DTX Discontinuous Transmission VAD Voice Activity Detector CAD Complex A

46、ctivity Detection CNG Comfort Noise Generation 3.2 General The function of the VAD algorithm is to indicate whether each 20 ms frame contains signals that should be transmitted, i.e. speech, music or information tones. The output of the VAD algorithm is a Boolean flag (VAD_flag) indicating presence

47、of such signals. 3.3 Functional description The block diagram of the VAD algorithm is depicted in figure 1. The VAD algorithm uses parameters of the speech encoder to compute the Boolean VAD flag (VAD_flag). Samples of the Input frame (s(i) are divided into sub-bands and level of the signal in each

48、band (leveln) is calculated. Input for the pitch detection function are open-loop lags (T_opn), which are calculated by open-loop pitch analysis of the speech encoder. The pitch detection function computes a flag (pitch) which indicates presence of pitch. Tone detection function calculates a flag (t

49、one), which indicates presence of an information tone. Tones are detected based on pitch gain of the open-loop pitch analysis The pitch gain is estimated using autocorrelation values (t0 and t1) received from the pitch analysis. Complex Signal Detection function calculates a flag (complex_warning), which indicates presence of a correlated complex signal such as music. Correlate complex signals are detected based on analysis of the correlation vector available in the open-loop pitch analysis.The VAD decision function estimates background noise leve

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 标准规范 > 国际标准 > 其他

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1