ImageVerifierCode 换一换
格式:PDF , 页数:26 ,大小:281.29KB ,
资源ID:741742      下载积分:10000 积分
快捷下载
登录下载
邮箱/手机:
温馨提示:
如需开发票,请勿充值!快捷下载时,用户名和密码都是您填写的邮箱或者手机号,方便查询和重复下载(系统自动生成)。
如填写123,账号就是123,密码也是123。
特别说明:
请自助下载,系统不会自动发送文件的哦; 如果您已付费,想二次下载,请登录后访问:我的下载记录
支付方式: 支付宝扫码支付 微信扫码支付   
注意:如需开发票,请勿充值!
验证码:   换一换

加入VIP,免费下载
 

温馨提示:由于个人手机设置不同,如果发现不能下载,请复制以下地址【http://www.mydoc123.com/d-741742.html】到电脑端继续下载(重复下载不扣费)。

已注册用户请登录:
账号:
密码:
验证码:   换一换
  忘记密码?
三方登录: 微信登录  

下载须知

1: 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。
2: 试题试卷类文档,如果标题没有明确说明有答案则都视为没有答案,请知晓。
3: 文件的所有权益归上传用户所有。
4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
5. 本站仅提供交流平台,并不能对任何下载内容负责。
6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

版权提示 | 免责声明

本文(ETSI TS 126 094-2018 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE Mandatory speech codec speech processing func.pdf)为本站会员(fatcommittee260)主动上传,麦多课文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知麦多课文库(发送邮件至master@mydoc123.com或直接QQ联系客服),我们立即给予删除!

ETSI TS 126 094-2018 Digital cellular telecommunications system (Phase 2+) (GSM) Universal Mobile Telecommunications System (UMTS) LTE Mandatory speech codec speech processing func.pdf

1、 ETSI TS 126 094 V15.0.0 (2018-07) Digital cellular telecommunications system (Phase 2+) (GSM); Universal Mobile Telecommunications System (UMTS); LTE; Mandatory speech codec speech processing functions; Adaptive Multi-Rate (AMR) speech codec; Voice Activity Detector (VAD) (3GPP TS 26.094 version 15

2、.0.0 Release 15) TECHNICAL SPECIFICATION ETSI ETSI TS 126 094 V15.0.0 (2018-07)13GPP TS 26.094 version 15.0.0 Release 15Reference RTS/TSGS-0426094vf00 Keywords GSM,LTE,UMTS ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348

3、623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The conten

4、t of any electronic and/or print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Docume

5、nt Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at https:/portal.etsi.org/

6、TB/ETSIDeliverableStatus.aspx If you find errors in the present document, please send your comment to one of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechani

7、cal, including photocopying and microfilm except as authorized by written permission of ETSI. The content of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. ETSI 2018. All rights reserve

8、d. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are trademarks of ETSI registered for the benefit of its Members. 3GPPTM and LTETMare trademarks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. oneM2M logo is protected for the benefit of its Members. GSMand

9、the GSM logo are trademarks registered and owned by the GSM Association. ETSI ETSI TS 126 094 V15.0.0 (2018-07)23GPP TS 26.094 version 15.0.0 Release 15Intellectual Property Rights Essential patents IPRs essential or potentially essential to normative deliverables may have been declared to ETSI. The

10、 information pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available fro

11、m the ETSI Secretariat. Latest updates are available on the ETSI Web server (https:/ipr.etsi.org/). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (

12、or the updates on the ETSI Web server) which are, or may be, or may become, essential to the present document. Trademarks The present document may include trademarks and/or tradenames which are asserted and/or registered by their owners. ETSI claims no ownership of these except for any which are ind

13、icated as being the property of ETSI, and conveys no right to use or reproduce any trademark and/or tradename. Mention of those trademarks in the present document does not constitute an endorsement by ETSI of products, services or organizations associated with those trademarks. Foreword This Technic

14、al Specification (TS) has been produced by ETSI 3rd Generation Partnership Project (3GPP). The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or GSM identities. These should be interpreted as being references to the corresponding ETSI d

15、eliverables. The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under http:/webapp.etsi.org/key/queryform.asp. Modal verbs terminology In the present document “shall“, “shall not“, “should“, “should not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be i

16、nterpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in direct citation. ETSI ETSI TS 126 094 V15.0.0 (2018-07)33GPP TS 26.094 version 15.0.0 Release 15Contents In

17、tellectual Property Rights 2g3Foreword . 2g3Modal verbs terminology 2g3Foreword . 4g31 Scope 5g32 References 5g33 Technical Description of VAD Option 1 5g33.1 Definitions, symbols and abbreviations 5g33.1.1 Definitions 5g33.1.2 Symbols 5g33.1.2.1 Variables . 5g33.1.2.2 Constants. 6g33.1.2.3 Function

18、s . 7g33.1.3 Abbreviations 7g33.2 General . 7g33.3 Functional description 7g33.3.1 Filter bank and computation of sub-band levels . 8g33.3.2 Pitch detection 10g33.3.3 Tone detection 10g33.3.4 Correlated Complex Signal Analysis (and detection) . 11g33.3.5 VAD decision . 11g33.3.5.1 Hangover addition

19、. 12g33.3.5.2 Background noise estimation 14g34 Technical Description of VAD Option 2 16g34.1 Definitions, symbols and abbreviations 16g34.1.1 Definitions 16g34.1.2 Symbols 16g34.1.2.1 Variables . 16g34.1.2.2 Constants. 17g34.1.2.3 Functions . 17g34.1.3 Abbreviations 18g34.2 General . 18g34.3 Functi

20、onal description 18g34.3.1 Frequency Domain Conversion 19g34.3.2 Channel Energy Estimator 19g34.3.3 Channel SNR Estimator 20g34.3.4 Voice Metric Calculation 20g34.3.5 Frame SNR and Long-Term Peak SNR Calculation . 20g34.3.6 Negative SNR Sensitivity Bias . 21g34.3.7 VAD Decision 21g34.3.8 Spectral De

21、viation Estimator 21g34.3.9 Sinewave Detection 22g34.3.10 Background Noise Update Decision . 22g34.3.10 Background Noise Estimate Update . 23g35 Computational details . 23g3Annex A (informative) : Change history . 24g3History 25g3ETSI ETSI TS 126 094 V15.0.0 (2018-07)43GPP TS 26.094 version 15.0.0 R

22、elease 15Foreword This Technical Specification has been produced by the 3rdGeneration Partnership Project (3GPP). The contents of the present document are subject to continuing work within the TSG and may change following formal TSG approval. Should the TSG modify the contents of the present documen

23、t, it will be re-released by the TSG with an identifying change of release date and an increase in version number as follows: Version x.y.z where: x the first digit: 1 presented to TSG for information; 2 presented to TSG for approval; 3 or greater indicates TSG approved document under change control

24、. y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, updates, etc. z the third digit is incremented when editorial only changes have been incorporated in the document. ETSI ETSI TS 126 094 V15.0.0 (2018-07)53GPP TS 26.094 version 15.0.0 Release

25、151 Scope The present document specifies two alternatives for the Voice Activity Detector (VAD) to be used in the Discontinuous Transmission (DTX) as described in 3. Implementors of mobile station and infrastructure equipment conforming to the AMR specifications can choose which of the two VAD optio

26、ns to implement. There are no interoperability factors associated with this choice. The requirements are mandatory on any VAD to be used either in User Equipment (UE) or Base Station Systems (BSS)s that utilize the AMR speech codec. 2 References The following documents contain provisions which, thro

27、ugh reference in this text, constitute provisions of the present document. - References are either specific (identified by date of publication, edition number, version number, etc.) or non-specific. - For a specific reference, subsequent revisions do not apply. - For a non-specific reference, the la

28、test version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Release as the present document. 1 3GPP TS 26.073: “Adaptive Multi-Rate (AMR); ANSI C source code“. 2 3GPP TS

29、26.090: “Transcoding functions“. 3 3 GPP TS 26.093: “Source Controlled Rate operation“. 4 ITU, The International Telecommunications Union, Blue Book, Vol. III, Telephone Transmission Quality, IXth Plenary Assembly, Melbourne, 14-25 November, 1988, Recommendation G.711, Pulse code modulation (PCM) of

30、 voice frequencies. 3 Technical Description of VAD Option 1 3.1 Definitions, symbols and abbreviations 3.1.1 Definitions For the purposes of the present document, the following terms and definitions apply: frame: time interval of 20 ms corresponding to the time segmentation of the speech transcoder

31、3.1.2 Symbols For the purposes of the present document, the following symbols apply. 3.1.2.1 Variables bckr_estn background noise estimate burst_count counts length of a speech burst, used by VAD hangover addition hang_count hangover counter, used by VAD hangover addition complex_hang_count hangover

32、 counter, used by CAD hangover addition complex_hang_timer hangover initator, used fo Complex Activity Estimation lagcount pitch detection counter leveln signal level new_speech pointer of the speech encoder, points a buffer containing last received samples of a speech frame 2 ETSI ETSI TS 126 094 V

33、15.0.0 (2018-07)63GPP TS 26.094 version 15.0.0 Release 15noise_level average level of the background noise estimate oldlagcount lagcount of the previous frame pitch flag indicating presence of a periodic signal complex_warning flag indicating the presence of a complex signal. best_corr_hp normalized

34、 and limited value from maximum HP filtered correlation vector corr_hp filtered best_corr_hp values pow_sum power of the input frame s(i) samples of the input framer snr_sum measure between input frame and noise estimate stat_count stationarity counter stat_rat measure indicating stationary T_opn op

35、en-loop lags 2 t0 autocorrelation maxima calculated by the open-loop pitch analysis 2 t1 signal power related to the autocorrelation maxima t0 2 tone flag indicating the presence of a tone vad_thr VAD threshold VAD_flag boolean VAD flag vadreg intermediate VAD decision complex_low intermediate compl

36、ex signal decisions complex_high intermediate complex signal decisions 3.1.2.2 Constants ALPHA_UP1 constant for updating noise estimate (see clause 3.3.5.2) ALPHA_DOWN1 constant for updating noise estimate (see clause 3.3.5.2) ALPHA_UP2 constant for updating noise estimate (see clause 3.3.5.2) ALPHA

37、_DOWN2 constant for updating noise estimate (see clause 3.3.5.2) ALPHA3 constant for updating noise estimate (see clause 3.3.5.2) ALPHA4 constant for updating average signal level (see clause 3.3.5.2) ALPHA5 constant for updating average signal level (see clause 3.3.5.2) BURST_LEN_HIGH_NOISE constan

38、t for controlling VAD hangover addition (see clause 3.3.5.1) BURST_LEN_LOW_NOISE constant for controlling VAD hangover addition (see clause 3.3.5.1) COEFF3 coefficient for the filter bank (see clause 3.3.1) COEFF5_1 coefficient for the filter bank (see clause 3.3.1) COEFF5_2 coefficient for the filt

39、er bank (see clause 3.3.1) HANG_LEN_HIGH_NOISE constant for controlling VAD hangover addition (see clause 3.3.5.1) HANG_LEN_LOW_NOISE constant for controlling VAD hangover addition (see clause 3.3.5.2) HANG_NOISE_THR constant for controlling VAD hangover addition (see clause 3.3.5.2) L_FRAME size of

40、 a speech frame, 160 L_NEXT length for the lookahead of the speech encoder, 40 LTHRESH threshold for pitch detection (see clause 3.3.2) NOISE_MAX maximum value for noise estimate (see clause 3.3.5.2) NOISE_MIN minimum value for noise estimate (see clause 3.3.5.2) NTHRESH threshold for pitch detectio

41、n (see clause 3.3.2) POW_PITCH_THR threshold for pitch detection (see clause 3.3.5) POW_COMPLEX_THR threshold for complex detection (see clause 3.3.5) STAT_COUNT threshold for stationary detection (see clause 3.3.5.2) CAD_MIN_STAT_COUNT minimum threshold after complex warning STAT_THR threshold for

42、stationary detection (see clause 3.3.5.2) STAT_THR_LEVEL threshold for stationary detection (see clause 3.3.5.2) TONE_THR threshold for tone detection (see clause 3.3.3) VAD_P1 constant of computation for VAD threshold (see clause 3.3.5.2) VAD_POW_LOW constant for controlling VAD hangover addition (

43、see clause 3.3.5.1) VAD_SLOPE constant of computation for VAD threshold (see clause 3.3.5) VAD_THR_HIGH constant of computation for VAD threshold (see clause 3.3.5) CVAD_THRESH_ADAPT_HIGH constant for updating complex_high CVAD_THRESH_ADAPT_LOW constant for updating complex_low CVAD_THRESH_HANG cons

44、tant for updating complex_hang_timer CVAD_HANG_LIMIT constant for initiating complex_hang_count CVAD_HANG_LENGTH constant for resetting complex_hang_count ETSI ETSI TS 126 094 V15.0.0 (2018-07)73GPP TS 26.094 version 15.0.0 Release 153.1.2.3 Functions + addition- subtraction * multiplication / divis

45、ion | x | absolute value of x AND Boolean ANDOR Boolean ORxnnab()=MIN(x,y) = MAX(x,y) = 3.1.3 Abbreviations For the purposes of the present document, the following abbreviations apply: ANSI American National Standards Institute DTX Discontinuous Transmission VAD Voice Activity Detector CAD Complex A

46、ctivity Detection CNG Comfort Noise Generation 3.2 General The function of the VAD algorithm is to indicate whether each 20 ms frame contains signals that should be transmitted, i.e. speech, music or information tones. The output of the VAD algorithm is a Boolean flag (VAD_flag) indicating presence

47、of such signals. 3.3 Functional description The block diagram of the VAD algorithm is depicted in figure 1. The VAD algorithm uses parameters of the speech encoder to compute the Boolean VAD flag (VAD_flag). Samples of the Input frame (s(i) are divided into sub-bands and level of the signal in each

48、band (leveln) is calculated. Input for the pitch detection function are open-loop lags (T_opn), which are calculated by open-loop pitch analysis of the speech encoder. The pitch detection function computes a flag (pitch) which indicates presence of pitch. Tone detection function calculates a flag (t

49、one), which indicates presence of an information tone. Tones are detected based on pitch gain of the open-loop pitch analysis The pitch gain is estimated using autocorrelation values (t0 and t1) received from the pitch analysis. Complex Signal Detection function calculates a flag (complex_warning), which indicates presence of a correlated complex signal such as music. Correlate complex signals are detected based on analysis of the correlation vector available in the open-loop pitch analysis.The VAD decision function estimates background noise leve

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1