1、 ETSI TS 126 193 V14.0.0 (2017-04) Digital cellular telecommunications system (Phase 2+) (GSM); Universal Mobile Telecommunications System (UMTS); LTE; Speech codec speech processing functions; Adaptive Multi-Rate - Wideband (AMR-WB) speech codec; Source controlled rate operation (3GPP TS 26.193 ver
2、sion 14.0.0 Release 14) TECHNICAL SPECIFICATION ETSI ETSI TS 126 193 V14.0.0 (2017-04)13GPP TS 26.193 version 14.0.0 Release 14Reference RTS/TSGS-0426193ve00 Keywords GSM,LTE,UMTS ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret
3、 N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The
4、 content of any electronic and/or print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable
5、 Document Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at https:/portal.et
6、si.org/TB/ETSIDeliverableStatus.aspx If you find errors in the present document, please send your comment to one of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or
7、mechanical, including photocopying and microfilm except as authorized by written permission of ETSI. The content of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. European Telecommunica
8、tions Standards Institute 2017. All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are Trade Marks of ETSI registered for the benefit of its Members. 3GPPTM and LTE are Trade Marks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. GSM and the G
9、SM logo are Trade Marks registered and owned by the GSM Association. ETSI ETSI TS 126 193 V14.0.0 (2017-04)23GPP TS 26.193 version 14.0.0 Release 14Intellectual Property Rights IPRs essential or potentially essential to the present document may have been declared to ETSI. The information pertaining
10、to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available from the ETSI Secretariat.
11、Latest updates are available on the ETSI Web server (https:/ipr.etsi.org/). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ET
12、SI Web server) which are, or may be, or may become, essential to the present document. Foreword This Technical Specification (TS) has been produced by ETSI 3rd Generation Partnership Project (3GPP). The present document may refer to technical specifications or reports using their 3GPP identities, UM
13、TS identities or GSM identities. These should be interpreted as being references to the corresponding ETSI deliverables. The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under http:/webapp.etsi.org/key/queryform.asp. Modal verbs terminology In the present document “shall“
14、, “shall not“, “should“, “should not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in
15、 direct citation. ETSI ETSI TS 126 193 V14.0.0 (2017-04)33GPP TS 26.193 version 14.0.0 Release 14Contents Intellectual Property Rights 2g3Foreword . 2g3Modal verbs terminology 2g3Foreword . 4g31 Scope 5g32 Normative references . 5g33 Definitions, symbols and abbreviations . 5g33.1 Definitions 5g33.2
16、 Symbols 6g33.3 Abbreviations . 6g34 General . 6g34.1 General organisation. 6g35 AMR-WB SCR operation 7g35.1 Transmit (TX) side . 7g35.1.1 General operation 7g35.1.2 Functions of the TX SCR handler . 8g35.1.2.1 AMR-WB SCR Timing procedures . 8g35.1.3 The TX part of the AN 9g35.2 Receive (RX) side 10
17、g35.2.1 General operation 10g35.2.3 Demands on the RX SCR handler 11g35.3 AMR-WB SID Information format 11g3Annex A (normative): AMR-WB DTX handler for the GSM system . 12g3A.1 Scope 12g3A.2 References 12g3A.3 Definitions, symbols and abbreviations . 13g3A.3.1 Definitions 13g3A.3.2 Symbols 13g3A.3.3
18、 Abbreviations . 13g3A.4 General . 14g3A.4.1 General organisation. 14g3A.5 Transmit (TX) side . 14g3A.5.1 General operation . 15g3A.5.1.1 Functions of the TX DTX handler 16g3A.5.1.2 Functions of the TX Radio Subsystem . 17g3A.5.1.2.1 Functions of the TX Radio Subsystem for TCH/WFS 17g3A.5.1.2.2 Func
19、tions of the Downlink TX Radio Subsystem for TFO . 17g3A.5.1.2.4 Functions of the TX Radio Subsystem for RATSCCH . 18g3A.6 Receive (RX) side 18g3A.6.1 General operation . 18g3A.6.1.1 Functions of the RX radio subsystem . 19g3A.6.1.2 Functions of the RX DTX handler 19g3Annex B: (informative) Change h
20、istory . 21g3History 22g3ETSI ETSI TS 126 193 V14.0.0 (2017-04)43GPP TS 26.193 version 14.0.0 Release 14Foreword This Technical Specification has been produced by the 3GPP. The present document describes the operation of the Adaptive Multi Rate Wideband speech codec during Source Controlled Rate (SC
21、R) operation within the 3GPP system. The contents of the present document are subject to continuing work within the TSG and may change following formal TSG approval. Should the TSG modify the contents of this TS, it will be re-released by the TSG with an identifying change of release date and an inc
22、rease in version number as follows: Version x.y.z where: x the first digit: 1 presented to TSG for information; 2 presented to TSG for approval; 3 or greater indicates TSG approved document under change control. y the second digit is incremented for all changes of substance, i.e. technical enhanceme
23、nts, corrections, updates, etc. z the third digit is incremented when editorial only changes have been incorporated in the specification; ETSI ETSI TS 126 193 V14.0.0 (2017-04)53GPP TS 26.193 version 14.0.0 Release 141 Scope This document describes the Source Controlled Rate (SCR) operation of the A
24、daptive Multi-Rate Wideband speech Codec. The implementation of this SCR operation is mandatory in all UMTS equipment implementing the Adaptive Multi-Rate Wideband speech Codec. The description is structured according to the block diagram in Figure 1. This structure of distributing the various funct
25、ions between system entities is not mandatory for implementation, as long as the operation on the speech decoder output remains the same. Annex A describes the Discontinuous Transmission (DTX) operation of the Adaptive Multi-Rate Wideband speech Codec in Codec Type FR_AMR-WB for the GSM system. 2 No
26、rmative references This document incorporates by dated and undated reference, provisions from other publications. These normative references are cited at the appropriate places in the text and the publications are listed hereafter. For dated references, subsequent amendments to or revisions of any o
27、f these publications apply to this document only when incorporated in it by amendment or revision. For undated references, the latest edition of the publication referred to applies. 1 3GPP TS 26.171 : “AMR Wideband Speech Codec; General description“. 2 3GPP TS 26.173 : “AMR Wideband Speech Codec; AN
28、SI-C code“. 3 3GPP TS 26.174 : “AMR Wideband Speech Codec; Test sequences“. 4 3GPP TS 26.190 : “AMR Wideband Speech Codec; Transcoding functions“. 5 3GPP TS 26.191 : “AMR Wideband Speech Codec; Error concealment of lost frames“. 6 3GPP TS 26.192 : “AMR Wideband Speech Codec; Comfort noise aspects“.
29、7 3GPP TS 26.194 : “AMR Wideband Speech Codec; Voice Activity Detector (VAD)“. 8 3GPP TS 26.201 : “AMR Wideband Speech Codec; Frame structure“. 3 Definitions, symbols and abbreviations 3.1 Definitions For the purpose of this document, the following definitions apply. frame: Time interval of 20 ms, c
30、orresponding to the time segmentation of the Adaptive Multi-Rate Wideband speech Codec, also used as a short term for a traffic frame. traffic frame: Block of 132477 information bits transmitted on the speech traffic channels. SID frame: Frame that conveys information about the acoustic background n
31、oise. speech frame: Traffic frame that has been classified as SPEECH_GOOD, SPEECH_BAD or SPEECH_LOST frame. VAD flag: Boolean flag, generated by the VAD algorithm indicating the presence (“1“) or the absence (“0“) of a speech frame. RX_TYPE: classifies the received frame. TX_TYPE: classifies the fra
32、me to be transmitted. hangover period: A period of frames added at the end of a speech burst in which VAD flag =“0“ and TX_TYPE is =“SPEECH_GOOD“, this period provides the encoder with an extra window to derive the Comfort Noise parameters . ETSI ETSI TS 126 193 V14.0.0 (2017-04)63GPP TS 26.193 vers
33、ion 14.0.0 Release 143.2 Symbols For the purpose of this document, the following symbols apply. NelapsedNumber of elapsed frames since the last updated SID frame. 3.3 Abbreviations For the purpose of this document , the following abbreviations apply. AN Access Network SCR Source Controlled Rate oper
34、ation TS Telecommunication Standard, Technical Specification GSM Global System for Mobile Telecommunication UE User EquipmentRAN Radio Access Network RX Receive SID Silence Descriptor TX Transmit VAD Voice Activity Detector 4 General Source Controlled Rate operation (SCR) is a mechanism for the AMR
35、Wideband Speech Codec, which allows to encode the input signal at a lower average rate by taking speech inactivity into account. The SCR scheme may be used for the following purposes: -to save power in the User Equipment; -to reduce the overall interference and load in the networks. SCR in the trans
36、mitting path (uplink) shall be in operation in UEs, if commanded so by the network. The UE shall handle SCR in the receiving path (downlink) at any time, regardless, whether SCR in the transmitting path is commanded or not. 4.1 General organisation The default SCR mechanism described in this documen
37、t requires the following functions: -a Voice Activity Detector (VAD) on the transmit (TX) side; -evaluation of the background acoustic noise on the transmit (TX) side, in order to transmit characteristic parameters to the receive (RX) side; -generation on the receive (RX) side of a similar noise, ca
38、lled comfort noise, during periods where the transmission is switched off. The Voice Activity Detector (VAD) is defined in 7 and the comfort noise functions in 6. Both are based partly on the speech Codec and its internal variables, defined in 4. In addition to these functions, if the parameters arr
39、iving at the RX side are detected to be seriously corrupted by errors, the speech or comfort noise must be generated from substituted data in order to avoid seriously annoying effects for the listener. These functions are defined in 5. An overall description of the speech processing parts can be fou
40、nd in 1. An overview of one link SCR operation is shown in Figure 1. ETSI ETSI TS 126 193 V14.0.0 (2017-04)73GPP TS 26.193 version 14.0.0 Release 14Figure 1: Block diagram of one link SCR operation 5 AMR-WB SCR operation 5.1 Transmit (TX) side A block diagram of the transmit side SCR functions is sh
41、own in Figure 2. Figure 2: Block diagram of SCR functions at the TX side 5.1.1 General operation The TX SCR handler passes traffic frames, individually marked by TX_TYPE, to the Framing unit. Each frame consists of bit fields containing the information bits, the codec mode indication, and the TX_TYP
42、E. TX_TYPE shall be used to specify the contents of the frame. The table below provides an overview of the different TX_TYPEs used and explains the required contents in the information bit and the mode indication bit fields. RX_TYPEMode IndicationInformation bitsTX SCR handlerSpeechEncoderVoiceActiv
43、ityDetectorComfort NoiseParameterComputation“Network”Informationpacketing,transport andclassificationInformation bitsMode IndicationTX_TYPERX SCR handlerSpeechDecoderErrorConcealmentComfortNoiseGenerationTX SCR handlerSpeechEncoderVoiceActivityDetectorComfort NoiseParameterComputationTX Access Netwo
44、rkTX ofinformationInformation bitsMode IndicationTX_TYPETS26.201Information bitsFrame TypeFQIFramingETSI ETSI TS 126 193 V14.0.0 (2017-04)83GPP TS 26.193 version 14.0.0 Release 14Table 1: SCR TX_TYPE identifiers for UMTS_AMR-WB and FR_AMR-WB TX_TYPE Information Bits Mode Indication SPEECH_GOOD Speec
45、h frame, size 132477 bits, depending on codec mode Current codec mode SPEECH_BAD Corrupt speech frame (bad CRC), size 132477 bits, depending on codec mode Current codec mode SPEECH_LOST No useful information. (Note: If implementation does not support the SPEECH_LOST, SPEECH_BAD shall be used instead
46、) No useful information SID_FIRST Marker for the end of talkspurt, no further information, all 35 comfort noise bits set to “0” The codec mode that would have been used if TX_TYPE had been “SPEECH_GOOD” SID_UPDATE 35 comfort noise bits The codec mode that would have been used if TX_TYPE had been “SP
47、EECH_GOOD” SID_BAD Corrupt SID update frame (bad CRC) The codec mode that would have been used if TX_TYPE had been “SPEECH_GOOD” NO_DATA No useful information, nothing to be transmitted No useful information TX_TYPE = “SPEECH_LOST” indicates that the Information Bit and Codec Mode fields do not cont
48、ain any useful data (but still should be transmitted over AN). The purpose of this TX_TYPE is indicate that the frame was transmitted but lost on some previous phase. This TX_TYPEs may occur only in TFO and TrFO situations. Note, that it is possible to replace SPEECH_LOST with SPEECH_BAD but this ma
49、y degrade the quality of the error concealment in the receiving end because concealment may try to use part of the received parameters from the frame which do not contain any useful information. TX_TYPE = “NO_DATA” indicates that the Information Bit and Codec Mode fields do not contain any useful data (and should not be transmitted over AN). The purpose of this TX_TYPE is to provide the option to save network transmission between the transcoder and AN. Note, the TX_TYPEs “SPEECH_BAD”, “SPEECH_LOST” and “SID_BAD” may occur in TFO and TrFO situations.