1、 ETSI TS 1Universal Mobile TelCodec for EnhANSI (3GPP TS 26.4floppy3TECHNICAL SPECIFICATION126 442 V13.1.0 (2016elecommunications System (LTE; n anced Voice Services (EVS)SI C code (fixed-point) .442 version 13.1.0 Release 1316-04) (UMTS); S); 13) ETSI ETSI TS 126 442 V13.1.0 (2016-04)13GPP TS 26.44
2、2 version 13.1.0 Release 13Reference RTS/TSGS-0426442vd10 Keywords LTE,UMTS ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06
3、) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The content of any electronic and/or print versions of the present document shall not be modified without t
4、he prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the pr
5、esent document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at http:/portal.etsi.org/tb/status/status.asp If you find errors in the present document, please send your comment to one o
6、f the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm except as authorized by written permission of ETSI. The content o
7、f the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. European Telecommunications Standards Institute 2016. All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are Trade Marks
8、of ETSI registered for the benefit of its Members. 3GPPTM and LTE are Trade Marks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. GSM and the GSM logo are Trade Marks registered and owned by the GSM Association. ETSI ETSI TS 126 442 V13.1.0 (2016-04)23GPP T
9、S 26.442 version 13.1.0 Release 13Intellectual Property Rights IPRs essential or potentially essential to the present document may have been declared to ETSI. The information pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found in ETSI
10、SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web server (https:/ipr.etsi.org/). Pursuant to the ETSI IPR Policy, no in
11、vestigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web server) which are, or may be, or may become, essential to the present document. Foreword This Technical Spe
12、cification (TS) has been produced by ETSI 3rd Generation Partnership Project (3GPP). The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or GSM identities. These should be interpreted as being references to the corresponding ETSI deliver
13、ables. The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under http:/webapp.etsi.org/key/queryform.asp. Modal verbs terminology In the present document “shall“, “shall not“, “should“, “should not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be interpr
14、eted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in direct citation. ETSI ETSI TS 126 442 V13.1.0 (2016-04)33GPP TS 26.442 version 13.1.0 Release 13Contents Intellec
15、tual Property Rights 2g3Foreword . 2g3Modal verbs terminology 2g3Foreword . 4g31 Scope 5g32 References 5g33 Definitions and abbreviations . 6g33.1 Definitions 6g33.2 Abbreviations . 6g34 C code structure 6g34.1 Contents of the C source code 6g34.2 Program execution 7g35 File formats 7g35.1 Speech fi
16、le (encoder input / decoder output) 7g35.2 Rate switching profile (encoder input) . 8g35.3 Parameter bitstream file (encoder output / decoder input) . 8g35.3.1 ITU-T G.192 compliant format . 8g35.3.2 Compact storage format file 8g35.4 VoIP parameter bitstream file (decoder input) . 8g35.5 Bandwidth
17、switching profile (encoder input) . 8g35.6 Channel-aware configuration file (encoder input and decoder output) 9g35.7 JBM trace file (decoder output) 9g3Annex A (informative): Change history . 10g3History 11g3ETSI ETSI TS 126 442 V13.1.0 (2016-04)43GPP TS 26.442 version 13.1.0 Release 13Foreword Thi
18、s Technical Specification has been produced by the 3rdGeneration Partnership Project (3GPP). The contents of the present document are subject to continuing work within the TSG and may change following formal TSG approval. Should the TSG modify the contents of the present document, it will be re-rele
19、ased by the TSG with an identifying change of release date and an increase in version number as follows: Version x.y.z where: x the first digit: 1 presented to TSG for information; 2 presented to TSG for approval; 3 or greater indicates TSG approved document under change control. y the second digit
20、is incremented for all changes of substance, i.e. technical enhancements, corrections, updates, etc. z the third digit is incremented when editorial only changes have been incorporated in the document. ETSI ETSI TS 126 442 V13.1.0 (2016-04)53GPP TS 26.442 version 13.1.0 Release 131 Scope The present
21、 document contains an electronic copy of the ANSI-C code for the Enhanced Voice Services (EVS) Codec. The ANSI-C code is necessary for a bit exact implementation of the EVS Codec (3GPP TS 26.445), Voice Activity Detection (VAD) (3GPP TS 26.451), Comfort Noise Generation (CNG) (3GPP TS 26.449), Disco
22、ntinuous Transmission (DTX) (3GPP TS 26.450), Packet Loss Concealment (PLC) of Lost Packets (3GPP TS 26.447), Jitter Buffer Management (JBM) (3GPP TS 26.448), and AMR-WB Interoperable Function (3GPP TS 26.446). Requirements for any implementation of the EVS codec to be standard compliant are specifi
23、ed in 3GPP TS 26.444 (Test sequences). 2 References The following documents contain provisions which, through reference in this text, constitute provisions of the present document. - References are either specific (identified by date of publication, edition number, version number, etc.) or non-speci
24、fic. - For a specific reference, subsequent revisions do not apply. - For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Rele
25、ase as the present document. 1 3GPP TR 21.905: “Vocabulary for 3GPP Specifications“. 2 3GPP TS 26.445: “Codec for Enhanced Voice Services (EVS); Detailed Algorithmic Description “. 3 3GPP TS 26.451: “Codec for Enhanced Voice Services (EVS); Voice Activity Detection (VAD)“. 4 3GPP TS 26.449: “Codec f
26、or Enhanced Voice Services (EVS); Comfort Noise Generation (CNG) Aspects“. 5 3GPP TS 26.450: “Codec for Enhanced Voice Services (EVS); Discontinuous Transmission (DTX)“. 6 3GPP TS 26.447: “Codec for Enhanced Voice Services (EVS); Error Concealment of Lost Packets“. 7 3GPP TS 26.448: “Codec for Enhan
27、ced Voice Services (EVS); Jitter Buffer Management“. 8 3GPP TS 26.446: “Codec for Enhanced Voice Services (EVS); AMR-WB Backward Compatible Functions“. 9 3GPP TS 26.444: “Codec for Enhanced Voice Services (EVS); Test Sequences“. 10 IETF RFC 3550: “RTP: A Transport Protocol for Real-Time Applications
28、“. 11 Recommendation ITU-T G.191 (03/10): “Software tools for speech and audio coding standardization“. 12 Recommendation ITU-T G.192: “A common digital parallel interface for speech standardization activities“. ETSI ETSI TS 126 442 V13.1.0 (2016-04)63GPP TS 26.442 version 13.1.0 Release 133 Definit
29、ions and abbreviations 3.1 Definitions Definition of terms used in the present document, can be found in 3GPP TS 26.445 2, 3GPP TS 26.451 3, 3GPP TS 26.449 4, 3GPP TS 26.450 5, 3GPP TS 26.447 6, 3GPP TS 26.448 7 and 3GPP TS 26.446 8. 3.2 Abbreviations For the purposes of the present document, the ab
30、breviations given in TR 21.905 1 and the following apply. An abbreviation defined in the present document takes precedence over the definition of the same abbreviation, if any, in TR 21.905 1. ACELP Algebraic Code-Excited Linear Prediction AMR-WB Adaptive Multi Rate Wideband (codec) CNG Comfort Nois
31、e Generator DTX Discontinuous Transmission EVS Enhanced Voice Services FB Fullband FEC Frame Erasure Concealment IP Internet Protocol JBM Jitter Buffer Management MSB Most Significant Bit MTSI Multimedia Telephony Service for IMS NB Narrowband PS Packet SwitchedPSTN Public Switched Telephone Network
32、 SAD Sound Activity Detection SC-VBR Source Controlled - Variable Bit Rate SID Silence Insertion Descriptor SWB Super Wideband VAD Voice Activity Detection WB Wideband WMOPS Weighted Millions of Operations Per Second 4 C code structure This clause gives an overview of the structure of the bit-exact
33、C code and provides an overview of the contents and organization of the C code attached to the present document. The C code has been verified on the following systems: - IBM PC compatible computers with Windows 7 or 8 operating system and Microsoft Visual C+ 2010 compiler, 32 bit builds. - IBM PC co
34、mpatible computers with Linux operating system and GNU gcc compiler version 4.3.x, 32 bit builds. ANSI-C was selected as the programming language because portability was desirable. 4.1 Contents of the C source code The C code distribution is organized as follows: ETSI ETSI TS 126 442 V13.1.0 (2016-0
35、4)73GPP TS 26.442 version 13.1.0 Release 13Table 1: Source code directory structure Directory Description README.txt information on how to compile Makefile UNIX style encoder Makefile Workspace_msvc/ Directory for the MSVC 2010 project files basic_op/ Source code files containing all ITU-T fixed-poi
36、nt basic operators. basic_math/ Source code files contains mathematical fixed-point functions lib_com/ Source code files used in encoder and decoder lib_dec/ Source code files used solely in the decoder lib_enc/ Source code files used solely in the encoder The distributed files with suffix “c“ conta
37、in the source code and the files with suffix “h“ are the header files. The ROM data is contained in files named “rom_xxx“ with suffix “c“. Makefiles are provided for the platforms in which the C code has been verified (listed above). Once the software is installed, this directory will have a compile
38、d version of the encoder (named EVS_cod) and the decoder (named EVS_dec). 4.2 Program execution The codec for Enhanced Voice Services is implemented in two programs: - EVS_cod: speech/audio encoder; - EVS_dec: speech/audio decoder. The programs should be called like: - EVS_cod encoder options ; - EV
39、S_dec decoder options . The speech/audio files contain 16-bit linear encoded PCM speech/audio samples and the parameter files contain encoded speech/audio data. The encoder and decoder options will be explained by running the applications without input arguments. See the file readme.txt for more inf
40、ormation on how to run the encoder and decoder programs. 5 File formats This clause describes the file formats used by the encoder and decoder programs. The test sequences defined in 1 also use the file formats described here. 5.1 Speech file (encoder input / decoder output) Speech files read by the
41、 encoder and written by the decoder consist of 16-bit words speech/audio sample. The byte order depends on the host architecture (e.g. LSByte first on PCs, etc.). Both the encoder and the decoder program process complete frames (corresponding to 20 ms, for example, 640 samples at 32 kHz sampling fre
42、quency) only. The encoder will pad the last frame to integer multiples of 20ms frames, i.e. n speech frames will be produced from an input file with a length between (n-1)*20ms+1 sample; n*20ms. The files produced by the decoder will always have a length of n*20ms. ETSI ETSI TS 126 442 V13.1.0 (2016
43、-04)83GPP TS 26.442 version 13.1.0 Release 135.2 Rate switching profile (encoder input) The encoder program can optionally read in a rate switching profile which specifies the encoding bitrate for each frame of speech processed. The file is a binary file, generated by gen-rate-profile, which is part
44、 of STL 2009, as contained in ITU-T G.191 11. The rate switching profile can contain EVS primary mode bitrates and AMR-WB IO mode bitrates arbitrarily. I.e. switching between the two modes can be specified by the rate switching profile. 5.3 Parameter bitstream file (encoder output / decoder input) T
45、he files produced by the speech/audio encoder/expected by the speech decoder contain an arbitrary number of frames in the following available formats. 5.3.1 ITU-T G.192 compliant format SYNC_WORD DATA_LENGTH B1 B2 Bnn Each box corresponds to one Word16 value in the bitstream file, for a total of 2+n
46、n words or 4+2nn bytes per frame, where nn is the number of encoded bits in the frame. Each encoded bit is represented as follows: Bit 0 = 0x007f, Bit 1 = 0x0081. The fields have the following meaning: - SYNC_WORD: Word to ensure correct frame synchronization between the encoder and the decoder. It
47、is also used to indicate the occurrences of bad frames. In the encoder output: (0x6b21) In the decoder input: Good frames (0x6b21) Bad frames (0x6b20) - DATA_LENGTH: Length of the speech data. Codec mode and frame type is extracted in the decoder using this parameter 5.3.2 Compact storage format fil
48、e The encoder and decoder programs can optionally write and read a file in the octet-based compact storage format. The compact storage format is specified in Annex A.2.6 of 2. 5.4 VoIP parameter bitstream file (decoder input) Packet size Arrival time RTP header G.192 format (see 5.3.1) The fields ha
49、ve the following size and meaning: - Packet size: 32 bit unsigned integer. (= 12 + 2 + DATA_LENGTH) - Arrival time: 32 bit unsigned integer. in ms. - RTP header: 96 bits (see RFC 3550 10), including RTP timestamp and SSRC. 5.5 Bandwidth switching profile (encoder input) The encoder program can optionally read in a bandwidth switching profile, which specifies the encoding bandwidth for each frame of speech processed. The file is a text file where each line contains nb_frames B. B specifies the signal ETSI ETSI TS 12