1、 INTERNATIONAL TELECOMMUNICATION UNION ITU-T G.722.2TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU Annex D(07/2003) SERIES G: TRANSMISSION SYSTEMS AND MEDIA, DIGITAL SYSTEMS AND NETWORKS Digital terminal equipments Coding of analogue signals by methods other than PCM Wideband coding of speech at ar
2、ound 16 kbit/s using Adaptive Multi-Rate Wideband (AMR-WB) Annex D: Digital test sequences ITU-T Recommendation G.722.2 Annex D ITU-T G-SERIES RECOMMENDATIONS TRANSMISSION SYSTEMS AND MEDIA, DIGITAL SYSTEMS AND NETWORKS INTERNATIONAL TELEPHONE CONNECTIONS AND CIRCUITS G.100G.199 GENERAL CHARACTERIST
3、ICS COMMON TO ALL ANALOGUE CARRIER-TRANSMISSION SYSTEMS G.200G.299 INDIVIDUAL CHARACTERISTICS OF INTERNATIONAL CARRIER TELEPHONE SYSTEMS ON METALLIC LINES G.300G.399 GENERAL CHARACTERISTICS OF INTERNATIONAL CARRIER TELEPHONE SYSTEMS ON RADIO-RELAY OR SATELLITE LINKS AND INTERCONNECTION WITH METALLIC
4、 LINES G.400G.449 COORDINATION OF RADIOTELEPHONY AND LINE TELEPHONY G.450G.499 TESTING EQUIPMENTS G.500G.599 TRANSMISSION MEDIA CHARACTERISTICS G.600G.699 DIGITAL TERMINAL EQUIPMENTS G.700G.799 General G.700G.709 Coding of analogue signals by pulse code modulation G.710G.719 Coding of analogue signa
5、ls by methods other than PCM G.720G.729 Principal characteristics of primary multiplex equipment G.730G.739 Principal characteristics of second order multiplex equipment G.740G.749 Principal characteristics of higher order multiplex equipment G.750G.759 Principal characteristics of transcoder and di
6、gital multiplication equipment G.760G.769 Operations, administration and maintenance features of transmission equipment G.770G.779 Principal characteristics of multiplexing equipment for the synchronous digital hierarchy G.780G.789 Other terminal equipment G.790G.799 DIGITAL NETWORKS G.800G.899 DIGI
7、TAL SECTIONS AND DIGITAL LINE SYSTEM G.900G.999 QUALITY OF SERVICE AND PERFORMANCE - GENERIC AND USER-RELATED ASPECTS G.1000G.1999 TRANSMISSION MEDIA CHARACTERISTICS G.6000G.6999 DIGITAL TERMINAL EQUIPMENTS G.7000G.7999 DIGITAL NETWORKS G.8000G.8999 For further details, please refer to the list of I
8、TU-T Recommendations. ITU-T Rec. G.722.2/Annex D (07/2003) i ITU-T Recommendation G.722.2 Wideband coding of speech at around 16 kbit/s using Adaptive Multi-rate Wideband (AMR-WB) Annex D Digital test sequences Summary This annex specifies version 5.4.0 of the bit-exact test sequences for the verifi
9、cation of the implementation of G.722.2 AMR-WB codec, voice activity detection, comfort noise generation and source controlled rate operation. The test sequences specified in this annex were also adopted by 3GPP in 3GPP specification TS 26.174. These test sequences are freely available on the ITU-T
10、website. They are also available for a fee on a CD-ROM from the ITU sales department at salesitu.int. Source Annex D to ITU-T Recommendation G.722.2 was approved on 29 July 2003 by ITU-T Study Group 16 (2001-2004) under the ITU-T Recommendation A.8 procedure. ii ITU-T Rec. G.722.2/Annex D (07/2003)
11、FOREWORD The International Telecommunication Union (ITU) is the United Nations specialized agency in the field of telecommunications. The ITU Telecommunication Standardization Sector (ITU-T) is a permanent organ of ITU. ITU-T is responsible for studying technical, operating and tariff questions and
12、issuing Recommendations on them with a view to standardizing telecommunications on a worldwide basis. The World Telecommunication Standardization Assembly (WTSA), which meets every four years, establishes the topics for study by the ITU-T study groups which, in turn, produce Recommendations on these
13、 topics. The approval of ITU-T Recommendations is covered by the procedure laid down in WTSA Resolution 1. In some areas of information technology which fall within ITU-Ts purview, the necessary standards are prepared on a collaborative basis with ISO and IEC. NOTE In this Recommendation, the expres
14、sion “Administration“ is used for conciseness to indicate both a telecommunication administration and a recognized operating agency. Compliance with this Recommendation is voluntary. However, the Recommendation may contain certain mandatory provisions (to ensure e.g. interoperability or applicabilit
15、y) and compliance with the Recommendation is achieved when all of these mandatory provisions are met. The words “shall“ or some other obligatory language such as “must“ and the negative equivalents are used to express requirements. The use of such words does not suggest that compliance with the Reco
16、mmendation is required of any party. INTELLECTUAL PROPERTY RIGHTS ITU draws attention to the possibility that the practice or implementation of this Recommendation may involve the use of a claimed Intellectual Property Right. ITU takes no position concerning the evidence, validity or applicability o
17、f claimed Intellectual Property Rights, whether asserted by ITU members or others outside of the Recommendation development process. As of the date of approval of this Recommendation, ITU had/had not received notice of intellectual property, protected by patents, which may be required to implement t
18、his Recommendation. However, implementors are cautioned that this may not represent the latest information and are therefore strongly urged to consult the TSB patent database. ITU 2004 All rights reserved. No part of this publication may be reproduced, by any means whatsoever, without the prior writ
19、ten permission of ITU. ITU-T Rec. G.722.2/Annex D (07/2003) iii CONTENTS Page Annex D Digital test sequences 1 D.1 Scope 1 D.2 General . 1 D.3 Test sequence format 1 D.4 Speech codec test sequences 2 D.5 Test sequences for source controlled rate operation. 4 D.6 Sequences for finding the 20 ms frami
20、ng of the adaptive multi-rate speech encoder 5 Electronic attachment SyncVectors TestVectors DTX_TestVectors ITU-T Rec. G.722.2/Annex D (07/2003) 1 ITU-T Recommendation G.722.2 Wideband coding of speech at around 16 kbit/s using Adaptive Multi-Rate Wideband (AMR-WB) Annex D Digital test sequences D.
21、1 Scope This annex specifies the digital test sequences for the adaptive multi-rate wideband (AMR-WB) speech codec specified in ITU-T Rec. G.722.2, its Annexes A and B, and its Appendix I. These sequences test for a bit-exact implementation of the adaptive multi-rate wideband speech transcoder, voic
22、e activity detection, comfort noise, and source controlled rate operation. D.2 General Digital test sequences are necessary to test for a bit exact implementation of the adaptive multi-rate wideband (AMR-WB) speech transcoder, voice activity detection, comfort noise generation, and source controlled
23、 rate operation. The test sequences may also be used to verify installations of the ANSI C code. Clause D.3 describes the format of the files which contain the digital test sequences. Clause D.4 describes the test sequences for the speech transcoder. Clause D.5 describes the test sequences for the V
24、AD, comfort noise and source controlled rate operation. Clause D.6 describes the method by which synchronisation is obtained between the test sequences and the speech codec under test. D.3 Test sequence format This clause provides information on the format of the digital test sequences for the adapt
25、ive multi-rate wideband (AMR-WB) speech, voice activity detection, comfort noise generation, and source controlled rate operation. D.3.1 File format The test sequence files in PC (little-endian) byte order are provided in archive files (ZIP format) which accompany this annex. Following decompression
26、, three types of file are provided: Files for input to the speech encoder: *.INP Files for comparison with the encoder output and for input to the speech decoder: *.COD Files for comparison with the decoder output: *.OUT One mode control file for the mode switching test T22.MOD All file formats are
27、described in Annex C/G.722.2. D.3.2 Codec homing Each *.INP file includes two homing frames (see Annex C/G.722.2) at the start of the test sequence. The function of these frames is to reset the speech encoder state variables to their initial value. In 2 ITU-T Rec. G.722.2/Annex D (07/2003) the case
28、of a correct installation of the ANSI-C simulation, all speech encoder output frames shall be identical to the corresponding frame in the *.COD file. In the case of a correct hardware implementation undergoing testing, the first speech encoder output frame is undefined and need not be identical to t
29、he first frame in the *.COD file, but all remaining speech encoder output frames shall be identical to the corresponding frames in the *.COD file. The function of the two homing frames in the *.COD files is to reset the speech decoder state variables to their initial value. In the case of a correct
30、installation of the ANSI-C simulation, all speech decoder output frames shall be identical to the corresponding frame in the *.OUT file. In the case of a correct hardware implementation undergoing testing, the first speech decoder output frame is undefined and need not be identical to first frame in
31、 the *.OUT file, but all remaining speech decoder output frames shall be identical to the corresponding frames in the *.OUT file. D.4 Speech codec test sequences This clause describes the test sequences designed to exercise the adaptive multi-rate wideband (AMR-WB) speech transcoder. D.4.1 Codec con
32、figuration The speech encoder shall be configured not to operate in the source controlled rate mode. D.4.2 Speech codec test sequences D.4.2.1 Speech encoder test sequences Twenty-three encoder input sequences are provided. Note that for the input sequences T00.INP to T03.INP, the amplitude figures
33、are given in 14-bit precision. The active speech levels are given in dBov. T00.INP Synthetic harmonic signal. The pitch delay varies slowly from 34 to 231 samples. The minimum and maximum amplitudes are 1475 and +5952. T01.INP Synthetic harmonic signal. The pitch delay varies slowly from 231 down to
34、 34 samples. Amplitudes at saturation point 5386 and +21 707. T02.INP Square sweep varying from 50 Hz to 7000 Hz. Amplitudes 32767. T03.INP Sinusoidal sweep varying from 50 Hz to 7000 Hz. Amplitudes 6217. T04.INP Female speech, ambient noise, active speech level: 22.5 dBov, P.341 filtered. T05.INP M
35、ale speech, ambient noise, active speech level: 29.9 dBov, P.341 filtered. T06.INP Female and male speech, ambient noise, active speech level: 36.1 dBov, P.341 filtered. T07.INP Female and male speech, ambient noise, active speech level: 45.8 dBov, P.341 filtered. T08.INP Female and male speech, amb
36、ient noise, active speech level: 7.7 dBov, P.341 filtered. T09.INP Female and male speech, Hoth noise, active speech level: 37.4 dBov, P.341 filtered. T10.INP Female and male speech, Hoth noise, active speech level: 27.3 dBov, P.341 filtered. T11.INP Female and male speech, Hoth noise, active speech
37、 level: 16.9 dBov, P.341 filtered. T12.INP Female and male speech, ambient noise, active speech level: 46.0 dBov, P.341 filtered. T13.INP Speech, very high and low car noise, P.341 filtered. ITU-T Rec. G.722.2/Annex D (07/2003) 3 T14.INP Female and male speech, ambient noise, active speech level: 26
38、.0 dBov, P.341 filtered. T15.INP Female and male speech, rain noise, active speech level: 37.2 dBov, P.341 filtered. T16.INP Female and male speech, rain noise, active speech level: 26.5 dBov, P.341 filtered. T17.INP Female and male speech, rain noise, active speech level: 16.4 dBov, P.341 filtered.
39、 This file includes homing frame test. T18.INP Male speech, active speech level: 29.7 dBov, P.341 filtered, with many zero frames. T19.INP Child speech, ambient noise, active speech level: 34.7 dBov, P.341 filtered. T20.INP Sequence for exercising the LPC vector quantisation codebooks and ROM tables
40、 of the codec. T21.INP Zero signal sequence. T22.INP Speech sequence for mode switching test. The output using these input sequences will be different depending on the tested adaptive multi-rate mode. In the notation used below should be changed to the number of the tested mode, i.e. one of 2385, 23
41、05, 1985, 1825, 1585, 1425, 1265, 885 or 660. The T00.INP and T01.INP sequences were designed to test the pitch lag of the adaptive multi-rate wideband speech encoder. In a correct implementation, the resulting speech encoder output parameters shall be identical to those specified in the T00_.COD an
42、d T01_.COD sequences, respectively. The T02.INP and T03.INP sequences are particularly suited for testing the LPC analysis, as well as for finding saturation problems. In a correct implementation, the resulting speech encoder output parameters shall be identical to those specified in the T02_.COD an
43、d T03_.COD sequences, respectively. The T04.INP and T05.INP sequences contain a lot of low-frequency components. In a correct implementation, the resulting speech encoder output parameters shall be identical to those specified in the T04_.COD and T05_.COD sequences, respectively. The T18.INP and T21
44、.INP sequences contain “all zeros“ frames (silence) in between segments of speech. In a correct implementation, the resulting speech encoder output parameters shall be identical to those specified in the T18_.COD and T21_.COD sequences, respectively. The T20.INP sequence was designed to exercise the
45、 LPC code indices and the ROM table indices of the codec. The sequences T06.INP to T17.INP and T19.INP were selected on the basis of bringing various input characteristics (background noise) and levels to the test sequence set. Homing frame test is also included in T17.INP. T17.INP has homing frames
46、 with length 320 smp, 640 smp and 960 smp starting from 32 000 smp, 16 000 smp and 48 000 smp in a respective order. In a correct implementation, the resulting speech encoder output parameters shall be identical to those specified in the T06_.COD to T17_.COD sequences, respectively. The T22.INP sequ
47、ence was designed to test mode switching in the encoder. For testing mode switching this sequence is used together with the mode control file T22.MOD. See Annex C/G.722.2 for the format of the mode control file. In a correct implementation, the resulting speech encoder output parameters shall be ide
48、ntical to those specified in the sequence T22.COD. Note that T22.COD contains parameter frames in different codec modes. 4 ITU-T Rec. G.722.2/Annex D (07/2003) D.4.2.2 Speech decoder test sequences Twenty-two times nine speech decoder input sequences TXX_.COD (XX = 0021, = 2385, 2305, 1985, 1825, 15
49、85, 1425, 1265, 885 or 660) are provided for the static mode tests. These are the output of the corresponding TXX.INP sequences, one set per mode. In a correct implementation, the resulting speech decoder output shall be identical to the corresponding TXX_.OUT sequences. The switching test decoder input T22.COD shall result in decoder output identical to the T22.OUT sequence. For the decoder switching test no special mode control