1、 ETSI TR 146 055 V14.0.0 (2017-04) Digital cellular telecommunications system (Phase 2+) (GSM); Performance characterization of the GSM Enhanced Full Rate (EFR) speech codec (3GPP TR 46.055 version 14.0.0 Release 14) TECHNICAL REPORT GLOBAL SYSTEM FOR MOBILE COMMUNICATIONSRETSI ETSI TR 146 055 V14.0
2、.0 (2017-04)13GPP TR 46.055 version 14.0.0 Release 14Reference RTR/TSGS-0446055ve00 Keywords GSM ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Pr
3、fecture de Grasse (06) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The content of any electronic and/or print versions of the present document shall not
4、be modified without the prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secreta
5、riat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at https:/portal.etsi.org/TB/ETSIDeliverableStatus.aspx If you find errors in the present document, pl
6、ease send your comment to one of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm except as authorized by written pe
7、rmission of ETSI. The content of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. European Telecommunications Standards Institute 2017. All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand
8、 the ETSI logo are Trade Marks of ETSI registered for the benefit of its Members. 3GPPTM and LTE are Trade Marks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. GSM and the GSM logo are Trade Marks registered and owned by the GSM Association. ETSI ETSI TR 1
9、46 055 V14.0.0 (2017-04)23GPP TR 46.055 version 14.0.0 Release 14Intellectual Property Rights IPRs essential or potentially essential to the present document may have been declared to ETSI. The information pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-mem
10、bers, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web server (https:/ipr.etsi.org/). Pursuan
11、t to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web server) which are, or may be, or may become, essential to the present docume
12、nt. Foreword This Technical Report (TR) has been produced by ETSI 3rd Generation Partnership Project (3GPP). The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or GSM identities. These should be interpreted as being references to the co
13、rresponding ETSI deliverables. The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under http:/webapp.etsi.org/key/queryform.asp. Modal verbs terminology In the present document “should“, “should not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be inter
14、preted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in direct citation. ETSI ETSI TR 146 055 V14.0.0 (2017-04)33GPP TR 46.055 version 14.0.0 Release 14Contents Intell
15、ectual Property Rights 2g3Foreword . 2g3Modal verbs terminology 2g3Foreword . 5g3Introduction 6g31 Scope 8g32 References 8g33 Abbreviations . 8g34 Quality under error (EP0 EP3) and tandeming conditions (Exp Number 1 and Exp Number 5) 9g35 Quality under background noise conditions (Exp Number 2 and E
16、xp Number 3) . 10g36 Talker dependency (Exp Number 4) 10g37 DTX system 10g37.1 Channel activity in DTX mode . 10g37.1.1 Test procedure 10g37.1.2 Speech channel activity 10g37.1.3 Level compensation 10g37.1.4 Interleaving compensation 11g37.1.5 Estimated mean TDMA channel activity 11g37.2 DTX/CNI Inf
17、ormal Expert Listening tests . 11g37.2.1 Introduction. 11g37.2.2 Test environment 11g37.2.3 Results 11g38 Performance with DTMF tones 11g38.1 Introduction 11g38.2 Test environment 12g38.3 Results 12g39 Network information tones . 13g310 Performance with special input signals 13g310.1 Music signals 1
18、3g310.2 Noise signals 14g311 Performance with different languages 14g312 Delay 15g313 Frequency response 18g313.1 Introduction 18g313.2 Test environment 18g313.3 Results 18g314 Complexity . 19g315 Summary of the results from the subjective testing . 20g3Annex A: Summary of results (lab by lab) 22g3A
19、.1 Quality under Error and tandeming conditions 22g3A.2 Quality under Background noise conditions 24g3A.3 Quality for Talker Dependency (DMOS and SD) 25g3Annex B: Change history 26g3ETSI ETSI TR 146 055 V14.0.0 (2017-04)43GPP TR 46.055 version 14.0.0 Release 14History 27g3ETSI ETSI TR 146 055 V14.0.
20、0 (2017-04)53GPP TR 46.055 version 14.0.0 Release 14Foreword This Technical Specification has been produced by the 3rdGeneration Partnership Project (3GPP). The contents of the present document are subject to continuing work within the TSG and may change following formal TSG approval. Should the TSG
21、 modify the contents of the present document, it will be re-released by the TSG with an identifying change of release date and an increase in version number as follows: Version x.y.z where: x the first digit: 1 presented to TSG for information; 2 presented to TSG for approval; 3 or greater indicates
22、 TSG approved document under change control. y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, updates, etc. z the third digit is incremented when editorial only changes have been incorporated in the document. ETSI ETSI TR 146 055 V14.0.0 (2017
23、-04)63GPP TR 46.055 version 14.0.0 Release 14Introduction The SMG2-Speech experts Group (SEG) started its activity early in 1995 for the standardization of an Enhanced Full Rate speech codec. The Group produced a test plan for the first phase of testing (pre-selection phase) which is described in pe
24、rmanent document SEG-4 (ETSI SMG2 SEG: SEG-4 (v 1.0) “A Subjective Pre-Selection Test Plan for the Enhanced Full Rate Speech Coding Algorithm“) to assess the performance of the submitted candidates. This test plan is based on the general knowledge coming from past ITU-T and ETSI activities on codec
25、evaluation (GSM half rate and ITU-T 8 kbit/s recent exercises for instance). At the end of this Pre-selection Phase, SMG decided to standardize the PCS 1 900 codec, known as the US-1 codec and no formal characterisation testing has been performed for the selected codec. The present document therefor
26、e reports the results from the Pre-selection and Verification Phase of testing only. Consequently, the results reported here are less detailed, and the confidence intervals for them are wider, than those obtained for the GSM half rate standardization (GSM 06.08, 3) where specific and detailed charac
27、terisation testing was performed. In addition, not all laboratories followed the same pre-selection test plan, further complicating the interpretation of the results. The following experiments included in SEG-4 were carried out by several laboratories in the Pre-selection Phase: - Experiment 1: Qual
28、ity under error and tandeming conditions (A-law, Modified IRS); - Experiment 2: Quality under background noise conditions (Vehicular noise, UPCM, NoIRS); - Experiment 3: Quality under background noise conditions (Background music, UPCM, NoIRS); - Experiment 4: Talker Dependency (UPCM, NoIRS); - Expe
29、riment 5: Quality under high error conditions EP3 (A-law, Modified IRS). A practical indirect method of performance comparison between different results was adopted utilising the Modulated Noise Reference Unit (MNRU) (see note) as a reference degradation. The MNRU provides the additional function of
30、 allowing normalisation of results across different laboratories carrying out the same experiment, through the conversion of MOS scores to Equivalent Q (dB). The Q (dB) values introduced in a test normally range from 0 to 50 dB. In SEG-4, both Experiment#1 and Experiment#5 on error conditions covers
31、 this range, the other experiments do not. NOTE: The MNRU is a device designed for producing speech correlated noise that sounds subjectively like the quantising noise produced by log-companded PCM codecs. The device is subjectively calibrated for Mean Opinion Scores (MOS) against Q dB (where Q is t
32、he ratio of the speech to speech-correlated noise power). The Equivalent Q of the codecs under test can be found from the corresponding MOS on the calibration curve of the MNRU (S-shaped curve). Only four laboratories ran tests which followed the Pre-selection Test Plan described in SEG-4 (BT/lab1,
33、CNET/lab2, Tele Denmark/lab3, NEC/lab4). MOTOROLA/lab5 participated in the Pre-selection Phase but their experiments did not comply with SEG-4. TI/lab8 ran one experiment only from SEG-4. Results produced by COMSAT/lab6 following a NOKIA-designed test plan are part of standardization of the codec in
34、 North America and NOKIA/lab7 performed complementary experiments during the ETSI Pre-selection Phase. As no further analysis have been undertaken to allow the averaging of scores across the different laboratories, results are reported in the annex on a laboratory-by-laboratory basis. For error and
35、tandeming conditions, results are reported in terms of Equivalent Q (dB) values. For background noise conditions and talker dependency, results are reported in terms of DMOS values with either Confidence Interval (CI) or Standard Deviation (SD) as there is insufficient data available to normalise ac
36、ross laboratories via MNRU conditions. The quality performance of the EFR codec is compared to High and Low references introduced in permanent documents SEG-3 (ETSI SMG2 SEG: SEG-3 “Selection Criteria for the Enhanced Full Rate Speech Coding Algorithm Speech Quality Requirements“) and SEG-4 (ETSI SM
37、G2 SEG: SEG-4 (v 1.0) “A Subjective Pre-Selection Test Plan for the Enhanced Full Rate Speech Coding Algorithm“, Section 7). These references were chosen as representative of the “minimum“ and “objective“ performance targets respectively, and are reported in table 1. ETSI ETSI TR 146 055 V14.0.0 (20
38、17-04)73GPP TR 46.055 version 14.0.0 Release 14Table 1: References per condition: High Ref., Low Ref. And G.728 EXPERIMENTS (SEG-4) Conditions High Ref Low Ref EXP#1 EP0 G.728 G.728 EXP#1 EP1 MNRU 24 dB TCH-FS (EP1) EXP#1 EP2 TCH-FS (EP1) TCH-FS (EP2) EXP#5 EP3 TCH-FS (EP2) TCH-FS (EP3) EXP#1 EP0 (t
39、andem) G.728 G.728 EXP#1 EP1 (tandem) TCH-FS (EP1) TCH-FS (EP1 tandem) EXP#2 Vehicle 10 G.728 G.728EXP#3 Music 20 G.728 G.728 EXP#4 ale Talkers G.728 G.728EXP#4 Female Talkers G.728 G.728 EXP#4 Children G.728 G.728 A figure showing the general trend of the EFR behaviour for error conditions in noise
40、-free environment, compared to the high (G.728) and low (TCH-FS) references is added to individual laboratories quantitative results (figure 15). The general quality performance of the EFR codec is summarised in table 15. In the Verification Phase, the behaviour of the EFR codec under the following
41、test conditions was tested: - behaviour of the DTX System; - performance with DTMF tones; - performance with network information tones; - performance with special input signals; - performance with music signals; - performance with noise signals; - performance with different languages; - delay of the
42、 TCH-EFR; - frequency response; - complexity. The results of these tests are also included in this report under the respective clauses. Furthermore, the EFR codec was checked for correct functioning for the following items: - test of overload point; - SID frame encoding; - muting behaviour; - idle c
43、hannel behaviour. No artefact or malfunctioning was detected for these items. ETSI ETSI TR 146 055 V14.0.0 (2017-04)83GPP TR 46.055 version 14.0.0 Release 141 Scope The present document gives background information on the performance of the GSM enhanced full rate speech codec. Experimental results f
44、rom the Pre-selection and Verification tests carried out during the standardization process by the SEG (Speech Expert Group) are reported to give a more detailed picture of the behaviour of the GSM enhanced full rate speech codec under different conditions of operation. 2 References The following do
45、cuments contain provisions which, through reference in this text, constitute provisions of the present document. References are either specific (identified by date of publication, edition number, version number, etc.) or non-specific. For a specific reference, subsequent revisions do not apply. For
46、a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Release as the present document. 1 GSM 03.05: “Digital cellular telecommunicat
47、ions system (Phase 2+); Technical performance objectives“. 2 GSM 03.50: “Digital cellular telecommunications system (Phase 2+); Transmission planning aspects of the speech service in the GSM Public Land Mobile Network (PLMN) system“. 3 GSM 06.08: “Digital cellular telecommunications system (Phase 2+
48、); Half rate speech; Performance of the GSM half rate speech codec“. 4 GSM 06.10: “Digital cellular telecommunications system (Phase 2+); Full rate speech transcoding“. 5 GSM 06.20: “Digital cellular telecommunications system (Phase 2+); Half rate speech transcoding“. 3 Abbreviations For the purpose
49、s of the present document, the following abbreviations apply: A/D Analogue to Digital ADPCM Adaptive Differential Pulse Code Modulation ACR Absolute Category Rating BSC Base Station Controller BTS Base Transceiver Station C/I Carrier-to-Interferer ratio CI Confidence Interval CNI Comfort Noise Insertion CRC Cyclic Redundancy Check D/A Digital to Analogue DAT Digital Audio TapeDCR Degradation Category Rating DSP Digital Signal ProcessorDTMF Dual Tone Multi Frequency DTX Discontinuous Transmission for power consumption and interference reduction EFR En