1、 ETSI TR 146 055 V15.0.0 (2018-07) Digital cellular telecommunications system (Phase 2+) (GSM); Performance characterization of the GSM Enhanced Full Rate (EFR) speech codec (3GPP TR 46.055 version 15.0.0 Release 15) TECHNICAL REPORT GLOBAL SYSTEM FOR MOBILE COMMUNICATIONSRETSI ETSI TR 146 055 V15.0
2、.0 (2018-07)13GPP TR 46.055 version 15.0.0 Release 15Reference RTR/TSGS-0446055vf00 Keywords GSM ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Pr
3、fecture de Grasse (06) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The content of any electronic and/or print versions of the present document shall not
4、be modified without the prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secreta
5、riat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at https:/portal.etsi.org/TB/ETSIDeliverableStatus.aspx If you find errors in the present document, pl
6、ease send your comment to one of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm except as authorized by written pe
7、rmission of ETSI. The content of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. ETSI 2018. All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are trademarks of ETSI regis
8、tered for the benefit of its Members. 3GPPTM and LTETMare trademarks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. oneM2M logo is protected for the benefit of its Members. GSMand the GSM logo are trademarks registered and owned by the GSM Association. ETS
9、I ETSI TR 146 055 V15.0.0 (2018-07)23GPP TR 46.055 version 15.0.0 Release 15Intellectual Property Rights Essential patents IPRs essential or potentially essential to normative deliverables may have been declared to ETSI. The information pertaining to these essential IPRs, if any, is publicly availab
10、le for ETSI members and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web server
11、(https:/ipr.etsi.org/). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web server) which are, or may be, or may become,
12、essential to the present document. Trademarks The present document may include trademarks and/or tradenames which are asserted and/or registered by their owners. ETSI claims no ownership of these except for any which are indicated as being the property of ETSI, and conveys no right to use or reprodu
13、ce any trademark and/or tradename. Mention of those trademarks in the present document does not constitute an endorsement by ETSI of products, services or organizations associated with those trademarks. Foreword This Technical Report (TR) has been produced by ETSI 3rd Generation Partnership Project
14、(3GPP). The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or GSM identities. These should be interpreted as being references to the corresponding ETSI deliverables. The cross reference between GSM, UMTS, 3GPP and ETSI identities can be
15、 found under http:/webapp.etsi.org/key/queryform.asp. Modal verbs terminology In the present document “should“, “should not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of prov
16、isions). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in direct citation. ETSI ETSI TR 146 055 V15.0.0 (2018-07)33GPP TR 46.055 version 15.0.0 Release 15Contents Intellectual Property Rights 2g3Foreword . 2g3Modal verbs terminology 2g3Foreword . 4g3Introduction 5g31 Sc
17、ope 7g32 References 7g33 Abbreviations . 7g34 Quality under error (EP0 EP3) and tandeming conditions (Exp Number 1 and Exp Number 5) 8g35 Quality under background noise conditions (Exp Number 2 and Exp Number 3) . 9g36 Talker dependency (Exp Number 4) 9g37 DTX system 9g37.1 Channel activity in DTX m
18、ode . 9g37.1.1 Test procedure 9g37.1.2 Speech channel activity 9g37.1.3 Level compensation 9g37.1.4 Interleaving compensation 9g37.1.5 Estimated mean TDMA channel activity 10g37.2 DTX/CNI Informal Expert Listening tests . 10g37.2.1 Introduction. 10g37.2.2 Test environment 10g37.2.3 Results 10g38 Per
19、formance with DTMF tones 10g38.1 Introduction 10g38.2 Test environment 10g38.3 Results 11g39 Network information tones . 11g310 Performance with special input signals 12g310.1 Music signals 12g310.2 Noise signals 13g311 Performance with different languages 13g312 Delay 14g313 Frequency response 17g3
20、13.1 Introduction 17g313.2 Test environment 17g313.3 Results 17g314 Complexity . 17g315 Summary of the results from the subjective testing . 18g3Annex A: Summary of results (lab by lab) 20g3A.1 Quality under Error and tandeming conditions 20g3A.2 Quality under Background noise conditions 22g3A.3 Qua
21、lity for Talker Dependency (DMOS and SD) 23g3Annex B: Change history 24g3History 25g3ETSI ETSI TR 146 055 V15.0.0 (2018-07)43GPP TR 46.055 version 15.0.0 Release 15Foreword This Technical Specification has been produced by the 3rdGeneration Partnership Project (3GPP). The contents of the present doc
22、ument are subject to continuing work within the TSG and may change following formal TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an identifying change of release date and an increase in version number as follows: Version x.y.z where
23、: x the first digit: 1 presented to TSG for information; 2 presented to TSG for approval; 3 or greater indicates TSG approved document under change control. y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, updates, etc. z the third digit is in
24、cremented when editorial only changes have been incorporated in the document. ETSI ETSI TR 146 055 V15.0.0 (2018-07)53GPP TR 46.055 version 15.0.0 Release 15Introduction The SMG2-Speech experts Group (SEG) started its activity early in 1995 for the standardization of an Enhanced Full Rate speech cod
25、ec. The Group produced a test plan for the first phase of testing (pre-selection phase) which is described in permanent document SEG-4 (ETSI SMG2 SEG: SEG-4 (v 1.0) “A Subjective Pre-Selection Test Plan for the Enhanced Full Rate Speech Coding Algorithm“) to assess the performance of the submitted c
26、andidates. This test plan is based on the general knowledge coming from past ITU-T and ETSI activities on codec evaluation (GSM half rate and ITU-T 8 kbit/s recent exercises for instance). At the end of this Pre-selection Phase, SMG decided to standardize the PCS 1 900 codec, known as the US-1 codec
27、 and no formal characterisation testing has been performed for the selected codec. The present document therefore reports the results from the Pre-selection and Verification Phase of testing only. Consequently, the results reported here are less detailed, and the confidence intervals for them are wi
28、der, than those obtained for the GSM half rate standardization (GSM 06.08, 3) where specific and detailed characterisation testing was performed. In addition, not all laboratories followed the same pre-selection test plan, further complicating the interpretation of the results. The following experim
29、ents included in SEG-4 were carried out by several laboratories in the Pre-selection Phase: - Experiment 1: Quality under error and tandeming conditions (A-law, Modified IRS); - Experiment 2: Quality under background noise conditions (Vehicular noise, UPCM, NoIRS); - Experiment 3: Quality under back
30、ground noise conditions (Background music, UPCM, NoIRS); - Experiment 4: Talker Dependency (UPCM, NoIRS); - Experiment 5: Quality under high error conditions EP3 (A-law, Modified IRS). A practical indirect method of performance comparison between different results was adopted utilising the Modulated
31、 Noise Reference Unit (MNRU) (see note) as a reference degradation. The MNRU provides the additional function of allowing normalisation of results across different laboratories carrying out the same experiment, through the conversion of MOS scores to Equivalent Q (dB). The Q (dB) values introduced i
32、n a test normally range from 0 to 50 dB. In SEG-4, both Experiment#1 and Experiment#5 on error conditions covers this range, the other experiments do not. NOTE: The MNRU is a device designed for producing speech correlated noise that sounds subjectively like the quantising noise produced by log-comp
33、anded PCM codecs. The device is subjectively calibrated for Mean Opinion Scores (MOS) against Q dB (where Q is the ratio of the speech to speech-correlated noise power). The Equivalent Q of the codecs under test can be found from the corresponding MOS on the calibration curve of the MNRU (S-shaped c
34、urve). Only four laboratories ran tests which followed the Pre-selection Test Plan described in SEG-4 (BT/lab1, CNET/lab2, Tele Denmark/lab3, NEC/lab4). MOTOROLA/lab5 participated in the Pre-selection Phase but their experiments did not comply with SEG-4. TI/lab8 ran one experiment only from SEG-4.
35、Results produced by COMSAT/lab6 following a NOKIA-designed test plan are part of standardization of the codec in North America and NOKIA/lab7 performed complementary experiments during the ETSI Pre-selection Phase. As no further analysis have been undertaken to allow the averaging of scores across t
36、he different laboratories, results are reported in the annex on a laboratory-by-laboratory basis. For error and tandeming conditions, results are reported in terms of Equivalent Q (dB) values. For background noise conditions and talker dependency, results are reported in terms of DMOS values with ei
37、ther Confidence Interval (CI) or Standard Deviation (SD) as there is insufficient data available to normalise across laboratories via MNRU conditions. The quality performance of the EFR codec is compared to High and Low references introduced in permanent documents SEG-3 (ETSI SMG2 SEG: SEG-3 “Select
38、ion Criteria for the Enhanced Full Rate Speech Coding Algorithm Speech Quality Requirements“) and SEG-4 (ETSI SMG2 SEG: SEG-4 (v 1.0) “A Subjective Pre-Selection Test Plan for the Enhanced Full Rate Speech Coding Algorithm“, Section 7). These references were chosen as representative of the “minimum“
39、 and “objective“ performance targets respectively, and are reported in table 1. ETSI ETSI TR 146 055 V15.0.0 (2018-07)63GPP TR 46.055 version 15.0.0 Release 15Table 1: References per condition: High Ref., Low Ref. And G.728 EXPERIMENTS (SEG-4) Conditions High Ref Low Ref EXP#1 EP0 G.728 G.728 EXP#1
40、EP1 MNRU 24 dB TCH-FS (EP1) EXP#1 EP2 TCH-FS (EP1) TCH-FS (EP2) EXP#5 EP3 TCH-FS (EP2) TCH-FS (EP3) EXP#1 EP0 (tandem) G.728 G.728 EXP#1 EP1 (tandem) TCH-FS (EP1) TCH-FS (EP1 tandem) EXP#2 Vehicle 10 G.728 G.728EXP#3 Music 20 G.728 G.728 EXP#4 ale Talkers G.728 G.728EXP#4 Female Talkers G.728 G.728
41、EXP#4 Children G.728 G.728 A figure showing the general trend of the EFR behaviour for error conditions in noise-free environment, compared to the high (G.728) and low (TCH-FS) references is added to individual laboratories quantitative results (figure 15). The general quality performance of the EFR
42、 codec is summarised in table 15. In the Verification Phase, the behaviour of the EFR codec under the following test conditions was tested: - behaviour of the DTX System; - performance with DTMF tones; - performance with network information tones; - performance with special input signals; - performa
43、nce with music signals; - performance with noise signals; - performance with different languages; - delay of the TCH-EFR; - frequency response; - complexity. The results of these tests are also included in this report under the respective clauses. Furthermore, the EFR codec was checked for correct f
44、unctioning for the following items: - test of overload point; - SID frame encoding; - muting behaviour; - idle channel behaviour. No artefact or malfunctioning was detected for these items. ETSI ETSI TR 146 055 V15.0.0 (2018-07)73GPP TR 46.055 version 15.0.0 Release 151 Scope The present document gi
45、ves background information on the performance of the GSM enhanced full rate speech codec. Experimental results from the Pre-selection and Verification tests carried out during the standardization process by the SEG (Speech Expert Group) are reported to give a more detailed picture of the behaviour o
46、f the GSM enhanced full rate speech codec under different conditions of operation. 2 References The following documents contain provisions which, through reference in this text, constitute provisions of the present document. References are either specific (identified by date of publication, edition
47、number, version number, etc.) or non-specific. For a specific reference, subsequent revisions do not apply. For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest ver
48、sion of that document in the same Release as the present document. 1 GSM 03.05: “Digital cellular telecommunications system (Phase 2+); Technical performance objectives“. 2 GSM 03.50: “Digital cellular telecommunications system (Phase 2+); Transmission planning aspects of the speech service in the G
49、SM Public Land Mobile Network (PLMN) system“. 3 GSM 06.08: “Digital cellular telecommunications system (Phase 2+); Half rate speech; Performance of the GSM half rate speech codec“. 4 GSM 06.10: “Digital cellular telecommunications system (Phase 2+); Full rate speech transcoding“. 5 GSM 06.20: “Digital cellular telecommunications system (Phase 2+); Half rate speech transcoding“. 3 Abbreviations For the purposes of the present document, the following abbreviations apply: A/D Analogue to Digital ADPCM Adaptive Differential Pulse Code Modulation ACR Absol