1、 ETSI EG 202 396-3 V1.5.1 (2015-10) Speech and multimedia Transmission Quality (STQ); Speech Quality performance in the presence of background noise; Part 3: Background noise transmission - Objective test methods ETSI GUIDE ETSI ETSI EG 202 396-3 V1.5.1 (2015-10) 2 Reference REG/STQ-229 Keywords noi
2、se, QoS, quality, speech ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document c
3、an be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The content of any electronic and/or print versions of the present document shall not be modified without the prior written authorization of ETSI. In case of
4、any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document ma
5、y be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at http:/portal.etsi.org/tb/status/status.asp If you find errors in the present document, please send your comment to one of the following services: https:/portal.etsi.org/Pe
6、ople/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm except as authorized by written permission of ETSI. The content of the PDF version shall not be modified without the
7、 written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. European Telecommunications Standards Institute 2015. All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are Trade Marks of ETSI registered for the benefit of its Members.
8、3GPPTM and LTE are Trade Marks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. GSM and the GSM logo are Trade Marks registered and owned by the GSM Association. ETSI ETSI EG 202 396-3 V1.5.1 (2015-10) 3 Contents Intellectual Property Rights 5g3Foreword . 5g
9、3Modal verbs terminology 5g31 Scope 6g32 References 6g32.1 Normative references . 6g32.2 Informative references 7g33 Symbols and abbreviations . 8g33.1 Symbols 8g33.2 Abbreviations . 8g34 Speech signals to be used . 9g35 Selection of the data within the scope of the wideband objective model: Experts
10、 evaluation . 10g35.1 Selection process 10g35.2 Results 10g35.3 French database 11g36 Description of the wideband objective test method . 11g36.1 Introduction 11g36.2 Speech sample preparation and nomenclature 12g36.2.1 Speech sample preparation . 12g36.2.2 Nomenclature 14g36.3 Additional Training d
11、ata 15g36.4 Principles of Relative Approach and Relative Approach 15g36.5 Objective N-MOS. 19g36.5.1 Introduction. 19g36.5.2 Description of N-MOS algorithm . 19g36.5.3 Comparing subjective and objective N-MOS results 22g36.6 Objective S-MOS . 23g36.6.1 Introduction. 23g36.6.2 Description of S-MOS Al
12、gorithm . 24g36.6.3 Comparing Subjective and Objective S-MOS Results 27g36.7 Objective G-MOS. 28g36.7.1 Description of G-MOS Algorithm 28g36.7.2 Comparing subjective and objective G-MOS results 29g37 Validation of the Wideband Objective Test Method 30g37.1 Introduction 30g37.2 ETSI EG 202 396-2 Data
13、base Results Analysis . 32g37.2.1 Comparing subjective and objective N-MOS results 32g37.2.2 Comparing subjective and objective S-MOS results 32g37.2.3 Comparing Subjective and Objective G-MOS Results . 33g37.3 Orange Validation Database results Analysed . 34g37.3.0 Introduction. 34g37.3.1 Comparing
14、 subjective and objective N-MOS results 34g37.3.2 Comparing subjective and objective S-MOS results 35g37.3.3 Comparing Subjective and Objective G-MOS Results . 35g38 Objective Model for Narrowband Applications . 36g38.0 Introduction 36g38.1 File pre-processing . 36g38.2 Adaptation of the Calculation
15、s . 37g38.3 Prediction results 38g3Annex A: Detailed post evaluation of listening test results 40g3Annex B: Results of PESQ and TOSQA2001 - Analysis of ETSI EG 202 396-2 database 43g3ETSI ETSI EG 202 396-3 V1.5.1 (2015-10) 4 Annex C: Comparison of objective MOS versus auditory MOS for the complete S
16、TF 294 database . 50g3Annex D: Comparison of objective MOS versus auditory MOS for rejected conditions 52g3Annex E: Void 54g3Annex F: Detailed STF 294 subjective and objective validation test results 55g3Annex G: Void 58g3Annex H: Extension of the Speech Quality Test Method to Narrowband: Adaptation
17、, Training and Validation . 59g3Annex I: Void 61g3Annex J: Summary of Czech samples not used for model training 62g3J.0 Introduction 62g3J.1 Selection process - Czech database 62g3J.2 General differences between the databases 64g3J.3 Comparison of the objective method results for Czech and French sa
18、mples . 67g3J.4 Czech conditions results analysis . 72g3J.4.1 Comparing subjective and objective N-MOS results . 72g3J.4.2 Comparing subjective and objective S-MOS results 72g3J.4.3 Comparing Subjective and Objective G-MOS Results. 73g3J.5 Language Dependent Robustness of G-MOS. 74g3J.6 Regression C
19、oefficients for Czech data . 75g3J.7 Post selection 76g3Annex K: Relative Approach Non-Linear Transformation . 80g3Annex L: Bibliography 81g3History 82g3ETSI ETSI EG 202 396-3 V1.5.1 (2015-10) 5 Intellectual Property Rights IPRs essential or potentially essential to the present document may have bee
20、n declared to ETSI. The information pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“,
21、 which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web server (http:/ipr.etsi.org). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced
22、 in ETSI SR 000 314 (or the updates on the ETSI Web server) which are, or may be, or may become, essential to the present document. Foreword This ETSI Guide (EG) has been produced by ETSI Technical Committee Speech and multimedia Transmission Quality (STQ). The present document is a deliverable of E
23、TSI Specialized Task Force (STF) 294 entitled: “Improving the quality of eEurope wideband speech applications by developing a performance testing and evaluation methodology for background noise transmission“. The present document is part 3 of a multi-part deliverable covering Speech and multimedia T
24、ransmission Quality (STQ); Speech Quality performance in the presence of background noise, as identified below: Part 1: “Background noise simulation technique and background noise database“; Part 2: “Background noise transmission - Network simulation - Subjective test database and results“; Part 3:
25、“Background noise transmission - Objective test methods“. Modal verbs terminology In the present document “shall“, “shall not“, “should“, “should not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms
26、for the expression of provisions). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in direct citation. ETSI ETSI EG 202 396-3 V1.5.1 (2015-10) 6 1 Scope The present document aims to identify and define testing methodologies which can be used to objectively evaluate the pe
27、rformance of narrowband and wideband terminals and systems for speech communication in the presence of background noise. Background noise is a problem in mostly all situations and conditions and need to be taken into account in both, terminals and networks. The present document provides information
28、about the testing methods applicable to objectively evaluate the speech quality in the presence of background noise. The present document includes: The description of the experts post evaluation process chosen to select the subjective test data being within the scope of the objective methods. The re
29、sults of the performance evaluation of the currently existing methods described in Recommendations ITU-T P.862 i.16 and P.862.1 i.17 and in TOSQA2001 i.19 which is chosen for the evaluation of terminals in the framework of ETSI VoIP speech quality test events i.8, i.9, i.10 and i.11. The method whic
30、h is applicable to objectively determine the different parameters influencing the speech quality in the presence of background noise taking into account: - the speech quality; - the background noise transmission quality; - the overall quality. The present document is to be used in conjunction with:
31、- ETSI ES 202 396-1 i.1 which describes a recording and reproduction setup for realistic simulation of background noise scenarios in lab-type environments for the performance evaluation of terminals and communication systems. - ETSI EG 202 396-2 i.2 which describes the simulation of network impairme
32、nts and how to simulate realistic transmission network scenarios and which contains the methodology and results of the subjective scoring for the data forming the basis of the present document. - French speech sentences as defined in Recommendation ITU-T P.501 i.13 for wideband and English speech se
33、ntences as defined in Recommendation ITU-T P.501 i.13 for narrowband. 2 References 2.1 Normative references References are either specific (identified by date of publication and/or edition number or version number) or non-specific. For specific references, only the cited version applies. For non-spe
34、cific references, the latest version of the reference document (including any amendments) applies. Referenced documents which are not found to be publicly available in the expected location might be found at http:/docbox.etsi.org/Reference. NOTE: While any hyperlinks included in this clause were val
35、id at the time of publication, ETSI cannot guarantee their long term validity. The following referenced documents are necessary for the application of the present document. Not applicable. ETSI ETSI EG 202 396-3 V1.5.1 (2015-10) 7 2.2 Informative references References are either specific (identified
36、 by date of publication and/or edition number or version number) or non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the reference document (including any amendments) applies. NOTE: While any hyperlinks included in this clause
37、were valid at the time of publication, ETSI cannot guarantee their long term validity. The following referenced documents are not necessary for the application of the present document but they assist the user with regard to a particular subject area. i.1 ETSI ES 202 396-1: “Speech and multimedia Tra
38、nsmission Quality (STQ); Speech quality performance in the presence of background noise; Part 1: Background noise simulation technique and background noise database“. i.2 ETSI EG 202 396-2: “Speech Processing, Transmission and Quality Aspects (STQ); Speech Quality performance in the presence of back
39、ground noise; Part 2: Background Noise Transmission - Network Simulation - Subjective Test Database and Results“. i.3 Recommendation ITU-T P.835: “Subjective test methodology for evaluating speech communication systems that include noise suppression algorithm“. i.4 Recommendation ITU-T P.800: “Metho
40、ds for subjective determination of transmission quality“. i.5 Recommendation ITU-T P.831: “Subjective performance evaluation of network echo cancellers“. i.6 Genuit, K.: “Objective Evaluation of Acoustic Quality Based on a Relative Approach“, InterNoise 96, Liverpool, UK. i.7 Recommendation ITU-T SG
41、 12 Contribution 34: “Evaluation of the quality of background noise transmission using the “Relative Approach“. i.8 ETSI 2ndSpeech Quality Test Event: “Anonymized Test Report“, ETSI Plugtests, HEAD acoustics, T-Systems Nova. NOTE: Available at: http:/www.etsi.org/WebSite/OurServices/Plugtests/Histor
42、y.aspx. Also available as ETSI TR 102 648-3. i.9 ETSI 3rdSpeech Quality Test Event: “Anonymized Test Report “IP Gateways“. NOTE: Available at: http:/www.etsi.org/WebSite/OurServices/Plugtests/History.aspx. i.10 ETSI 3rdSpeech Quality Test Event: “Anonymized Test Report “IP Phones“. i.11 ETSI 4thSpee
43、ch Quality Test Event: “Anonymized Test Report “IP Gateways and IP Phones“. NOTE: Available at: http:/www.etsi.org/WebSite/OurServices/Plugtests/History.aspx. i.12 F. Kettler, H.W. Gierlich, F. Rosenberger: “Application of the Relative Approach to Optimize Packet Loss Concealment Implementations“, D
44、AGA, March 2003, Aachen, Germany. i.13 Recommendation ITU-T P.501: “Test Signals for Use in Telephonometry“. i.14 R. Sottek, K. Genuit: “Models of Signal Processing in human hearing“, International Journal of Electronics and Communications (AE) volume 59, 2005, p. 157-165. NOTE: Available at: http:/
45、www.elsevier.de/aeue. i.15 SAE International - Document 2005-01-2513: “Tools and Methods for Product Sound Design of Vehicles“ R. Sottek, W. Krebber, G. Stanley. i.16 Recommendation ITU-T P.862: “Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assess
46、ment of narrowband telephone networks and speech codecs“. ETSI ETSI EG 202 396-3 V1.5.1 (2015-10) 8 i.17 Recommendation ITU-T P.862.1: “Mapping function for transforming P.862 raw result scores to MOS-LQO“. i.18 Recommendation ITU-T P.862.2: “Wideband extension to Recommendation P.862 for the assess
47、ment of wideband telephone networks and speech codecs“. i.19 Recommendation ITU-T SG 12 Contribution 19: “Results of objective speech quality assessment of wideband speech using the Advanced TOSQA2001“. i.20 Recommendation ITU-T G.722: “7 kHz audio-coding within 64 kbit/s“. i.21 Recommendation ITU-T
48、 G.722.2: “Wideband coding of speech at around 16 kbit/s using Adaptive Multi-Rate Wideband (AMR-WB)“. i.22 Recommendation ITU-T P.56: “Objective measurement of active speech level“. i.23 Recommendation ITU-T P.57: “Artificial ears“. i.24 M. Spiegel: “Theory and problems of statistics“, McGraw Hill,
49、 1998. i.25 Void. i.26 M. Kendall: “Rank correlation methods“, Charles Griffin Speech quality performance in the presence of background noise: Background noise transmission for mobile terminals-objective test methods“. i.33 Hastie, T.; Tibshirani, R.; Friedman, J.: “The Elements of Statistical Learning: Data Mining, Inference, and Prediction“, New York: Springer-Verlag, 2001. 3 Symbols and abbreviations 3.1 Symbols For the purposes of the present document, the following symbols apply:
copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1