1、 ETSI TR 102 526 V1.1.1 (2006-06)Technical Report Speech Processing, Transmission and Quality Aspects (STQ);Wideband telephony considerationsETSI ETSI TR 102 526 V1.1.1 (2006-06) 2 Reference DTR/STQ-00057 Keywords speech, telephony ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE
2、Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice Individual copies of the present document can be downloaded from: http:/www.etsi.org The present document may be ma
3、de available in more than one electronic version or in print. In any case of existing or perceived difference in contents between such versions, the reference version is the Portable Document Format (PDF). In case of dispute, the reference shall be the printing on ETSI printers of the PDF version ke
4、pt on a specific network drive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at http:/portal.etsi.org/tb/status/status.asp If you
5、 find errors in the present document, please send your comment to one of the following services: http:/portal.etsi.org/chaircor/ETSI_support.asp Copyright Notification No part may be reproduced except as authorized by written permission. The copyright and the foregoing restriction extend to reproduc
6、tion in all media. European Telecommunications Standards Institute 2006. All rights reserved. DECTTM, PLUGTESTSTM and UMTSTM are Trade Marks of ETSI registered for the benefit of its Members. TIPHONTMand the TIPHON logo are Trade Marks currently being registered by ETSI for the benefit of its Member
7、s. 3GPPTM is a Trade Mark of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. ETSI ETSI TR 102 526 V1.1.1 (2006-06) 3 Contents Intellectual Property Rights4 Foreword.4 1 Scope 5 2 References 5 3 Abbreviations .8 4 Overview about work in different areas .8 4.1 S
8、ubjective speech quality assessment.8 4.1.1 Conversational Tests: Comparison of narrowband and wideband speech codecs in noisy environment 9 4.1.2 Third party listening tests .10 4.2 Wideband codecs and mixed narrowband/wideband scenarios12 4.3 Objective speech quality assessment13 4.3.1 Radiation d
9、irectivity of the artificial mouth13 4.3.2 Limitations for wideband introduced by the terminal.14 4.4 Quality prediction and modelling.16 4.4.1 Extension of objective speech quality measures (P.862) to wideband16 4.4.2 Extension of the E-model .17 4.4.3 The definition of QoS in conjunction with wide
10、band.19 4.4.3.1 General19 4.4.3.2 Parameters in relation to QoS20 4.4.3.3 Speech Quality 20 History 21 ETSI ETSI TR 102 526 V1.1.1 (2006-06) 4 Intellectual Property Rights IPRs essential or potentially essential to the present document may have been declared to ETSI. The information pertaining to th
11、ese essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available from the ETSI Secretariat. Lates
12、t updates are available on the ETSI Web server (http:/webapp.etsi.org/IPR/home.asp). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates
13、on the ETSI Web server) which are, or may be, or may become, essential to the present document. Foreword This Technical Report (TR) has been produced by ETSI Technical Committee Speech Processing, Transmission and Quality Aspects (STQ). ETSI ETSI TR 102 526 V1.1.1 (2006-06) 5 1 Scope The present doc
14、ument describes the state of the art of the research, tools and standards which are relevant for specifying, assessing and predicting wideband speech quality. The present document gives a summary of: - Existing methods and specifications applicable for wideband telephony. - The state of the art subj
15、ective testing procedures for wideband applications. - The ongoing work relevant to define and assess the wideband terminal (and network) characteristics. - The ongoing work on objective models for wideband speech quality assessment and prediction. The present document furthermore gives an overview
16、about the work needed to create a wideband transmission rating model. Independent of speech coder used. The present document focuses on wideband telephony (100 Hz to 8 kHz) but is not limited to this frequency range. 2 References For the purposes of this Technical Report (TR), the following referenc
17、es apply: 1 Barriac V.; Le Saout, J.-Y.; Lockwood, C.: “Discussion on unified methodologies for the comparison of voice quality of narrowband and wideband scenarios“; ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 8-9 2004. 2 Drascher, T.:
18、“A Subjective Sound Quality Assessment of Mobile Phones for Production Support“; ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 8-9 2004. 3 Gierlich, H.W.; Vll, S.; Jax, P.; Kettler, F.: “Speech quality assessment for wideband communication
19、 scenarios“; ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 8-9 2004. 4 Gros, L.; Monfort, J.-Y.; Quinquis C.: “Comparison of Narrow band and Wideband Speech Codecs in noisy environment“; ETSI-Workshop on Wideband Speech Quality in Terminal
20、s and Networks: Assessment and Prediction; Mainz, June 8-9 2004. 5 Halkosaari, T.; Vaalgamaa, M.: “Radiation Directivity of Human and Artificial Speech“; ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 8-9 2004. 6 Kitawaki, N.: “Perspectives
21、 on Multimedia Quality Prediction Methodologies for Advanced Mobile and IP-based Telephony“; ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 8.-9. 2004. 7 Mahieux, Y.; Derval, G.; Delam.D.: “Wide Band Speech introduction into VOIP solutions“
22、; ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 8-9 2004. 8 Mller, S.; Raake, A.; Barriac, V.; Quinquis, C.: “Deriving Equipment Impairment Factors for Wideband Speech Codecs“; ETSI-Workshop on Wideband Speech Quality in Terminals and Netw
23、orks: Assessment and Prediction; Mainz, June 8-9 2004. 9 Raake, A.: “How much better can wideband telephony be? - Estimating the necessary R-scale extension“; ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 8-9 2004. ETSI ETSI TR 102 526 V1.
24、1.1 (2006-06) 6 10 Rix, A.: “Subjective wideband speech quality and modelling issues“; ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 8-9 2004. 11 Rix, A.: “Perceptual wideband speech and audio quality measurement“; ETSI-Workshop on Wideban
25、d Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 8-9 2004. 12 Sydow, C.: “Practical Limitations of Wideband Terminals“; ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 8-9 2004. 13 Ulseth, T: “A path towards
26、 common quality assessment of narrowband and wideband voice“; ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 8-9 2004. 14 ITU-T Recommendation H.245 : “Control protocol for multimedia communication“. 15 Mller, S., Raake, A., Kitawaki, N., T
27、akahashi, A., Wltermann, M. (2006): “Impairment Factor Framework for Wideband Speech Codecs“, submitted to IEEE Trans. Audio, Speech and Language Processing. 16 Vll, S.; Gierlich, H.W.: “High Quality Background Noise Simulation: the ETSI STF 273 Project“; 2nd ETSI-Workshop on Wideband Speech Quality
28、 in Terminals and Networks: Assessment and Prediction; Mainz, June 22-23 2005. 17 Monfort, J.-Y.: “New STF 294: Improving the quality of eEurope wideband speech applications by developing a standardised performance testing and evaluation methodology for background noise transmission“; 2nd ETSI-Works
29、hop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 22-23 2005. 18 Beaugeant, C.; Varga, I.; Jax, P.: “Noise Reduction Preprocessing for the Adaptive Multi-Rate Wideband (AMR-WB) Speech Codec“; 2nd ETSI-Workshop on Wideband Speech Quality in Terminals and
30、 Networks: Assessment and Prediction; Mainz, June 22-23 2005. 19 Varga, I.: “Standardization of the Adaptive Multi-Rate Wideband (AMR-WB) Speech Codec and its Applications“; 2nd ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 22-23 2005. 20
31、Quinquis, C.: “A new codec within ITU-T: new bandwidth“; 2nd ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 22-23 2005. 21 Jax, P.; Vary, P.: “On the use of artificial bandwith Extension Techniques in Wideband Speech Communications“; 2nd ET
32、SI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 22-23 2005. 22 Pankaj K. R.; Ajit V. Rao: “A Scalable Wideband Speech Codec“; 2nd ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 22-23
33、2005. 23 Iser, B.: “Bandwidth extension of telephone band- limited Speech Signals“; 2nd ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 22-23 2005. 24 Meier, F.: “A wideband capable VoIP Chipset called INCA-IP“; 2nd ETSI-Workshop on Wideband
34、 Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 22-23 2005. 25 Diethorn, E.: “Aspects of Wideband Speech in Enterprise Telephony“; 2nd ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 22-23 2005. 26 Wltermann
35、, M.; Mller, S.; Raake, A.: “Quality Dimensions of Narrow-Band and Wideband Telephone Connections“; 2nd ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 22-23 2005. ETSI ETSI TR 102 526 V1.1.1 (2006-06) 7 27 Schmidtmer, C.: “Perceptual Wideba
36、nd Audio Quality Assessments Using PEAQ“; 2nd ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 22-23 2005. 28 Berger, J.: “Speech Quality in Cellular Networks - More or Less than GSM EFR“; 2nd ETSI-Workshop on Wideband Speech Quality in Termi
37、nals and Networks: Assessment and Prediction; Mainz, June 22-23 2005. 29 Vll, S.; Gierlich, H.W.; Kettler, F.; Jax, P.: “Background Noise Transmission Quality for Wideband Systems“; 2nd ETSI-Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction; Mainz, June 22-23 2
38、005. 30 ETSI EG 202 396-1: “Speech Processing, Transmission and Quality Aspects (STQ); Speech quality performance in the presence of background noise; Part 1: Background noise simulation technique and background noise database“. 31 ETSI EG 202 396-2: “Speech Processing, Transmission and Quality Aspe
39、cts (STQ); Speech Quality performance in the presence of background noise Part 2: Background noise transmission - Network simulation“. 32 ITU-T Recommendation E.800: “Terms and definitions related to quality of service and network performance including dependability“. 33 ITU-T Recommendation P.581:
40、“Use of head and torso simulator (HATS) for hands-free terminal testing“. 34 ITU-T Recommendation P.800: “Methods for subjective determination of transmission quality“. 35 ITU-T Recommendation P.831: “Subjective performance evaluation of network echo cancellers“. 36 ITU-T Recommendation P.832: “Subj
41、ective performance evaluation of hands-free terminals“. 37 ITU-T Recommendation P.833: “Methodology for derivation of equipment impairment factors from subjective listening-only tests“. 38 ITU-T Recommendation P.834: “Methodology for the derivation of equipment impairment factors from instrumental m
42、odels“. 39 ITU-T Recommendation P.862: “Perceptual evaluation of speech quality (PESQ): An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs“. 40 ITU-T Recommendation P.862.2: “Wideband extension to Recommendation P.862 for the assessment o
43、f wideband telephone networks and speech codecs“. 41 ITU-T Recommendation P.50: “Artificial Voices“. 42 ISO/DIS 10845: “Acoustics - Frequency weighting “A“ for noise measurement“. 43 ETSI ETR 250: “Transmission and Multiplexing (TM); Speech communication quality from mouth to ear for 3,1 kHz handset
44、 telephony across networks“. 44 ITU-R Recommendation BS.1116: “Methods for the subjective assessment of small impairments in audio systems including multichannel sound systems“. 45 ITU-R Recommendation BS.1534-1: “Method for the subjective assessment of intermediate quality levels of coding systems“
45、. 46 ITU-T Recommendation G.722: “7 kHz audio-coding within 64 kbit/s“. 47 ITU-T Recommendation G.711: “Pulse code modulation (PCM) of voice frequencies“. 48 ITU-T Recommendation G.722.1: “Low-complexity coding at 24 and 32 kbit/s for hands-free operation in systems with low frame loss“. ETSI ETSI T
46、R 102 526 V1.1.1 (2006-06) 8 3 Abbreviations For the purposes of the present document, the following abbreviations apply: ACR Absolute Category Rating AMR Adaptive Multi Rate Ie equipment ImpairmentIe,wb wideband Ie MOS Mean Opinion Score NB Narrowband NP Network Performance PCM Pulse-Code Modulatio
47、n PESQ Perceptual Evaluation of Speech Quality QoS Quality of Service WB Wideband 4 Overview about work in different areas 4.1 Subjective speech quality assessment Generally the subjective testing procedures which are found in the relevant ITU-T Recommendations (P.800 34, P.831 35, P.832 36, etc.) c
48、an be applied. The most important questions to be answered are: - Is the overall speech quality improved when using wideband transmission systems? - How is the perception of speech sound quality influenced by wideband systems? - Can speech intelligibility and the interactivity be improved by wideban
49、d transmission systems? - How is the wideband system performance in the presence of noise? - What performance requirements have to be set for other the parameters known influencing the speech quality (delay, loudness, echo performance, etc.)? - Are there new quality parameters influencing the speech quality for wideband systems? Ongoing work for wideband concentrates on the relationship between the single talk speech quality for narrowband systems vs. wideband systems.