1、 ETSI ES 202 396-1 V1.6.1 (2015-06) Speech and multimedia Transmission Quality (STQ); Speech quality performance in the presence of background noise; Part 1: Background noise simulation technique and background noise database ETSI STANDARD ETSI ETSI ES 202 396-1 V1.6.1 (2015-06)2 Reference RES/STQ-2
2、27-1 Keywords noise, performance, quality, speech ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucratif enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important noti
3、ce The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The content of any electronic and/or print versions of the present document shall not be modified without the prior written authoriza
4、tion of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Document Format (PDF) version kept on a specific network drive within ETSI Secretariat. Users of the present document should be a
5、ware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at http:/portal.etsi.org/tb/status/status.asp If you find errors in the present document, please send your comment to one of the following services:
6、https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm except as authorized by written permission of ETSI. The content of the PDF version shall no
7、t be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. European Telecommunications Standards Institute 2015. All rights reserved. DECTTM, PLUGTESTSTM, UMTSTMand the ETSI logo are Trade Marks of ETSI registered for the
8、 benefit of its Members. 3GPPTM and LTE are Trade Marks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. GSM and the GSM logo are Trade Marks registered and owned by the GSM Association. ETSI ETSI ES 202 396-1 V1.6.1 (2015-06)3 Contents Intellectual Property
9、 Rights 5g3Foreword . 5g3Modal verbs terminology 5g3Introduction 5g31 Scope 6g32 References 6g32.1 Normative references . 6g32.2 Informative references 6g33 Definitions and abbreviations . 8g33.1 Definitions 8g33.2 Abbreviations . 8g34 Overview of existing methods for realistic sound reproduction. 8
10、g34.1 Introduction 8g34.2 Surround Sound Techniques. 9g34.3 IOSONO9g34.4 Eidophonie . 10g34.5 Four-loudspeaker arrangement for playback of binaurally recorded signals 11g34.6 NTT Background-Noise Database . 11g34.7 General conclusions . 12g35 Recording arrangement 12g35.1 Binaural equalization 12g35
11、.2 The equalization procedure 13g36 Loudspeaker Setup for Background Noise Simulation 15g36.1 Test Room Requirements . 15g36.2 Loudspeaker Positioning 15g36.3 Equalization and Calibration 16g36.4 Accuracy of the reproduction arrangement 22g36.4.1 Comparison between original sound field and simulated
12、 sound field . 22g36.4.2 Displacement of the test arrangement in the simulated sound field 23g36.4.3 Transmission of background noise: Comparison of terminal performance in the original sound field and the simulated sound field . 25g36.5 Simulation of additional acoustic conditions 29g37 Background
13、Noise Simulation in cars 30g37.1 General setup 30g37.2 Recording arrangement 31g37.2.1 Recording setup with the terminals microphone 31g37.2.2 Recording setup with a pair of cardioid microphones. 31g37.3 Equalization and Calibration with the terminals microphone 31g37.4 Equalization and Calibration
14、with a pair of cardioid microphones 36g37.5 Accuracy of the reproduction arrangement 41g37.5.1 Comparison between original sound field and simulated sound field . 41g37.5.2 Transmission of background noise: Comparison of terminal performance in the original sound field and the simulated sound field
15、. 42g38 Background Noise Database 45g38.1 Binaural signals 45g38.2 Binaural signals identical to the background noise recordings provided in ETSI TS 103 224 i.19 . 48g38.3 Stereophonic signals . 49g3Annex A (informative): Comparison of Tests in Sending Direction and D-Values Conducted in Different R
16、ooms . 50g3A.1 Test Setup . 50g3ETSI ETSI ES 202 396-1 V1.6.1 (2015-06)4 A.2 Results of the Tests 50g3A.2.1 Sending Frequency Response Characteristics and SLR . 51g3A.2.2 D-Value with Pink Noise . 51g3A.2.3 D-Value with Cafeteria Noise 51g3A.3 Conclusions 52g3Annex B (informative): Graphs 53g3Histor
17、y 61g3ETSI ETSI ES 202 396-1 V1.6.1 (2015-06)5 Intellectual Property Rights IPRs essential or potentially essential to the present document may have been declared to ETSI. The information pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be f
18、ound in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web server (http:/ipr.etsi.org). Pursuant to the ETSI IPR Po
19、licy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web server) which are, or may be, or may become, essential to the present document. Foreword This ET
20、SI Standard (ES) has been produced by ETSI Technical Committee Speech and multimedia Transmission Quality (STQ). The present document is part 1 of a multi-part deliverable covering Speech and multimedia Transmission Quality (STQ); Speech quality performance in the presence of background noise, as id
21、entified below: ETSI ES 202 396-1: “Background noise simulation technique and background noise database“; ETSI EG 202 396-2: “Background noise transmission - Network simulation - Subjective test database and results“; ETSI EG 202 396-3: “Background noise transmission - Objective test methods“. Modal
22、 verbs terminology In the present document “shall“, “shall not“, “should“, “should not“, “may“, “need not“, “will“, “will not“, “can“ and “cannot“ are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NO
23、T allowed in ETSI deliverables except when used in direct citation. Introduction Background noise is present in most of the conversations today. Background noise may impact the speech communication performance to terminal and network equipment significantly. Therefore testing and optimization of suc
24、h equipment is necessary using realistic background noises. Furthermore reproducible conditions for the tests are required which can be guaranteed only under lab type condition. The present document addresses this issue by describing a methodology for recording and playback of background noises unde
25、r well-defined and calibratable conditions in a lab-type environment. Furthermore a database with real background noises is included. ETSI ETSI ES 202 396-1 V1.6.1 (2015-06)6 1 Scope The quality of background noise transmission is an important factor, which significantly contributes to the perceived
26、 overall quality of speech. Existing and even more the new generation of terminals, networks and system configurations including broadband services can be greatly improved with a proper design of terminals and systems in the presence of background noise. The present document: describes a noise simul
27、ation environment using realistic background noise scenarios for laboratory use; contains a database including the relevant background noise samples for subjective and objective evaluation. The present document provides information about the recording techniques needed for background noise recording
28、s and discusses the advantages and drawbacks of existing methods. The present document describes the requirements for laboratory conditions. The loudspeaker setup and the loudspeaker calibration and equalization procedure are described. The simulation environment specified can be used for the evalua
29、tion and optimization of terminals and of complex configurations including terminals, networks and other configurations. The main application areas should be: office, home and car environment. The setup and database as described in the present document are applicable for: Objective performance evalu
30、ation of terminals in different (simulated) background noise environments. Speech processing evaluation by using the pre-processed speech signal in the presence of background noise, recorded by a terminal. Subjective evaluation of terminals by performing conversational tests, specific double talk te
31、sts or talking and listening tests in the presence of background noise. Subjective evaluation in third party listening tests by recording the speech samples of terminals in the presence of background noise. 2 References 2.1 Normative references References are either specific (identified by date of p
32、ublication and/or edition number or version number) or non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the reference document (including any amendments) applies. Referenced documents which are not found to be publicly availabl
33、e in the expected location might be found at http:/docbox.etsi.org/Reference. NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee their long term validity. The following referenced documents are necessary for the application of the present
34、document. Not applicable. 2.2 Informative references References are either specific (identified by date of publication and/or edition number or version number) or non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the reference d
35、ocument (including any amendments) applies. NOTE: While any hyperlinks included in this clause were valid at the time of publication, ETSI cannot guarantee their long term validity. The following referenced documents are not necessary for the application of the present document but they assist the u
36、ser with regard to a particular subject area. ETSI ETSI ES 202 396-1 V1.6.1 (2015-06)7 i.1 Surround Sound Past, Present, and Future: “A history of multichannel audio from mag stripe to Dolby Digital“, Joseph Hull - Dolby Laboratories Inc. i.2 AES preprint 3332 (1992): “Improved Possibilities of Bina
37、ural Recording and Playback Techniques“, K. Genuit, H.W. Gierlich; U. Knzli. NOTE: See at http:/www.aes.org/e-lib/browse.cfm?elib=6801. i.3 AES preprint 3732 (1993): “A System for the Reproduction Technique for Playback of Binaural Recordings“, N. Xiang, K. Genuit, H.W. Gierlich. NOTE: See at http:/
38、www.aes.org/e-lib/browse.cfm?elib=6501. i.4 NTTAT Database: “Ambient Noise Database CD-ROM“. NOTE: See at http:/www.ntt- i.5 ISO 11904-1: “Acoustics - Determination of sound immission from sound sources placed close to the ear - Part 1: Technique using a microphone in a real ear (MIRE technique)“. i
39、.6 Spatial Hearing: “The psychophysics of human sound localization“, J. Blauert. i.7 Recommendation ITU-T P.57: “Artificial ears“. i.8 Recommendation ITU-T P.58: “Head and torso simulator for telephonometry“. i.9 Recommendation ITU-T P.340: “Transmission characteristics and speech quality parameters
40、 of hands-free terminals“. i.10 Recommendation ITU-T P.64: “Determination of sensitivity/frequency characteristics of local telephone systems“. i.11 Recommendation ITU-T G.722: “7 kHz audio-coding within 64 kbit/s“. i.12 Genuit, K.: “A Description of the Human Outer Ear Transfer Function by Elements
41、 of Communication Theory (No. B6-8)“. NOTE: Proceedings of the 12thInternational Congress on Acoustics. Toronto published on behalf of the Technical Program Committee by the Executive Committee of the 12thInternational Congress on Acoustics. i.13 IEC 60050-722: “International Electrotechnical Vocabu
42、lary - Chapter 722: Telephony“. i.14 “Wellenfeldsynthese - Eine neue Dimension der 3D-Audiowiedergabe“; Fernseh- und Kino-Technik, Nr. 11/2002, pp. 735-738. i.15 “The Iosono Sound Difference“. NOTE: See at http:/www.iosono-sound.de. i.16 “Ein neues Verfahren der raumbezogenen Stereophonie mit verbes
43、serter bertragung der Rauminformation“; P. Scherer, Rundfunktechnische Mitteilungen, 1977, pp. 196-204. i.17 ETSI EG 202 396-1 (V1.1.2): “Speech Processing, Transmission and Quality Aspects (STQ); Speech quality performance in the presence of background noise; Part 1: Background noise simulation tec
44、hnique and background noise database“. i.18 ETSI TS 151 010-1: “Digital cellular telecommunications system (Phase 2+); Mobile Station (MS) conformance specification; Part 1: Conformance specification (3GPP TS 51.010-1)“. i.19 ETSI TS 103 224: “Speech and multimedia Transmission Quality (STQ); A soun
45、d field reproduction method for terminal testing including a background noise database“. ETSI ETSI ES 202 396-1 V1.6.1 (2015-06)8 3 Definitions and abbreviations 3.1 Definitions For the purposes of the present document, the following terms and definitions apply: crosstalk: appearance of undesired en
46、ergy in a channel, owing to the presence of a signal in another channel, caused by, for example induction, conduction or non-linearity NOTE: See IEC 60050-722 i.13. 3.2 Abbreviations For the purposes of the present document, the following abbreviations apply: CD Compact Disc EQ Equalization FFT Fast
47、 Fourier Transform FIR Finite Impulse Response HATS Head And Torso Simulator IIR Infinite Impulse Response MIRE Microphone In Real Ear MRP Mouth Reference PointNTT Nippon Telegraph and Telephone corporation SLR Send Loudness Rating VHF Very High Frequency 4 Overview of existing methods for realistic
48、 sound reproduction 4.1 Introduction In general the existing methods for close to original sound recording and reproduction aimed for different applications: Techniques intending to reproduce the actual sound field. Techniques providing hearing adequate (ear related) signals in the human ear canal.
49、Techniques generating artificial acoustical environments. Within this clause the different methods are briefly described and their applicability for close to original sound-filed reproduction is discussed. A variety of methods have been studied, in the following a summary of the most important ones relevant to the present document is given. The different methods were analyzed on the basis of the following requirements: The background noise recording technique should be: - easy to us