1、 ETSI TR 1Universal Mobile TelStudy on SurrouPacket SwitMultimedia Broad(3GPP TR 26.9TECHNICAL REPORT 126 950 V13.0.0 (2016elecommunications System (LTE; ound Sound codec extension ched Streaming (PSS) and adcast/Multicast Service (MB.950 version 13.0.0 Release 1316-01) (UMTS); n for BMS) ) ETSI ETS
2、I TR 126 950 V13.0.0 (2016-01)13GPP TR 26.950 version 13.0.0 Release 13Reference RTR/TSGS-0426950vd00 Keywords LTE,UMTS ETSI 650 Route des Lucioles F-06921 Sophia Antipolis Cedex - FRANCE Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 Siret N 348 623 562 00017 - NAF 742 C Association but non lucrati
3、f enregistre la Sous-Prfecture de Grasse (06) N 7803/88 Important notice The present document can be downloaded from: http:/www.etsi.org/standards-search The present document may be made available in electronic versions and/or in print. The content of any electronic and/or print versions of the pres
4、ent document shall not be modified without the prior written authorization of ETSI. In case of any existing or perceived difference in contents between such versions and/or in print, the only prevailing document is the print of the Portable Document Format (PDF) version kept on a specific network dr
5、ive within ETSI Secretariat. Users of the present document should be aware that the document may be subject to revision or change of status. Information on the current status of this and other ETSI documents is available at http:/portal.etsi.org/tb/status/status.asp If you find errors in the present
6、 document, please send your comment to one of the following services: https:/portal.etsi.org/People/CommiteeSupportStaff.aspx Copyright Notification No part may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm except as authorized
7、by written permission of ETSI. The content of the PDF version shall not be modified without the written authorization of ETSI. The copyright and the foregoing restriction extend to reproduction in all media. European Telecommunications Standards Institute 2016. All rights reserved. DECTTM, PLUGTESTS
8、TM, UMTSTMand the ETSI logo are Trade Marks of ETSI registered for the benefit of its Members. 3GPPTM and LTE are Trade Marks of ETSI registered for the benefit of its Members and of the 3GPP Organizational Partners. GSM and the GSM logo are Trade Marks registered and owned by the GSM Association. E
9、TSI ETSI TR 126 950 V13.0.0 (2016-01)23GPP TR 26.950 version 13.0.0 Release 13Intellectual Property Rights IPRs essential or potentially essential to the present document may have been declared to ETSI. The information pertaining to these essential IPRs, if any, is publicly available for ETSI member
10、s and non-members, and can be found in ETSI SR 000 314: “Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in respect of ETSI standards“, which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web server (https:/ipr.etsi.o
11、rg/). Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web server) which are, or may be, or may become, essential to the p
12、resent document. Foreword This Technical Report (TR) has been produced by ETSI 3rd Generation Partnership Project (3GPP). The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or GSM identities. These should be interpreted as being referen
13、ces to the corresponding ETSI deliverables. The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under http:/webapp.etsi.org/key/queryform.asp. Modal verbs terminology In the present document “shall“, “shall not“, “should“, “should not“, “may“, “need not“, “will“, “will not“,
14、 “can“ and “cannot“ are to be interpreted as described in clause 3.2 of the ETSI Drafting Rules (Verbal forms for the expression of provisions). “must“ and “must not“ are NOT allowed in ETSI deliverables except when used in direct citation. ETSI ETSI TR 126 950 V13.0.0 (2016-01)33GPP TR 26.950 versi
15、on 13.0.0 Release 13Contents Intellectual Property Rights 2g3Foreword . 2g3Modal verbs terminology 2g3Foreword . 4g31 Scope 5g32 References 5g33 Definitions and abbreviations . 6g33.1 Definitions 6g33.2 Abbreviations . 6g34 Use cases 6g34.1 Surround sound over headphones . 7g34.2 Surround sound over
16、 loudspeakers . 8g34.2.1 Decoding and rendering on a UE 8g34.2.2 Decoding and rendering on a non-3GPP device connected to a UE . 8g34.2.3 Decoding on a UE and rendering on a non-3GPP device connected to a UE . 9g34.2.4 PSS/MBMS delivery methods 9g35 Design constraints 9g35.1 Mono/Stereo Backwards co
17、mpatibility 10g35.2 Number of audio channels 10g35.2.1 Number of audio input channels . 10g35.2.2 Number of audio output channels . 10g35.3 Sampling frequency 10g35.4 Bit rates 11g35.5 Computational complexity . 11g35.6 Other design constraints . 11g36 Test item selection criteria . 11g37 Performanc
18、e requirements 12g37.1 General requirements . 12g37.2 Loudspeaker requirements 12g37.3 Binaural test 13g37.4 Backward compatibility test . 13g37.5 Error test . 13g37.6 Listening test on HRTF 13g38 Validation of the user benefits and feasibility through evaluation of at least one example of surround
19、sound . 13g38.1 Listening test over loudspeakers 13g38.2 Listening test over headphones 15g38.3 Backward compatibility . 18g38.4 Test under errors conditions . 19g38.4.1 Results with interleaver 19g38.4.2 Results without interleaver . 21g38.5 Test on HRTFs . 23g39 Conclusion 24g3Annex A: Test plans
20、and global analysis reports 25g3Annex B: Change history 26g3History 27 ETSI ETSI TR 126 950 V13.0.0 (2016-01)43GPP TR 26.950 version 13.0.0 Release 13Foreword This Technical Report has been produced by the 3rdGeneration Partnership Project (3GPP). The contents of the present document are subject to
21、continuing work within the TSG and may change following formal TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an identifying change of release date and an increase in version number as follows: Version x.y.z where: x the first digit:
22、1 presented to TSG for information; 2 presented to TSG for approval; 3 or greater indicates TSG approved document under change control. y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, updates, etc. z the third digit is incremented when editor
23、ial only changes have been incorporated in the document. ETSI ETSI TR 126 950 V13.0.0 (2016-01)53GPP TR 26.950 version 13.0.0 Release 131 Scope The present document investigates the potential user experience benefits of surround audio in 3GPP services. The investigation will be performed as follows:
24、 Identify and document relevant use cases for surround sound in 3GPP. Define design constraints that would need to be met by a surround audio codec extension method for adoption by 3GPP. Identify suitable testing methodology for surround sound in relevant use cases of the PSS and MBMS services. Defi
25、ne subjective minimum performance criteria that would need to be met in order to motivate the consideration of a surround audio coding extension for adoption by 3GPP. Validate the user benefits and the feasibility of the deployment of surround sound for the PSS and MBMS services according to the def
26、ined minimum performance criteria, bit rate and design constraints for all the use cases (such as surround sound speaker set-up and headphone decoding mode) through evaluation of at least one example of surround sound coding methods which may be MPS. 2 References The following documents contain prov
27、isions which, through reference in this text, constitute provisions of the present document. - References are either specific (identified by date of publication, edition number, version number, etc.) or non-specific. - For a specific reference, subsequent revisions do not apply. - For a non-specific
28、 reference, the latest version applies. In the case of a reference to a 3GPP document (including a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same Release as the present document. 1 3GPP TR 21.905: “Vocabulary for 3GPP Specifications“. 2 3
29、GPP TS 26.346: “Multimedia Broadcast/Multicast Service (MBMS); Protocols and codecs“. 3 3GPP TS 26.234: “Transparent end-to-end Packet-switched Streaming Service (PSS); Protocols and codecs“. 4 ITU-R Recommendation BS.775-2: “Multichannel stereophonic sound system with and without accompanying pictu
30、re,“ Jul. 2006. 5 ITU- Recommendation BS.1534-1: “Method for the subjective assessment of intermediate quality level of coding systems“, Geneva, 2003. 6 ISO/IEC JTC1/SC29/WG11 N2006 “Report on the MPEG-2 AAC Stereo Verification Tests“; Feb 1998, http:/www.chiariglione.org/mpeg/working_documents/mpeg
31、-02/audio/AAC_results.zip. 7 3GPP TR 26.936: “Performance characterization of 3GPP audio codecs“. ETSI ETSI TR 126 950 V13.0.0 (2016-01)63GPP TR 26.950 version 13.0.0 Release 133 Definitions and abbreviations 3.1 Definitions For the purposes of the present document, the terms and definitions given i
32、n TR 21.905 1 and the following apply. A term defined in the present document takes precedence over the definition of the same term, if any, in TR 21.905 1. HRTF: A Head-Related Transfer Function (HRTF) represents a pair of filters that are obtained by measurement or modelling. It represents the aco
33、ustic transmission from a point in space to the entrance of a listeners ear canal. It depends on the relative positions of the source and the listeners head. 3.2 Abbreviations For the purposes of the present document, the abbreviations given in TR 21.905 1 and the following apply. An abbreviation de
34、fined in the present document takes precedence over the definition of the same abbreviation, if any, in TR 21.905 1. 5.1ch Loudspeaker set-up with 2 front channels, 2 rear channels, 1 center channel and 1 subwoofer HRTF Head-Related Transfer Function MPS MPEG Surround MUSHRA MUlti Stimulus test with
35、 Hidden Reference and Anchor 4 Use cases The relevant use cases considered in this study are applications in the context of MBMS and/or PSS services. In the home entertainment industry the de facto standard for surround sound content is the 5.1 channel format. The reproduction of such surround signa
36、l can be done in various ways using a number of channels that is not necessarily equal to the content at the service provider side resulting in different listening modes. The general characteristics of MBMS and PSS services apply and will be considered to derive design constraints and performance re
37、quirements for the study item. We have identified the following use cases for consideration. Table 1: List of use cases considered in the study Use case # Reproduction Description 1 a Headphones Surround decoding with binaural post-processing 1 b Headphones Surround decoding with binaural processing
38、 being part of the decoding process 2.1 a Loudspeakers Surround decoding followed by rendering on the UE 2.1 b Loudspeakers Surround decoding with rendering being part of the decoding process on the UE 2.2 Loudspeakers Surround bit-stream is transported via the UE. Decoding and rendering is performe
39、d in a non-3gpp device connected to the UE. 2.3 Loudspeakers Surround decoding on the UE. Decoded surround audio data are transported to a non-3gpp device connected to the UE for rendering. NOTE: In the following use cases it is assumed that the surround sound content provided to the server comprise
40、s multiple channels, typically 6 channels in the 5.1 format. ETSI ETSI TR 126 950 V13.0.0 (2016-01)73GPP TR 26.950 version 13.0.0 Release 13 Alternatively, the surround sound content may be presented to the server as a binauralized stereo signal. In this case, the server would encode the surround so
41、und as an artistic downmix (which is also referred to as Binaural Virtual Surround effect). No additional processing would be required when listening over headphones. However, this alternative format would have several implications: o When playing over stereo or multichannel loudspeakers, the decode
42、r would have to remove the binauralization effect. Some signalling would be needed to indicate that the downmix is binauralized stereo signal. o This alternative format would not offer mono/stereo backward compatibility to existing 3GPP audio codecs, especially when listening over loudspeakers. In t
43、he following use cases it is assumed that the surround bit-stream contains spatial information to control the behaviour of the surround decoder. The surround decoder produces surround sound based on this side information. However, a possible additional function of the surround capable UE is that the
44、 surround decoder may be able to upmix stereo signals encoded by legacy 3GPP audio codecs, which can then be binauralized for listening over headphones. 4.1 Surround sound over headphones Binaural/Stereo post-processing may or may not be part of the surround sound decoder (see Figures 1 and 2). Figu
45、re 1 illustrates a block diagram where the binaural or stereo post-processing is not part of the surround decoder. A server transmits surround sound bit-streams via PSS or MBMS protocols/services. The UE first decodes the received surround bit-stream to a surround signal. The resulting surround sign
46、al is processed by binaural or stereo downmix post-processing to produce a stereo signal. The resulting signal can be represented on headphones. NOTE: The surround bit-stream is decoded inside the UE to a surround signal. This surround signal is input to a binaural or stereo downmix post-processor t
47、hat produces a representation of the surround signal for headphone reproduction. Figure 1: Signal flow for use case 1 a where binaural and stereo downmix post-processing is not part of the surround sound decoder Figure 2 provides a block diagram where binaural post-processing is part of, i.e. integr
48、ated into, the surround decoder. The only difference with regard to Figure 1 is that the surround bit-stream is not first decoded to a full surround signal prior to binaural post-processing. Instead the steps of surround decoding and binaural decoding are integrated into a single binaural surround d
49、ecoder. NOTE: The surround bit-stream is decoded inside the UE directly to produce a representation of the surround signal for headphone reproduction. Figure 2: Signal flow for use case 1 b where binaural processing is part of the surround sound decoder ETSI ETSI TR 126 950 V13.0.0 (2016-01)83GPP TR 26.950 version 13.0.0 Release 134.2 Surround sound over loudspeakers In case the surround sound is to be played back over loudspeakers a number of scenarios can be considered. These scenarios are outlined below. 4.2.1 Decoding and rendering