ITU-T G 722-2012 7 kHz audio-coding within 64 kbit s (Study Group 16)《64 kbit s内的7 KHz音频编码研究组16[作废 ITU-T G 722 应用3 西班牙语 ITU-T G 722 附件 A西班牙语ITU-T G 722西班牙语ITU-T G 722应用2 西班牙语ITU-T .pdf

上传人:testyield361 文档编号:796376 上传时间:2019-02-02 格式:PDF 页数:274 大小:2.10MB
下载 相关 举报
ITU-T G 722-2012 7 kHz audio-coding within 64 kbit s (Study Group 16)《64 kbit s内的7 KHz音频编码研究组16[作废 ITU-T G 722 应用3 西班牙语 ITU-T G 722 附件 A西班牙语ITU-T G 722西班牙语ITU-T G 722应用2 西班牙语ITU-T .pdf_第1页
第1页 / 共274页
ITU-T G 722-2012 7 kHz audio-coding within 64 kbit s (Study Group 16)《64 kbit s内的7 KHz音频编码研究组16[作废 ITU-T G 722 应用3 西班牙语 ITU-T G 722 附件 A西班牙语ITU-T G 722西班牙语ITU-T G 722应用2 西班牙语ITU-T .pdf_第2页
第2页 / 共274页
ITU-T G 722-2012 7 kHz audio-coding within 64 kbit s (Study Group 16)《64 kbit s内的7 KHz音频编码研究组16[作废 ITU-T G 722 应用3 西班牙语 ITU-T G 722 附件 A西班牙语ITU-T G 722西班牙语ITU-T G 722应用2 西班牙语ITU-T .pdf_第3页
第3页 / 共274页
ITU-T G 722-2012 7 kHz audio-coding within 64 kbit s (Study Group 16)《64 kbit s内的7 KHz音频编码研究组16[作废 ITU-T G 722 应用3 西班牙语 ITU-T G 722 附件 A西班牙语ITU-T G 722西班牙语ITU-T G 722应用2 西班牙语ITU-T .pdf_第4页
第4页 / 共274页
ITU-T G 722-2012 7 kHz audio-coding within 64 kbit s (Study Group 16)《64 kbit s内的7 KHz音频编码研究组16[作废 ITU-T G 722 应用3 西班牙语 ITU-T G 722 附件 A西班牙语ITU-T G 722西班牙语ITU-T G 722应用2 西班牙语ITU-T .pdf_第5页
第5页 / 共274页
点击查看更多>>
资源描述

1、 International Telecommunication Union ITU-T G.722TELECOMMUNICATION STANDARDIZATION SECTOR OF ITU (09/2012) SERIES G: TRANSMISSION SYSTEMS AND MEDIA, DIGITAL SYSTEMS AND NETWORKS Digital terminal equipments Coding of voice and audio signals 7 kHz audio-coding within 64 kbit/s Recommendation ITU-T G.

2、722 ITU-T G-SERIES RECOMMENDATIONS TRANSMISSION SYSTEMS AND MEDIA, DIGITAL SYSTEMS AND NETWORKS INTERNATIONAL TELEPHONE CONNECTIONS AND CIRCUITS G.100G.199 GENERAL CHARACTERISTICS COMMON TO ALL ANALOGUE CARRIER-TRANSMISSION SYSTEMS G.200G.299 INDIVIDUAL CHARACTERISTICS OF INTERNATIONAL CARRIER TELEP

3、HONE SYSTEMS ON METALLIC LINES G.300G.399 GENERAL CHARACTERISTICS OF INTERNATIONAL CARRIER TELEPHONE SYSTEMS ON RADIO-RELAY OR SATELLITE LINKS AND INTERCONNECTION WITH METALLIC LINES G.400G.449 COORDINATION OF RADIOTELEPHONY AND LINE TELEPHONY G.450G.499 TRANSMISSION MEDIA AND OPTICAL SYSTEMS CHARAC

4、TERISTICS G.600G.699 DIGITAL TERMINAL EQUIPMENTS G.700G.799 General G.700G.709 Coding of voice and audio signals G.710G.729Principal characteristics of primary multiplex equipment G.730G.739 Principal characteristics of second order multiplex equipment G.740G.749 Principal characteristics of higher

5、order multiplex equipment G.750G.759 Principal characteristics of transcoder and digital multiplication equipment G.760G.769 Operations, administration and maintenance features of transmission equipment G.770G.779 Principal characteristics of multiplexing equipment for the synchronous digital hierar

6、chy G.780G.789 Other terminal equipment G.790G.799 DIGITAL NETWORKS G.800G.899 DIGITAL SECTIONS AND DIGITAL LINE SYSTEM G.900G.999 MULTIMEDIA QUALITY OF SERVICE AND PERFORMANCE GENERIC AND USER-RELATED ASPECTS G.1000G.1999 TRANSMISSION MEDIA CHARACTERISTICS G.6000G.6999 DATA OVER TRANSPORT GENERIC A

7、SPECTS G.7000G.7999 PACKET OVER TRANSPORT ASPECTS G.8000G.8999 ACCESS NETWORKS G.9000G.9999 For further details, please refer to the list of ITU-T Recommendations. Rec. ITU-T G.722 (09/2012) i Recommendation ITU-T G.722 7 kHz audio-coding within 64 kbit/s Summary Recommendation ITU-T G.722 describes

8、 the characteristics of an audio wideband (WB, 50 to 7 000 Hz) coding system which may be used for a variety of higher quality speech applications. The coding system uses sub-band adaptive differential pulse code modulation (SB-ADPCM) within a bit rate of 64 kbit/s. The system is henceforth referred

9、 to as 64 kbit/s (7 kHz) audio-coding. In the SB-ADPCM technique used, the frequency band is split into two sub-bands (higher and lower) and the signals in each sub-band are encoded using ADPCM. The system has three basic modes of operation corresponding to the bit rates used for 7 kHz audio-coding:

10、 64, 56 and 48 kbit/s. The latter two modes allow an auxiliary data channel of 8 and 16 kbit/s, respectively, to be provided within 64 kbit/s by making use of bits from the lower sub-band. Erratum 1 was incorporated in this new edition, as well as some additional typos identified within the main bod

11、y of ITU-T G.722. Annex A provides three frequency masks that can be used to simplify evaluation of the mass-produced equipment using ITU-T G.722 codecs, and make easier checks carried out during installation. The masks therein are specifically not intended to supplant any requirements of this Recom

12、mendation, but rather to suggest the needs of acceptance testing for production quantities of equipment using ITU-T G.722 codecs. They concern the measure of the signal-to-total distortion ratio in a loop with SB-ADPCM. Thus, these specifications do not aim at taking the place of the test digital se

13、quences of the ITU-T G.722 algorithm, but rather to ensure, once these sequences have been checked on a first model, that the quality of the equipment using these codecs is maintained. Annex B describes a scalable superwideband (SWB, 50-14 000 Hz) speech and audio-coding algorithm operating at 64, 8

14、0 and 96 kbit/s. The ITU-T G.722 superwideband extension codec is interoperable with ITU-T G.722. The output of the ITU-T G.722 SWB coder has a bandwidth of 50-14 000 Hz. The coder operates with 5 ms frames, has an algorithmic delay of 12.3125 ms and a worst case complexity of 22.76 WMOPS. By defaul

15、t, the encoder input and decoder output are sampled at 32 kHz. The superwideband encoder for improved ITU-T G.722 64 kbit/s core produces an embedded bitstream structured in two layers corresponding to two available bit rates from 80 to 96 kbit/s. The superwideband encoder for improved ITU-T G.722 5

16、6 kbit/s core produces an embedded bitstream structured in one layer corresponding to one available bit rate of 64 kbit/s. This 64 kbit/s mode is also scalable with the 80 kbit/s and 96 kbit/s modes. The bitstream can be truncated at the decoder side or by any component of the communication system t

17、o instantaneously adjust the bit rate to the desired value (96 kbit/s 80 kbit/s 64 kbit/s) with no need for out-of-band signalling. The underlying algorithm includes three main parts: higher band enhancements, bandwidth extension (BWE) and transform coding in modified discrete cosine transform (MDCT

18、) domain based on algebraic vector quantization (AVQ). In this revised version, an update was done to the text vectors of Annex B, so they can better assist in checking compliance of implementations. Annex C describes an alternative implementation of ITU-T G.722 Annex B based on floating-point arith

19、metic. While Annex B provides a bit-exact, fixed-point specification with the fixed-point C-source code available from the ITU-T, alternative floating implementation is useful for platforms equipped with floating-point processors. This alternative floating-point arithmetic was found to be fully inte

20、roperable with Annex B in all configurations including the cross configurations. Annex D describes a stereo extension of the wideband codec ITU-T G.722 and its superwideband extension, ITU-T G.722 Annex B. It is optimized for the transmission of stereo signals with limited additional bitrate, while

21、keeping full compatibility with both codecs. Annex D operates from 64 to 128 kbit/s with four superwideband stereo bitrates at 80, 96, 112 and 128 kbit/s and two wideband stereo bitrates at 64 and 80 kbit/s. The wideband stereo modes are backward compatible with legacy ITU-T G.722, while the superwi

22、deband modes offer the backward compatibility with both mono ii Rec. ITU-T G.722 (09/2012) wideband ITU-T G.722 and superwideband ITU-T G.722 Annex B. The stereo codec operates on 5 ms frames with an algorithmic delay of 13.625 ms for wideband stereo and 15.9375 ms for superwideband stereo. The enco

23、der input and decoder output are sampled at 16 kHz and 32 kHz for wideband and superwideband operating modes respectively. The underlying algorithm includes three main parts: stereo parameter analysis and down-mix at the encoder and stereo synthesis at the decoder. The first stereo extension layer i

24、s an 8 kbit/s layer comprising the basic stereo parameters, wideband inter-channel time difference/inter-channel phase difference/inter-channel coherence and sub-band inter-channel level differences. The second stereo layer, also an 8 kbit/s layer, enhances the stereo image by encoding low frequency

25、 sub-band inter-channel phase differences. Finally, the third stereo layer is a 16 kbit/s layer. In this last layer, the inter-channel phase differences of a larger bandwidth are transmitted which allow to further improve the stereo image. The bitstream can be truncated by the decoder, or by any com

26、ponents of the communication system, to instantaneously adjust the bitrate to the desired value, including wideband ITU-T G.722 and superwideband ITU-T G.722 Annex B bitrates, with no need for out-of-band signalling. Networking aspects and test sequences for the main body algorithm are addressed in

27、Appendices I and II respectively to this Recommendation. In this new edition, Appendix II was updated to reflect a restructuring of the test sequences for ITU-T G.722 main body. Packet loss concealment (PLC) algorithms, also known as frame erasure concealment algorithms, hide transmission losses in

28、audio systems where the input signal is encoded and packetized, sent over a network, received and decoded before play out. PLC algorithms can be found in most standard recent speech coders. ITU-T G.722 was initially designed without such a feature. Therefore, Appendices III and IV provide two PLC me

29、chanisms for ITU-T G.722. The algorithms in both appendices were verified to have high quality performance with alternative quality/complexity trade-offs. At an additional complexity of 2.8 WMOPS worst-case and 2 WMOPS average compared with the ITU-T G.722 decoder without PLC, the ITU-T G.722 PLC al

30、gorithm described in Appendix III provides better speech quality whereas the ITU-T G.722 PLC specified in ITU-T G.722 Appendix IV provides lower complexity adding almost no additional complexity to that of the main body ITU-T G.722 decoding (worst-case additional complexity is 0.07 WMOPS). The algor

31、ithm in Appendix III performs the packet loss concealment in the 16 kHz output domain of the ITU-T G.722 decoder. Periodic waveform extrapolation is used to fill in the waveform of lost packets, mixing with filtered noise according to signal characteristics prior to the loss. The extrapolated 16 kHz

32、 signal is passed through the QMF analysis filter bank, and the sub-band signals are passed to partial sub-band ADPCM encoders to update the states of the sub-band ADPCM decoders. Additional processing takes place for each packet loss in order to provide a smooth transition from the extrapolated wav

33、eform to the waveform decoded from the received packets. Among other things, the states of the sub-band ADPCM decoders are phase aligned with the first received packet after a packet loss, and the decoded waveform is time-warped in order to align with the extrapolated waveform before the two are ove

34、rlap-added to smooth the transition. For protracted packet loss, the algorithm gradually mutes the output. The algorithm operates on an intrinsic 10-ms frame size. It can operate on any packet or frame size that is a multiple of 10 ms. The longer input frame becomes a super frame, for which the pack

35、et loss concealment is called an appropriate number of times at its intrinsic frame size of 10 ms. It results in no additional delay when compared with regular ITU-T G.722 decoding using the same frame size. In Appendix IV, the decoder comprises three stages: lower sub-band decoding, higher sub-band

36、 decoding and quadrature mirror filter (QMF) synthesis. In the absence of frame erasures, the decoder structure is identical to ITU-T G.722, except for the storage of the two decoded signals, of the higher and lower sub-bands. In case of frame erasures, the decoder is informed by the bad frame indic

37、ation (BFI) signalling. It then performs an analysis of the past lower-band reconstructed signal and extrapolates the missing signal using linear-predictive coding (LPC), pitch-synchronous period repetition and adaptive muting. Once a good frame is received, the decoded signal is cross-faded with th

38、e extrapolated signal. In the higher sub-band, the decoder repeats the previous frame Rec. ITU-T G.722 (09/2012) iii pitch-synchronously, with adaptive muting and high-pass post-processing. The adaptive differential pulse code modulation (ADPCM) states are updated after each frame erasure. Appendix

39、V defines a coding scheme for mid-side (MS) stereo using the superwideband extension defined in Annex B of ITU-T G.722. By introducing the mid-side stereo coding into stereo terminals, interoperability with the monaural devices could be obtained in very low complexity. The basic coding scheme is as

40、follows: two channels of the left-right (LR) stereo are converted to those of the mid-side stereo and then the signals of each channel are independently encoded using ITU-T G.722 Annex B; then, at the decoder side, the mid-side channels of the bitstream from the encoder are decoded respectively and

41、then the decoded signals of the mid-side channels are reversed to those of the LR channels. The LR-MS conversion and its inverse are conducted in a conventional way. On the encoder side, two additional arithmetic operations per sample are required for the LR-MS conversion and one operator for the MS

42、-LR conversion in the decoder. In an STL2009 (see ITU-T G.191) basic operator implementation, the conversion complexity amounts to about 0.2 WMOPS in total. The coding algorithm for each channel is identical to the one in Recommendation ITU-T G.722 Annex B. Annexes B, C and D contain an electronic a

43、ttachment provided with the ANSI C source code, which is an integral part of these annexes. ANSI C source code is also provided as an integral part of Appendices III and IV. NOTE An ANSI-C code reference implementation for the algorithm in the main body of ITU-T G.722 is found in the ITU-T G722 modu

44、le of the ITU-T G.191 Software Tools Library. Test sequences are provided for compliance testing of the ITU-T G.722 algorithm in the main body of this Recommendation. Test vectors are provided to assist in checking the correct operation of Annexes B, C and D and Appendices III and IV. History Editio

45、n Recommendation Approval Study Group 1.0 ITU-T G.722 1987-02-28 XVIII 2.0 ITU-T G.722 1988-11-25 2.1 ITU-T G.722 (1988) App. II 1988-11-25 2.2 ITU-T G.722 (1988) Annex A 1993-03-12 XV 2.3 ITU-T G.722 (1988) App. III 2006-11-24 16 2.4 ITU-T G.722 (1988) App. IV 2006-11-24 16 2.5 ITU-T G.722 (1988) A

46、pp. IV 2007-07-06 16 2.6 ITU-T G.722 (1988) App. IV 2009-11-06 16 2.7 ITU-T G.722 (1988) Amd. 1 2010-11-13 16 2.8 ITU-T G.722 (1988) Amd. 2 2011-03-25 16 3.0 ITU-T G.722 2012-09-13 16 Keywords ADPCM, ITU-T G.722, ITU-T G.722 Annex B, packet loss concealment, PLC, stereo coding, sub-band coding, supe

47、rwideband, wideband. iv Rec. ITU-T G.722 (09/2012) FOREWORD The International Telecommunication Union (ITU) is the United Nations specialized agency in the field of telecommunications, information and communication technologies (ICTs). The ITU Telecommunication Standardization Sector (ITU-T) is a pe

48、rmanent organ of ITU. ITU-T is responsible for studying technical, operating and tariff questions and issuing Recommendations on them with a view to standardizing telecommunications on a worldwide basis. The World Telecommunication Standardization Assembly (WTSA), which meets every four years, estab

49、lishes the topics for study by the ITU-T study groups which, in turn, produce Recommendations on these topics. The approval of ITU-T Recommendations is covered by the procedure laid down in WTSA Resolution 1. In some areas of information technology which fall within ITU-Ts purview, the necessary standards are prepared on a collaborative basis with ISO and IEC. NOTE In this Recommendation, the expression “Administration“ is used for conciseness to indicate both a telecommunication administration and a recognized operating agency. Compliance with this Reco

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 标准规范 > 国际标准 > 其他

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1