1、 Recommendation ITU-R BT.1620-1(03/2010)Data structure for DV-based audio,data and compressed video ata data rate of 100 Mbit/sBT SeriesBroadcasting service(television)ii Rec. ITU-R BT.1620-1 Foreword The role of the Radiocommunication Sector is to ensure the rational, equitable, efficient and econo
2、mical use of the radio-frequency spectrum by all radiocommunication services, including satellite services, and carry out studies without limit of frequency range on the basis of which Recommendations are adopted. The regulatory and policy functions of the Radiocommunication Sector are performed by
3、World and Regional Radiocommunication Conferences and Radiocommunication Assemblies supported by Study Groups. Policy on Intellectual Property Right (IPR) ITU-R policy on IPR is described in the Common Patent Policy for ITU-T/ITU-R/ISO/IEC referenced in Annex 1 of Resolution ITU-R 1. Forms to be use
4、d for the submission of patent statements and licensing declarations by patent holders are available from http:/www.itu.int/ITU-R/go/patents/en where the Guidelines for Implementation of the Common Patent Policy for ITU-T/ITU-R/ISO/IEC and the ITU-R patent information database can also be found. Ser
5、ies of ITU-R Recommendations (Also available online at http:/www.itu.int/publ/R-REC/en) Series Title BO Satellite delivery BR Recording for production, archival and play-out; film for television BS Broadcasting service (sound) BT Broadcasting service (television) F Fixed service M Mobile, radiodeter
6、mination, amateur and related satellite services P Radiowave propagation RA Radio astronomy RS Remote sensing systems S Fixed-satellite service SA Space applications and meteorology SF Frequency sharing and coordination between fixed-satellite and fixed service systems SM Spectrum management SNG Sat
7、ellite news gathering TF Time signals and frequency standards emissions V Vocabulary and related subjects Note: This ITU-R Recommendation was approved in English under the procedure detailed in Resolution ITU-R 1. Electronic Publication Geneva, 2010 ITU 2010 All rights reserved. No part of this publ
8、ication may be reproduced, by any means whatsoever, without written permission of ITU. Rec. ITU-R BT.1620-1 1 RECOMMENDATION ITU-R BT.1620-1 Data structure for DV-based audio, data and compressed video at a data rate of 100 Mbit/s (Question ITU-R 12/6) (2003-2010) Scope This Recommendation defines t
9、he data structure for the interface of DV-based digital audio, subcode data, and compressed video at 100 Mb/s. The standard defines the processes required to decode the DV-based data structure into eight channels of AES digital audio at 48 kHz, subcode data, and high-definition video at 1 920 1 080/
10、60/I, 1920 1 080/50/I, 1 280 720/60/P and 1 280 720/50/P. The ITU Radiocommunication Assembly, considering a) that applications within professional television production and post-production have been identified where DV-based video compression can offer operational and economic advantages; b) that t
11、hree data rates have been proposed within the same compression family to serve different applications (25 Mbit/s, 50 Mbit/s and 100 Mbit/s); c) that the sampling rasters for each of the three applications are different; d) that for the international exchange of high-definition programme material the
12、 ITU-R recommends the application of Recommendation ITU-R BT.709; e) that audio, auxiliary data and metadata elements form an integral part of these applications; f) that these elements are multiplexed into a single data stream for transport and further processing; g) that the compression quality an
13、d functional characteristics must be identical and reproducible in complex production chains; h) that for this purpose all details of parameters used for coding and multiplexing must be defined, recommends 1 that for applications in professional television production and post-production using DV-bas
14、ed compression at 100 Mbit/s, the parameters given in Annexes 1 and 2 should be used; 2 that compliance with this Recommendation is voluntary. However, the Recommendation may contain certain mandatory provisions (to ensure e.g. interoperability or applicability) and compliance with the Recommendatio
15、n is achieved when all of these mandatory provisions are met. The words “shall” or some other obligatory language such as “must” and the negative equivalents are used to express requirements. The use of such words shall in no way be construed to imply partial or total compliance with this Recommenda
16、tion. 2 Rec. ITU-R BT.1620-1 Annex 1 1 Overview This Recommendation defines how DIF packets and other data such as audio and time code data are formatted for recording on a DV-based recorder, specified elsewhere. As shown in Fig. 1, the processed audio, video and subcode data are output for the reco
17、rding on a Type D-12 recorder. Additionally these data are multiplexed in the DIF (digital interface) format data to output for different applications through a digital interface port. Details of the process shown in Fig. 1 are described in 3 and 4. 2 Abbreviations and acronyms as used in this Recom
18、mendation AAUX Audio auxiliary data AP1 Audio application ID AP2 Video application ID AP3 Subcode application ID APT Track application ID Arb Arbitrary AS AAUX source pack ASC AAUX source control pack CGMS Copy generation management system CM Compressed macro block DBN DIF block number DCT Discrete
19、cosine transform DIF Digital interface DRF Direction flag Dseq DIF sequence number DSF DIF sequence flag EFC Emphasis audio channel flag EOB End of block LF Locked mode flag QNO Quantization number QU Quantization Res Reserved for future use SCT Section type SMP Sampling frequency SSYB Subcode sync
20、block STA Status of the compressed macro block Rec. ITU-R BT.1620-1 3 STYPE Signal type Syb Subcode sync block number TF Transmitting flag VAUX Video auxiliary data VLC Variable length coding VS VAUX source pack VSC VAUX source control pack References Recommendation ITU-R BS.647 A digital audio inte
21、rface for broadcasting studios. Recommendation ITU-R BR.780 Time and control code standards, for production applications in order to facilitate the international exchange of television programmes on magnetic tapes. Recommendation ITU-R BT.1847 1 280 720, 16:9 progressively-captured image format for
22、production and international programme exchange in the 50 Hz environment. Recommendation ITU-R BT.709 Parameter values for the HDTV standards for production and international programme exchange. Recommendation ITU-R BT.1543 1 280 720, 16 9 progressively-captured image format for production and inter
23、national programme exchange in the 60 Hz environment. Recommendation ITU-R BT.1616 Data stream format for the exchange of DV-based audio, data and compressed video over interfaces complying with Recommendation ITU-R BT.1381. 3 Data processing 3.1.1 Video encoding parameter The source component signa
24、l to be processed shall comply with the video parameters as defined by Recommendations ITU-R BT.709, ITU-R BT.1543 and ITU-R BT.1847. Not all formats may be supported by all manufacturers. 3.1.2 Audio encoding parameter The audio signal shall be sampled at 48 kHz, with 16-bit quantization defined by
25、 Recommendation ITU-R BS.647. 3.1.3 Subcode data The time code format in the subcode area shall be the LTC codeword and comply with Recommendation ITU-R BR.780. Each frame of time code shows a frame number that corresponds to each video frame in the 1 920 1 080-line interlaced system, and two video
26、frames each in the 1 280 720-line progressive system. 4 Rec. ITU-R BT.1620-1 3.1.4 Frame structure In the 1 920 1 080-line system, video data, audio data, and subcode data in one video frame shall be processed in each frame. In the 1 280 720-line system, these data in two video frames shall be proce
27、ssed within one frame duration of the 1 920 1 080-line system. Consequently, audio data and subcode data in the 1 280 720-line system are processed in the same way as the 1 920 1 080-line system. The audio data corresponding to one video frame in the 1 920 1 080-line system and two video frames in t
28、he 1 280 720-line system is defined as an audio-processing unit. FIGURE 1 Data processing block diagram 1620-01SamplingConversionBlockingShufflingDCTWeightingQuantizationVLCFormatterRateControlShufflingDIFFormatterVideoDIFRecordingSDTIAudioAES3Subcode8 bits or more 8 / 10 bits 16 bits 3.2 Data struc
29、ture The data structure of the compressed stream at the digital interface is shown in Fig. 2. The data of each frame shall be divided into four DIF channels. Each DIF channel shall be divided into 10 DIF sequences for the 60-Hz system1and 12 DIF sequences for the 50-Hz system. Each DIF sequence shal
30、l consist of a header section, subcode section, VAUX section, audio section, and video section with the following DIF blocks respectively: Header section: 1 DIF block Subcode section: 2 DIF blocks VAUX section: 3 DIF blocks Audio section: 9 DIF blocks Video section: 135 DIF blocks. 160 Hz systems al
31、so includes 60/1.001 Hz. Rec. ITU-R BT.1620-1 5 As shown in Fig. 2, each DIF block shall consist of a 3-byte ID and 77 bytes of data. The DIF data bytes are numbered 0 to 79. Figure 3 shows the data structure of a DIF sequence. FIGURE 2 Data structure 1620-02Byte position number0 Structure of a DIFs
32、equenceDIF blocksStructure of a DIF blockDIF sequencesDIF block numberDIF sequence numberData in one frameFirst channel Second channel Third channel Fourth channelsequence 0,0DIF sequence 1,0DIF sequence n-1,0DIF sequence 0,1DIF sequence n-1,3DIF Header section Subcode section VAUX section Audio h4;
33、 h+) for(s=0; s2; s+) for(k=0; k27; k+) for(t=0; t5; t+) a = (4h + s + 2t + 2) mod 10; b = (4h + s + 2t + 6) mod 10; c = (4h + s + 2t + 8) mod 10; d = (4h + s + 2t + 0) mod 10; e = (4h + s + 2t + 4) mod 10; DBNq = (5t + 25k) mod 135; DSNp = INT(5t + 25k + 675s) / 135); V DBNq, h of DSNp = CM h,a,2,k
34、 V (DBNq + 1), h of DSNp = CM h,b,1,k V (DBNq + 2), h of DSNp = CM h,c,3,k V (DBNq + 3), h of DSNp = CM h,d,0,k V (DBNq + 4), h of DSNp = CM h,e,4,k where DBNq: DIF block number DSNp: DIF sequence number h: Divided block s, t: Vertical order of super block k: Macro block order in super block 1 920 1
35、 080/50/I system for(h=0; h4; h+) for(k=0; k27; k+) for(i=0; i11; i+) a = (4h + i + 2) mod 11; b = (4h + i + 6) mod 11; c = (4h + i + 8) mod 11; d = (4h + i + 0) mod 11; e = (4h + i + 4) mod 11; DBNq = (5i + 55k) mod 135; DSNp = INT(5i + 55k) / 135); V DBNq, h of DSNp = CM h,a,2,k V (DBNq + 1), h of
36、 DSNp = CM h,b,1,k V (DBNq + 2), h of DSNp = CM h,c,3,k V (DBNq + 3), h of DSNp = CM h,d,0,k V (DBNq + 4), h of DSNp = CM h,e,4,k for(k=0; k27; k+) DBNq = 5k; DSNp = 11; V DBNq, 0 of DSNp = CM 0,11,0,k V (DBNq + 1), 0 of DSNp = CM 0,11,1,k V (DBNq + 2), 0 of DSNp = CM 0,11,2,k 24 Rec. ITU-R BT.1620-
37、1 V (DBNq + 3), 0 of DSNp = CM 0,11,3,k V (DBNq + 4), 0 of DSNp = CM 0,11,4,k where DBNq: DIF block number DSNp: DIF sequence number h: Divided block i: Vertical order of super block k: Macro block order in super block TABLE 22 Video DIF blocks and compressed macro blocks for the 60-Hz system DIF ch
38、annel number DIF sequence number DIF block Compressed macro block 0 0 V 0,0 CM 0,2,2,0 V 1,0 CM 0,6,1,0 V 2,0 CM 0,8,3,0 V 3,0 CM 0,0,0,0 V 4,0 CM 0,4,4,0 : : : : : 9 : : V 134,0 CM 0,3,4,26 1 0 V 0,1 CM 1,6,2,0 V 1,1 CM 1,0,1,0 V 2,1 CM 1,2,3,0 V 3,1 CM 1,4,0,0 V 4,1 CM 1,8,4,0 : : : : : 9 : : V 13
39、4,1 CM 1,7,4,26 : : : : 3 0 V 0,3 CM 3,4,2,0 V 1,3 CM 3,8,1,0 V 2,3 CM 3,0,3,0 V 3,3 CM 3,2,0,0 V 4,3 CM 3,6,4,0 : : : : : 9 : : V 134,3 CM 3,5,4,26 Rec. ITU-R BT.1620-1 25 TABLE 23 Video DIF blocks and compressed macro/blocks for the 1 920 1 080/50/I system DIF channel number DIF sequence number DI
40、F block Compressed macro block 0 0 V 0,0 CM 0,2,2,0 V 1,0 CM 0,6,1,0 V 2,0 CM 0,8,3,0 V 3,0 CM 0,0,0,0 V 4,0 CM 0,4,4,0 : : : : : 10 : : V 134,0 CM 0,3,4,26 11 V 0,0 CM 0,11,0,0 V 1,0 CM 0,11,1,0 : : V 134,0 CM 0,11,4,26 1 0 V 0,1 CM 1,6,2,0 V 1,1 CM 1,10,1,0 V 2,1 CM 1,1,3,0 V 3,1 CM 1,4,0,0 V 4,1
41、CM 1,8,4,0 : : : : : 10 : : V 134,1 CM 1,7,4,26 11 V 0,1 : : V 134,1 : : : : 3 0 V 0,3 CM 3,3,2,0 V 1,3 CM 3,7,1,0 V 2,3 CM 3,9,3,0 V 3,3 CM 3,1,0,0 V 4,3 CM 3,5,4,0 : : : : : 10 : : V 134,3 CM 3,4,4,26 11 V 0,3 : : V 134,3 26 Rec. ITU-R BT.1620-1 TABLE 24 Video DIF blocks and compressed macro/block
42、s for the 1 280 720/50/P system DIF channel number DIF sequence number DIF block Compressed macro block 0 0 V 0,0 CM 0,2,2,0 V 1,0 CM 0,6,1,0 V 2,0 CM 0,8,3,0 V 3,0 CM 0,0,0,0 V 4,0 CM 0,4,4,0 : : : : : 9 : : V 134,0 CM 0,3,4,26 10 V 0,0 : : V 134,0 11 V 0,0 : : V 134,0 1 0 V 0,1 CM 1,6,2,0 V 1,1 CM
43、 1,0,1,0 V 2,1 CM 1,2,3,0 V 3,1 CM 1,4,0,0 V 4,1 CM 1,8,4,0 : : : : : 9 : : V 134,1 CM 1,7,4,26 10 V 0,1 : : V 134,1 11 V 0,1 : : V 134,1 : : : : 3 0 V 0,3 CM 3,4,2,0 V 1,3 CM 3,8,1,0 V 2,3 CM 3,0,3,0 V 3,3 CM 3,2,0,0 V 4,3 CM 3,6,4,0 : : : : : 9 : : V 134,3 CM 3,5,4,26 10 V 0,3 : : V 134,3 11 V 0,3
44、 : : V 134,3 Rec. ITU-R BT.1620-1 27 4 Video compression This section includes the video compression processing for the 1 920 1 080/60/I system, the 1 920 1 080/50/I system, the 1 280 720/60/P system and the 1 280 720/50/P system. 4.1 Video structure 4.1.1 Video sampling structure The video sampling
45、 structure shall comply with Recommendation ITU-R BT.709 for 1 920 1 080-line systems, and Recommendations ITU-R BT.1543 and ITU-R BT.1847 for the 1 280 720-line systems. The construction of luminance (Y) and two colour-difference signals (CR, CB) is described in Table 25. A sample conversion from 1
46、0-bit input video to 8 bits or more is provided by the resampling process (the first processing block of Fig. 1). 4.1.1.1 Video frame pixel structure 1 920 1 080/60/I system 1 920 pixels of luminance and 960 pixels of each colour-difference signal per line shall be transmitted as shown in Fig. 10. T
47、he sampling starting point in the active period of CRand CBsignals shall be the same as the sampling starting point in the active period of the Y signal. Each pixel shall be converted to the code of twos complement (508 to 507) by inverting the MSB of the input video signal. 1 920 1 080/50/I system
48、1 920 pixels of luminance and 960 pixels of each colour-difference signal per line shall be transmitted as shown in Fig. 11. The sampling starting point in the active period of CRand CBsignals shall be the same as the sampling starting point in the active period of the Y signal. Each pixel shall be
49、converted to the code of twos complement (508 to 507) by inverting the MSB of the input video signal. 1 280 720/60/P system 1 280 pixels of luminance and 640 pixels of each colour-difference signal per line shall be transmitted as shown in Fig. 12. The sampling starting point in the active period of CRand CBsignals shall be the same as the sampling starting point in the active period of the Y signal. Each pixel shall be converted to the code of twos c