1、 Copyright 2008 by THE SOCIETY OF MOTION PICTURE AND TELEVISION ENGINEERS 595 W. Hartsdale Ave., White Plains, NY 10607 (914) 761-1100 Approved April 18, 2008 Table of Contents Page Foreword . 2 1 Scope . 3 2 Conformance Notation . 3 3 Normative References . 3 4 Classes of Video Index Data 4 5 Forma
2、tting 4 5.1 Classes 1 and 2 . 5 5.2 Class 3 5 5.3 CRCC Polynomial 5 6 Data Formats Classes 1 and 2 . 5 6.1 Class 1.1, Information Required to Display the Signal 5 6.2 Classes 1.2 and 1.3, Pan and Scan Data . 8 6.3 Class 2.1, Field Rate Technical Heritage Information . 10 6.4 Class 2.2, Slow Rate Tec
3、hnical Heritage Information 12 6.5 Class 2.3, Other Technical Heritage Information (Reserved) 14 Annex A Bibliography (Informative) 15 Annex B Pan and Scan Examples (Informative). 16 Page 1 of 17 pages RP 186-2008Revision of RP 186-1995 SMPTE RECOMMENDED PRACTICE Video Index Information Coding for 5
4、25- and 625-Line Television Systems RP 186-2008 Page 2 of 17 pages Foreword SMPTE (the Society of Motion Picture and Television Engineers) is an internationally-recognized standards developing organization. Headquartered and incorporated in the United States of America, SMPTE has members in over 80
5、countries on six continents. SMPTEs Engineering Documents, including Standards, Recommended Practices and Engineering Guidelines, are prepared by SMPTEs Technology Committees. Participation in these Committees is open to all with a bona fide interest in their work. SMPTE cooperates closely with othe
6、r standards-developing organizations, including ISO, IEC and ITU. SMPTE Engineering Documents are drafted in accordance with the rules given in Part XIII of its Administrative Practices. SMPTE Recommended Practice RP 186 was prepared by Technology Committee S22. RP 186-2008 Page 3 of 17 pages 1 Scop
7、e This Recommended Practice is intended to provide a method of coding the Video Index Information data structure in 525-line and 625-line component digital video signals so that various picture and program related source data can be carried in conjunction with a video signal. Specific details of tra
8、nsporting the data structure through various video interconnection systems are not included in this document. These may be found in SMPTE 125M. The specifications of Active Format Description (AFD) and Pan and Scan data in this Recommended Practice have been superseded by SMPTE 2016-1 through SMPTE
9、2016-4. Designers and users are urged to consult these documents. The information conveyed in the AFD and Pan and Scan sections of this document can be accurately represented by the mechanisms defined in the SMPTE 2016 suite; however the converse is not necessarily the case. Use of this Recommended
10、Practice to convey AFD and Pan and Scan data is deprecated. 2 Conformance Notation Normative text is text that describes elements of the design that are indispensable or contains the conformance language keywords: “shall“, “should“, or “may“. Informative text is text that is potentially helpful to t
11、he user, but not indispensable, and can be removed, changed, or added editorially without affecting interoperability. Informative text does not contain any conformance keywords. All text in this document is, by default, normative, except: the Introduction, any section explicitly labeled as “Informat
12、ive“ or individual paragraphs that start with “Note:” The keywords “shall“ and “shall not“ indicate requirements strictly to be followed in order to conform to the document and from which no deviation is permitted. The keywords, “should“ and “should not“ indicate that, among several possibilities, o
13、ne is recommended as particularly suitable, without mentioning or excluding others; or that a certain course of action is preferred but not necessarily required; or that (in the negative form) a certain possibility or course of action is deprecated but not prohibited. The keywords “may“ and “need no
14、t“ indicate courses of action permissible within the limits of the document. The keyword “reserved” indicates a provision that is not defined at this time, shall not be used, and may be defined in the future. The keyword “forbidden” indicates “reserved” and in addition indicates that the provision w
15、ill never be defined in the future. A conformant implementation according to this document is one that includes all mandatory provisions (“shall“) and, if implemented, all recommended provisions (“should“) as described. A conformant implementation need not implement optional provisions (“may“) and n
16、eed not implement them as described. Unless otherwise specified the order of precedence of the types of normative information in this document shall be as follows. Normative prose shall be the authoritative definition. Tables shall be next, followed by formal languages, then figures, and then any ot
17、her language forms. 3 Normative References The following standard contains provisions which, through reference in this text, constitute provisions of this standard. At the time of publication, the edition indicated was valid. All standards are subject to revision, and parties to agreements based on
18、this standard are encouraged to investigate the possibility of applying the most recent edition of the standard indicated below. ETSI TR 101 154 V1.4.1, Digital Video Broadcasting (DVB): Implementation Guidelines for the Use of MPEG-2 Systems, Video and Audio in Satellite, Cable and Terrestrial Broa
19、dcasting Applications, Annex B, July 2000. RP 186-2008 Page 4 of 17 pages 4 Classes of Video Index Data The Video Index Information data structure organizes video source data into three groups based on their types and usage. The Class 1 data group provides information that is required for display or
20、 realtime processing of the video signal. The Class 2 data group provides heritage information; that is, information about how the video originated or was previously processed, which if known would facilitate optimization of subsequent processing or display. The Class 3 data group provides for the a
21、ssociation with the video signal of other information not required for the display or realtime processing of that signal. The three classes of data are further divided into sub classes as follows: Class 1.1 Information required to display the signal, not including pan and scan. Class 1.2 Pan and sca
22、n data, first 3 octets. Class 1.3 Pan and scan data, last 3 octets. Class 2.1 Field rate technical heritage information useful for further signal processing. Class 2.2 Slow rate technical heritage information useful for further signal processing. Class 2.3 Reserved for other technical heritage infor
23、mation. Class 3.x User information, undefined at this time, which could include such things as: Program ID. Station or production company. Tape; length, type, purchase date, serial number. Date of production. Studio light temperature. Lens type. Class 1.1 data is mandatory and shall always be provid
24、ed. For other data classes, where specific information is unavailable, data fields should be set to the No Information state (all bits set to 0). 5 Formatting Video Index data, as specified in this Recommended Practice, shall be formatted for insertion into video signals as specified in SMPTE 125M.
25、The Video Index data shall be left-justified within the available data space; that is, it shall immediately follow the SAV on the line specified for it. The data may be identified by its position along the line; Class 1 is followed by Class 2, which is followed by Class 3. See Figure 1. RP 186-2008
26、Page 5 of 17 pages EAVClass1.1CRCC CRCC CRCC CRCC CRCC CRCCClass1.2Class1.3Class2.1Class2.2Class2.3SAVClass 1 Class 2 Class 3 Unused12 octets 24 octets12 octets3 octets 3 octets 3 octets 3 octets 3 octets 3 octets111111Figure 1 Data Formatting 5.1 Classes 1 and 2 Each sub class in Class 1 and Class
27、2 shall contain a block of four 8 bit octets, three data octets followed by one CRCC octet. This totals 24 octets for the two classes. Octets may be further divided into upper and lower 4-bit quartets for coding purposes. 5.2 Class 3 Formatting of Class 3 will be determined at a later date. The tota
28、l number of octets including data and CCRC check values shall not exceed 24 octets. 5.3 CRCC Polynomial CRCC check values for each data block shall be calculated using the data octets for that block. All bits starting from B0 through B7 of the first data octet through the last data octet shall be pa
29、ssed through the CRCC check bit generator whose polynomial is G(x) =X8+ X4+ X3+ X2+ 1 All registers constituting the generator polynomial shall be preset to logical 1 at the beginning of each data block. 6 Data Formats Classes 1 and 2 Formats for each data octet are shown in the following charts. 6.
30、1 Class 1.1, Information Required to Display the Signal This data sub class codes data fields which define the video scanning system and the video signal form. 6.1.1 Class 1.1, Data Octet 1 Scanning System Data Field The scanning system data field provides information about the scanning system used,
31、 including total number of lines, field rate and aspect ratio and active format description. RP 186-2008 Page 6 of 17 pages 6.1.1.1 Scanning System Bits B0B2 identify the scanning system and shall be set as shown below: SCANNING SYSTEM B2 B1 B0 No information 0 0 0 525/59.94/4x3 0 0 1 625/50/4x3 0 1
32、 0 Reserved 0 1 1 Reserved 1 0 0 525/59.94/16x9 1 0 1 625/50/16x9 1 1 0 Reserved, from 1 1 1 through 1 1 1 6.1.1.2 Active Format Description Note: The representation of AFD defined here has been superseded by SMPTE 2016-1. Transport of SMPTE 2016-1 data in the vertical ancillary space is defined in
33、SMPTE 2016-3. Users are urged to consult these documents. Bits B3-B6 comprise the Active Format Description, or AFD, and shall be set as defined in ETSI TR 101 154 V1.4.1. The settings are reproduced for convenience in the table below. The Active Format is the area of interest in terms of its aspect
34、 ratio within the active area of the scanning system, as illustrated in ETSI TR 101 154 V1.4.1. For further information, including additional illustrations for some of the AFD states, see the Recommended Receiver Reaction to Aspect Ratio Signaling in Digital Video Broadcasting from the Digital Telev
35、ision Group (listed in annex A). AFD B6 B5 B4 B3 Reserved, see below 0 0 0 0 Reserved (see below for exception) 0 0 0 1 Box 16:9 (top) 0 0 1 0 Box 14:9 (top) 0 0 1 1 Box 16:9 (center) 0 1 0 0 Reserved 0 1 0 1 Reserved 0 1 1 0 Reserved 0 1 1 1 Active format is the same as coded frame 1 0 0 0 4:3 (cen
36、ter) 1 0 0 1 16:9 (center) 1 0 1 0 14:9 (center) 1 0 1 1 Reserved 1 1 0 0 4:3 (with shoot and protect 14:9 center) 1 1 0 1 16:9 (with shoot and protect 14:9 center) 1 1 1 0 16:9 (with shoot and protect 4:3 center) 1 1 1 1 RP 186-2008 Page 7 of 17 pages Designers should note that AFD is largely redun
37、dant with the pan and scan data defined in section 5.2. If it is desired to use the pan and scan data, AFD shall be set to 0000. If valid AFD data is present, it shall override the pan and scan data. Note: Both ETSI TS 101 154 V1.8.1 and ATSC A/53 Part 4:2007 use the SMPTE 2016-1 semantics for AFD r
38、ather than those set forth above. The differences are minor, however, and a receiving device designed in accordance with SMPTE 2016-1 should provide acceptable results with the AFD coding given here and vice versa. 6.1.1.3 Bit B7 is reserved and unused; it shall be set to 0. 6.1.2 Class 1.1, Data Oc
39、tet 2, Lower Quartet Signal Form Data Field The signal form data field identifies which component set is used. The bits in this field shall be set as defined in the table below, except that where the signal is in Y, Cb, Cr form, the color encoding heritage of the signal, if any, may be indicated her
40、e. This ensures compatibility with equipment in the field which was designed in accordance with earlier documents. See section 6.3.5. SIGNAL FORM B3 B2 B1 B0 No information 0 0 0 0 R, G, B 0 0 0 1 Y, Cb, Cr 0 0 1 0 Y, U, V 0 0 1 1 Monochrome 0 1 0 0 NTSC 0 1 PAL 0 1 0 PAL-M 0 1 SECAM 1 0 0 Reserved,
41、 from 1 0 0 1 through 1 1 1 1 6.1.3 Class 1.1, Data Octet 2, Upper Quartet Reserved Data Field This data field is reserved. Until defined, this field shall be set to 0000 . RESERVED B7 B6 B5 B4 No information 0 0 0 0 Reserved, from 0 0 0 1 through 1 1 1 1 6.1.4 Class 1.1, Data Octet 3, Lower Quartet
42、 Sampling Structure Data Field This data field describes the sample structure of the component signal, and also identifies the main and sub channels, where applicable. The bits in this field shall be set as shown in the table below. The fourth component, where shown, is an auxiliary or key channel.
43、Samples in 4:4:4 and 4:4:4:4 systems are distributed between main and sub channels as described in SMPTE RP 175. RP 186-2008 Page 8 of 17 pages SAMPLING STRUCTURE B3 B2 B1 B0 No information 0 0 0 0 4:0:0 0 0 0 1 4:2:2 0 0 1 0 4:2:2:4, main 0 0 1 1 4:2:2:4, sub 0 1 0 0 4:4:4, main 0 1 0 1 4:4:4, sub
44、0 1 1 0 4:4:4:4, main 0 1 1 1 4:4:4:4, sub 1 0 0 0 4:2:2 progressive, main 1 0 0 1 4:2:2 progressive, sub 1 0 1 0 4:2:0 progressive 1 0 1 1 8:4:4, main 1 1 0 0 8:4:4, sub 1 1 0 1 4:2:2:4, single link 1 1 1 0 Reserved 1 1 1 1 6.1.5 Class 1.1, Data Octet 3, Upper Quartet Reserved Data Field This data
45、field is reserved. Until defined, the bits in this field shall be set to 0000. RESERVED B7 B6 B5 B4 No information 0 0 0 0 Reserved, from 0 0 0 1 through 1 1 1 1 6.2 Classes 1.2 and 1.3, Pan and Scan Data Note: The representation of Pan and Scan data defined here has been superseded by SMPTE 2016-2.
46、 Transport of SMPTE 2016-2 data in the vertical ancillary space is defined in SMPTE 2016-4. Users are urged to consult these documents. There are 6 octets of pan and scan data, which shall be formatted into 2 3-octet classes as defined below. The 6 octets are used to represent 3 15-bit numbers and 3
47、 flags, which define a viewport (target) aperture relative to the source image. Bit 0 shall be the LSB. Pan X Tilt Y Zoom Z 0 14 15 0 7 8 14 15 0 14 15 Clas 1.2 Clas 1.3 Pan Location of left edge of viewport with respect to center of source image, in source pixels Tilt Location of top edge of viewpo
48、rt with respect to center of source image, in source lines Zoom Height of viewport, in lines. X, Y, Z Flags RP 186-2008 Page 9 of 17 pages See Figure 2. Pan, Tilt and Zoom are 2s complement numbers, representing source image pixels or lines, with 4 bits subpixel resolution. Figure 2 Pan and Scan Par
49、ameters As only two aspect ratios, 16:9 and 4:3, are in the scope of this document, the aspect ratio of the target image shall be assumed to be whichever the source is not; e.g., if the source is 16:9, the target viewport shall be assumed to be 4:3. The width of the viewport shall be calculated by the receiving device from the height, as given by the Zoom value, taking into account the aspect ratio of the pixels. The X and Y flags provide a mechanism for altering this, as foll