1、 Copyright 2012 by THE SOCIETY OF MOTION PICTURE AND TELEVISION ENGINEERS 3 Barker Avenue, White Plains, NY 10601 (914) 761-1100 Approved June 5, 2012 Table of Contents Page Foreword . 2 Intellectual Property 2 Introduction 2 1 Scope . 3 2 Conformance Notation . 3 3 Normative References . 3 4 Defini
2、tions 4 4.1 Stereoscopic Image Pair 4 5 Source Image Format 4 6 20-Bit Virtual Interface Data Structure . 5 6.1 Level A Direct Mapping Mode . 6 6.2 Level B-DL SMPTE ST 372 Dual-Link Mapping Mode . 10 7 Payload Identifier . 13 7.1 Byte 1 Digital Interface and Payload Identification . 13 7.2 Byte 2 Pi
3、cture Rate . 13 7.3 Byte 3 Sampling Structure, Aspect Ratio and Horizontal Size . 14 7.4 Byte 4 Extended Aspects 14 8 Left/Right Eye Image Interface Timing . 14 9 Levels of Operation (Informative) 15 Annex A Bibliography (Informative) . 16 Annex B SMPTE ST 425-4 Document Road Map (Informative) 17 Pa
4、ge 1 of 17 pages SMPTE ST 425-4:2012 SMPTE STANDARD Dual 3 Gb/s Serial Digital Interface for Stereoscopic Image Transport SMPTE ST 425-4:2012 Page 2 of 17 pages Foreword SMPTE (the Society of Motion Picture and Television Engineers) is an internationally-recognized standards developing organization.
5、 Headquartered and incorporated in the United States of America, SMPTE has members in over 80 countries on six continents. SMPTEs Engineering Documents, including Standards, Recommended Practices, and Engineering Guidelines, are prepared by SMPTEs Technology Committees. Participation in these Commit
6、tees is open to all with a bona fide interest in their work. SMPTE cooperates closely with other standards-developing organizations, including ISO, IEC and ITU. SMPTE Engineering Documents are drafted in accordance with the rules given in Part XIII of its Operations Manual. SMPTE ST 425-4 was prepar
7、ed by Technology Committee 32NF. Intellectual Property At the time of publication no notice had been received by SMPTE claiming patent rights essential to the implementation of this Standard. However, attention is drawn to the possibility that some of the elements of this document may be the subject
8、 of patent rights. SMPTE shall not be held responsible for identifying any or all such patent rights. Introduction This section is entirely informative and does not form an integral part of this Standard. There is a need in the industry to have an interface for transporting stereoscopic images compl
9、ying with 4:2:2 and 4:4:4, 10-bit and 12-bit image formats which can each be transported by a single SMPTE ST 424 serial interface. This standard also defines the payload identifier that will identify the Left/Right (L/R) eye images, audio and other associated ancillary data. SMPTE ST 425-4:2012 Pag
10、e 3 of 17 pages 1 Scope This standard defines a means of transporting stereoscopic images (Left eye and Right eye images) using an interface consisting of two streams based on the SMPTE ST 425-1 data structures. The Left eye images are carried on one stream of the interface and the Right eye images
11、are carried on the other stream. The stereoscopic image formats to be transported using this standard are the 4:2:2 and 4:4:4 image formats enumerated in Table 1, where each image format is defined by SMPTE ST 274, SMPTE ST 296, SMPTE ST 2048-2, SMPTE ST 428-9 or SMPTE ST 428-19, has a payload capac
12、ity of nominally 3 Gb/s and can be transported by a single SMPTE ST 424 serial interface. Audio and other associated ancillary data may also be transported. This standard also defines a payload identifier. Mapping structures for the video essence and ancillary data shall be as defined in SMPTE ST 42
13、5-1 Level A or Level B-DL. It is not necessary for implementations to include support for all formats defined in Table 1, nor is it a requirement to support both mapping modes to conform to this standard. Implementers should indicate supported formats and supported mapping modes in commercial public
14、ations. 2 Conformance Notation Normative text is text that describes elements of the design that are indispensable or contains the conformance language keywords: “shall“, “should“, or “may“. Informative text is text that is potentially helpful to the user, but not indispensable, and can be removed,
15、changed, or added editorially without affecting interoperability. Informative text does not contain any conformance keywords. All text in this document is, by default, normative, except: the Introduction, any section explicitly labeled as “Informative“ or individual paragraphs that start with “Note:
16、” The keywords “shall“ and “shall not“ indicate requirements strictly to be followed in order to conform to the document and from which no deviation is permitted. The keywords, “should“ and “should not“ indicate that, among several possibilities, one is recommended as particularly suitable, without
17、mentioning or excluding others; or that a certain course of action is preferred but not necessarily required; or that (in the negative form) a certain possibility or course of action is deprecated but not prohibited. The keywords “may“ and “need not“ indicate courses of action permissible within the
18、 limits of the document. The keyword “reserved” indicates a provision that is not defined at this time, shall not be used, and may be defined in the future. The keyword “forbidden” indicates “reserved” and in addition indicates that the provision will never be defined in the future. Unless otherwise
19、 specified, the order of precedence of the types of normative information in this document shall be as follows: Normative prose shall be the authoritative definition; Tables shall be next; followed by formal languages; then figures; and then any other language forms. 3 Normative References Note: All
20、 references in this document to other SMPTE documents use the current numbering style (e.g. SMPTE ST 12-2:2008) although, during a transitional phase, the document as published (printed or PDF) may bear an older designation (such as SMPTE 12M-2-2008). Documents with the same root number (e.g. 12-2)
21、and publication year (e.g. 2008) are functionally identical. SMPTE ST 425-4:2012 Page 4 of 17 pages The following standards contain provisions which, through reference in this text, constitute provisions of this recommended practice. At the time of publication, the editions indicated were valid. All
22、 standards are subject to revision, and parties to agreements based on this recommended practice are encouraged to investigate the possibility of applying the most recent edition of the standards indicated below. SMPTE ST 12-2:2008, Television Transmission of Time Code in the Ancillary Data Space SM
23、PTE ST 299-1:2009, 24-Bit Digital Audio Format for SMPTE ST 292 Bit-Serial Interface SMPTE ST 299-2:2010, Extension of the 24-Bit Digital Audio Format to 32 Channels for 3 Gb/s Bit-Serial Interfaces SMPTE ST 352:2011, Payload Identification Codes for Serial Digital Interfaces SMPTE ST 425-1:2011, So
24、urce Image Format and Ancillary Data Mapping for the 3 Gb/s Serial Interface 4 Definitions 4.1 Stereoscopic Image Pair Two uncompressed images representing a left eye (Le) image and a right eye (Re) image which are coincident in time having identical sampling structure, raster resolution, pixel dept
25、h and colorimetry. 5 Source Image Format The source image formats shall be those 4:2:2 and 4:4:4, 10-bit or 12-bit image formats referenced by SMPTE ST 425-1, which can be transported by a single SMPTE ST 424 serial interface as shown in Table 1. The Left eye image and the Right eye image shall have
26、 the identical image pixel format structure and they shall be a stereoscopic image pair. For the carriage of stereo production image formats, the source image horizontal and vertical data structure for each Left Eye (Le) and Right Eye (Re), shall be as defined for those formats referenced by SMPTE S
27、T 425-1. Table 1, repeated from SMPTE ST 425-1 for convenience, shows the source formats so referenced. Table 1 Source Image Formats SMPTE ST 425-1 (Informative) Reference SMPTE Standard Image Format Signal Format Sampling Structure/Pixel Depth Frame / Field Rates (Frames per second / Fields per sec
28、ond) Mapping mode ST 274 1920 1080 4:2:2 (YCBCR)/10-bit 60, 60/1.001 and 50 Frames Progressive Level A Level B-DL ST 2048-2 2048 x 1080(5)4:2:2 (YCBCR)/10-bit 60, 60/1.001, 50, 48 and 48/1.001 Frames Progressive Level A Level B-DL ST 296 1280 x 720 4:4:4 (RGB), 4:4:4:4 (RGB +A)/10-bit(6)4:4:4 (YCBCR
29、), 4:4:4:4 (YCBCR+A)/10-bit(6)60, 60/1.001 and 50 Frames Progressive Level A - 30, 30/1.001, 25, 24 and 24/1.001 Frames Progressive Level A - ST 274 1920 x 1080 4:4:4 (RGB), 4:4:4:4 (RGB +A)/10-bit(6)4:4:4 (YCBCR), 4:4:4:4 (YCBCR+A)/10-bit(6)60, 60/1.001 and 50 Fields Interlaced Level A Level B-DL 3
30、0, 30/1.001, 25, 24 and 24/1.001 Frames Progressive Level A Level B-DL 30, 30/1.001, 25, 24 and 24/1.001 PsF(1)Level A Level B-DL SMPTE ST 425-4:2012 Page 5 of 17 pages ST 2048-2 2048 x 1080(5)4:4:4 (RGB)(7), 4:4:4:4 (RGB +A)/10-bit(6)4:4:4 (YCBCR), 4:4:4:4 (YCBCR+A)/10-bit(6)30, 30/1.001, 25, 24 an
31、d 24/1.001 Frames Progressive Level A Level B-DL 30, 30/1.001, 25, 24 and 24/1.001 PsF(2)Level A Level B-DL ST 274 1920 x 1080 4:4:4 (RGB)/12-bit 4:4:4 (YCBCR)/12-bit 60, 60/1.001 and 50 Fields Interlaced Level A Level B-DL 30, 30/1.001, 25, 24 and 24/1.001 Frames Progressive Level A Level B-DL ST 2
32、048-2 2048 x 1080(5)4:4:4 (RGB)/12-bit(7)4:4:4 (YCBCR)/12-bit 30, 30/1.001, 25, 24 and 24/1.001 Frames Progressive Level A Level B-DL 30, 30/1.001, 25, 24 and 24/1.001 Frames PsF(2)Level A Level B-DL ST 428-9 2048 x 1080(5)4:4:4 (XYZ)/12-bit 24 Frames Progressive Level A Level B-DL 24 Frames PsF(3)L
33、evel A Level B-DL ST 428-19 2048 x 1080(5)4:4:4 (XYZ)/12-bit 25 and 30Frames Progressive Level A Level B-DL 25 and 30Frames PsF(4)Level A Level B-DL ST 274 1920 x 1080 4:2:2 (YCBCR)/12-bit 60, 60/1.001 and 50 Fields Interlaced Level A Level B-DL 30, 30/1.001, 25, 24 and 24/1.001 Frames Progressive L
34、evel A Level B-DL 30, 30/1.001, 25, 24 and 24/1.001 Frames PsF(1)Level A Level B-DL ST 2048-2 2048 x 1080(5)4:2:2 (YCBCR)/12-bit 4:2:2:4 (YCBCR+A)/12-bit(6)30, 30/1.001, 25, 24 and 24/1.001 Frames Progressive Level A Level B-DL 30, 30/1.001, 25, 24 and 24/1.001 Frames PsF(2)Level A Level B-DL Notes:
35、 (1) PsF structure as defined in SMPTE ST 274 (2) PsF structure as defined in SMPTE ST 2048-2 (3) PsF structure as defined in SMPTE ST 429-9 (4) PsF structure as defined in SMPTE ST 428-19 (5) This is the maximum pixel array, the active image may not fill the maximum array. (6) Definition of the Alp
36、ha channel mapping is application dependant. Refer to SMPTE ST 425-1 for further details. (7) In this image format RGB indicates either RGB or RFSGFSBFS. 6 20-Bit Virtual Interface Data Structure Each Le and Re image of the stereoscopic image pair shall be constructed as an individual 20-bit virtual
37、 interface consisting of two times 10-bit data structures, Data Stream 1 and Data Stream 2, as defined in SMPTE ST 425-1. The 20-bit virtual interface for each Le and Re image shall be constructed in accordance with either the Level A mapping mode, or the Level B-DL mapping mode defined in SMPTE ST
38、425-1. In addition to the different mapping modes, SMPTE ST 425-1 also defines different mapping structures for each image format. Refer to SMPTE ST 425-1 for details of the mapping structures and mapping modes for each format in Table 1. Each virtual interface shall contain timing reference code wo
39、rds (SAV/EAV), line numbers and line based CRCs as defined in SMPTE ST 425-1 and/or the source image format document. Each virtual interface shall be frame, line and word aligned, having an interface frequency of 148.5 MHz or 148.5/1.001 MHz. SMPTE ST 425-4:2012 Page 6 of 17 pages An example of the
40、20-bit virtual interface for both Le and Re is shown in Figure 1. By way of example, Figure 1 illustrates mapping structure 1 of the Level A direct mapping mode for YCBCR 4:2:2 / 10-Bit signals at 60, 60/1.001, 50, 48 and 48/1.001 Progressive Frames/Sec. Y 0Y 1Y 2Y 1919EAV(3FFh)EAV(000h)EAV(000h)Y 1
41、918EAV(XYZh)Y 3CB0CR0CB1CR1CB959CR959EAV(3FFh)EAV(000h)EAV(000h)EAV(XYZh)SAV(3FFh)SAV(000h)SAV(000h)SAV(XYZh)SAV(3FFh)SAV(000h)SAV(000h)SAV(XYZh)Interface clock frequency = 148.5 MHz or 148.5 / 1.001 MHzInterface clock frequency = 148.5 MHz or 148.5 / 1.001 MHzLN0LN1LN0LN1CR0CR1CR0CR1Ancillary data
42、spaceAncillary data spaceData stream one of the Re (Right Eye) virtual interfaceData stream two of the Re (Right Eye) virtual interfaceRe (Right Eye)20-Bit Virtual InterfaceY 0Y 1Y 2Y 1919EAV(3FFh)EAV(000h)EAV(000h)Y 1918EAV(XYZh)Y 3CB0CR0CB1CR1CB959CR959EAV(3FFh)EAV(000h)EAV(000h)EAV(XYZh)SAV(3FFh)
43、SAV(000h)SAV(000h)SAV(XYZh)SAV(3FFh)SAV(000h)SAV(000h)SAV(XYZh)Interface clock frequency = 148.5 MHz or 148.5 / 1.001 MHzInterface clock frequency = 148.5 MHz or 148.5 / 1.001 MHzLN0LN1LN0LN1CR0CR1CR0CR1Ancillary data spaceAncillary data spaceData stream one of the Le (Left Eye) virtual interfaceDat
44、a stream two of the Le (Left Eye)virtual interfaceLe (Left Eye)20-Bit Virtual InterfaceFigure 1 Example Mapping Structure 1 from SMPTE ST 425-1 4:2:2 (YCBCR)/10-Bit Signals at 60, 60/1.001, 50, 48 and 48/1.001 Progressive Frames/Sec 6.1 Level A Direct Mapping Mode 6.1.1 Image Mapping Structures R, G
45、, B,/ Y, CB, CR,/ X, Y, Z and A components shall be mapped into the virtual interface of each Le and Re image according to the Level A mapping structures defined in SMPTE ST 425-1. 6.1.2 Level A Direct Mapping Mode Audio Data When present, audio data shall be mapped into the ancillary data space of
46、the Le and Re virtual interface, according to the Level A audio mapping rules defined in SMPTE ST 425-1. The audio data shall be mapped onto the Le virtual interface first and any remaining data shall then be mapped onto the Re virtual interface. In some applications audio data of the Le interface m
47、ay be duplicated in the Re interface. Audio channel usage shall be further signaled using the payload identifier as defined in Section 7 of this document. 6.1.2.1 Number of Audio Channels Up to 32 audio channels sampled at 32 kHz, 44.1 kHz or 48 kHz may be mapped into each Le and Re virtual interfac
48、e for a maximum of up to 64 channels. At 96-kHz sampling, up to 16 audio channels may be mapped into each Le and Re virtual interface for a maximum of up to 32 channels. SMPTE ST 425-4:2012 Page 7 of 17 pages The maximum number of audio channels that can be mapped into the available ancillary data s
49、pace of the Le and Re virtual interface varies in accordance with the video format and the video frame rate as shown in SMPTE ST 425-1, repeated here in Table 2 for convenience. Table 2 Number of audio channels supported for each image format frame rate and audio sampling rate (Informative) Image Format Frame/Field Rates Maximum number of audio channels at 32-kH