1、 Copyright 2014 by THE SOCIETY OF MOTION PICTURE AND TELEVISION ENGINEERS 3 Barker Avenue, White Plains, NY 10601 (914) 761-1100 Approved May 13, 2014 Table of Contents Page Foreword . 2 Intellectual Property 2 Introduction 2 1 Scope . 3 2 Conformance Notation . 3 3 Normative References . 3 4 Defini
2、tion of Acronyms, Terms and Data Types 4 5 Metadata Structure . 4 6 Generic Container Structure 5 Annex A Bibliography (Informative) . 7 Page 1 of 7 pages SMPTE ST 2070-2:2014 SMPTE STANDARD Stereoscopic 3D in MXF for Operations OP1a Mapping SMPTE ST 2070-2:2014 Page 2 of 7 pages Foreword SMPTE (the
3、 Society of Motion Picture and Television Engineers) is an internationally-recognized standards developing organization. Headquartered and incorporated in the United States of America, SMPTE has members in over 80 countries on six continents. SMPTEs Engineering Documents, including Standards, Recomm
4、ended Practices, and Engineering Guidelines, are prepared by SMPTEs Technology Committees. Participation in these Committees is open to all with a bona fide interest in their work. SMPTE cooperates closely with other standards-developing organizations, including ISO, IEC and ITU. SMPTE Engineering D
5、ocuments are drafted in accordance with the rules given in its Standards Operations Manual. SMPTE ST 2070-2 was prepared by Technology Committee 31FS. Intellectual Property At the time of publication no notice had been received by SMPTE claiming patent rights essential to the implementation of this
6、Engineering Document. However, attention is drawn to the possibility that some of the elements of this document may be the subject of patent rights. SMPTE shall not be held responsible for identifying any or all such patent rights. Introduction There are requirements for both interleaved and separat
7、ed stereoscopic 3D content. While the timing relationship between each pair of images is especially important during stereoscopic acquisition and display, during post-production and distribution image pairs may be dealt with in non-real time as outlined in this suite of documents. When content is cr
8、eated for distribution, left eye and right eye pictures might be interleaved. When content needs to be edited or manipulated in post-production, it might be stored as separated files. In stereoscopic acquisition, either an interleaved or separated structure can be used. As a stereoscopic 3D asset mo
9、ves from production, through post-production to distribution (whether broadcast, web, on-demand or other), these can be transcoded and rewrapped several times. Upon playout of a file compliant with this suite of standards, the timing relationships between left-eye images and right-eye images must be
10、 recreated. SMPTE ST 2063 discusses this requirement (and others germane to high quality stereoscopic 3D production) in detail, and may be helpful to implementers. The SMPTE ST 2070 document suite therefore addresses such use cases based on the common provisions defined in the part 1 document. The p
11、art 2 document specifies the interleaved wrapping using OP-1a and the part 3 document provides the single track wrapping using OP-Atom or OP-1a. Dual image stereoscopic 3D imaging systems deliver two images (left eye and right eye) that are arranged to be seen simultaneously, or near simultaneously,
12、 by the left and right eyes. Viewers then perceive increased depth in the picture, which becomes more like the natural binocular viewing experience. Regardless of how the pair of images are created, it is assumed that they are properly aligned in space and time at the input of the MXF file system. T
13、his standard therefore describes means by which the MXF file system can be as transparent as possible to the time and space alignment of the image pair. The SMPTE D-Cinema family of standards describes a MXF mapping for stereoscopic images within that application space. The ST 2070 suite of document
14、s is not intended to be used in the D-Cinema application space. SMPTE ST 2070-2:2014 Page 3 of 7 pages 1 Scope This document defines a standard way to wrap an interleaved stereoscopic 3D video stream in MXF using OP1a per SMPTE ST 378. It follows the common provisions defined in part 1 of this multi
15、part document and defines the interleaving structure to be used by a Generic Container implementation conforming to this suite of documents. This standard currently supports two stream stereoscopic images. 2 Conformance Notation Normative text is text that describes elements of the design that are i
16、ndispensable or contains the conformance language keywords: “shall“, “should“, or “may“. Informative text is text that is potentially helpful to the user, but not indispensable, and can be removed, changed, or added editorially without affecting interoperability. Informative text does not contain an
17、y conformance keywords. All text in this document is, by default, normative, except: the Introduction, any section explicitly labeled as “Informative“ or individual paragraphs that start with “Note:” The keywords “shall“ and “shall not“ indicate requirements strictly to be followed in order to confo
18、rm to the document and from which no deviation is permitted. The keywords, “should“ and “should not“ indicate that, among several possibilities, one is recommended as particularly suitable, without mentioning or excluding others; or that a certain course of action is preferred but not necessarily re
19、quired; or that (in the negative form) a certain possibility or course of action is deprecated but not prohibited. The keywords “may“ and “need not“ indicate courses of action permissible within the limits of the document. The keyword “reserved” indicates a provision that is not defined at this time
20、, shall not be used, and may be defined in the future. The keyword “forbidden” indicates “reserved” and in addition indicates that the provision will never be defined in the future. A conformant implementation according to this document is one that includes all mandatory provisions (“shall“) and, if
21、 implemented, all recommended provisions (“should“) as described. A conformant implementation need not implement optional provisions (“may“) and need not implement them as described. Unless otherwise specified, the order of precedence of the types of normative information in this document shall be a
22、s follows: Normative prose shall be the authoritative definition; Tables shall be next; followed by formal languages; then figures; and then any other language forms. 3 Normative References Note: All references in this document to other SMPTE documents use the current numbering style (e.g. SMPTE ST
23、378:2004) although, during a transitional phase, the document as published (printed or PDF) may bear an older designation (such as SMPTE 378M-2004). Documents with the same root number (e.g. 378) and publication year (e.g. 2004) are functionally identical. The following standards contain provisions
24、that, through reference in this text, constitute provisions of this recommended practice. At the time of publication, the editions indicated were valid. All standards are subject to revision, and parties to agreements based on this standard are encouraged to investigate the possibility of applying t
25、he most recent edition of the standards indicated below. SMPTE ST 377-1:2011, Material Exchange Format (MXF) File Format Specification Amendment 1:2012 to SMPTE ST 377-1:2011 Amendment 2:2012 to SMPTE ST 377-1:2011 SMPTE ST 2070-2:2014 Page 4 of 7 pages SMPTE ST 378:2004, Material Exchange Format (M
26、XF) Operational Pattern 1A (Single Item, Single Package) SMPTE ST 379-2:2010, Television Material Exchange Format (MXF) MXF Constrained Generic Container SMPTE ST 2070-1:2014, Stereoscopic 3D in MXF for Operations Common Provisions 4 Definition of Acronyms, Terms and Data Types For the purposes of t
27、his document, the terms and definitions given in SMPTE ST 377-1 apply. 5 Metadata Structure Inside the MXF Header Metadata, the Content Storage shall reference one Material Package mapping the tracks of a single Source Package as an OP1a file per SMPTE ST 378, and shall contain two picture essence t
28、racks, one for the left and one for the right eye. Each picture essence track shall identify the respective eye by the OperationsStereoscopic SubDescriptor, defined in SMPTE ST 2070-1. The file may also contain data tracks. Where a data track defines data essence related to one of the picture essenc
29、e streams, this shall be indicated by the use of the OperationsStereoscopic SubDescriptor, as defined in SMPTE ST 2070-1, to identify the respective eye. The file may also contain optional data tracks, one for each video stream, matching each of the eyes, left and right respectively. As in regular O
30、P1a files, the file may also contain an optional system item, optional audio streams and an optional generic data stream. The scheme in Figure 1 illustrates how the Content Storage shall be structured. SMPTE ST 2070-2:2014 Page 5 of 7 pages Figure 1 Content Storage illustrative example Note: Figure
31、1 omits significant details in the interest of simplicity. 6 Generic Container Structure The Generic Container shall be frame wrapped as described in SMPTE ST 379-2. Clip wrapping is not possible since there are at least two tracks of video essence. Any Picture Essence Descriptor shall have a strong
32、 reference to the OperationsStereoscopic SubDescriptor, defined in SMPTE ST 2070-1. Any Data Essence Descriptor used to describe stereoscopic data essence wrapped according to this standard shall have a strong reference to the OperationsStereoscopic SubDescriptor, as defined in SMPTE ST 2070-1. Cont
33、ent Storage Material Package Video Track (left eye) Video Track (right eye) Audio Track(s) (optional) Timecode Track Data Track (left eye related, optional) Data Track (right eye related, optional) Source Package Timecode Track Video Track (left eye) Video Track (right eye) Audio Track(s) (optional)
34、 Data Track (left eye related, optional) Data Track (right eye related, optional) Data Track (generic, optional) Data Track (generic, optional) SMPTE ST 2070-2:2014 Page 6 of 7 pages Figure 2 Example Generic Container structure Note: The above diagram is simply an illustration, as the actual stream
35、might not contain VANC and might contain either System Items or separate data tracks. Track:EditRate = frame rate Index Table Entry Index Table Entry Video (L) Audio 1 Data (L) Data (R) Video (R) Audio 2 System Data (Generic) Data (L) Data (R) Video (L) Video (R) Audio 1 Audio 2 Data (Generic) System SMPTE ST 2070-2:2014 Page 7 of 7 pages Annex A Bibliography (Informative) SMPTE ST 2063:2012, Stereoscopic 3D Full Resolution Contribution Link Based on MPEG-2 TS SMPTE ST 2070-3:2014, Stereoscopic 3D in MXF for Operations OP-Atom or Single Track OP-1a Mapping