1、 Copyright 2009 by THE SOCIETY OF MOTION PICTURE AND TELEVISION ENGINEERS 3 Barker Avenue, White Plains, NY 10601 (914) 761-1100 Approved February 11, 2009 Table of Contents Page Foreword 2 Intellectual Property .2 Introduction 2 1 Scope 3 2 Conformance Notation 3 3 Normative References 4 4 Definiti
2、ons .5 5 Stream Composition9 5.1 Overall Stream Sequence .9 5.2 Sequence of MAP Packets and Active Material Media Packets.11 5.3 Decoder Pre-charge Media Packets .11 5.4 Sequence of Media Packets for Compound Clips 11 5.5 Mixed Media Tracks in Compound Clips (Informative) 12 6 Packet Headers 12 7 Pa
3、cket Type Descriptions. 13 7.1 Map Packets .13 7.2 Field Locator Table Packet .25 7.3 Unified Material Format Packets.27 7.4 Media Packets.39 7.5 End-of-Stream Packet.49 Annex A Transfer over IP using FTP (Informative) 50 A.1 Supported FTP Commands50 A.2 Initiating a Transfer 50 A.3 Transfer Options
4、 52 Annex B Synchronization of Media Elements (Informative). 54 B.1 An Example 54 Annex C Bibliography (Informative) . 56 Page 1 of 56 pages SMPTE 360-2009Revision of SMPTE 360M-2004 SMPTE STANDARD General Exchange Format SMPTE 360-2009 Page 2 of 56 pages Foreword SMPTE (the Society of Motion Pictur
5、e and Television Engineers) is an internationally-recognized standards developing organization. Headquartered and incorporated in the United States of America, SMPTE has members in over 80 countries on six continents. SMPTEs Engineering Documents, including Standards, Recommended Practices, and Engi
6、neering Guidelines, are prepared by SMPTEs Technology Committees. Participation in these Committees is open to all with a bona fide interest in their work. SMPTE cooperates closely with other standards-developing organizations, including ISO, IEC and ITU. SMPTE Engineering Documents are drafted in a
7、ccordance with the rules given in Part XIII of its Administrative Practices. This SMPTE Engineering Document was prepared by Technology Committee N26. Intellectual Property At the time of publication no notice had been received by SMPTE claiming patent rights essential to the implementation of this
8、Standard. However, attention is drawn to the possibility that some of the elements of this document may be the subject of patent rights. SMPTE shall not be held responsible for identifying any or all such patent rights. Introduction This Standard describes one file format that can be used to move si
9、mple clips or compound clips. It also describes how the file transfer protocol (FTP) can be used for the transfers. This includes file name syntax to support an implementation of partial stream transfers and other extensions. The byte streams described in this Standard can be moved using any reliabl
10、e transfer protocol and a network that supports that protocol. Annex A describes one protocol stack and a network technology that are in use today. There is no reason other protocols such as XTP could not be used. XTP supports features such as reliable multicast which are not part of FTP. In many ap
11、plications, real-time broadcast schemes are used. The byte stream for simple clips described in this Standard can be used under those conditions. However, this practice is not recommended for compound clips. The EBU/SMPTE Joint Task Force report also describes objectives for rich file formats and ri
12、ch metadata capabilities in future designs. The file format described in this Standard was in place before the task force report was issued and therefore does not meet all of the reports stated objectives. This type of transfer was initially implemented on a Fibre Channel network using FTP with tran
13、smission control protocol (TCP) and Internet protocol (IP). This transfer technique is well suited to implementations on IP networks such as Fibre Channel, Ethernet, or ATM, although it can be used on any reliable protocol stack and data network technology. An example of transfers over an IP link us
14、ing the IETF standard FTP is provided in Annex A. SMPTE 360M-2009 Page 3 of 56 pages 1 Scope This Standard specifies a file exchange format for the transfer of simple clips and compound clips between television program storage systems. This Standard provides documentation of formats and contents, in
15、cluding specification of packet types, contents, and order. Informative information on a control scheme, a protocol stack, and an interface is included. Physical links (data network technology) are not specified in this Standard. Any appropriate physical link may be used. Simple clips and compound c
16、lips, as defined in this Standard, are specified as alternatives to traditional approaches to transferring television programming between storage devices. This Standard defines two simple constructs for transfer of material: 1) A simple clip format in which all audio, video and time code tracks star
17、t and stop coincidentally and where the audio and video tracks are contiguous, and; 2) A compound clip format, which is a collection of audio, video, and time code tracks, any of which may contain several segments, with cut transitions between the segments. Transitions may occur on each track indepe
18、ndently of the other tracks. This Standard is independent of material format, whether compressed or not, and allows material to be transferred across a network before recording has finished. The transferred material specified in this Standard includes Motion JPEG (MJPEG), MPEG-1, MPEG-2, and DV-base
19、d video compression standards, with associated audio, time code, and user data that may include user-defined metadata. The video, audio, and time code material is time multiplexed to provide immediate availability at the receiver. 2 Conformance Notation Normative text is text that describes elements
20、 of the design that are indispensable or contains the conformance language keywords: “shall“, “should“, or “may“. Informative text is text that is potentially helpful to the user, but not indispensable, and can be removed, changed, or added editorially without affecting interoperability. Informative
21、 text does not contain any conformance keywords. All text in this document is, by default, normative, except: the Introduction, any section explicitly labeled as “Informative“ or individual paragraphs that start with “Note:” The keywords “shall“ and “shall not“ indicate requirements strictly to be f
22、ollowed in order to conform to the document and from which no deviation is permitted. The keywords, “should“ and “should not“ indicate that, among several possibilities, one is recommended as particularly suitable, without mentioning or excluding others; or that a certain course of action is preferr
23、ed but not necessarily required; or that (in the negative form) a certain possibility or course of action is deprecated but not prohibited. The keywords “may“ and “need not“ indicate courses of action permissible within the limits of the document. The keyword “reserved” indicates a provision that is
24、 not defined at this time, shall not be used, and may be defined in the future. The keyword “forbidden” indicates “reserved” and in addition indicates that the provision will never be defined in the future. Unless otherwise specified, the order of precedence of the types of normative information in
25、this document shall be as follows: Normative prose shall be the authoritative definition; Tables shall be next; followed by formal languages; then figures; and then any other language forms. SMPTE 360-2009 Page 4 of 56 pages 3 Normative References The following standards contain provisions which, th
26、rough reference in this text, constitute provisions of this registered disclosure document. At the time of publication, the editions indicated were valid. All standards are subject to revision, and parties to agreements based on this registered disclosure document are encouraged to investigate the p
27、ossibility of applying the most recent edition of the standards indicated below. SMPTE 12M-1-2008, Television Time and Control Code SMPTE 292-2008, 1.5 Gb/s Signal/Data Interface SMPTE 314M-2005, Television Data Structure for DV-Based Audio, Data and Compressed Video 25 and 50 Mb/s SMPTE 337-2008, F
28、ormat for Non-PCM Audio and Data in an AES3 Serial Digital Audio Interface SMPTE 338-2008, Format for Non-PCM Audio and Data in AES3 Data Types SMPTE 340-2008, Format for Non-PCM Audio and Data in AES3 ATSC A/52B Digital Audio Compression Standard for AC-3 and Enhanced AC-3 Data Types SMPTE RP 202-2
29、008, Video Alignment for Compression Coding AES3-2003, AES Standard for Digital Audio Digital Input-Output Interfacing Serial Transmission Format for Two-Channel Linearly Represented Digital Audio Data ANSI INCITS 4-1986 (R2002), Information Systems Coded Character Sets 7-Bit American National Stand
30、ard Code for Information Interchange (7-Bit ASCII) formerly ANSI X3.4-1986 (R1997) ATSC A/52:2005, Digital Audio Compression (AC-3) Standard IEEE 754-1985 (R1990), Standard for Binary Floating-Point Arithmetic ISO/IEC 11172-2:1993, Information Technology Coding of Moving Pictures and Associated Audi
31、o for Digital Storage Media at Up to About 1.5 Mb/s Part 2: Video ISO/IEC 11172-2:1993/Cor 1:1996 ISO/IEC 11172-2:1993/Cor 2:1999 ISO/IEC 13818-1:2007, Information Technology Generic Coding of Moving Pictures and Associated Audio Information: Systems ISO/IEC 13818-1:2007/Cor 1:2008 ISO/IEC 13818-2:2
32、000, Information Technology Generic Coding of Moving Pictures and Associated Audio Information: Video ISO/IEC 13818-2:2000/Cor 1:2007 ISO/IEC 13818-2:2000/Cor 2:2007 ISO/IEC 13818-2:2000/Amd 1:2001 Support for Colour Spaces SMPTE 360M-2009 Page 5 of 56 pages ISO/IEC-61834-2 (1998-08), Recording Heli
33、cal-Scan Digital Video Cassette Recording System Using 6.35mm Magnetic Tape for Consumer Use (525-60, 625-50 and 1250-50 Systems), Part 2: SD Format for 525-60 and 625-50 Systems ISO/IEC 10918-1:1994, Information Technology Digital Compression and Coding of Continuous-Tone Still Images (JPEG) ISO/IE
34、C 10918-1:1994/Cor 1:2005 ITU-R BT.601-5 (10/95), Studio Encoding Parameters of Digital Television for Standard 4:3 and Wide-Screen 16:9 Aspect Ratios ITU-R BS.1196 (1995) (Annex 2), Audio Coding for Digital Terrestrial Television Broadcasting ITU-T H.261 (03/93), Video Codec for Audiovisual Service
35、s at p x 64 kb/s ITU-T T.81 (1992) Information Technology Digital Compression and Coding of Continuous-Tone Still Images: Requirements and Guidelines (JPEG) 4 Definitions The following terms are defined for use in this Standard. Notes: 1) For standard definition material, the traditional style of to
36、tal number of lines per frame (525 and 625) has been used. For high-definition material, the new convention of active lines per frame (720 and 1080) has been used 2) The terminology for MPEG video streams includes “MPEG” when describing an arbitrary MPEG video type. The terms “MPEG-1” and “MPEG-2” a
37、re used when a specific version of MPEG is required. MPEG-1 video is always standard definition. When a MPEG-2 stream is standard definition video in either the 525 or 625 format, the term ”MPEG-2 SD” is used. Finally “MPEG-2 HD” is used when a MPEG-2 stream is a high definition video stream. GXF MP
38、EG video streams are always encoded as single frames of MPEG Elementary Stream video elements. 4.1 clip: Clips shall consist of collections of time code, audio, and compressed video tracks. The contents of the tracks determine whether the material is a simple clip or a compound clip. See also simple
39、 clip and compound clip. 4.2 compound clip: In a compound clip, each track may be comprised of one or more segments arranged in a specified sequence as shown in Figure 1. Each segment of material on a track of a compound clip shall originate from a single source media file. Segments may comprise all
40、 or part of the material from a source media file. If only part of the material is used, it shall be a single contiguous segment delimited by a mark in and a mark out point. Transitions between segments shall be cut transitions. Video track 1 Media segment V1 Media segment V2 Audio track 1 Media seg
41、ment A1 Media segment A3 Audio track 2 Media segment A2 Media segment A4 Time code Media segment T1 Figure 1 Example of a compound clip SMPTE 360-2009 Page 6 of 56 pages 4.3 data types: Table 1 describes the data types used in this Standard. Numerical values in this Standard that are preceded by 0x
42、are hexadecimal values. All other numerical values are decimal numbers. Table 1 Data type descriptions Data type Description BINARY 8-bit binary data without an implicit length INT32 32-bit signed integer UINT64 64-bit unsigned integer UINT32 32-bit unsigned integer UINT16 16-bit unsigned integer UI
43、NT8 8-bit unsigned integer CHARACTER A single character code using ANSI INCITS 4-1986 STRING An ordered vector of ANSI INCITS 4-1986 coded characters that is 0x00 (NULL) terminated. NOTE: Unicode is not allowed. FLOAT64 64-bit signed floating point (see IEEE 754) The byte order for multibyte values
44、defined in this Standard shall be the least significant byte (LSB) first, followed by progressively more significant bytes to the most significant byte (MSB) unless otherwise specified. Multibyte values in predefined items such as MPEG and DV compressed video streams shall be as stored/transported i
45、n their native order. 4.4 field identifier: The field identifier indicates the field number within the associated video frame: 1 = Field 1 2 = Field 2 For interlaced video, the field identifier alternates between 1 and 2. For progressive video, the field identifier is 1 on every field/frame. Note: T
46、he field 1/2 nomenclature is always used, even when traditional standards would use field 0/1. 4.5 field locator table: The optional field locator table (FLT) provides a method to locate the desired material in the stream without processing the entire stream. The FLT shall contain up to 1000 offsets
47、 into the stream, which can be used to locate a point in the stream near a specified field. If an FLT is not present, unneeded fields or frames may have to be processed in order to locate the desired field or frame. The FLT facilitates partial file transfers from devices such as data tape archives b
48、y providing the ability to locate the desired portion of the streamed material without reading the entire stream. 4.6 field number: An unsigned integer that designates the position of a video field on a playout time line. The field number time line shall be zero-based and increment with each field.
49、Frame based compression systems shall represent frames numbers with even field numbers. 4.7 field and frame rates: The field rate for 625-line material is exactly 50 fields per second. The field rate for 525-line material is documented as 59.94 fields per second. The actual field rate is 60/1.001 fields per second. The field rate for 1080-line interlaced material can be 60 fields/sec, 59.94 (the exact rate is 60/1.001) SMPTE 360M-2009 Page 7 of 56 pages fields/sec, or 50 f