1、 Copyright 2016 by THE SOCIETY OF MOTION PICTURE AND TELEVISION ENGINEERS 3 Barker Avenue, White Plains, NY 10601 (914) 761-1100 Approved July 29, 2016 Table of Contents Page Foreword . 2 Introduction 2 1 Scope . 3 2 Conformance Notation . 3 3 Glossary . 3 3.1 Basic Concepts . 3 3.2 Image Format Rep
2、resentations 8 3.3 Display Types 9 Page 1 of 9 pages SMPTE EG 2061:2016 SMPTE ENGINEERING GUIDELINE Stereoscopic Distribution Master Glossary SMPTE EG 2061:2016 Page 2 of 9 pages Foreword SMPTE (the Society of Motion Picture and Television Engineers) is an internationally-recognized standards develo
3、ping organization. Headquartered and incorporated in the United States of America, SMPTE has members in over 80 countries on six continents. SMPTEs Engineering Documents, including Standards, Recommended Practices, and Engineering Guidelines, are prepared by SMPTEs Technology Committees. Participati
4、on in these Committees is open to all with a bona fide interest in their work. SMPTE cooperates closely with other standards-developing organizations, including ISO, IEC and ITU. SMPTE Engineering Documents are drafted in accordance with the rules given in its Standards Operations Manual. SMPTE EG 2
5、061 was prepared by Technology Committee 30MR. Introduction This section is entirely informative and does not form an integral part of this Engineering Document. SMPTE created a Task Force on 3D to the Home which began its work in August 2008 and published its recommendations in March 2009. SMPTE co
6、mmittees performed further work on specific outputs of this Task Force, ultimately resulting in creation of a Glossary of Terms applying to Stereoscopic television. These committees requested creation of a standalone Glossary document for the benefit of the SMPTE community. SMPTE EG 2061-1:2016 Page
7、 3 of 9 pages 1 Scope This document provides a centralized Glossary of Terms related to the distribution and display of Stereoscopic content for the home. 2 Conformance Notation This Engineering Guideline is purely informative and meant to provide tutorial information to the industry. It does not im
8、pose Conformance Requirements and avoids the use of Conformance Notation. Engineering Guidelines frequently provide tutorial information about a Standard or Recommended Practice and when this is the case, the user should rely on the Standards and Recommended Practices referenced for interoperability
9、 information. 3 Glossary To assist with comprehension and consistency of documents relating to Stereoscopic television, the following definitions are provided. Some of the terms have been adopted from previously published glossaries and documents. Other terms have been generated in order to be speci
10、fically unambiguous, especially in cases where ambiguous terms have been popularly used with multiple definitions. 3.1 Basic Concepts 2D Extraction: A process of deriving a 2D image from a Stereoscopic source. 2D Overlay: A single instance of a two-dimensional closed caption, subtitle, or graphics o
11、verlay that contains no information embedded in it regarding depth. See also: 3D Overlay, Overlay Essence Track, Overlay Offset Value. 2D Overlay With Offset: A 2D Overlay with Overlay Offset information. See also: 2D Overlay, Overlay Essence Track, Overlay Offset Value. 3D-Compatible or 3D-Complian
12、t Device: A device (such as a receiver or set-top box (STB) that can identify Stereoscopic 3D content and pass-through that Stereoscopic 3D content data stream (without change) to downstream devices. 3D-Enabled System: A system that can decode standardized 3D distribution formats and properly displa
13、y them in 3D. This system can include a display along with additional devices (receiver, STB, and/or IR transmitter), or be integrated into a single display. A 3D-Compliant Device, a 3D-Ready Display, and additional devices (such as a Stereoscopic selection device) can combine to form a 3D-Enabled S
14、ystem. 3D-Exclusive Content: 3D TV content designed or intended only to be viewed in 3D, and not designed or intended to be converted to 2D for 2D display. Examples of 3D-Exclusive content might include movies, concerts or other productions that are produced specifically for 3D viewing that are eith
15、er not made available for 2D or that require separate mastering for a 2D viewing version. 3D-Ready Display: A 3D display that requires the addition of a decoder before it can properly display Stereoscopic 3D content encoded using a standardized Stereoscopic distribution format. 3DHM: An acronym for
16、3D Home Master. 3D Distribution Data: Compressed and possibly encrypted data file or stream derived from the Stereoscopic Distribution Master used for actual distribution to the home. SMPTE EG 2061:2016 Page 4 of 9 pages 3D Home Master: An uncompressed and unencrypted image format or file package de
17、rived from a 3D Source Master 3D Overlay: A single instance of a closed caption, subtitle, or graphics overlay that includes either: 3D Overlay Image Pair: A Stereoscopic 3D left eye/right eye image pair; or 3D Image Content: Image content which results in two or more simultaneous views of a scene t
18、o create the perception of depth at the display. 3D Source Master: The digital master of a Stereoscopic feature. Absolute Parallax Transition Limit: The maximum rate of parallax change (dZ/dt) that allows a viewer to fuse views into a Stereoscopic 3D image. Accommodation: The ability of the eyes len
19、ses to change shape in order to focus. Accommodation/Vergence Relationship: The relationship between the focusing of the eyes and verging of the eyes when looking at a particular object point in the visual world. When accommodation and vergence match, the time required to fuse Stereoscopic stimulus
20、is reduced, distortions in perceived depth are reduced, and viewer fatigue and discomfort are reduced. Also called the accommodation convergence relationship, or the convergence accommodation relationship. Anaglyph: A Stereoscopic moving or still image where each view has been superimposed with a co
21、mplimentary color that produces a 3D effect when viewed through similarly colored glasses. Binocular Symmetries: Identical visual parameters between a stereo pair of left and right eye images with the sole exception of horizontal disparity (parallax). The symmetrical parameters consist of illuminati
22、on, focus, geometry, color, registration and temporal symmetry. Blur Gradient: In real scenes, the retinal image is sharpest for objects in focus and blurred for objects not in focus. In many Stereoscopic 3D display systems with flat screens, the entire image on the screen is in focus at all times,
23、reducing the Blur Gradient depth cue. Camera: As defined here for this document includes the lens and all associated equipment required to create digital representations of motion picture images. The camera is generally capable of stand alone operation. Camera Parameters: These parameters completely
24、 determine the transformation between 3D coordinates in the world reference frame of an object being imaged by a camera and the 2D image plane coordinates of the resulting image. Intrinsic Camera Parameters: A set of parameters that relate the geometry of ideal perspective projection to the physics
25、of the camera. These parameters consist of the camera constant f (the distance between the image plane and the lens), the principal point which is the intersection point of the optic axis with the image plane in the measurement reference plane located on the image plane, the geometric distortion cha
26、racteristics of the lens, and the horizontal and vertical scale factors, i.e., distances between rows and between columns. In the field of Analytic Photogrammetry, this set of parameters is also called the Interior Orientation or the Inner Orientation of the camera. Extrinsic Camera Parameters: A se
27、t of parameters that determine the pose of the camera in the 3D world reference frame. The parameters determine the rigid body transformation between the world coordinate frame and camera-centered 3D coordinate frame. In the field of Analytic Photogrammetry, this set of parameters is also called the
28、 Exterior Orientation of the camera. SMPTE EG 2061-1:2016 Page 5 of 9 pages Camera Systems: As defined here for this document consists of all cameras and associated equipment to create S3D content or multi-view content. In the context of this document a camera system is also considered an imaging sy
29、stem. Coincident In Time: With respect to dual video signals for Stereoscopic television, this means that not only are the two video signals genlocked, but that they represent the same moment in time for the scene displayed. Color Rivalry: A condition where the color information of homologous points
30、 creates a depth cue that is inconsistent with other depth cues, notably parallax. Note that this does not imply that the color information of multiple points must be the same. Compatible 3D Content: Stereoscopic 3D TV content which is designed to include or enable conversion to and viewing of the c
31、ontent in 2D. Users with 3D-Enabled equipment supporting the distribution format will display compatible Stereoscopic 3D Content in 3D, while those without such equipment will still enjoy the same program as a 2D representation. Compression: A lossless or lossy process on Stereoscopic 3D content to
32、reduce the size of the image data in order to be able to store or transmit data in an efficient manner. Compression Encoding: The process of converting data into a specialized format for efficient transmission or storage. Convergence: The inward rotation of the eyes, in the horizontal direction, pro
33、ducing fusion. The more general term is vergence, which includes inward and outward rotation. Convergence is also sometimes used to also refer to the toe-in of stereo cameras. Corresponding Points, Conjugate Points, or Homologous Points: The points imaged in the left and right eyes (or the points in
34、 the left and right Stereoscopic image channels) referring to the same point on the depicted object. Cue Conflict: A condition that occurs when multiple depth cues indicate different depth ordering (e.g., when monocular depth cues such as size difference conflict with binocular depth cues such as pa
35、rallax). Depth Map: A Depth Map is an array containing multiple Depth Values. A Depth Map corresponds to an image from a particular viewpoint and each Depth Value corresponds to a pixel in that image. A Depth Map shall have one Depth Value for each pixel in the corresponding image. No additional met
36、adata is required for interpretation of a Depth Map. Depth Range: The range of distances in camera space from the background point producing maximum acceptable positive parallax to the foreground point producing maximum acceptable negative parallax. This is also known as the depth budget, Maximum Al
37、lowable On-Film Deviation (MAOFD), or Fusible Range. It can be given as a ratio of screen width. When referring to the related parallax, the term parallax budget is used. Depth Value: Depth Value shall represent the Physical Depth, Z, in the IEEE 754-2008 single-precision (32-bit) binary floating-po
38、int format. Diplopia: also called Double vision, which in stereo viewing is a condition where the left and right homologues in a stereogram remain separate instead of being fused into a single image. Disparity: The distance between conjugate points on a stereo imaging device or on recorded images. O
39、n human retinae, this is known as retinal disparity. The corresponding term for the display screen is parallax. Disparity Map: An array representing multiple Disparity Values corresponding to a particular one of the images in the stereoscopic image pair. SMPTE EG 2061:2016 Page 6 of 9 pages Disparit
40、y Units (DU): An image-width referenced measure used to represent disparities in a stereoscopic image pair. Disparity Value: the horizontal displacement between corresponding points in a stereoscopic image pair, measured in Disparity Units, from a point in the left-eye image plane to the correspondi
41、ng point in the right-eye image plane. Display Parallax Transition Limit: The maximum rate of parallax change for a particular display system that allows a viewer to fuse views into a 3D image. Note that this takes into account display characteristics, such a refresh rate, update frequency, image di
42、splay multipliers, etc. Edge Violation: When objects in front of the screen plane (negative parallax) intersect the edge of frame. This can lead to contradictory depth cues for the viewer. Causing an edge violation is sometimes called “Breaking the Frame”. This problem can be reduced through the use
43、 of a Floating Window. Exterior Orientation: In the field of Analytic Photogrammetry, Exterior Orientation refers to an images position and orientation related to an exterior coordinate system. Floating Window: A black or partially transparent mask that can be placed on the left of the left image an
44、d/or on the right of the right image in order to reduce the depth cue confusion due to edge violations. Floating Windows create the illusion of moving the physical borders of the screen surround forward or backward. A floating window can be a Dynamic Window by changing window depth or tilting the wi
45、ndow at different times during playout, and can either be pre-rendered into the image or generated by the display using metadata. Focal Volume: The volumetric space in a Stereoscopic 3D image that is in focus. This is the focal plane multiplied by the depth of field. Formatting: A process for conver
46、ting Stereoscopic content into another format/representation. It could involve spatial and/or temporal decimation, interpretation, extraction, etc., to create a new image structure with quality that is equal to or less than the original Stereoscopic content. Fusion: The combination, by the mind, of
47、the images seen by the left and right eyes into a single image. Genlock: Abbreviation of “sync Generator Lock.” Genlock is a technique for locking a devices internal sync structures (and thus image structures) to a common external reference (a “sync generator”). This is especially important for S3D
48、imaging (camera) systems where both eye images need to be aligned precisely both vertically and horizontally in respect to each other. Ghosting: Crosstalk between Stereoscopic image channels. This can impair Stereoscopic depth cues. Sometimes this is due to imperfect selection devices. Horizontal Im
49、age Translation (HIT): Moving left and right eye images with respect to each other to adjust the zero parallax position for image points. Horopter: The locus in space of zero binocular retinal disparity. The theoretical horopter in the horizontal plane (if eye fixation is directly ahead) is known as the Vieth-Mller Circle. Image Formatting: A Generic term for the processing or transforming of image data from one format to another. Interaxial: The distance between the lens axes of stereo camera heads, also denoted as tc. Interocular or Interpupillary: The distance betw