ImageVerifierCode 换一换
格式:PDF , 页数:15 ,大小:641.23KB ,
资源ID:591123      下载积分:10000 积分
快捷下载
登录下载
邮箱/手机:
温馨提示:
如需开发票,请勿充值!快捷下载时,用户名和密码都是您填写的邮箱或者手机号,方便查询和重复下载(系统自动生成)。
如填写123,账号就是123,密码也是123。
特别说明:
请自助下载,系统不会自动发送文件的哦; 如果您已付费,想二次下载,请登录后访问:我的下载记录
支付方式: 支付宝扫码支付 微信扫码支付   
注意:如需开发票,请勿充值!
验证码:   换一换

加入VIP,免费下载
 

温馨提示:由于个人手机设置不同,如果发现不能下载,请复制以下地址【http://www.mydoc123.com/d-591123.html】到电脑端继续下载(重复下载不扣费)。

已注册用户请登录:
账号:
密码:
验证码:   换一换
  忘记密码?
三方登录: 微信登录  

下载须知

1: 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。
2: 试题试卷类文档,如果标题没有明确说明有答案则都视为没有答案,请知晓。
3: 文件的所有权益归上传用户所有。
4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
5. 本站仅提供交流平台,并不能对任何下载内容负责。
6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

版权提示 | 免责声明

本文(CAN CSA-ISO IEC TR 15938-8F-2013 Information technology - Multimedia content description interface - Part 8 Extraction and use of MPEG-7 descriptions AMENDMENT 3 Technologies for d.pdf)为本站会员(figureissue185)主动上传,麦多课文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知麦多课文库(发送邮件至master@mydoc123.com或直接QQ联系客服),我们立即给予删除!

CAN CSA-ISO IEC TR 15938-8F-2013 Information technology - Multimedia content description interface - Part 8 Extraction and use of MPEG-7 descriptions AMENDMENT 3 Technologies for d.pdf

1、Information technology Multimedia content description interface Part 8: Extraction and use of MPEG-7 descriptions AMENDMENT 6: Extraction and matching of video signature toolsAmendment 6:2013 (IDT) toNational Standard of CanadaCAN/CSA-ISO/IEC TR 15938-8-04(ISO/IEC TR 15938-8:2002, IDT)NOT FOR RESALE

2、 / PUBLICATION NON DESTINE LA REVENTE.Standards Update ServiceAmendment 6:2013 toCAN/CSA-ISO/IEC TR 15938-8-04January 2013Title:Information technology Multimedia content description interface Part 8: Extraction and use of MPEG-7 descriptions AMENDMENT 6: Extraction and matching of video signature t

3、oolsPagination:7 pages (iii preliminary and 4 text)To register for e-mail notification about any updates to this publicationgo to shop.csa.caclick on CSA Update ServiceThe List ID that you will need to register for updates to this publication is 2422261.If you require assistance, please e-mail techs

4、upportcsagroup.org or call 416-747-2233.Visit CSA Groups policy on privacy at csagroup.org/legal to find out how we protect your personal information.Reference numberISO/IEC TR 15938-8:2002/Amd.6:2011(E)ISO/IEC 2011TECHNICAL REPORT ISO/IECTR15938-8First edition2002-12-15AMENDMENT 62011-11-01Informat

5、ion technology Multimedia content description interface Part 8: Extraction and use of MPEG-7 descriptions AMENDMENT 6: Extraction and matching of video signature tools Technologies de linformation Interface de description du contenu multimdia Partie 8: Extraction et utilisation des descriptions MPEG

6、7 AMENDEMENT 6: Extraction et correspondance des outils de signature vido ISO/IEC TR 15938-8:2002/Amd.6:2011(E) COPYRIGHT PROTECTED DOCUMENT ISO/IEC 2011 All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electroni

7、c or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISOs member body in the country of the requester. ISO copyright office Case postale 56 CH-1211 Geneva 20 Tel. + 41 22 749 01 11 Fax + 41 22 749 09 47 E-mail copyrightiso.org W

8、eb www.iso.org ii ISO/IEC 2011 All rights reservedAmendment 6:2013 toCAN/CSA-ISO/IEC TR 15938-8-04ISO/IEC TR 15938-8:2002/Amd.6:2011(E) ISO/IEC 2011 All rights reserved iiiForeword ISO (the International Organization for Standardization) and IEC (the International Electrotechnical Commission) form t

9、he specialized system for worldwide standardization. National bodies that are members of ISO or IEC participate in the development of International Standards through technical committees established by the respective organization to deal with particular fields of technical activity. ISO and IEC tech

10、nical committees collaborate in fields of mutual interest. Other international organizations, governmental and non-governmental, in liaison with ISO and IEC, also take part in the work. In the field of information technology, ISO and IEC have established a joint technical committee, ISO/IEC JTC 1. I

11、nternational Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2. The main task of the joint technical committee is to prepare International Standards. Draft International Standards adopted by the joint technical committee are circulated to national bodies for

12、voting. Publication as an International Standard requires approval by at least 75 % of the national bodies casting a vote. In exceptional circumstances, when the joint technical committee has collected data of a different kind from that which is normally published as an International Standard (“stat

13、e of the art”, for example), it may decide to publish a Technical Report. A Technical Report is entirely informative in nature and shall be subject to review every five years in the same manner as an International Standard. Attention is drawn to the possibility that some of the elements of this docu

14、ment may be the subject of patent rights. ISO and IEC shall not be held responsible for identifying any or all such patent rights. Amendment 6 to ISO/IEC TR 15938-8:2002 was prepared jointly by Joint Technical Committee ISO/IEC JTC 1, Information technology, Subcommittee SC 29, Coding of audio, pict

15、ure, multimedia and hypermedia information. Amendment 6:2013 toCAN/CSA-ISO/IEC TR 15938-8-04Amendment 6:2013 toCAN/CSA-ISO/IEC TR 15938-8-04ISO/IEC TR 15938-8:2002/Amd.6:2011(E) ISO/IEC 2011 All rights reserved 1Information technology Multimedia content description interface Part 8: Extraction and u

16、se of MPEG-7 descriptions AMENDMENT 6: Extraction and matching of video signature tools After 4.9.1, add: 4.9.2 Video Signature The visual content descriptors in Clauses 6-9 of ISO/IEC 15938-3:2002 are very useful when trying to find videos with similar content. These descriptors are intended to be

17、general and were found to be unsuitable for the task of finding duplicate content. The video signature descriptor is designed to identify duplicate video content. This descriptor is robust to a wide range of common video editing operations, but is sufficiently different for every original content to

18、 identify it reliably. The video signature is composed of three main elements: a frame signature, a set of compact summary frame signatures, referred to as words, and a group-of-frames representation for a temporal segment, referred to as a bag of words. 4.9.2.1 Extraction 11.4.5 to 11.4.8 of ISO/IE

19、C 15938-3:2002 describe the extraction of the video signature. 4.9.2.2 Matching A Video Signature is composed of multiple temporal segments, each represented by a BagOfWords element, and multiple frames, each represented by a FrameSignature element and a FrameConfidence element. The matching between

20、 two Video Signatures and is carried out in three stages, designed to maximize matching speed and true positives and to minimize false positives. The first stage uses the BagOfWords element to identify candidate matching segments. The second stage uses the FrameSignature element to identify candidat

21、es of frame rate ratio and temporal offset between the candidate matching segments. The third stage performs frame-by-frame matching to determine candidate matching intervals using the FrameSignature and FrameConfidence elements, and then determines the best match between the Video Signatures and .

22、These matching stages are explained in more detail below. 1v2v1v2vStage 1 (Segment matching with BagOfWords) All of the temporal segments of Video Signature are compared with all of the temporal segments of Video Signature . 1v2vAmendment 6:2013 toCAN/CSA-ISO/IEC TR 15938-8-04ISO/IEC TR 15938-8:2002

23、/Amd.6:2011(E) 2 ISO/IEC 2011 All rights reservedFor two segments and , their similarity is assessed by comparing the bag-of-words representation for each vocabulary j and merging the results to reach a decision. More specifically, for 1f2f1BagOfWords j and 2BagOfWords j , their distance is measured

24、 by the Jaccard distance metric given by 1212#( ), 1#( )jJBagOfWords j BagOfWords jD BagOfWords j BagOfWords jBagOfWords j BagOfWords jwhere # denotes the number of elements in a set. This measures the distance of the segments 1f and 2f in a given vocabulary as a function of the distinct words they

25、have in common and all the distinct words that they contain jointly. For vocabularies, we have Jaccard distances , , ,. . These distances are fused to give the composite distance as 5QJD0JD1JQD1JD10QkJkJDD Then a decision on the similarity of the segments is reached by thresholding each of Jaccard d

26、istances , , ,. , and the composite distance . That is, the segments and are passed to stage 2 of matching if more than half of the Q Jaccard distances , , ,. are less than a threshold T and the composite distance is less than another threshold , otherwise they are declared not matching. JD01JD1JQD1

27、JD1ifJQD12jfJD02TJD1JDStage 2 (Frame rate ratio & time shift estimation using Hough transform) For the segment pairs passed to this stage, a Hough transform is used to estimate the temporal parameter differences, i.e. frame rate ratio and time shift, between the segments. These are linear properties

28、 and can therefore be estimated using two strongly corresponding frame pairs. First, the L1 distance between the FrameSignature elements of the frame pairs between the segments are calculated and the pairs whose distance is smaller than a threshold are selected as strongly corresponding frame pairs.

29、 Then, two strongly corresponding frame pairs are selected to calculate the time shift and frame rate ratio, and the bin corresponding to the calculated parameters in the Hough space is incremented. This is done for all possible combinations of two strongly corresponding frame pairs. Finally, multip

30、le temporal parameters with high response in the Hough space are selected as candidate parameters, and are passed to stage 3 of matching. If the highest response in the Hough space is below a certain threshold, the segment pairs are declared not matching. Stage 3 (Frame-by-frame matching on frame si

31、gnature) The matching interval (the start and end position of the match) is determined by temporal interval growing based on a frame-by-frame matching on the full frame signature. The candidate temporal parameters between two sequences are used in this frame-by-frame matching. Amendment 6:2013 toCAN

32、/CSA-ISO/IEC TR 15938-8-04ISO/IEC TR 15938-8:2002/Amd.6:2011(E) ISO/IEC 2011 All rights reserved 3First, the estimated time shift is used to determine the initial temporal matching position. Then, using the estimated frame rate ratio, the temporal interval is extended frame-by-frame towards both dir

33、ections by calculating the L1 distance between the FrameSignature elements of corresponding frames. The temporal extension stops when the distance exceeds a certain threshold in order to determine the matching interval. If the length of the matching interval is shorter than a given minimum duration,

34、 the matching interval is eliminated as a non-match. Otherwise, the FrameConfidence element associated with each frame in the matching interval is checked to verify the match. The overall confidence of the matching interval is calculated as the ratio of the number of frames which has a FrameConfiden

35、ce that is higher than a certain threshold. If the overall confidence is below a certain level, the matching interval is eliminated as a false match caused by frames with low information content. This process is carried out for all of the candidate temporal parameters, thus generating multiple candi

36、dates of matching intervals. Then, one candidate interval is selected as the best matching result, based on the mean L1 distance of FrameSignatures and the length of the interval. The best interval is selected by first selecting multiple intervals with mean L1 distances below a threshold and then se

37、lecting amongst them the one which has the longest length. Clause 7 of ISO/IEC 15938-6:2003 contains an exemplary implementation and source code for this matching technique, including default threshold values. 4.9.2.3 Fast Matching using Index The process of fast matching using index tables may be u

38、sed as a pre-filtering step to quickly determine whether two Video Signatures (a query video) and (a reference video) have possible matching intervals, in which case the matching process described in 4.9.2.2 follows. 1v2vIndex tables using the word elements of the Video Signature are utilized in the

39、 fast matching process. For each video, index tables for each word ( 5Q ) are built, which maps the values of the word (0-242) to the frame numbers of which they appear. By using the index tables, we can quickly locate frame pairs between two videos which have the same word values. The index tables

40、are built before matching is carried out. For the reference video, all of the frames are used to build the index tables, referred to as all-frame index tables. For the query video, only selected keyframes are used to build the index tables, referred to as keyframe index tables. The keyframe selectio

41、n of a query video is carried out using a keyframe detection algorithm. The proposed algorithm uses the FrameSignature element, and proceeds as follows. Keyframe selection for a query video 1. Calculate L1 distance between FrameSignature elements of each frame and its previous frame. 2. Set a slidin

42、g window and find the maximum L1 distance within the window. The frame which the distance with its previous frame gives the maximum value is a keyframe candidate. A recommended size of the sliding window is 4 seconds. In case the expected matching duration is less than 4 seconds, the recommended win

43、dow size is half of the expected matching duration. The keyframe candidates for the whole video should be selected by sliding the window by one frame. 3. If consecutive frames are selected as keyframe candidates, remove all such consecutive keyframe candidates except for the last one. 4. If the Fram

44、eConfidence element of the keyframe candidate is smaller than a predefined threshold, discard that keyframe candidate and select as a keyframe candidate the first posterior frame of which the FrameConfidence element is larger than or equal to the threshold. 5. The remaining keyframe candidates are s

45、elected as keyframes. Amendment 6:2013 toCAN/CSA-ISO/IEC TR 15938-8-04ISO/IEC TR 15938-8:2002/Amd.6:2011(E) 4 ISO/IEC 2011 All rights reservedThe matching process between a query video and a reference video uses the keyframe index tables of the query video and all-frame index tables of the reference

46、 video to identify matching frame pairs between the query and the reference videos. The detailed matching procedure is as follows. Matching procedure using index tables 1. For each word ( ), keyframe index table of the query video and all-frame index table of the reference video are compared to iden

47、tify frame pairs with the same word value between the query and the reference video. Frame pairs having the same value for multiple of the 5Q5Q words are selected as candidate of matching frame pairs to be passed to the next step. 2. Each candidate frame pair is further verified by calculating the L

48、1 distance between the FrameSignature elements. This can be done in two steps, 1) first by calculating the L1 distance using only the 25 out of 380 dimensions used to construct the words, 2) then by calculating the L1 distance for the whole 380 FrameSignature dimensions. If the L1 distance is smalle

49、r than a predefined threshold, the frame pairs are identified as matching frame pairs. 3. If the number of the matching frame pairs between the query and reference video identified in the previous step is larger than a predefined threshold, the query and the reference videos are passed to the matching process described in 4.9.2.2 for identifying the matching intervals. Clause 7 of ISO/IEC 15938-6:2003 contains an exemplary implementation and source code for this matching technique, including de

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1