Internet Taxonomies and Metadata- Creating Them, Using .ppt

上传人:priceawful190 文档编号:376576 上传时间:2018-10-08 格式:PPT 页数:344 大小:592.50KB
下载 相关 举报
Internet Taxonomies and Metadata- Creating Them, Using .ppt_第1页
第1页 / 共344页
Internet Taxonomies and Metadata- Creating Them, Using .ppt_第2页
第2页 / 共344页
Internet Taxonomies and Metadata- Creating Them, Using .ppt_第3页
第3页 / 共344页
Internet Taxonomies and Metadata- Creating Them, Using .ppt_第4页
第4页 / 共344页
Internet Taxonomies and Metadata- Creating Them, Using .ppt_第5页
第5页 / 共344页
亲,该文档总共344页,到这儿已超出免费预览范围,如果喜欢就下载吧!
资源描述

1、Copyright 2000 Access Innovations, Inc.,1,Internet Taxonomies and Metadata: Creating Them, Using Them National Online Meeting May 18, 2001,Marjorie M.K. Hlava Access Innovations, Inc.,Copyright 2000 Access Innovations, Inc.,2,Schedule for the day,Standards Markup methods Meta data Search Engines Tax

2、onomies,Copyright 2000 Access Innovations, Inc.,3,Logistics,Ask lots of questions If you wonder so do others. Break Lunch (at about slide #162) Afternoon Break Workshop ends at 5:00 PM,Copyright 2000 Access Innovations, Inc.,4,What we will cover todaydetails,Standards The MLs Search Engines Meta-dat

3、a Thesaurus and Taxonomy Construction Theory of knowledge Outlines of knowledge Types of vocabularies Methodologies for creation Details of construction,Copyright 2000 Access Innovations, Inc.,5,New Names,Taxonomy = Thesaurus Catalog = A&I Journal or Directory or Library Secondary = Distributor B2B

4、Efrastructure Market Forces = Stock Market Portal = Host,Copyright 2000 Access Innovations, Inc.,6,New Technologies,XML RDF Multilingual (Unicode) Multinational Geographically Distributed Processing,Copyright 2000 Access Innovations, Inc.,7,What does it take to make the digital information model wor

5、k?,Copyright 2000 Access Innovations, Inc.,8,together?,What makes it work,Copyright 2000 Access Innovations, Inc.,9,Standards,Copyright 2000 Access Innovations, Inc.,10,Standards,What are standards? Mutually agreed and consensus voted processes and measures The standards process Identify need Draft

6、standard Comments and resolution Vote approval by members,Copyright 2000 Access Innovations, Inc.,11,Official Standards Organizations,ISO - International Standards Organization National Bodies -ANSI - American National Standards Institute Subject areas NISO - National Information Standards Organizat

7、ion Voting Members - SLA,Copyright 2000 Access Innovations, Inc.,12,Power Quasi standards,Company proprietary - PDF - portable document format (PDF/X) W3C - World Wide Web Consortium TCP/IP - “standard” via accepted practice (RFC) Agreed practice Dialog Format b,Copyright 2000 Access Innovations, In

8、c.,13,The work horses of the net - all W3C standards,Physical Network Protocols Network Applications,Copyright 2000 Access Innovations, Inc.,14,Physical Network,Cabling a number of levels Desktop computer Host or server Router Telephone / leased lines,Copyright 2000 Access Innovations, Inc.,15,Proto

9、cols,Mutually agreed format or set of conventions “Standards” World Wide Web Consortium - W3C Internet messages travel in packets packets up to 1500 bytes (characters) each 7 - 8 bits per byte,Copyright 2000 Access Innovations, Inc.,16,Network Protocols,Controlled by Request for Comments (RFC) Few a

10、re standards - informal review process RFC 822 - electronic mail STD 5, RFC 791 - IP STD 7, RFC 793 - TCP RFC 2068 - HTTP,Copyright 2000 Access Innovations, Inc.,17,Application protocols,Turns the transmission into something we can recognize Mail, telnet, ftp, archie, gopher, WAIS MOSAIC WWW, Netsca

11、pe, Internet Explorer,Copyright 2000 Access Innovations, Inc.,18,Network Applications,Client server architecture Share the work load of the implementation system Uses several protocols at the same time email telnet ftp WWW,Copyright 2000 Access Innovations, Inc.,19,Key Content Standards - all NISO/I

12、SO standards,Organizational and Access Control Storage, Retrieval, and Preservation Field Formatting and Tagging Classification Basis for Content / Publishing Subject indexing (taxonomies) Abstracting,Copyright 2000 Access Innovations, Inc.,20,Copyright 2000 Access Innovations, Inc.,21,Lets get spec

13、ific: mark up languages,SGML - NISO/ISO HTML - W3C XML - W3C,Copyright 2000 Access Innovations, Inc.,22,The Nature of Markup,Structure - beginning, end; chapter above section; title before body Content - phone number, author, title, legal Added-value information - subject term indexing, document typ

14、e, version Format - BOLD, Italics, CENTER,Copyright 2000 Access Innovations, Inc.,23,Standard Generalized Markup Language (SGML),Meta-language Published, supported standard - ISO 8879:1988 Application and platform independence Sharing and re-packaging of information Portable,Copyright 2000 Access In

15、novations, Inc.,24,Standard Generalized Markup Language (SGML),Complex and challenging to maintain Not WEB friendly Lack of supported style sheet Complicated software No mainstream browser support,Copyright 2000 Access Innovations, Inc.,25,Basic Parts of SGML,1. SGML declaration 2. Document type def

16、inition (DTD) - HTML DTD EAD DTD 3. Document instance -Marked up Title Page,Copyright 2000 Access Innovations, Inc.,26,Basic Components of of SGML Markup - 1 of 2,Elements - the parts of the document, expressed as tags- Author- Title- Paragraph- Title page,Copyright 2000 Access Innovations, Inc.,27,

17、Basic Components of SGML Markup - 2 of 2,2. Attributes - information about elements Marjorie M.K. Hlava Jay Ven Eman SGML Short Course,Copyright 2000 Access Innovations, Inc.,28,Title Page - Coarse Markup,Copyright 2000 Access Innovations, Inc.,29,Title Page - Fine Markup,Copyright 2000 Access Innov

18、ations, Inc.,30,Journal of the National Cancer Institute, Vol. 91, No. 11, 899, June 2, 1999: Oxford University Press IN THIS ISSUE Significant Tax Consequences Human T-cell leukemia virus (HTLV) and bovine leukemia virus (BLV) are retroviruses that cause hematopoietic cancers and encode a unique pr

19、otein, Tax, which is involved in the transformation of infected cells. Philpott and Buehring (p. 933) have investigated the mechanism by which Tax proteins may induce cell transformation. They observed chromosomal damage and diminished DNA integrity in both virus-infected and tax gene-transfected ce

20、lls. To ascertain which pathways of DNA repair might . . . . . . . .,SGML example,Copyright 2000 Access Innovations, Inc.,31,Highwire sample,Journal of the National Cancer Institute, Vol. 91, No. 11, 899, June 2, 1999 1999 Oxford University Press IN THIS ISSUE Significant Tax Consequences Human T-ce

21、ll leukemia virus (HTLV) and bovine leukemia virus (BLV) are retroviruses that cause hematopoietic cancers and encode a unique protein, Tax, which is involved in the transformation of infected cells. Philpott and Buehring (p. 933) have investigated the mechanism by which Tax proteins may induce cell

22、 transformation. They observed chromosomal damage and diminished DNA integrity in both virus-infected and tax gene-transfected cells. To ascertain which pathways of DNA repair might be inhibited, the investigators evaluated the repair of selective DNA lesions introduced by specific agents. HTLV-or B

23、LV-infected or tax gene-transfected cells showed normal ability to repair DNA damage induced by deoxyribonuclease I or psoralen but markedly decreased ability to repair damage induced by UV light, quercetin, or hydrogen peroxide. These results suggest that base-excision repair of oxidative damage is

24、 the pathway most inhibited by Tax proteins. This inhibition may contribute to the virus-initiated mechanism(s) of cell transformation.,Link c:cgicontentfull9111933,Copyright 2000 Access Innovations, Inc.,32,Photocomposition input,-#1-899.RTF-BEGIN- rtf1ansideff0infodoccomm generated by an Adobe app

25、licationfonttblf0froman Times New Roman;f1froman Times New Roman;f2froman WP MultinationalA Roman;colortbl;red0blue0green0 ;stylesheet s2sbasedon1snext1 keepsb100sa60sl270keepnb ; s1 keepfi240qjsl230keepncf1fs20 ; sectdpardplainhyphhotz720 keep s1sb240sa60qcsl500keepnbrdrtbrdrhairbrdrbtwbrdrhairbrsp

26、325brdrbbrdrhairbrdrbtwbr drhairbrsp230fs42 IN THIS ISSUEplainfs40 plainfs40 par pardplain s2sb100sa60sl270keepnb Significant Tax Consequences plainb par pardplain s1fi240qjsl230keepnfs20 Human T-cell leukemia virus (HTLV) and bovine leukemia virus (BLV) are retroviruses that cause hematopoietic can

27、cers and encode a unique protein, Tax, which is involved in the transformation of infected cells. Philpott and Buehring (p. 933) have investigated the mechanism by which Tax proteins may induce cell transformation.,Copyright 2000 Access Innovations, Inc.,33,HTML,An SGML DTD - a specific application

28、Limited set of elements Has Metadata at source Most elements are for format and display Little information about content, context, structure, added-value Too many problems with SGML WEB publishing,Copyright 2000 Access Innovations, Inc.,34,Copyright 2000 Access Innovations, Inc.,35,digression,The me

29、ta data in the HTML header is what the spider is set to capture and bring back.,Copyright 2000 Access Innovations, Inc.,36,HTML Presentation format,GEOPHYSICAL RESEARCH LETTERS, VOL. 26, NO. 10, PAGES 1349-1352, MAY 15, 1999Next: 1. Introduction Streamer disconnection events observed with the LASCO

30、coronagraph Y.-M. Wang, N. R. Sheeley, Jr., R. A. Howard, and N. B. Rich1 E. O. Hulburt Center for Space Research, Naval Research Laboratory, Washington, DC P. L. Lamy Laboratoire dAstronomie Spatiale, Marseille, France Received January 7, 1999, accepted February 3, 1999 Abstract: We present Large A

31、ngle Spectrometric Coronagraph (LASCO) observations of two events that suggest magnetic disconnection in coronal streamers. During the 1-2 days preceding each event, successions of narrow looptops are seen rising slowly through the 2-6 field of view, forming a bright streamer stalk which continues t

32、o elongate with time. As the streamer becomes ever more constricted, it eventually severs at a heliocentric distance of 4 . The lower part of the stalk collapses back to form a cusplike structure extending to 3 , while the disconnected segment is observed as a kink or density enhancement that propag

33、ates outward with a speed of order 200 km s. We interpret these non-CME events as transient openings and closings of magnetic flux rooted at the.,Copyright 2000 Access Innovations, Inc.,37,HTML Source - 1 of 2,Streamer disconnection events observed with the LASCO coronagraphGEOPHYSICAL RESEARCH LETT

34、ERS, VOL. 26, NO. 10, PAGES 1349-1352, MAY 15, 1999,Copyright 2000 Access Innovations, Inc.,38,HTML Source - 2 of 2,Next: 1. IntroductionStreamer disconnection events observed with the LASCO coronagraph Y.-M. Wang, N. R. Sheeley, Jr., R. A. Howard, and N. B. Rich1E. O. Hulburt Center for Space Resea

35、rch, Naval Research Laboratory, Washington, DCP. L. LamyLaboratoire dAstronomie Spatiale, Marseille, FranceReceived January 7, 1999, accepted February 3, 1999Abstract: We present Large Angle Spectrometric Coronagraph (LASCO) observations of two events that suggest magnetic disconnection in coronal s

36、treamers. During the 1-2 days preceding each event, successions of narrow looptops are seen rising slowly through the 2-6 field of view, forming a bright streamer stalk which continues to elongate with time. As the streamer becomes ever more constricted,Copyright 2000 Access Innovations, Inc.,39,eXt

37、ensible Markup Language (XML),Meta language No DTD required Content and context tags possible Use of style sheets Not all the features of SGML www.oasis-open.org/cover,Copyright 2000 Access Innovations, Inc.,40,Related eXtensible languages,XSL - style language XLL - linking language,Copyright 2000 A

38、ccess Innovations, Inc.,41,Meta language SGML XML,Relationships,Document WEB page WEB app. Instance Finding aid Industry app.,Language HTML DTD CDFEAD DTD OFX,Copyright 2000 Access Innovations, Inc.,42,Copyright 2000 Access Innovations, Inc.,43,Meta-data,Definition The Past - History of Meta-data Th

39、e Present - Current Initiatives including a discussion of standards The Future - Adopting Meta-data,Copyright 2000 Access Innovations, Inc.,44,Definition of meta-data,Data about data Information about information,Copyright 2000 Access Innovations, Inc.,45,One definition of Meta-data,“Definitional da

40、ta that provides information about or documentation of other data managed within an application or environment.”,Copyright 2000 Access Innovations, Inc.,46,Another definition,Data that characterizes other data in a reflexive way may include descriptive information about the context, quality and cond

41、ition, or characteristics of the data,Copyright 2000 Access Innovations, Inc.,47,Data about data - like what?,Author name Date of creation Language used in the creation Title of the creation Subject of the creation Keywords.,Copyright 2000 Access Innovations, Inc.,48,Narrowing the focus,Keywords (ak

42、a subject headings, index terms, identifiers, etc.) are one type of meta data. The afternoon part of this workshop will focus entirely on that one type of meta data.,Copyright 2000 Access Innovations, Inc.,49,For example.,A bibliographic database record usually includes information such as author, t

43、itle, language, date of creation, and subject area. So does a traditional library card catalog,Copyright 2000 Access Innovations, Inc.,50,But did you think about,The legend on a street map? The yellow pages in a telephone book? The aisle signs in a supermarket?,Copyright 2000 Access Innovations, Inc

44、.,51,Meaning of meta-data,Meta-data is information that points to an answer or a solution,Copyright 2000 Access Innovations, Inc.,52,Meta-data vs. metadata,Metadata is “a word coined by Jack E. Myers to represent current and future lines of products implementing the concepts of his MetaModel, and al

45、so to designate his company The Metadata Company that would develop and market those products.”,Copyright 2000 Access Innovations, Inc.,53,Metadata,A term not used prior to 1969 Used first in 1973 Registered U.S. Trademark (in 1986), owned by Jack Myers,Copyright 2000 Access Innovations, Inc.,54,Met

46、adata - continued,Metadata granted Incontestable status in 1991 designed to be a term with no particular meaning,Copyright 2000 Access Innovations, Inc.,55,Confused?,An HTML header can include some meta data, and is a standard web page tag, but not all HTML headers include meta data,Copyright 2000 A

47、ccess Innovations, Inc.,56,Copyright 2000 Access Innovations, Inc.,57,“Historic” meta data initiatives,MARC - Machine Readable Cataloging a description of the item main entry and added entries subject headings the classification or call number,AACR2 - Anglo American Cataloging Rule, 2nd edition (198

48、8) - the “style sheet” for MARC records,Copyright 2000 Access Innovations, Inc.,58,Current Initiatives,Dublin Core Indecs/EPICS/BISAC RDF TEI ROADS ONIX and many others, especially for non-text information .,Copyright 2000 Access Innovations, Inc.,59,The Dublin Core,March 1995 in Dublin, Ohio April

49、1996 in Warwick, United Kingdom NISO passed this as a standard with comments - now being resolved currently 13 elements Scheme and Type qualifiers,Copyright 2000 Access Innovations, Inc.,60,Dublin Core elements - version 1.1,Title Creator (a.k.a. Author) Subject Description Publisher Contributor (a.k.a.OtherAgent) Type (a.k.a. ObjectType),

展开阅读全文
相关资源
猜你喜欢
相关搜索

当前位置:首页 > 教学课件 > 大学教育

copyright@ 2008-2019 麦多课文库(www.mydoc123.com)网站版权所有
备案/许可证编号:苏ICP备17064731号-1