Internet Resources Discovery Metadata (2) 1 H. Weinberger - A. Frank Contents of Metadata Lecture 1. Introduction to Metadata 2. Interoperability of Metadata 3. Evolution of Descriptive Metadata 4. Web Descriptive Metadata 5. Structured Metadata 6. Semantic Web 2 H. Weinberger - A. Frank Content Metadata in PL – – methods & tools (standards) classification systems Metadata in DL – 3 methods & tools Metadata in AL (Automated Library) – (Paper Library) (Digital Library) Web DL examples: classification, SH (Subject Heading), virtual subject catalog & on-line reference catalog. H. Weinberger - A. Frank Metadata in PL: methods & tools (1) Classification method – Predefined organizing scheme: – – – – 4 Evolutionary Created outside the library & used by means of assigning out of resource attributes Used for item tracking & shelf arrangement Used by librarian & user H. Weinberger - A. Frank Metadata in PL: methods & tools (2) 5 Catalog (Index) - Detailed inventory of specific collection or library: – Holds information that is in the item: title, author, publisher, classification. – Place: library centers. – Format: cards, printed, compiled books. – Used for finding information and managing collection. – Used by Librarian, users. H. Weinberger - A. Frank Metadata in PL: methods & tools (3) 6 Bibliography: List of subject oriented Thesaurus: Controlled vocabulary for publications that can be locally compiled according to needs. indexing & classification; contains “word list”, descriptors & identifiers. Each term is described by scope note, narrower term, broader term & used for. H. Weinberger - A. Frank Metadata in AL: methods & tools (1) 7 Classification: Resource-subject organization – Information external to item. – Used to indicate location, manage collection & searching. – Used by librarians. Catalog: online catalog (ALEPH) – Used for librarians & users. – Used by locating, circulation, managing, sharing information. H. Weinberger - A. Frank Metadata in AL: methods & tools (2) Bibliography: compiled on request Thesaurus: controlled vocabulary – Used for indexing, retrieval, and monitoring literature. – Used by librarians & users Example: “Information Technology” vs. “Educational Computing” in Eric. 8 H. Weinberger - A. Frank Metadata in AL: methods & tools (3) Standards: Enable information exchange between library systems: – – 9 Used for delivering collection related data. Used by librarians & specialists. H. Weinberger - A. Frank PL-AL Classification Systems Numeric: – – – Alphanumeric: – 10 Mathematics Subject Classification http://www.ams.org/mathweb/mi-mathbyclass.html Dewey Decimal Classification (DDC) Comprised originally of 10 classes (generalities, philosophy, religion, social science, language, NT, tech, art literature and history). Universal Decimal Classification (UDC) http://www.niss.ac.uk/subject/index.html – ACM computing classification system Library of Congress classification (LC) H. Weinberger - A. Frank From PL to DL: towards standardization פתיחת המרחב הדיגיטלי קוראת לפיתוח כלים לשיתופיות במידע שראשיתה בשיתופיות בכלים ושיטות לארגון ,ייצוג ותאור משאבים. שתי מגמות: – – פיתוח כלים חדשים. ניסיון התאמה של כלים קיימים לסביבה חדשה. H. Weinberger - A. Frank 11 Library of Congress Standards LC 12 Classification system (LCC) MARC (MAchine-Readable Cataloging): standard for the representation and communication of bibliographic and related information in machine-readable form. Web Example: Signpost H. Weinberger - A. Frank What is Z39.50? Z39.50 is a protocol which specifies data structures and interchange rules that allow a client machine (called an “origin" in the standard) to search databases on a server machine (called a "target" in the standard) and retrieve records that are identified as a result of such a search. 13 H. Weinberger - A. Frank DL Metadata Examples Implementing PL-AL methods to DL: 14 Dewey Decimal Classification: used in CyberDewey LC Classification: used in Cyberstacks, Signpost IPL Visual Index Interface www.ipl.org/ MeSH Subject Heading www.nlm.nih.gov/mesh/ On-line catalogs (Weblogs) H. Weinberger - A. Frank Dewey Decimal Classification on the Web Used for Information Retrieval Used by users & librarians as a reference scheme. Example: CyberDewey – http://ivory.lm.com/~mundie/CyberDewey/CyberDewey.html Status: HNM 15 H. Weinberger - A. Frank Unified Medical Language System מערכת שמתייחסת לתחום מסוים. נועדה להקל על אחזור מידע ואינטגרציה של מקורות. כולל הירארכיה של מושגים ברשת סמנטית שמייצגת גם יחסים בין קטגוריות (קשור ב ..גורם ל.)... המושגים כוללים :ישויות פיזיות ואירועים. H. Weinberger - A. Frank 16 Medical Subject Headings MeSH When searching for subjects it is inefficient unless there is consistency in the vocabulary. Controlled vocabulary. Mesh includes: – – 17 – Subject headings of 400,000 papers. Thesaurus of subject headings, hierarchical. Thesaurus of chemical terms 18,000. H. Weinberger - A. Frank Summing up תפיסת הספריה ,קלאסית ) ,(PLאנלוגית ) (ALאו דיגיטלית ( ,)DLכאוסף מאורגן של שירותים, מחייבת שימוש בשיטות וכלים לארגון האוסף ,למתן גישה ושיתוף ברמת משאבים ,שירותים ונתונים. – – מחד :אנו רואים ניסיון המשכיות – בדיקה והתאמה של כלים קלאסיים לסביבה משתנה. מאידך :צרכים חדשים מחייבים פיתוח כלים מותאמים. H. Weinberger - A. Frank 18 References Clifford A. Lynch, “The Z39.50 Information Retrieval Standard”, http://www.dlib.org/dlib/april97/04lynch.html Sandra D. Payette, Oya Y. Rieger, “Z39.50The User's Perspective”, http://www.dlib.org/dlib/april97/cornell/04payette.html 19 H. Weinberger - A. Frank Contents of Metadata Lecture 1. Introduction to Metadata 2. Interoperability of Metadata 3. Evolution of Descriptive Metadata 4. Web Descriptive Metadata 5. Structured Metadata 6. Semantic Web 20 H. Weinberger - A. Frank Content Descriptive metadata Historical key points Advantages of Web Metadata Examples - Metadata initiatives: Metadata advantages 21 1. MARC 2. Dublin Core (DC) H. Weinberger - A. Frank Descriptive Metadata (1) DM is usually kept separated form the data. DM must be associated with the content it describes. DM describes data elements organized in fields and sub-fields: e.g., 22 field: author’s name sub-field: author’s surname H. Weinberger - A. Frank Descriptive Metadata (2) 23 Examples methods are: catalogs, indexes, Abstracting and indexing services, thesaurus other summery information. Usually created by professionals Have a consistent structure Examples applications are: , MARC http://www.loc.gov/marc/umb/ Dublin Core, ROADS, Medical Subject Headings (MESH), … H. Weinberger - A. Frank 1.The MARC Initiative 24 The MARC formats are standards for the representation and communication of bibliographic and related information in machine-readable form. The development of MARC 21 formats an international effort. The Library of Congress and the National Library of Canada serve as the maintenance agency for the MARC 21 formats for bibliographic, authority, holdings, classification, and community information data for the MARC 21 user community. H. Weinberger - A. Frank Library automation and the role of MARC 25 WHAT: A MARC record is a MAchineReadable Cataloging record. "Cataloging record" means a bibliographic record , or the information traditionally shown on a catalog card. WHY: Using the MARC standard prevents duplication of work and allows libraries to better share bibliographic resources. H. Weinberger - A. Frank Marc Cataloging Record The record includes (not necessarily in this order): 1) a description of the item, 2) main entry and added entries, 3) subject headings, and 4) the classification or call number. (MARC records often contain much additional information.) 26 H. Weinberger - A. Frank MARC SGML & XML An implementation of the Standard Generalized Markup Language (SGML) for use with MARC data. An implementation of the eXtensible Markup Language (XML) for use with MARC data. XML and MARC: A Choice or a Replacement? http://www.ala.org/alcts/organization/div/marbi/marbi-ccda.html 27 H. Weinberger - A. Frank 2. Dublin Core (DC) Benefits: – – Drawbacks: – – 28 simple anyone can use not yet standardized may not map well with existing standards Example: self-cataloging Web pages Meta content? H. Weinberger - A. Frank History of the Dublin Core Created in Dublin, Ohio, 1995. Set of simple metadata elements that can be applied to a wide variety of DL materials. Idea: simplicity Problems: simple is not accurate: – – 29 e.g., if a subject field taken from Dewey, it is useful to note it in the metadata. Indexers need lists of possible values and types for fields. H. Weinberger - A. Frank Dublin Core Element Set (1) Descriptive rules: – 15 elements intended to convey a common semantic understanding. – Provides controlled vocabulary for document description. – Use of label expression enables clear syntactic operation. 30 http://purl.oclc.org/metadata/dublin_core H. Weinberger - A. Frank Dublin Core Element Set (2) 31 1. TITLE (Creator or Publisher) 2. CREATOR (person or organization responsible) 3. SUBJECT Subject and Keywords (topic expressed as keywords or phrases The use of controlled vocabularies and formal classification schemas is encouraged). 4. DESCRIPTION (textual description, abstracts) 5. PUBLISHER 6. Other CONTRIBUTOR 7. DATE 8. TYPE category of the resource 9. FORMAT (software and possibly hardware needed to display) 10. IDENTIFIER (String or number.. URLs and URNs ISBN) 11. SOURCE (A string or number used to uniquely identify the work from which this resource was derived PDF ISBN). 12. LANGUAGE RELATION 14. COVERAGE Resource characteristics. Currently under development. 15. RIGHTS A link to a copyright notice, to a rights-management statement, or to a service that would provide information about terms of access to the resource. H. Weinberger - A. Frank Metadata Advantages Separating the structural information of a hypertext document from the specification of the document actual presentation. 32 Applying one style to multiple documents. A single document may have multiple styles for different presentation forms. Applicable to text & hypertext modes. H. Weinberger - A. Frank References 33 http://metadata.net/ Metadata E-Paper: http://dis.lib.muohio.edu/documents/metadata.html IFLA site: http://www.IFLA.org/II/metadata.htm ROADS: http://ukoln.bath.ac.uk/roads/ Stuart Weibel, Juha Hakala, “DC-5: The Helsinki Metadata Workshop A Report on the Workshop and Subsequent Developments”, http://www.dlib.org/dlib/february98/02weibel.html H. Weinberger - A. Frank