Lifecycle Metadata for Digital Objects (INF 389K) September 18, 2006 The Big Metadata Picture, Web Access, and the W3C Context Anne Gilliland-Swetland, “Setting the Stage” I: Generalities Content: “library” focus on access Context: “archives” focus on context of creation and use Structure: relationships among objects, aggregations of objects, and versions of objects Note however: these three categories can mix levels--how? Setting the Stage II: Functional types Administrative Descriptive Preservation Technical Use Can you think of other types? Is this a good breakdown? When does each come into play? Who sees them? Setting the Stage III: Attributes Source of metadata (internal or external) Method of metadata creation (auto or manual) Nature of metadata (lay or expert) Status (static/ long-term or dynamic/shortterm) Structure (structured or unstructured) Semantics (controlled or uncontrolled) Level (item or collection or repository) Setting the stage IV: Life Cycle Creation and [multi-]versioning (administration and description) Organization (registration, cataloging, indexing) Searching and retrieval (transactions, system records) Utilization (rights, version control, annotations) Preservation and disposition (refresh, test integrity, migrate, expunge…) Gail Hodge, “Understanding Metadata” Definition: “Metadata is structured information that describes, explains, locates, or otherwise makes it easier to retrieve, use, or manage an information resource.” Purposes of metadata Resource discovery (surrogates, cataloging) Organizing resources (statically or dynamically) Interoperability (e.g., Z39.50 vs OAI) Digital Identification (URL, PURL, DOI, Handle) Archiving and preservation (e.g., PREMIS) Understanding Metadata II: Levels FRBR distinctions: Work (a “creation of the mind”) Expression (one of the possible ways that a creation may be expressed) Manifestation (one of the possible embodiments of an expression) Item (an individual exemplar of a manifestation) Metadata can apply to objects at each of these levels, although all levels may not apply to all possible objects Note this leaves out aggregates Understanding Metadata III: Sets Dublin Core (and extensions and profiles) Text Encoding Initiative (TEI) header Metadata Encoding and Transmission Standard (METS) Metadata Object Description Schema (MODS) Encoded Archival Description (EAD) Learning Object Metadata (LOM) Visual Resources Association Core (and CDWA) MPEG multimedia INDECS and ONIX Understanding Metadata IV: Creation and Interoperability Templates, tools, harvesters Metadata Registries PRONOM GDFR Crosswalks and frameworks (RDF, Semantic Web) Berners-Lee, “Web architecture from 50,000 feet” Universal information utility: how Universal addresses: where (URIs) Universal language: saywhat? (XML) Action repositories: dowhat? (web services) Description repositories: who/what (DTDs, schemas, namespaces) Logical structure for documents (RDF) makes them operable-upon (at a high logical level: B-L wants proofs, not just informal gatherings) XML in 10 Points XML is for structuring data XML looks like HTML (tags and attributes) XML is text for computers XML is purposely verbose XML is a family XML is only partly new (SGML, HTML) XHTML->XML XML is modular (namespaces) XML is base for RDF, Semantic Web XML is free, universal, supported Eric Miller, “Introduction to RDF” XML provides syntax for RDF RDF provides structural constraints and a data model for metadata semantics Means of publishing metadata semantics Means of combining multiple metadata sets developed for specific purposes Goal: to enrich metadata available on the Web and make search more precise RDF Data Model Resources (nodes) Property-types (relations) Values (assigned to nodes) Triples in a directed graph Can be iterative: each value can in turn be a resource with property-type relations to values To ensure precision, each reference to a specific semantic set is identified by namespace RDF and metadata vocabularies (namespaces) RDF/XML schema used for declaring vocabularies (allowed “elements”) and their allowed values (where restricted) Each metadata set is maintained and made available online via a URI Metadata registries can gather the metadata namespace definitions or point to them Tim O'Reilly "What is Web 2.0?" So what is it? Web as platform/service Harnessing collective intelligence (of users) Data is the next “Intel Inside”: information as value End of the software release cycle: perpetual beta, user collaboration Lightweight programming models: cut and paste, mashups Software above the single device Rich user experiences Metadata and the Web (2.0?) Metadata inside vs outside (but will we seek help with preservation?) Metadata as value-added (but who adds the value?) What metadata must be authoritative? What metadata can never be authoritative? How important is metadata?