The Future of Technical Services: Metadata: Standards and New Developments Vicki Toy Smith

advertisement
The Future of Technical Services:
Metadata: Standards and New
Developments
NLA Annual Conference, October, 2005
Reno, Nevada
Vicki Toy Smith
University of Nevada, Reno
Machine Readable Cataloging
(MARC)



Standard in the 1960s
Revolutionary advancement at the
time
Development of MARC set the stage
for creation of centralized library
databases
40 Years Later (2005)


Now is the time for recreation of
bibliographic standards
Need for new infrastructure =
Existing (MARC) + new standards
Need for a new set of
standards to be developed

Some Issues:



Versatility
Extensibility
Cooperative management
What is Metadata?


“Data about data”
5 types of metadata:
 Descriptive – Title, author, topic, etc.
 Administrative – Record number, record date,
etc.
 Technical – File size, software needed, etc.
 Rights – Copyright ownership, etc.
 Management – typically by/for owning agency
(Price paid, circulation restrictions, etc.)
Examples of metadata
schemas












Dublin Core (Cross disciplinary)
DDI (Social sciences)
EAD (Archives)
FGDC (Geographic)
IMS (Education)
LOM (Learning Object Metadata)
MARC (Libraries)
METS (Structural metadata)
ONIX (Publishers & booksellers)
TEI (Text encoding-Humanities)
VRA (Visual resources)
Etc. …
Why is Metadata Important?






Increased accessibility
Ability for different systems to talk to one
another
Expanding use
Multi-versioning
Preservation
Cost considerations
Examples of Metadata
Standards Web Sites:


LC standards include:
MARC: Machine-Encoded Cataloging:
http://www.loc.gov/marc/
 MARCXML
http://www.loc.gov/standards/marcxml/
 MODS: Metadata Object Description Schema:
http://www.loc.gov/standards/mods/
 EAD: Encoded Archival Description (LC & SAA):
http://www.loc.gov/ead/

Another standard—Developed by Dublin Core Metadata
Initiative:
 Dublin Core:
http://dublincore.org
Library related standards:
Considerations


MARC21 Formats--Representation and communication of
descriptive metadata about information items
 International standard (maintained by LC)
 Field/record size limitations
 No ability to embed related objects (e.g. book cover GIF)
 Limited ability to convey hierarchical/complex
relationships
 Narrow focus on cataloging
 Cannot be directly processed by widely-used web
applications
MARCXML – MARC21 data in an XML structure
 Especially useful for description of complex digital
library objects
 Allows libraries to use common computer language
 Ignores/simplifies some MARC21 data
 Not an international standard yet (maintained by LC)
Library related standards:
Considerations (cont’d.)

MODS (Metadata Object Description Standard)—XML
markup language for selected data from MARC21 records as
well as original resource description
 Richer than Dublin Core; library-oriented XML metadata
schema
 Can accommodate AACR2 standards
 Maintained by same agency (LC) that maintains MARC21
 Well-suited as a metadata format for OAI harvesting
 Used by some publishers now (e.g. UC Press (California
Digital Library) & University of Chicago Press)
 MARC21 readily converts to MODS; however, cannot
readily do a reverse conversion of MODS to MARC21
 Community best practice guidelines would enhance
usefulness
Archival related standard:
Considerations

EAD (Encoded Archival Description)—XML markup
design for encoded finding aids using Standard
Generalized Markup Language
 EAD header carries metadata for finding aid
 Standard maintained by LC along with SAA
 Best suited for archival description
 Provides simple or complex mark-up design to
support varying levels of indexing
 Needs help at the item level
Dublin Core standard:
Considerations

Dublin Core – Metadata element set (for description of
digital objects)
 Maintenance agency: Dublin Core Metadata Initiative
(DCMI) hosted by OCLC Research
 Intended for use by both non-catalogers and specialists
 No MARC tags used with Dublin Core metadata
 No consistency across different projects using Dublin
Core
 Documentation for Dublin Core not well-defined
 Western States Best Practices (version 2.0) useful
http://www.cdpheritage.org/resource/metadata/wsdc
mbp/
 UALC Cataloging/Metadata Committee working on
handbook
Some other types of Digital
Library Standards

Library of Congress also working on:
 METS (Metadata Encoding & Transmission Standard)—
Structure for encoding descriptive, administrative & structural
metadata (www.loc.gov/mets)
 OCLC DPR, RLG, Harvard, National Library of Wales
exploring or using for various projects


MIX (NISO Metadata for Images in XML)—XML schema for
encoding technical data elements required to manage digital
image collections
 MIX still in developmental stages
PREMIS (Preservation Metadata)—Data dictionary and
supporting XML schemas for core preservation metadata
needed to support the long-term preservation of digital
materials
Standards necessary when
planning metadata projects





Attention to rules/guidelines when implementing
a specific type of metadata
Community best practice guidelines need to be
developed for each schema
When standards are revised or extended, it will
be difficult to transform the metadata from one
set to another
When metadata standards are extended to
represent local information, librarian needs to
work with the vendor to accommodate the
change
Examine use of crosswalks
Why use Crosswalks?







Allow for different systems to talk to one another
Break down data transfer barriers (different
systems can share)
Combine metadata catalogs (e.g. Union catalogs)
Provide cross search between unlike databases
(e.g. Federated search tools)
Make data/metadata maintenance more efficient
Enable conversion between record types
(MARCXML, MODS, DC)
Reduce cost in technical services processing
Crosswalking challenges






Schema granularity: One to many matches; many
to one matches
Crosswalking schemas need different granularity
levels
Specificity lacking to describe library resources
Handling object relationships or hierarchies
(e.g. EAD=>MARC, etc.)
Crosswalking is rarely a one to one mapping
Process results in some “un-mappable” data
Crosswalking model: MarcEdit allows catalogers to translate
data from one schema to another
(From Terry Reese’s metadata presentation at ALA Summer
Program 2005)
Future of technical services +
metadata?

Some issues to consider in planning new metadata projects


Choose a metadata schema that can make web searching more
meaningful
Choose a metadata schema that will help get your information
to the right audience

LC and others developing metadata tools

LC providing tools for translating MARC records into MODS


OCLC working on crosswalk (Metadata Switch Project)
http://www.oclc.org/research/mswitch/default.htm
OCLC working on FRBR algorithm
http://www.oclc.org/research/projects/frbr/algorithm.htm
Future of technical services + metadata? (Cont’d.)
Individuals also creating metadata tools—
Terry Reese’s Marc Edit tool
Future of technical services +
metadata? (cont’d.)

Examination of standards and protocols being
developed for the harvesting of metadata
 e.g. Open Archives Initiative for Metadata
Harvesting
 http://www.openarchives.org/
New Developments


System migration considerations for all
parties concerned
More publishers starting to provide XML
records


e.g. UC Press records
Federated Search Tools being developed

e.g. Innovative Interfaces: Millennium Access
Plus (MAP)
New Developments (cont’d.)


Vendors now providing XML tools to manage
digital collections
Innovative Interfaces products:
 Millennium Media Management
 Creates/stores media objects (e.g. images,
sound files, audio files)
 Also includes a Copyright/Access component
to handle licensing/copyright issues of
digital collections
 XML Harvester
 Gathers XML records from any server
 Parses/creates record on Innovative system
 Metadata Builder—Supports EAD, DC & XML
 Stores XML in metadata schema of choice
What can libraries do today?




Libraries can provide metadata services (e.g. MIT):
 MIT’s Metadata services includes:
 Data & conceptual models
 XML binding
 Best practices
 Applications
 Training
http://libraries.mit.edu/metadata/services.html
Libraries can develop new pilot projects to investigate uses of
metadata
Libraries can request that vendors/publishers develop new
products to facilitate change
UNR has begun several pilot projects using:
 Dublin Core
 EAD
 Example of EAD Finding Aid (with style sheet)
http://www.library.unr.edu/specoll/9716.xml
 MARC Edit toolkit
Current Metadata projects at
UNR

Digital collections (using CONTENTdm) with metadata coded in
Dublin Core:
http://imageserver.library.unr.edu/

Metadata harvesting by:

UALC’s Mountain West Digital Library
http://www.lib.utah.edu/digital/mwdl/

University of Michigan’s OAIster
http://oaister.umdl.umich.edu/o/oaister/
Useful Web Sites






Standards at the Library of Congress
 http://www.loc.gov/standards
XML Standards at the Library of Congress LC Network
Development and MARC Standards ( framework for
working with MARC data in a XML environment)
 http://www.loc.gov/standards/marcxml/
Library of Congress Portals Applications Issues Group
 http://www.loc.gov/catdir/lcpaig/
NISO Web Site identifies, develops, maintains, publishes
technical standards to manage information in digital
environment
 http://www.niso.org/
NISO article on Understanding Metadata
 http://www.niso.org/standards/resources/Understandi
ngMetadata.pdf
Terry Reese’s Marc Editing Utility
 http://oregonstate.edu/~reeset/marcedit/html/
Thank you!
Vicki Toy Smith, Principal Cataloger
University of Nevada, Reno
vicki@unr.edu
Download