Digital Curation: Round One by Gretchen Gueguen is licensed under a Creative Commons AttributionNonCommercial-ShareAlike 3.0 Unported License. DIGITAL CURATION: ROUND ONE Gretchen Gueguen University of Virginia Formerly of East Carolina University Agenda • What is Digital Curation • The Digital Curation Lifecycle Model • Case Study: The Eastern North Carolina Digital Library, 2003-2011 What is Digital Curation? Preservation Collection Archiving Maintenance Selection Key elements of the DCC Curation Lifecycle Model Key elements of the DCC Curation Lifecycle Model • Data • any information in binary digital form, is at the centre of the Curation Lifecycle. Key elements of the DCC Curation Lifecycle Model • Full Lifecycle Actions – Description and Representation Information – Preservation Planning – Community Watch and Participation – Curate and Preserve Key elements of the DCC Curation Lifecycle Model • Sequential Actions – Conceptualise – Create or Receive – Appraise and Select – Ingest – Preservation Action – Store – Access, Use and Reuse – Transform Key elements of the DCC Curation Lifecycle Model • Occasional Actions – Dispose – Reappraise – Migrate Case Study The Eastern North Carolina Digital Library The Eastern North Carolina Digital Library: the 4 W’s Who: • Joyner Library at East Carolina University+ What: • A digital library of books+ about eastern North Carolina When: • 2003-2004, initial project • 2004-2007, partnership project The Eastern North Carolina Digital Library: the 4 W’s Where: You are here. The Eastern North Carolina Digital Library: the 4 W’s Why: • ECU is the largest university in the eastern region, serving some of the poorest and most under-served counties in the state. • Material on eastern NC not widely available • The expertise and interest existed in the library to create a great digital project. The Eastern North Carolina Digital Library: and 1 H How: ASP.net interface Digitization Lesson Activities Transcription Metadata Case Study: The Eastern North Carolina Digital Library The End. …until Case Study: The Eastern North Carolina Digital Library • 2008-2009, creation of Joyner Library Digital Collections, a sister repository more broad in scope • 2010-2011, migration of ENCDL into JLDC Comparing ENCDL JLDC • TextML / ASP.net • Non-standard metadata (aside from TEI transcriptions) • Two basic material types • Non-standard filenaming • Significant supplementary documentation for each object • Text and Image/artifact in different search and browse • Extensive web-presence with educational activities • TextML / ASP.net • Metadata standards (METS, MODS, MIX, TEI) • Variety of materials • Each object has Persistent Identifier (PID) and consistent filenaming • Full-repository search • Basic web-presence, but robust searching tools Digital Curation Round One… Applying the lifecycle model • Community Watch and Participation – What are the common standards endorsed by our community? • • • • • • JPEG2000 EPUB PREMIS NC ECHO’s PMDO Flash HTML 5 • Curate and Preserve – What are the standards that will best fit our curation and preservation needs? Applying the lifecycle model • Preservation Planning – What actions are in the best long-term interest of the ENCDL? JLDC? • Meetings with stakeholders • Web analytics • Reproduction requests • Review of infrastructure • Migration – Digital objects, metadata, web application Applying the lifecycle model • Create a new collection in the repository • Create a “PID” for each digital object into the repository • Create METS/MODS/MIX/TEI/PREMIS record for each – Incorporate supplemental metadata – Create new PREMIS records for each • Create new hybrid object type for image + video • JPEG2000 for all images • pdf and epub for books Applying the lifecycle model • JPEG2000 – Complicated algorithms – Inadequate software – Web application development with Kakadu – Presentation copies only at this time. • Metadata – PREMIS – Mapping and scripting multiple times • Repository structure – Modeling new object types – Functional requirements for UI and metadata – Use case scenarios in ENCDL mapped to JLDC Applying the lifecycle model • Access, Use and Reuse – Recreate the book viewer using JPEG2000 – Create subject and map browse for the entire repository – Recreate ENCDL pages with repository’s stylesheet The End • Curation • Preservation • Community Watch and participation What Have We Learned? • Many of us will eventually need to migrate not just data, but collections and “experiences” into other repositories. • Digital Curation Lifecycle Model can help us think through Curation activities and evaluate them. • The Lifecycle Model is not linear, nor will our activities be. • The Lifecycle Model is not finite, but iterative. Thanks! • East Carolina University – – – – – – – – Michael Reece Joe Barricella Justin Tew Mark Custer Maury York John Lawrence Linda Teel Hazel Walker • At-Large – Emily Gore – Justin Vaughn – Amy Chiles • In Spirit… – Chuck Jones Contacts Gretchen Gueguen Email: gmg2n@virginia.edu Web: http://www.gretchengueguen.com Eastern North Carolina Digital Library http://digital.lib.ecu.edu/historyfiction Joyner Library Digital Collections http://digital.lib.ecu.edu