BHL-E Meta Data Harmonisation Wolfgang Koller & Heimo Rainer NHM Vienna BHL-E Meta Data • Corner Stones EU cofunded eContentplus ( ECP 518001 ) – lead @ Museum f. Naturkunde, Berlin, GE 2009-05-01 / 2012-04-30 Consortium of 28 Partners (AT, BE, CZ, FI, GB, GE, IT, NL, PL, SP, US – SIL & MO ) 9 Technology Providers, incl. ATOS / AIT 21 Content Providers • Major Goals digitized literature content from european institutions for BHL-family WebSite incl. Search Portal – www.bhl-europe.eu Multilinguality Contribution to European Cultural Portal – www.europeana.eu BHL-E Meta Data www.bhl-europe.eu BHL-E Meta Data www.europeana.eu BHL-E Meta Data www.europeana.eu BHL-E Meta Data Open Literature Exchange Format www.bhl-europe.eu/bhl-schema/v0.3/OLEF_v0.3.xsd http://www.bhl-europe.eu/bhl-schema/v0.3/ OLEF OLEF Specification XML-Schema for exchange of literature data list of required metadata information https://docs.google.com/spreadsheet/ccc?key=0Ak_9CQQdVjCidERlRmRhOHZDUGJONC1FMkw1VFByVUE&hl=en_US#gid=0 • Imports – – – – • bibliographic data – MODS Metadata Object Description Standard – http://www.loc.gov/standards/mods/ policy expressions IPR – ODRL Open Digital Rights Language - http://odrl.net/ still image data – MIX Metadata for Images in XML – http://www.loc.gov/standards/mix/ scientific names – DwC Taxon Terms - http://code.google.com/p/darwincore/wiki/Taxon RDF-S representation for Linked Open Data (in progress) OLEF OLEF Structure (simplified) Monograph / Serial TOC Figure Article Image Chapter Figure Index Image BHL-E Meta Data Metadatastandard To be [according to uploaded Preingest test [volumes/pag Content in BHL- Content in Ingest data] es] E Portal Europeana Spring 2011 Institution NHM [Natural History Museum] NMP MARC21 [Narodni muzeum] Update from Richard on 22.04.2011: April 2011: ~3000 pages April 2012: ~5000 pages LANDOE ? [Land Oberösterreich] 3400 volumes ~600.000 pages HNHM ? [Hungarian Natural History Museum] ~ 35 volumes ~ 3000 pages Comment on FTP/ detailed Comment on information content over BHL-US Herbarz :...(882 pages) 2568 2568 planned Comment on workflow asked for upload and estimation of pages/items 03.03.11 - problems with ftp client 04.03.11 metadata files missing in folders &jpeg files in main directory- asked to check upload 14.07.2011 additional content : provide metadata OK from AIT 800 volumes in over oai-pmh using 19.05.2011 spring 2011 ready OLEF OK from NHMW additional scanning - 24.05.2011 of 150.000 pages green light for during this year LANDOE 11.01.2011 will send detailed information until 17.12.2010 BHL-E Meta Data • Schema Mapping Tool – slim / easy to use / cross platform / standalone application (JAVA) https://github.com/bhle/bhle/tree/master/pre-ingest/schema-mapping-tool http://bhl.nhm-wien.ac.at/smt/launch.html • built in schemas ESE 3.2 & 3.3 MARC21 MODS 3.4 OLEF 0.3 • JDBC connection • built in conversions MARC21 – MARCXML MARC21 – MODS MARC21 – OLEF MARCXML – MODS MARCXML – MOLEF MODS – OLEF RefNum – OLEF Schema Mapping Tool Mapping to OLEF BHL-E Meta Data BHL-Europe Global Architecture Diagram BHL-E Meta Data • Integration of Components into Ingest System BHL-E Meta Data Current work Person Names – VIAF www.viaf.org Taxonomic repositories – Catalogue of Life www.catalogueoflife.org / PESI www.eu-nomen.eu/pesi/