Koller_BHLE_metadata_harmonisation

advertisement
BHL-E Meta Data Harmonisation
Wolfgang Koller & Heimo Rainer
NHM Vienna
BHL-E Meta Data
• Corner Stones
EU cofunded eContentplus ( ECP 518001 ) – lead @ Museum f. Naturkunde, Berlin, GE
2009-05-01 / 2012-04-30
Consortium of 28 Partners (AT, BE, CZ, FI, GB, GE, IT, NL, PL, SP, US – SIL & MO )
9 Technology Providers, incl. ATOS / AIT
21 Content Providers
• Major Goals
digitized literature content from european institutions for BHL-family
WebSite incl. Search Portal – www.bhl-europe.eu
Multilinguality
Contribution to European Cultural Portal – www.europeana.eu
BHL-E Meta Data
www.bhl-europe.eu
BHL-E Meta Data
www.europeana.eu
BHL-E Meta Data
www.europeana.eu
BHL-E Meta Data
Open Literature Exchange Format
www.bhl-europe.eu/bhl-schema/v0.3/OLEF_v0.3.xsd
http://www.bhl-europe.eu/bhl-schema/v0.3/
OLEF
OLEF Specification
XML-Schema for exchange of literature data
list of required metadata information
https://docs.google.com/spreadsheet/ccc?key=0Ak_9CQQdVjCidERlRmRhOHZDUGJONC1FMkw1VFByVUE&hl=en_US#gid=0
•
Imports
–
–
–
–
•
bibliographic data – MODS Metadata Object Description Standard – http://www.loc.gov/standards/mods/
policy expressions IPR – ODRL Open Digital Rights Language - http://odrl.net/
still image data – MIX Metadata for Images in XML – http://www.loc.gov/standards/mix/
scientific names – DwC Taxon Terms - http://code.google.com/p/darwincore/wiki/Taxon
RDF-S representation for Linked Open Data (in progress)
OLEF
OLEF Structure (simplified)
Monograph
/ Serial
TOC
Figure
Article
Image
Chapter
Figure
Index
Image
BHL-E Meta Data
Metadatastandard To be
[according to
uploaded
Preingest test
[volumes/pag Content in BHL- Content in Ingest
data]
es]
E Portal
Europeana Spring 2011
Institution
NHM
[Natural History
Museum]
NMP
MARC21
[Narodni
muzeum]
Update from
Richard on
22.04.2011:
April 2011:
~3000 pages
April 2012:
~5000 pages
LANDOE
?
[Land
Oberösterreich]
3400 volumes
~600.000
pages
HNHM
?
[Hungarian
Natural History
Museum]
~ 35 volumes
~ 3000 pages
Comment on FTP/
detailed
Comment on
information
content
over BHL-US
Herbarz :...(882
pages)
2568
2568 planned
Comment on
workflow
asked for upload
and estimation of
pages/items 03.03.11
- problems with ftp
client 04.03.11
metadata files
missing in folders
&jpeg files in main
directory- asked to
check upload 14.07.2011
additional content : provide metadata OK from AIT 800 volumes in
over oai-pmh using 19.05.2011
spring 2011 ready OLEF
OK from NHMW
additional scanning
- 24.05.2011
of 150.000 pages
green light for
during this year LANDOE
11.01.2011
will send detailed
information
until 17.12.2010
BHL-E Meta Data
•
Schema Mapping Tool – slim / easy to use / cross platform / standalone application (JAVA)
https://github.com/bhle/bhle/tree/master/pre-ingest/schema-mapping-tool
http://bhl.nhm-wien.ac.at/smt/launch.html
•
built in schemas
ESE 3.2 & 3.3
MARC21
MODS 3.4
OLEF 0.3
•
JDBC connection
•
built in conversions
MARC21 – MARCXML
MARC21 – MODS
MARC21 – OLEF
MARCXML – MODS
MARCXML – MOLEF
MODS – OLEF
RefNum – OLEF
Schema Mapping Tool
Mapping to OLEF
BHL-E Meta Data
BHL-Europe Global Architecture Diagram
BHL-E Meta Data
•
Integration of Components into Ingest System
BHL-E Meta Data
Current work
Person Names – VIAF www.viaf.org
Taxonomic repositories – Catalogue of Life www.catalogueoflife.org /
PESI www.eu-nomen.eu/pesi/
Download