Open & Linked Data and the British Library

advertisement
Forging Links & Breaking
Shackles
The Linked Open Data BNB
Brenda Young
Metadata Systems Manager
Linked Data
& Ex Libris

2011 IGeLU conference at University
of Haifa included a linked data
session (See:
http://igelu.org/conferences/haifa2011/archive-of-presentations )

The Linked Open Data SIWG was
created after the event “…to achieve
essential linked open data features in all
Ex Libris products where appropriate,
both from the data publishing, the data
consuming and the data integration
perspective”

Contact Lukas Koster, University of
Amsterdam Library for more
information
2
British Library Metadata Services
Background
Has its roots in The British
National Bibliography Ltd prior
to the BL’s foundation
The BL supplies metadata to:

Increase visibility of holdings &
connect users to BL content, e.g.
via OCLC, COPAC etc

Participate in collaborative
national & international
cataloguing initiatives, e.g. ISDS,
SUNCAT

Support free and priced
bibliographic services
3
A Changing Bibliographic Environment…
Library Sector Relevance
Declining?
“I did my PhD with only 12 visits to a
library. That was 5 years ago;
things have improved since then,
now you don’t need to use a library
at all!”
Increasing?
“The release of library data offers the
opportunity for it to be used in ways
un-thought of by the library &
information community…”
4
New External Drivers
Putting Public Sector Data To Work

The web accelerated
development of a collaboration
culture & fostered expectation
that information should be freely
available

2009 saw an increasing
Government commitment to the
principle of opening up public
data for wider reuse

Public data will be released
under open licences which
enable free reuse, including
commercial reuse
New External Drivers
Linked Open Data
Government proposes a 5 star
rating for open public data:
 Available on the web (any
format), with an open licence
 As 1 star + available as
machine-readable structured
data (e.g. Excel)
 As 2 star + but nonproprietary format
 All the above + open
standards from the World Wide
Web Consortium
 All the above + link
your data to other people’s
data to provide context
So What Are We Doing?
Rising expectations & technical
developments make it essential the
BL responds
We are meeting the challenge of the new
environment by:
 Developing an open metadata
strategy
 Freely offering foundational
metadata
 Collaborating with the
community on innovative new
services (e.g. linked data) to
advance understanding
7
Open Metadata Strategy
Objectives
 Adopt a multi-threaded
approach addressing the needs
of:
 Traditional libraries
 Researchers using new
metadata processing
techniques
 Linked data developers
 Remove barriers & enable
innovation without unnecessary
restrictions
 Migrate from library to crossdomain standards, developing
solutions with users
8
What Have We Achieved?

Signed over 600 organisations
in 80 countries to z39.50
service

Supplied catalogue metadata
in new formats under CC0
licenses e.g. the Open Library, BBC
& Wikimedia Commons

Worked with Government, W3C
& developers on technical,
standards & licensing issues

Created a linked data version
of the British National
Bibliography
9
Why Should We Be Interested In Linked Data?
10
See: http://vimeo.com/36752317
Our Linked Data Journey
What to Offer?
We wanted to:
 Advance debate from theory to
practice via release of a ‘critical mass’
of data
 Show commitment by using a
large, core dataset: niche examples
are not as compelling
Why BNB?
 A reusable dataset of published
output: not a unique institutional
catalogue
 Uniform format over 60 years & 3
million records in many languages
11
The BNB & Linked Data
Selecting Sites For Linking
To put BNB data in a wider
context

We blended general linked
resources:
 GeoNames
 Lexvo
 RDF Book Mashup

With key linked library
resources:
 LCSH
 VIAF
 Dewey.info
12
Collaboration
BL Metadata Services
Talis
 BNB Data
 Catalogue Bridge
 Training
• Utilities and tools for
manipulating MARC21 data
 TMQ MARC Global Tools
 Analysts
• Bibliographic Data Analysis
• MARC to RDF mapping
• XSLT Conversion scripts
• RDF
• SPARQL
 Technical Infrastructure
 Data Modelling
Assistance
 Production team
• Match & merge
• Bulk processing
13
The BNB & Linked Data
Remodelling
14
MARC21 to RDF Conversion
Workflow
• Selection
• Selection
• Pre-processing
• Pre-processing
• Character
• Characterset
setconversion
conversion
• URI
• URIgeneration
generation
•Data
Data transformation
• Create & load triples
MARC to RDF conversion
Consists of multiple automated steps
15
The BNB & Linked Data
Vocabularies used












Bibliographic Ontology
Bio: a Vocabulary for Biographical Information
British Library Terms
Dublin Core
Event Ontology
FOAF: Friend of a Friend
ISBD
Org: an Organisation Ontology
OWL
SKOS
RDF Schema
WGS84 Geo Positioning
Sample data can be downloaded from:
http://www.bl.uk/bibliographic/datasamples.html
16
The BNB & Linked Data
Format Conversion - MARC
17
The BNB & Linked Data
Format Conversion – RDF/XML
18
The BNB & Linked Data
Format Conversion – Triples
19
Where Did We Get To?
Multiple Access Routes
• thedatahub.org/dataset/bluk-bnb-basic
• thedatahub.org/dataset/bluk-bnb
BNB Books 1950-2012
.
3 Million Records
90 Million Unique Triples
• bnb.data.bl.uk/sparql
• bnb.data.bl.uk/describe
• bnb.data.bl.uk/search
20
Achievements
 Presence & visibility
 New library data model - being
utilised by wider groups
 New opportunities for
collaboration - with public & private
sector organisations
 Confirmation that valuable data
will be used – e.g. up to 8 million
monthly transactions
21
Lessons Learned
Its a New Way of Thinking…
 Give thought to data modelling &
sustainability: Data wasn’t
originally designed for this
 Everyone is learning: you may be
the best judge
 There may be tools or expertise
out there: don’t reinvent the wheel
 Conversion inevitably identifies
hidden data issues: & creates new
ones!
 Offer sample data for feedback: &
continually improve…
22
Library Linked Data Wish List?
We Need More…
 Tools to link library data to
other resources
 LMS integration of linked data
options
 Navigation & visualisation
applications
 Feedback on usage
 Collaboration on shared
approaches
23
Next Steps?
Linked Open BNB See:
http://www.bl.uk/bibliographic/datafree.html
Next steps:
 Complete staged release
 Offer monthly updates once
complete
 Documentation & further
refinement of data model
 Identify what could be
offered or linked to next?
Questions?
Images from
24
Download