BGS Linked Data Pilot – aims & objectives

BGS Linked Data Pilot – aims & objectives
DNF Expert Group Meeting
London, 18/11/10
John Laxton
© NERC All rights reserved
We are not experts on linked
data – the aim of the pilot is
to learn!!
The pilot will run through this
winter – about to start
© NERC All rights reserved
Outline of talk
• Background – where we are now
•
•
•
with web dissemination
What are the outstanding
problems?
How we think Linked Data might
help
What we hope to do in the pilot
project
© NERC All rights reserved
Background – where are we
now with web dissemination?
We have developed an international standard for
geoscience data interoperability - GeoSciML
• based on a conceptual model of geoscientific
information drawing on existing data models
implemented in UML
• implemented an XML/GML encoding of the model
subset
• identified areas that require standardised classifications
in order to enable interchange
© NERC All rights reserved
Who is involved?
GA (Australia)
SGU (Sweden)
CSIRO (Australia)
USGS (USA)
VGS (Australia)
AZGS (USA)
BRGM (France)
BGS (UK)
GSC (Canada)
GSJ (Japan)
APAT (Italy)
© NERC All rights reserved
Data Model Packages
•
•
•
•
•
•
•
•
•
•
•
•
Geologic Feature (inc Mapped Feature)
Geologic Unit
Earth Material (lithology)
Geologic Structure
Fossil
Geologic Age
Boreholes & Observations
Geologic Relation
CGI Values
Vocabulary
Metadata
Collection
© NERC All rights reserved
Geologic Unit
© NERC All rights reserved
Vocabularies are being developed
to enable semantic
interoperability
© NERC All rights reserved
Vocabulary topics
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
Composition part proportion terms
Compound material genesis terms
Constituent part role terms
Composition Category terms
Consolidation degree terms
Contact character terms
Convention codes for planar orientation
specification
Description purpose terms
Determination method terms for
orientation measurements
Earth Material colour terms
Fabric terms
Fault movement sense terms
Fault movement type terms
Feature observation method terms
Geologic contact type terms
Geologic event environment category
terms
Geologic event process category terms
Geologic relationship role type terms
Geologic unit body morphology terms
Geologic unit exposure colour terms
Geologic unit type
© NERC All rights reserved
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
Geologic unit outcrop character
Lithology categories for Geologic
unit composition
Mapped feature observation
method terms
Metamorphic grade terms
Metamorphic facies terms
Named time ordinal era terms
Particle aspect ratio terms
Particle size terms
Particle sorting terms
Particle shape terms
Particle type terms
Quantity value qualifier
SimpleLithology
Stratified unit bedding pattern terms
Stratified unit bedding style terms
Stratified unit bedding thickness
terms
Stratigraphic rank terms
Vocabulary relationship role terms
But this is still only semantic
interoperability within the
geoscientific community……
© NERC All rights reserved
But the vocabularies are in SKOS:
which enables…….
© NERC All rights reserved
© NERC All rights reserved
GeoSciML was tested and applied
in the eContent-Plus
OneGeology-Europe project
© NERC All rights reserved
Objectives of OneGeology-Europe
•
•
•
•
•
•
•
to bring together a web-accessible, interoperable
geological spatial dataset for the whole of Europe at 1:1
million scale
to develop a harmonised specification for basic
geological map data and make significant progress
towards harmonising the dataset
to accelerate the development and deployment of
GeoSciML
to facilitate re-use and addition of value by a wide
spectrum of users and identify, document and
disseminate strategies for the reduction of technical and
business barriers to re-use
to address the multilingual aspects of access through a
multilingual discovery portal
to move geological knowledge closer to the end-user
To contribute to INSPIRE
© NERC All rights reserved
© NERC All rights reserved
© NERC All rights reserved
OneGeology-Europe and
INSPIRE
•
•
•
One of the objectives of OneGeologyEurope was to ‘contribute to INSPIRE’
Tried to use implementing rules as much
as possible
GeoSciML now being used as the basis
for the INSPIRE Geology data
specification
© NERC All rights reserved
This will lead to GeoSciML
conformant web services,
possibly using common
vocabularies, across Europe.
Problem solved?
© NERC All rights reserved
Outstanding Problems
•
•
Mapping from ‘internal’ concepts to
common ones is very time consuming
and can involve significant loss of
semantic resolution – experience from
OneGeology-Europe
All of this is still geologists talking to
geologists!
© NERC All rights reserved
We want to….
•
•
•
•
•
make our information much more widely
used and useable (semantic publishing)
make our domain knowledge available
be able to integrate information from
other domains more readily with our
information
enable non-geoscientists to query our
information, and receive answers, using
language from their own domain.
be able to link similar, but not identical,
concepts
© NERC All rights reserved
Can Linked Data and the Semantic
Web help with this?
© NERC All rights reserved
© NERC All rights reserved
There is some useful stuff out
there….
© NERC All rights reserved
© NERC All rights reserved
Small Pilot Project to…
•
•
•
•
Learn about Linked Data through ‘doing’
Investigate potential of Linked Data –
does it live up to claims?
Demonstrate potential to BGS
Management
What do we need to do to implement
Linked Data properly?
© NERC All rights reserved
Pilot Project will aim to…
•
•
•
•
Add standard URIs to some vocabulary concepts and
related data items (eg geology map polygons)
Create a small ontology in OWL/RDF linking concepts
in our vocabularies to concepts in SWEET Geology
Make use of the existing mapping between concepts in
SWEET ontologies from different domains
Construct some SPARQL queries using concepts in
non-geology domains to query geological data
© NERC All rights reserved
We hope to find out….
•
•
•
•
•
•
How much effort is involved in building
ontologies?
Can you really construct a Linked Data web
without a specific question in mind?
Could we build ontologies to link to INSPIRE
codelists/vocabularies?
Can we create useful SPARQL queries?
What do we need to do to fill the gaps and is it
practical?
A roadmap for the way forward (which will
depend on what the wider UK spatial data
community does)
© NERC All rights reserved
Questions (or answers)?
© NERC All rights reserved