State-of-play PIDS v6.00 - Joinup

advertisement
DG Joint Research Centre
Study and guidelines on
Geospatial Linked Data as part
of ISA Action 1.17
Persistent IDentifiers (PIDs)
13th March 2014
Alice Vasilescu - Deloitte
Agenda
Click to edit Master title style
1
Introduction
2
Context
3
Approach
4
Preliminary results
5
Sharing your experience – What barriers could there be for governing PIDs?
6
Next steps
2
Introduction
Today’s webinar
Click toWeedit
Master
style
will provide
an outline title
of the study
and
1. Setting
the scene
our work so far with regards to PIDs.
This webinar is also an opportunity to provide
feedback and to exchange experiences: What
2.
Developing
proposals
β
3. Refinement &
recommendations
GUIDELINES
barriers could there be for governing
PIDs?
WEBINAR II Guidelines on
methodologies
Presentation of the first version of the
guidelines for a common RDF Vocabulary for
INSPIRE data and an approach to PID
governance: opportunities for communityled improvements and next steps towards
an official INSPIRE encoding.
Persistent
Identifiers
(PID)
3
Agenda
Click to edit Master title style
1
Introduction
2
Context
3
Approach
4
Preliminary results
5
Sharing your experience – What barriers could there be for governing PIDs?
6
Next steps
4
Objectives
This study has been prepared in the context of
the Interoperability for European PublicClick to
Administrations (ISA) Programme and, in
particular A Reusable INSPIRE Reference
Platform (ARE3NA, ISA Action 1.17)
This study should provide:
edit Master title style
Connecting Geospatial Data
1. Shared evidence about the current status
in Europe of linked (geospatial) data
related to INSPIRE.
2. An initial common/agreed methodology
and guidelines towards RDF encodings for
INSPIRE
3. Recommendations for how location PIDs
could be governed for INSPIRE and other
relevant activities.
5
Scope
PID = A persistent identifier (PI) is a long-lasting reference to a digital object—a single file or set of files.
Noted persistent identifier systems include: Archival Resource Keys (ARKs), Digital Object Identifiers (DOIs),
Persistent Uniform Resource Locators (PURLs), Uniform Resource Names (URNs), and Extensible Resource
Identifiers (XRIs)
Click to edit Master title style
Linked Data Principles
1) Use URIs as names for things
2) Use HTTP URIs, so that people can look up those names
3) When someone looks up a URI, useful information should be provided, using the standards (RDF, SPARQL)
4) Include links to other URIs, so that they can discover more things
PIDs
<<assign>>
<<assign>>
Data
Sets/Objects
Real World
Things
<<include>>
Resources on the web that
contain data about real-world
things
Real-world things such as a river
basin, train station, a country, etc.
6
Agenda
Click to edit Master title style
1
Introduction
2
Context
3
Approach
4
Preliminary results
5
Sharing your experience – What barriers could there be for governing PIDs?
6
Next steps
7
Approach
DAMA Framework
(DAMA-DMBOK Guide)
Deloitte Data Governance
Framework (COBIT based)
Click to edit Master title style
Governance
Controls
Policies &
Principles
Data
Governance
Strategy
Tools &
Technology
Organization
Processes
Governance
Financing
Operations
Architecture
Evaluation Framework
Governance refers to the control and Click to
Operations
is about the processes
edit Master title style
the steering of the decisions on PIDs
and tools needed to run PIDs
Governance
Financing
Operations
Architecture
Financing is about the resources
Architecture corresponds to a formal
needed for the operations and the
architectural updates
specifications around PIDs
9
Evaluation Framework
Click to edit Master title style
10
Literature
Author(s)
Date
Archer, P., Dekkers,
M., Goedertier, S., &
Loutas, N.
201
3
Overbeek, H. & van
den Brink, L.
201
3
Overbeek,
Brentjes, T.
H.
&
Portele, C.
Vanbockryck, J. &
Robbrecht, J.
Berendt, B.
201
3
201
3
201
2
201
4
DDGI
201
3
Geonovum
201
3
Schade, S. & Smits,
P.
201
2
Archer, P.,
Goedertier, S. &
Loutas, N.
201
2
W3C
201
2
Davidson, P.,
Murray, K. &
Williams, S.
201
1
Title
Type
Description
Study on business models for Linked
Open Data (BM4LOGD)
R
Towards a national URI Strategy for
Linked Data of the
R
Source:
https://joinup.ec.europa.eu/sites/default/files/Study_on_business_m
odels_for_Linked_Open_Government_Data_BM4LOGD_v1.00_2.p
df
Source:
Click to edit Master title style
Dutch public sector
Concept URI Strategy for the NL Public
Sector
PPT
URI strategy of INSPIRE
PPT
Concepts of Meta-SDI
PPT
USEWOD 2014: Building a Web
Observatory for research on LOD usage
Open
Government
Data
–
Verwaltungsdaten frei für Wirtschaft und
Gesellschaft
W
PP
Projectplan Linked Open Data
R
Why Linked Data should not lead to next
generation SDI
10 Rules for Persistent URIs
P
http://www.pilod.nl/images/a/aa/D1-2013-0919_Towards_a_NL_URI_Strategy.pdf
Source:
http://www.geonovum.nl/sites/default/files/2013-03-12_uri-strat.pdf
Source:
http://www.geonovum.nl/sites/default/files/4clemensportele.pdf
Source:
http://www.poweredbyinspire.eu/documents/0403linkeddata-robbrechtvanbockryck.pdf
Source:
http://people.cs.kuleuven.be/~bettina.berendt/USEWOD2014/
Source:
http://www.google.be/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1
&ved=0CCsQFjAA&url=http%3A%2F%2Fwww.ddgi.de%2Fdownloa
ds-topmenu-8%2Fdoc_download%2F1066-positionspapier-opengovernment-data-verwaltungsdaten-frei-fuer-wirtschaft-undgesellschaft&ei=Rl3vUvnAOMijhgeLx4D4Dg&usg=AFQjCNHDyPud
3AJxKZTzBI7ovAJ3z3lzWg
Source:
http://www.geonovum.nl/sites/default/files/projectplan_linked_open_
data_2012-2013.pdf
Source: IEEE International on Geoscience and Remote Sensing
Symposium (IGARSS), pp. 2894-2897
R
Source: https://joinup.ec.europa.eu/community/semic/document/10rules-persistent-uris
223 Best Practices URI Construction.
W3C GLD WG wiki.
W
Designing URI Sets for Location. Version
1.0.
R
Source:
http://www.w3.org/2011/gld/wiki/223_Best_Practices_URI_Construc
tion
Source:
http://data.gov.uk/sites/default/files/Designing_URI_Sets_for_Locati
on-V1.0_10.pdf
11
Literature
Author(s)
Date
Davidson, P.
201
0
UK CTOC
201
4
Abbas, S. & Ojo, A.
201
4
Sauermann, L. &
Cyganiak, R.
200
8
201
4
201
4
201
4
EC, DG JRC
PURL
KE – Knowledge
Exchange
201
3
201
3
Click to edit Master title style
Title
Designing URI sets for the UK public
sector. Version 1.0. Interim paper.
Type
P
URI patterns for identifiers proposal
W
Applying Design Patterns in URI
Strategies - Naming in Linked Geospatial
Data Infrastructure
P
Cool URIs for the Semantic Web. W3C
Interest Group Note
Digital Object Identifier System (DOI)
W
The Handle System
W
EZID (Easy ID) – Long Term Identifiers
Made Easy
Implementation of Identifiers using URIs in
INSPIRE – Frequently Asked Questions.
Persistent Uniform Resource Locators
W
Persistent Identifiers Studies
W
Description
Source:
https://www.gov.uk/government/uploads/system/uploads/attachment
_data/file/60975/designing-URI-sets-uk-public-sector.pdf
http://standards.data.gov.uk/proposal/uri-patterns-identifiersproposal
Source:
https://www.deri.ie/content/applying-design-patterns-uri-strategiesnaming-linked-geospatial-data-infrastructure
W
W
W
Source : http://www.w3.org/TR/cooluris/
Source: http://www.doi.org/
Source : http://www.handle.net/
http://n2t.net/ezid/
http://inspire.ec.europa.eu/index.cfm/pageid/5120
http://purl.org/
http://www.knowledge-exchange.info/Default.aspx?ID=332
12
Literature
If there is anything relevant missing from our list, please contact us:
are3na@jrc.ec.europa.eu
Click to edit Master title
style
The list will be provided together with this current presentation.
13
Agenda
Click to edit Master title style
1
Introduction
2
Context
3
Approach
4
Preliminary results
5
Sharing your experience – What barriers could there be for governing PIDs?
6
Next steps
14
Governance
Click to edit Master title style
Governance
Organization
Austria – REEEP(Renewable Energy
and Energy Efficiency Partnership)
Reegle.info
Belgium – Agiv Flemish Land
Agency
Denmark – Knowledge Exchange
Germany - German National
Library
Italy - Agenzia per l'Italia Digitale
Netherlands – SURF Initiative
United
Kingdom
Chief
Technology Officer Council
Work
W3C
Study on business models for
Linked Open Data (BM4LOGD)
Concepts of Meta-SDI
URI Strategy of INSPIRE
Designing URI sets for UK Public
Sector
Designing URI sets for Location
(UK)
Click to edit Master title style
Input reviewed
Unknown yet
16
Governance
Preliminary conclusions
Based on available data analyzed until now:
1)
2)
3)
4)
5)
Barriers
Based on available data analyzed until now:
Click to edit Master title style
Policy is often associated with URI design
patterns and the management of domains.
However, there is definitely more than these
two elements.
When a policy exists it isn’t always well
formalized depending on the size and
organizational structure.
There seem to be no examples of PID
organizational structures in the context of
INSPIRE and in e-Government.
There is a lot of theoretical work about the
“technical side” of persistent identifiers, often
associated with linked data
Lack of a holistic picture that integrates the
several GOFA dimensions.
1)
2)
PIDs governance is an inherently complex
topic given the way that “ICT governance and
management” is done in the different Member
States
There is “no silver bullet” for PID governance
17
Financing
Governance
PIDs Policy
Defines what and
how can be
assigned a PDI
Business
Case
Justifies the
investment in
persistent
identifiers
PIDs Organisational
structure
Defines who has the authority
and control (planning,
monitoring and enforcement)
over the management of PIDs
Operations
Click
to edit Master title style
Registration
Validation
Creates and registers
PIDs
Redirection
Validates the
format, location and
the uniqueness of
PIDs
Uses persistent PIDs
to redirect to the
desired Web
resource
Long Term
Preservation
Archives fullback
copies of “dead”
resources
Cost Model
Service Model
Defines the allocation of costs
and charges associated with
the provision of persistent
PIDs.
Describes the set of services that will benefit
from the use of persistent PIDs, this
encompasses both data and metadata about
services
Financing
Financing
Architecture
18
Financing – Business case
Click to edit Master title style
Organisation
Work
United Kingdom - Department of
Environment, Food and Rural
Affairs (DEFRA)
Study on business models for
Linked Open Data (BM4LOGD)
Input reviewed
Unknown yet
19
Financing – Cost model
Organisation
Click
to edit Master
title style
European Commission
Austria – REEEP (Renewable
Energy and Energy Efficiency
Partnership) Reegle.info
Belgium – INSPIRE State Report
Germany – INSPIRE State Report
Italy- Agenzia per l'Italia Digitale
Work
UK – INSPIRE State Report
Study on business models for
Linked Open Data (BM4LOGD)
Belgium – INSPIRE State Report
Germany – INSPIRE State Report
Input reviewed
Unknown yet
20
Financing
Preliminary conclusions
Based on available data analyzed until now:
1)
Barriers
No clear
understanding
on title
the investment
Click1) to
edit
Master
style
needed to set up PIDs.
There are no formalized business cases for
PIDs – we believe that the interest in linked
data is sometimes enough:
e.g. John Sheridan, Head of
Legislation Services “A business
case for using linked data would like
making a business case for using
electricity.”
2)
There seems to be no cost model for PIDs.
For Linked Data or the implementation of
INSPIRE, when they can be found, they are
very different from one case to the other.
Below are a few examples:

Development costs, Maintenance costs, Promotion
costs – Linked Data

Operating costs of the IT infrastructure, Production of
interoperability Business model for the implementation
of INSPIRE, IT structure Processing of spatial data,
Personnel – INSPIRE Germany
21
Operational
Governance
PIDs Policy
Defines what and
how can be
assigned a PDI
Business
Case
Justifies the
investment in
persistent
identifiers
PIDs Organisational
structure
Defines who has the authority
and control (planning,
monitoring and enforcement)
over the management of PIDs
Operations
Click
to edit Master title style
Registration
Validation
Creates and registers
PIDs
Redirection
Validates the
format, location and
the uniqueness of
PIDs
Uses persistent PIDs
to redirect to the
desired Web
resource
Long Term
Preservation
Archives fullback
copies of “dead”
resources
Cost Model
Service Model
Defines the allocation of costs
and charges associated with
the provision of persistent
PIDs.
Describes the set of services that will benefit
from the use of persistent PIDs, this
encompasses both data and metadata about
services
Financing
Financing
Architecture
22
Operational
Organisation
Click to edit Master
style
Germany title
- German
National
Library
Netherlands - SURF
United
Kingdom
Chief
Technology Officer Council
Work
PersID III.a – Current State and
State of the Art & III.b – User
Requirements
10 Rules for Persistent URIs
URI Strategy of INSPIRE
Designing URI sets for UK Public
Sector
Designing URI sets for Location
(UK)
Towards a national URI-‐Strategy
for Linked Data of the Dutch
public
Input reviewed
Unknown yet
23
Operational
Preliminary conclusions
Based on available data analyzed until now:
1)
2)
3)
Barriers
Click to edit Master title style
The domain owner is implicitly considered
to be responsible for registration and
validation
There is little discussion about long term
preservation and most papers mention 303
Redirection
There is little discussion about validation
of PIDs (it is much more about
standardization and very little about control)
24
Architecture
Governance
PIDs Policy
Defines what and
how can be
assigned a PDI
Business
Case
Justifies the
investment in
persistent
identifiers
PIDs Organisational
structure
Defines who has the authority
and control (planning,
monitoring and enforcement)
over the management of PIDs
Operations
Click
to edit Master title style
Registration
Validation
Creates and registers
PIDs
Redirection
Validates the
format, location and
the uniqueness of
PIDs
Uses persistent PIDs
to redirect to the
desired Web
resource
Long Term
Preservation
Archives fullback
copies of “dead”
resources
Cost Model
Service Model
Defines the allocation of costs
and charges associated with
the provision of persistent
PIDs.
Describes the set of services that will benefit
from the use of persistent PIDs, this
encompasses both data and metadata about
services
Financing
Financing
Architecture
25
Architecture
Organisation
Austria – REEEP(Renewable
Click to edit Master
title
style
Energy and
Energy
Efficiency
Partnership) Reegle.info
Belgium – Agiv Flemish Land
Agency
Denmark – Knowledge Exchange
Germany - German National
Library
Ireland - Digital Enterprise
Research Institute, National
University of Ireland, Galway
Italy - Agenzia per l'Italia Digitale
Netherlands - SURF
United
Kingdom
Chief
Technology Officer Council
W3C
Work
Input reviewed
Unknown yet
Study on business models for
Linked Open Data (BM4LOGD)
Concepts of Meta-SDI
URI Strategy of INSPIRE
Designing URI sets for UK Public
Sector
Designing URI sets for Location
(UK)
26
Architecture
Preliminary conclusions
Most work has been done in this area.
Barriers
Control
and validation
are as
important
as
Click1) to
edit
Master
title
style
standardization but often overlooked.
1) Most referenced work is the 10 Rules for
Persistent URI and Designing URI sets for
UK Public Sector
2) There is no EU agreed policy for PIDs for
centrally managed, shared resources
3) Several guidelines have been put forward
for the URI persistence ( W3C - RFC2616., UK
- Designing URI sets for UK Public Sector,
ISA), but currently these are just
recommendations and not always aligned
27
Agenda
Click to edit Master title style
1
Introduction
2
Context
3
Approach
4
Preliminary results
5
Sharing your experience – What barriers could there be for governing PIDs?
6
Next steps
28
Sharing your experience – What barriers could
there be for governing PIDs?
Sharing your experience – What barriers could there be for
governing PIDs?
Governance
12/12
Click to edit Master title style
Operations
Architecture
Financing
Or our approach
30
Agenda
Click to edit Master title style
1
Introduction
2
Context
3
Approach
4
Preliminary results
5
Sharing your experience – What barriers could there be for governing PIDs?
6
Next steps
31
Next steps
Click to edit Master title style
1. Highlights
Follow us on Joinup. We will provide the highlights of this
webinar
2. Refinement
WEBINAR II: Guidelines & methodologies 5 & 6 May 2014
AND….
At any time, your feedback is very appreciated.
32
Next steps
Click to edit Master title style
Thank you for your participation!
Join the collaborative platforms in
other areas of Open Source &
Semantic Interoperability on
https://joinup.ec.europa.eu/
Connecting Geospatial Data
33
Download