DG Joint Research Centre Study and guidelines on Geospatial Linked Data as part of ISA Action 1.17 Persistent IDentifiers (PIDs) 13th March 2014 Alice Vasilescu - Deloitte Agenda Click to edit Master title style 1 Introduction 2 Context 3 Approach 4 Preliminary results 5 Sharing your experience – What barriers could there be for governing PIDs? 6 Next steps 2 Introduction Today’s webinar Click toWeedit Master style will provide an outline title of the study and 1. Setting the scene our work so far with regards to PIDs. This webinar is also an opportunity to provide feedback and to exchange experiences: What 2. Developing proposals β 3. Refinement & recommendations GUIDELINES barriers could there be for governing PIDs? WEBINAR II Guidelines on methodologies Presentation of the first version of the guidelines for a common RDF Vocabulary for INSPIRE data and an approach to PID governance: opportunities for communityled improvements and next steps towards an official INSPIRE encoding. Persistent Identifiers (PID) 3 Agenda Click to edit Master title style 1 Introduction 2 Context 3 Approach 4 Preliminary results 5 Sharing your experience – What barriers could there be for governing PIDs? 6 Next steps 4 Objectives This study has been prepared in the context of the Interoperability for European PublicClick to Administrations (ISA) Programme and, in particular A Reusable INSPIRE Reference Platform (ARE3NA, ISA Action 1.17) This study should provide: edit Master title style Connecting Geospatial Data 1. Shared evidence about the current status in Europe of linked (geospatial) data related to INSPIRE. 2. An initial common/agreed methodology and guidelines towards RDF encodings for INSPIRE 3. Recommendations for how location PIDs could be governed for INSPIRE and other relevant activities. 5 Scope PID = A persistent identifier (PI) is a long-lasting reference to a digital object—a single file or set of files. Noted persistent identifier systems include: Archival Resource Keys (ARKs), Digital Object Identifiers (DOIs), Persistent Uniform Resource Locators (PURLs), Uniform Resource Names (URNs), and Extensible Resource Identifiers (XRIs) Click to edit Master title style Linked Data Principles 1) Use URIs as names for things 2) Use HTTP URIs, so that people can look up those names 3) When someone looks up a URI, useful information should be provided, using the standards (RDF, SPARQL) 4) Include links to other URIs, so that they can discover more things PIDs <<assign>> <<assign>> Data Sets/Objects Real World Things <<include>> Resources on the web that contain data about real-world things Real-world things such as a river basin, train station, a country, etc. 6 Agenda Click to edit Master title style 1 Introduction 2 Context 3 Approach 4 Preliminary results 5 Sharing your experience – What barriers could there be for governing PIDs? 6 Next steps 7 Approach DAMA Framework (DAMA-DMBOK Guide) Deloitte Data Governance Framework (COBIT based) Click to edit Master title style Governance Controls Policies & Principles Data Governance Strategy Tools & Technology Organization Processes Governance Financing Operations Architecture Evaluation Framework Governance refers to the control and Click to Operations is about the processes edit Master title style the steering of the decisions on PIDs and tools needed to run PIDs Governance Financing Operations Architecture Financing is about the resources Architecture corresponds to a formal needed for the operations and the architectural updates specifications around PIDs 9 Evaluation Framework Click to edit Master title style 10 Literature Author(s) Date Archer, P., Dekkers, M., Goedertier, S., & Loutas, N. 201 3 Overbeek, H. & van den Brink, L. 201 3 Overbeek, Brentjes, T. H. & Portele, C. Vanbockryck, J. & Robbrecht, J. Berendt, B. 201 3 201 3 201 2 201 4 DDGI 201 3 Geonovum 201 3 Schade, S. & Smits, P. 201 2 Archer, P., Goedertier, S. & Loutas, N. 201 2 W3C 201 2 Davidson, P., Murray, K. & Williams, S. 201 1 Title Type Description Study on business models for Linked Open Data (BM4LOGD) R Towards a national URI Strategy for Linked Data of the R Source: https://joinup.ec.europa.eu/sites/default/files/Study_on_business_m odels_for_Linked_Open_Government_Data_BM4LOGD_v1.00_2.p df Source: Click to edit Master title style Dutch public sector Concept URI Strategy for the NL Public Sector PPT URI strategy of INSPIRE PPT Concepts of Meta-SDI PPT USEWOD 2014: Building a Web Observatory for research on LOD usage Open Government Data – Verwaltungsdaten frei für Wirtschaft und Gesellschaft W PP Projectplan Linked Open Data R Why Linked Data should not lead to next generation SDI 10 Rules for Persistent URIs P http://www.pilod.nl/images/a/aa/D1-2013-0919_Towards_a_NL_URI_Strategy.pdf Source: http://www.geonovum.nl/sites/default/files/2013-03-12_uri-strat.pdf Source: http://www.geonovum.nl/sites/default/files/4clemensportele.pdf Source: http://www.poweredbyinspire.eu/documents/0403linkeddata-robbrechtvanbockryck.pdf Source: http://people.cs.kuleuven.be/~bettina.berendt/USEWOD2014/ Source: http://www.google.be/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1 &ved=0CCsQFjAA&url=http%3A%2F%2Fwww.ddgi.de%2Fdownloa ds-topmenu-8%2Fdoc_download%2F1066-positionspapier-opengovernment-data-verwaltungsdaten-frei-fuer-wirtschaft-undgesellschaft&ei=Rl3vUvnAOMijhgeLx4D4Dg&usg=AFQjCNHDyPud 3AJxKZTzBI7ovAJ3z3lzWg Source: http://www.geonovum.nl/sites/default/files/projectplan_linked_open_ data_2012-2013.pdf Source: IEEE International on Geoscience and Remote Sensing Symposium (IGARSS), pp. 2894-2897 R Source: https://joinup.ec.europa.eu/community/semic/document/10rules-persistent-uris 223 Best Practices URI Construction. W3C GLD WG wiki. W Designing URI Sets for Location. Version 1.0. R Source: http://www.w3.org/2011/gld/wiki/223_Best_Practices_URI_Construc tion Source: http://data.gov.uk/sites/default/files/Designing_URI_Sets_for_Locati on-V1.0_10.pdf 11 Literature Author(s) Date Davidson, P. 201 0 UK CTOC 201 4 Abbas, S. & Ojo, A. 201 4 Sauermann, L. & Cyganiak, R. 200 8 201 4 201 4 201 4 EC, DG JRC PURL KE – Knowledge Exchange 201 3 201 3 Click to edit Master title style Title Designing URI sets for the UK public sector. Version 1.0. Interim paper. Type P URI patterns for identifiers proposal W Applying Design Patterns in URI Strategies - Naming in Linked Geospatial Data Infrastructure P Cool URIs for the Semantic Web. W3C Interest Group Note Digital Object Identifier System (DOI) W The Handle System W EZID (Easy ID) – Long Term Identifiers Made Easy Implementation of Identifiers using URIs in INSPIRE – Frequently Asked Questions. Persistent Uniform Resource Locators W Persistent Identifiers Studies W Description Source: https://www.gov.uk/government/uploads/system/uploads/attachment _data/file/60975/designing-URI-sets-uk-public-sector.pdf http://standards.data.gov.uk/proposal/uri-patterns-identifiersproposal Source: https://www.deri.ie/content/applying-design-patterns-uri-strategiesnaming-linked-geospatial-data-infrastructure W W W Source : http://www.w3.org/TR/cooluris/ Source: http://www.doi.org/ Source : http://www.handle.net/ http://n2t.net/ezid/ http://inspire.ec.europa.eu/index.cfm/pageid/5120 http://purl.org/ http://www.knowledge-exchange.info/Default.aspx?ID=332 12 Literature If there is anything relevant missing from our list, please contact us: are3na@jrc.ec.europa.eu Click to edit Master title style The list will be provided together with this current presentation. 13 Agenda Click to edit Master title style 1 Introduction 2 Context 3 Approach 4 Preliminary results 5 Sharing your experience – What barriers could there be for governing PIDs? 6 Next steps 14 Governance Click to edit Master title style Governance Organization Austria – REEEP(Renewable Energy and Energy Efficiency Partnership) Reegle.info Belgium – Agiv Flemish Land Agency Denmark – Knowledge Exchange Germany - German National Library Italy - Agenzia per l'Italia Digitale Netherlands – SURF Initiative United Kingdom Chief Technology Officer Council Work W3C Study on business models for Linked Open Data (BM4LOGD) Concepts of Meta-SDI URI Strategy of INSPIRE Designing URI sets for UK Public Sector Designing URI sets for Location (UK) Click to edit Master title style Input reviewed Unknown yet 16 Governance Preliminary conclusions Based on available data analyzed until now: 1) 2) 3) 4) 5) Barriers Based on available data analyzed until now: Click to edit Master title style Policy is often associated with URI design patterns and the management of domains. However, there is definitely more than these two elements. When a policy exists it isn’t always well formalized depending on the size and organizational structure. There seem to be no examples of PID organizational structures in the context of INSPIRE and in e-Government. There is a lot of theoretical work about the “technical side” of persistent identifiers, often associated with linked data Lack of a holistic picture that integrates the several GOFA dimensions. 1) 2) PIDs governance is an inherently complex topic given the way that “ICT governance and management” is done in the different Member States There is “no silver bullet” for PID governance 17 Financing Governance PIDs Policy Defines what and how can be assigned a PDI Business Case Justifies the investment in persistent identifiers PIDs Organisational structure Defines who has the authority and control (planning, monitoring and enforcement) over the management of PIDs Operations Click to edit Master title style Registration Validation Creates and registers PIDs Redirection Validates the format, location and the uniqueness of PIDs Uses persistent PIDs to redirect to the desired Web resource Long Term Preservation Archives fullback copies of “dead” resources Cost Model Service Model Defines the allocation of costs and charges associated with the provision of persistent PIDs. Describes the set of services that will benefit from the use of persistent PIDs, this encompasses both data and metadata about services Financing Financing Architecture 18 Financing – Business case Click to edit Master title style Organisation Work United Kingdom - Department of Environment, Food and Rural Affairs (DEFRA) Study on business models for Linked Open Data (BM4LOGD) Input reviewed Unknown yet 19 Financing – Cost model Organisation Click to edit Master title style European Commission Austria – REEEP (Renewable Energy and Energy Efficiency Partnership) Reegle.info Belgium – INSPIRE State Report Germany – INSPIRE State Report Italy- Agenzia per l'Italia Digitale Work UK – INSPIRE State Report Study on business models for Linked Open Data (BM4LOGD) Belgium – INSPIRE State Report Germany – INSPIRE State Report Input reviewed Unknown yet 20 Financing Preliminary conclusions Based on available data analyzed until now: 1) Barriers No clear understanding on title the investment Click1) to edit Master style needed to set up PIDs. There are no formalized business cases for PIDs – we believe that the interest in linked data is sometimes enough: e.g. John Sheridan, Head of Legislation Services “A business case for using linked data would like making a business case for using electricity.” 2) There seems to be no cost model for PIDs. For Linked Data or the implementation of INSPIRE, when they can be found, they are very different from one case to the other. Below are a few examples: Development costs, Maintenance costs, Promotion costs – Linked Data Operating costs of the IT infrastructure, Production of interoperability Business model for the implementation of INSPIRE, IT structure Processing of spatial data, Personnel – INSPIRE Germany 21 Operational Governance PIDs Policy Defines what and how can be assigned a PDI Business Case Justifies the investment in persistent identifiers PIDs Organisational structure Defines who has the authority and control (planning, monitoring and enforcement) over the management of PIDs Operations Click to edit Master title style Registration Validation Creates and registers PIDs Redirection Validates the format, location and the uniqueness of PIDs Uses persistent PIDs to redirect to the desired Web resource Long Term Preservation Archives fullback copies of “dead” resources Cost Model Service Model Defines the allocation of costs and charges associated with the provision of persistent PIDs. Describes the set of services that will benefit from the use of persistent PIDs, this encompasses both data and metadata about services Financing Financing Architecture 22 Operational Organisation Click to edit Master style Germany title - German National Library Netherlands - SURF United Kingdom Chief Technology Officer Council Work PersID III.a – Current State and State of the Art & III.b – User Requirements 10 Rules for Persistent URIs URI Strategy of INSPIRE Designing URI sets for UK Public Sector Designing URI sets for Location (UK) Towards a national URI-‐Strategy for Linked Data of the Dutch public Input reviewed Unknown yet 23 Operational Preliminary conclusions Based on available data analyzed until now: 1) 2) 3) Barriers Click to edit Master title style The domain owner is implicitly considered to be responsible for registration and validation There is little discussion about long term preservation and most papers mention 303 Redirection There is little discussion about validation of PIDs (it is much more about standardization and very little about control) 24 Architecture Governance PIDs Policy Defines what and how can be assigned a PDI Business Case Justifies the investment in persistent identifiers PIDs Organisational structure Defines who has the authority and control (planning, monitoring and enforcement) over the management of PIDs Operations Click to edit Master title style Registration Validation Creates and registers PIDs Redirection Validates the format, location and the uniqueness of PIDs Uses persistent PIDs to redirect to the desired Web resource Long Term Preservation Archives fullback copies of “dead” resources Cost Model Service Model Defines the allocation of costs and charges associated with the provision of persistent PIDs. Describes the set of services that will benefit from the use of persistent PIDs, this encompasses both data and metadata about services Financing Financing Architecture 25 Architecture Organisation Austria – REEEP(Renewable Click to edit Master title style Energy and Energy Efficiency Partnership) Reegle.info Belgium – Agiv Flemish Land Agency Denmark – Knowledge Exchange Germany - German National Library Ireland - Digital Enterprise Research Institute, National University of Ireland, Galway Italy - Agenzia per l'Italia Digitale Netherlands - SURF United Kingdom Chief Technology Officer Council W3C Work Input reviewed Unknown yet Study on business models for Linked Open Data (BM4LOGD) Concepts of Meta-SDI URI Strategy of INSPIRE Designing URI sets for UK Public Sector Designing URI sets for Location (UK) 26 Architecture Preliminary conclusions Most work has been done in this area. Barriers Control and validation are as important as Click1) to edit Master title style standardization but often overlooked. 1) Most referenced work is the 10 Rules for Persistent URI and Designing URI sets for UK Public Sector 2) There is no EU agreed policy for PIDs for centrally managed, shared resources 3) Several guidelines have been put forward for the URI persistence ( W3C - RFC2616., UK - Designing URI sets for UK Public Sector, ISA), but currently these are just recommendations and not always aligned 27 Agenda Click to edit Master title style 1 Introduction 2 Context 3 Approach 4 Preliminary results 5 Sharing your experience – What barriers could there be for governing PIDs? 6 Next steps 28 Sharing your experience – What barriers could there be for governing PIDs? Sharing your experience – What barriers could there be for governing PIDs? Governance 12/12 Click to edit Master title style Operations Architecture Financing Or our approach 30 Agenda Click to edit Master title style 1 Introduction 2 Context 3 Approach 4 Preliminary results 5 Sharing your experience – What barriers could there be for governing PIDs? 6 Next steps 31 Next steps Click to edit Master title style 1. Highlights Follow us on Joinup. We will provide the highlights of this webinar 2. Refinement WEBINAR II: Guidelines & methodologies 5 & 6 May 2014 AND…. At any time, your feedback is very appreciated. 32 Next steps Click to edit Master title style Thank you for your participation! Join the collaborative platforms in other areas of Open Source & Semantic Interoperability on https://joinup.ec.europa.eu/ Connecting Geospatial Data 33