Auditing our Data Method and Toolkit

advertisement
www.derilinx.com
Auditing Our Data
Method & Toolkit
Deirdre Lee
deirdre@derilinx.com
@deirdrelee
11th Feb 2015
“Just as oil was likened to black gold, data
takes on a new importance and value in the
digital age.”
-Commissioner Neelie Kroes
2
www.derilinx.com
Open Data brings many benefits…
3
… where do we begin?
www.derilinx.com
4
https://www.gov.uk/government/publications/open-data-charter/g8-open-datacharter-and-technical-annex
Photo: http://www.telegraph.co.uk/news/worldnews/g8/10128266/G8-Open-DataCharter-why-it-matters.html
www.derilinx.com
What data do we hold?
How is data managed?
What data should we publish?
5
www.derilinx.com
May 2013: Executive Order -- Making Open and
Machine Readable the New Default for
Government Information
In the (UK) government response to the
Shakespeare Review of June 2013, the
government sets out its aim to create a National
Information Infrastructure (NII).
6
www.derilinx.com
What is Data?
7
www.derilinx.com
Data Lifecycle
8
Knud Möller. Lifecycle models of data-centric systems and domains: The abstract data
lifecycle model. 2012. Semantic Web 4.1: 67-88. URL: http://www.semantic-webjournal.net/content/lifecycle-models-data-centric-systems-and-domains
www.derilinx.com
Challenges of Data Management
• Who owns the data?
• Who is responsible for the data?
• What if data is aggregated from multiple
sources?
• How to keep data up-to-date?
• ..etc.
9
www.derilinx.com
Data Asset Framework
10
Ekmekcioglu, C. et al., 2009. Data Asset Framework Implementation Guide, Digital
Curation Centre. Available at: http://www.data-audit.eu/.
www.derilinx.com
COMSODE Methodology for Publishing Datasets as Open Data
11
Nečaský, M. et al., 2015. D5.1 Methodology for publishing datasets as open data,
COMSODE. Available at: http://www.comsode.eu/index.php/deliverables/.
www.derilinx.com
Development of Open Data Publication Plan
• Identify datasets managed/held by organisation.
• Record basic information for each dataset
• Identify datasets to publish as Open Data
• Define how and how often data should be published
• Estimate the effort of publication
• Prioritise data to publish as Open Data
12
www.derilinx.com
D/PER Open Data Audit Template
13
www.derilinx.com
14
www.derilinx.com
Administrative Data
Operational Data
15
www.derilinx.com
Identifying Datasets
• Machine-readable data, not graphs, reports,
applications, or web portals.
• A dataset should be managed and maintained
within the organisation.
• High-value data should be prioritised
16
www.derilinx.com
Machine-Readable Data



17
Table, graph and map from The Quality of Bathing Water in Ireland – An Overview for the
Year 2012
https://www.epa.ie/pubs/reports/water/bathing/thequalityofbathingwaterinireland2012.
html#.U0FxMVdgzvY
www.derilinx.com
High-value Data
• used to increase agency accountability and
responsiveness;
• improve public knowledge of the agency and its
operations;
• further the core mission of the agency;
• create economic opportunity; or
• respond to need and demand as identified through
public consultation
(The U.S. Office of Management and Budget, 2009)
www.derilinx.com
18
Tips on Identifying High-Value Data
• Reports, surveys or other public documents that have
been published by the organisation, or
• Data requests that have been received by the
organisation, e.g. ePQs or FOI requests.
• Data behind Key Performance Indicators, e.g. those used
in IrelandStat
19
www.derilinx.com
Development of Open Data Publication Plan
• Identify datasets managed/held by organisation.
• Record basic information for each dataset
• Identify datasets to publish as Open Data
• Define how and how often data should be published
• Estimate the effort of publication
• Prioritise data to publish as Open Data
20
www.derilinx.com
Metadata
21
www.derilinx.com
22
www.derilinx.com
Development of Open Data Publication Plan
• Identify datasets managed/held by organisation.
• Record basic information for each dataset
• Identify datasets to publish as Open Data
• Define how and how often data should be published
• Estimate the effort of publication
• Prioritise data to publish as Open Data
23
www.derilinx.com
Chain of economic effects of lowered PSI re-use charges
24
DeVries, M. & Hittmair, G., 2014. Open data is fact now- when does the reuse. In
SharePSI Lisbon. Lisbon, PT. Available at: http://www.w3.org/2013/sharepsi/workshop/lisbon/agenda.
www.derilinx.com
Chain of economic effects of lowered PSI re-use charges
25
DeVries, M. & Hittmair, G., 2014. Open data is fact now- when does the reuse. In
SharePSI Lisbon. Lisbon, PT. Available at: http://www.w3.org/2013/sharepsi/workshop/lisbon/agenda.
www.derilinx.com
Chain of economic effects of lowered PSI re-use charges
26
DeVries, M. & Hittmair, G., 2014. Open data is fact now- when does the reuse. In
SharePSI Lisbon. Lisbon, PT. Available at: http://www.w3.org/2013/sharepsi/workshop/lisbon/agenda.
www.derilinx.com
Open Data
--------------------Closed Data
• Personal Data
• Secure Data
27
www.derilinx.com
Upcoming Opportunities
• Open Data Workshop, Thurs 19th Feb
• Open Data Training Day, early March
• Support for internal data audits
• Support for development of Open Data Strategies
28
www.derilinx.com
Download