Data Sharing and Linkage Service

advertisement
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
Data Sharing and Linkage Service
Pathfinder Projects Update - For Information
Purpose
This paper was first provided to the board in August 2013 and has been updated with
information on the principal investigators / project leads and progress.
Background
As part of the Data Linkage Framework a fund has been established to support projects
that will deliver its aims by tackling the challenges that prevent effective and efficient data
linkages from occurring. These challenges are
1. Uncertainty about the legalities and public acceptability of data sharing and linkage.
2. Incomplete data, or data that cannot be linked.
3. Limited capacity for secure exchange and access to data.
4. Limited capacity of public sector organisations to analyse and make use of linked
data
Project Updates
 An event was held on 7th October where each of the main projects was presented to
a group of analysts and other staff from Scottish Government, NRS, NSS and other
partner organisations. These presentations will be prepared for wider publication.
 Each of the scoping projects has now produced a draft report. These will be finalised
and then shared to stimulate ideas for potential new projects in these areas
 The young people scoping project has been subsumed into a wider project being
carried out in Scottish Government around the strategic use of data in this field and
no longer requires funding
 Discussions with PAC around the implications of their feedback on the privacy
implications of this project are ongoing
 A new indexer has been recruited in NRS and will aim to take forward linkage on the
1947 mental survey as an early priority
Selection Process
All projects submit an application describing the proposed project, setting out the
challenges to be addressed, the project methodology, the timeline, funding requirements
etc. The proforma is attached in Annexe A.
Each project is considered by a panel consisting of the Chief Statistician, Chief Executive of
NRS and the Head of ISD. In most cases approval of projects involved a period of
negotiation between the researchers and the approval panel to come to an agreed project.
A formal procurement process was then completed through NRS to deliver the funding with
contract management provided by the DSLS team.
All projects have gone through this process. The total funds committed to pathfinders
across three years is £240,000.
Advertising of the opportunity for this funding has been relatively informal through exiting
networks. All bids are considered by the selection panel and each is considered on its
individual merits. No set budget has been ring-fenced for additional projects to be delivered
so there is no maximum or minimum the funding that must be allocated in a year. We are
committed to considering funding projects up to April 2015.
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
Projects
Project: NHS Greater Glasgow and Clyde Population Counts
Aim: To enable a better understanding of the differences in population counts produced by
different sources for Glasgow City Council and NHS Greater Glasgow and Clyde
Project Lead / Principal Investigator: Kirsty MacLachlan NRS
Objectives:

to provide a better understanding of the differences between population
estimates in Scotland calculated by NRS (the Census and the Mid-YearEstimates) and estimates derived from administrative data sources, such as the
CHI, NHSCR, Local Authority (LA) data, etc.

to review and document available data sources and assess whether the data
quality of administrative data sources can be improved

to establish the suitability and accuracy of relevant data sources and
determine whether it is possible to use them to contribute to population statistics

to assess whether the population of NHS service users in GG&C Health
Board area is significantly different to the population estimates produced by NRS,
particularly given the significance of the MYE in determining the allocation of
NHS resources

to identify any potential disparities in the population estimates for particular
geographical areas or subsets of the population, with the intention of
documenting, explaining and assessing the implications of these differences
where possible.

To report on any barriers to data sharing that are identified and to document
the steps taken to reduce or alleviate these barriers. This will assist the work that
is taking place in the Beyond 2011 programme and the Data Sharing & Linkage
Service (DSLS).
Costs: £70,000 (to provide staff in NRS) plus potential support in kind for linkage when
DSLS staff in place
Outcomes:
 to create an inventory of all relevant administrative data sources, including an
assessment of the potential contribution each dataset can make to the project
objectives.

to produce detailed reports which describe and analyse the differences
between the various datasets to which access is granted, including summary
tables of population estimates from different sources.

to create a set of agreed population records which enable a reliable and
accurate population estimate to be derived.

to report and present any recommendations concerning the future allocation
of NHS funding to TAGRA based on the evidence found in the analysis.
Data Linkage Framework Programme Board - 5 November 2013

PAPER 5
to report on any barriers to data sharing that are identified and to document
the steps taken to reduce or alleviate these barriers and to inform the
development of the population spine This will assist the work that is taking
place in the Beyond 2011 programme and the Data Sharing & Linkage
Service (DSLS).
Learning Points
Data quality The project is particularly focussed on the quality of data used to produce full
population statistics and will help to address data quality and completeness issues
associated with this type of data. Ensuring the data quality improvement work can be
generally applied across local authorities and health boards is an important factor being
considered. For example the project has developed our understanding of the challenges
associated with identifying the whereabouts of hard to count groups such as young men not
in regular contact with health or authority services
Data sharing This project is developing our understanding on all the challenges by
investigating sharing and linking issues relating to data held by the council, the health
board, NRS and ISD as well as other potential sources of population data.
Spine The project is informing the development of the population spine to be used both in
Beyond 2011 and to facilitate linking in the DSLS
Technicalities of linking and sharing The project gives an opportunity to test and
develop our technical approaches to file exchange, linkage etc
Timescales
Key Milestones
By Jan 2014 – Source investigation phase complete, aiming to have data access in place
By March 2014 – Inventory of data sources completed
By April 2014 – Report on Barriers encountered to sharing
By Jan 2015 – Expansion to support national level work for B2011
By Jan 2015 – Further record level linkage in support of B2011 complete
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
Project: Children vulnerable to maltreatment and/or neglect: scoping the capacity of the
Scottish Longitudinal Survey (SLS) to address research questions of interest.
Project Lead / Principal Investigator: Janice McGhee, Edinburgh University
Aims:
 To scope whether it is possible to identify vulnerable children using the SLS through
mapping relevant information within its linked administrative and health-related data.
 To explore whether data in the SLS would be useful and/or possible in creating task
specific administrative data linkage projects addressing research questions relevant
to these children
 To explore, given privacy and data issues, whether the SLS may be a suitable
vehicle to conduct longitudinal research on children who potentially may be
vulnerable to maltreatment and/or neglect in Scotland.
Objectives:
 To identify and create indicators from relevant variables/field within SLS and linked
administrative and health-related data that could potentially identify populations of
children who may be vulnerable to maltreatment and neglect. This would include
discussion of relevant derived variables (such as key hospital admission codes).

To identify and create indicators from relevant variables/field within SLS and linked
administrative and health-related data that may have the capacity to answer specific
questions about children who potentially may be vulnerable to neglect and abuse.

To gather information on frequencies, for example, the number of parents in
treatment for drug misuse who also have children.

To identify appropriate sample(s) from the SLS. Taking account of privacy issues
this may include identifying vulnerable children born to SLS members who are the
parents, or SLS members who are children and linking in information on their
mother.

After identifying the sample(s), to explore whether there would be sufficient numbers
for useful longitudinal analysis?

To explore technically as to whether either a SLS member who is the child can have
other family records linked (health data about their parents), in order to provide
informative longitudinal data about vulnerable children and their parents. For
example, are the datasets CHI seeded, or hold other appropriate identifiers, in order
to have required information for the linkage to the SLS.

Further, to explore ethical issues in consultation with the NHS/ISD Privacy Advisory
Committee (PAC) relevant to linkage beyond the SLS member, either as the child or
parent, since this will be required to track development and vulnerability

To identify new linkages from current health and social care datasets that would be
necessary to, or would enhance, the identification and tracking of these vulnerable
children.
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
Costs: £25,000
Outcomes:
The primary outcome would be a working paper scoping the capacity of the Scottish
Longitudinal Survey and associated health and education related databases to identify
children who may potentially be vulnerable to maltreatment and/or neglect. This would
include:




Discussion of practical, ethical and technical issues in linking data;
Exploration of the possibility of small scale linkages that would be informative about
specific aspects of children’s well-being, for example, child members who are in
treatment for substance misuse and who have children;
Potential research questions that could be addressed; and
Whether there is likely to be sufficient numbers of relevant children for any
longitudinal analysis.
The secondary outcome would be a paper to discuss the implications arising from the
identified practical, ethical and technical issues in the context of the proposed National
Data Linkage Centre and PAC regime.
Learning Points
Consent Explore issues relating to consent where data on another un-consented
individual (e.g. a parent) is required in order to study an individual where consent does
exist (e.g. the child)
Issue associated with very small population Inform us on how to use data linkage to
create longitudinal research to facilitate the study of very small population groups (such as
vulnerable children) and inform us on any particular privacy issues due to the small
populations
Read Through Informing the work on maintaining the ability to link between admin systems
for this purpose
Approvals Will work through existing approval processes as well as using new streamlined
processes gathering valuable experience and insights into the mechanisms around this to
inform the programme
NB This project has recently received feedback from its PAC application. It has identified a
number of significant issues relating the privacy of the individuals involved. Given the
vulnerable nature of the individuals involved the PAC felt there was an obligation to act if
the researcher identified a potential for harm to an individual (even if this was a deidentified record) and that this would potentially result in a breach of privacy.
Researchers at Edinburgh University and the SLS are currently clarifying and responding to
the PAC feedback and colleagues from Scottish Government and NRS will also contribute
to understanding the PAC response and developing a way forward to deal with the
important issues that have been identified for this project. This may result in a change in
the timeline and scope of this project but it is providing exceptionally valuable learning
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
around the nature of analysis of vulnerable individuals and small populations.
25/10/13 – Discussions with PAC are ongoing but have not yet been concluded to allow the
project to be replanned
Timescales
Key Milestones
- Initiation - December 2012
- Review of Variables - February 2013
- Completion of PAC application - March 2013
- Completion of analysis report – October 2014
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
Project: Looked After Children Data / Dental Records
Project Lead / Principal Investigator: David Conway, University of Glasgow
Aims:
A data linkage study which provides access to population wide data from NHS dental
and local authority sources. These will provide greater scope for comparison between
young people who are looked after by the state and their peers. Finally we aim to set
out the governance and technical protocols which can be used in future projects which
to link the similar data sets
Objectives:

To conduct a cross-sectoral data linkage project focusing on linking and analysing the
NHS dental service and oral health data collated in the Information Services Division
(ISD) of National Services Scotland (NSS) to the national data on looked after children
collected by local authorities and collated by the Scottish Government. This will include
a comparison of oral health and access to dental services among looked after children
with the wider child health population and assess any variation by geographic location,
age, sex, and category of looked after children.

To provide guidance which will facilitate data linkage in the longer-term and thus the
sustainability of the project.

To provide results to inform the development of care pathways for these children within
the Childsmile national oral health improvement programme. This will include producing
a report for the Childsmile Board and Chief Dental Officer, Scottish Government to
inform the development of care pathways into dental services for looked after children
Costs: £110,000
Outcomes:
1) Produce a protocol for cross-sectoral record-linkage involving health and Looked After
Children data. This will include detailing governance processes in relation to approval for
data access, linkage and analysis. It will also include outlining a mechanism for maintaining
the potential for future linkage without the maintenance of a large merged database (read
through)
2) Produce a detailed metadata report of linked dataset including variable descriptors and
definitions, and data quality, completeness, accuracy (with detailed focus on LAC data).
3) Produce a final research report on the findings of the data linkage detailed
epidemiological analysis and draft papers for peer reviewed journals. These publications
will include explanations for these findings, demonstrate the utility of this cross-sectoral
linkage, and explore the need for developing potential interventions and care-pathways for
these children. Implications for general health outcomes and services will also be explored
Learning Points
Approvals and Sharing: Our study will continue to explore the information governance
and approval processes for cross-sectoral linkage building on learning from the Scottish
Informatics Programme (SHIP) and on the first ScotXed and health linkage of SCN and CHI
numbers. Permission will be sought from the ScotXed and LAC data access controllers /
committees. NHS ethics approval is unlikely to be required for the data linkage, but a NHS
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
Ethics Committee scientific officer will be consulted for advice.
Data Quality: Our project will also include assessment of quality and completeness of the
LAC dataset.
Read Through: The project will be an exemplar for read through and seek to establish the
principle and process of read through by establishing this between health and education
Cross sectoral working: We believe our collaboration of multi-disciplinary academics with
colleagues in ISD, and Scottish Government with strong policy and front-line service links,
will deliver the additional capacity to analyse and support the use of linked data. The
techniques developed by our project are likely apply to other preventive health services for
children who are looked after by the state including for example, immunisation
programmes, sexual health initiatives and other interventions which seek to improve health
and wellbeing.
End to End Linkage Project: This project allows us to test from beginning to end the
proposed processes being developed to deliver data linkage from initiation, approvals,
sharing, linkage, analysis and disclosure control.
Contribution to Official Statistics: It is intended that the linkage between LAC and health
will ultimately contribute to LAC national statistics reporting so this project will help us to
identify particular issues relating to production of national statistics that can be more widely
applied.
Timescales
Key Milestones

Project Initiation - January 2013

Data Sharing Agreement for LAC data - March 2013

Source Metadata Report - November 2013

All remaining health approvals and pathway protocol - November 2013

Final generic read through process - January 2014

Interim Linking and Analysis Report - February 2014

Final Linking and Analysis Report - March 2014

Linked Metadata Report - June 2014

Final Research Report - September 2014
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
Project: Data Quality Improvement in Scottish Environmental Protection Agency (SEPA)
Project Lead / Principal Investigator: Duncan Taylor / Steven Fox (SEPA)
Aims: To start rolling out within a test setting ProcXed, the ScotXed data validation tool, to
demonstrate that it can be delivered outwith the existing Scottish Government limitation to
deliver improved data quality in partner organisations
Objectives:

To demonstrate the de-coupled ProcXed tool can be used externally in test and then in
the live installation

To test this within a near live environment

To understand user issues associated with using the tool externally

To understand challenges and opportunities associated with this approach

To add additional users organisations based on the experience with SEPA
Costs: No cash cost but in kind support being provided by Scotxed
Outcomes:
 SEPA to use ProcXed to collect data from a range of partners to feed into their
Aquaculture project using an automated upload, validation and data transfer process
to move the data into their backend systems to support reporting etc.
 To demonstrate the usefulness of the tool in a live data collection / improvement
environment by delivery of case studies describing the process
 Improved data quality / collection in partner organisations
 To open up potential access to new data sources
Learning Points
Data Quality To help understand the issues around improving data quality within a live
collection environment
Service Provision To help understand the challenges and opportunities associated with
providing a service to partner organisations around data quality improvement and to
provide technical feedback on the provision of the validation tool.
NB Following on from this project ScotXed are also now working with Building Standards –
this is a collection that will go live in early 2014 and have gone live with data collection on
the first benefit to be administered in Scotland – Scottish Welfare Fund. Lessons learned
from this project have contributed to successful deliver of these follow on projects and the
improved quality data already being collected
Timescales
Key Milestones


Test set up established and SLA agreed - February 2013
Data Collections started - March 2013
Data Linkage Framework Programme Board - 5 November 2013




Final technical set up work completed - August 2013
Live processes started - September 2013
Aquaculture website live – October 2013
Evaluation complete – November 2013
PAPER 5
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
Project: Cross-Sectoral Scoping Opportunities and Challenges
Aims: To start focussing on specific challenges and opportunities associated with linking of
data between specific sectors
Project Lead / Principal Investigator: Health – Jill Pell, University of Glasgow;
Environment – Marian Scott, University of Glasgow; Genetics – Pamela Linkstead,
Generation Scotland; Young People – Carrie Graham, Scottish Government
Objectives:
 Develop our network of researchers interested in data linkage

For a number of important sectors identify;
o any current sector level key policy drivers or challenges that could be better
addressed with improved evidence or research using linked data and any key /
priority opportunities to link datasets to help deliver this improved evidence from
the perspective of those within the sector
o The challenges preventing this from happening (eg legal, data quality etc)
o Any current data linkage centres, infrastructure or resource within the sector

At the dataset level identify;
o Important datasets within the sector including a brief summary explanation of
what is held in the dataset, coverage eg national or for a specific location, type of
data eg survey or census data, administrative data, purpose of collection eg for
statistics, for administration etc, and how often the data is collected or updated
eg annual collection, constantly changing admin system
o Specific useful linkages that could be made to this data set
o Benefits of making this linkage
o Barriers to creating this linkage
Costs: £30,000 (across four partners - Environment, Health, Genetics and Young People)
Outcomes:
 Increase our network of interested researchers to be further developed
 Meta data on each of the identified sectors
 Priority projects within and between sectors
 Increase visibility and recognition of the DLF
 Initial projects to be delivered through DSLS
Learning Points
Challenges and Opportunities Increase our knowledge by gathering detailed information
on the range of challenges and opportunities presenting within specific sectors, increasing
the detail of our knowledge on specifics within sectors
Expectation of Researchers Increase our knowledge of what specific researchers expect
to be able to deliver in terms of linked data and what they expect of a linkage service being
developed to deliver this
Prioritisation of Linkage Can we start to identify a key strategic core of datasets that will
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
open up linkage across a number of sectors and so should be prioritised for linkage
Timescales
Key Milestones




Health Scoping August 2013
Environmental Work Completed September 2013
Genetics Work Completed October 2013
Young People Completed January 2014
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
Project: 1947 Mental Survey Linkage
Project Lead / Principal Investigator: Ian Deary, Edinburgh University
Aims: To develop a new longitudinal linkage resource based on the 1947 national survey
of the mental ability of 11 year olds by linking to the NHSCR to allow records to be linked
up to the present and back to the 1939 population register
Objectives:

To identify broader issues and opportunities for longitudinal research and opportunities
to link to the 1947 data

To link the 1947, 1939 and NHSCR

To agree a storage strategy for this new resource
Costs: £10,000
Outcomes:
 Linked data set matching all elements of the 1947 data (main population plus
detailed sub-samples) to NHSCR
 Scoping report on longitudinal data
 Scoping report on linking to the 1947 data
 Agreed storage solution for this dataset
Learning Points
Longitudinal Data Longitudinal data projects are long term and so require data storage
solutions not required more broadly as linked data projects within the framework will not be
maintained beyond the end of the particular project
Technicalities of Linkage This project will be used as a training tool for new indexing staff
Permissions and Approvals This project will require PAC approval and will also help
develop our understanding of consent etc around longitudinal data
Capacity This project has been used to inform assumption on technical capacity required
to deliver linkage projects
Timescales
Key Milestones




Initiation - May 13
General Scoping paper – October 13
Data linkage work complete - May 14
Storage solution in place – May 14
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
Annex A: DATA LINKAGE FRAMEWORK FUNDING BID TEMPLATE
Funding is available until April 2015 to support projects that will help deliver the aims of the
Scotland-wide Data Linkage Framework for statistical and research purposes by tackling
the challenges that prevent more effective and efficient data linkages from occurring in
Scotland. These challenges are:
o Uncertainty about the legalities and public acceptability of data sharing and linkage.
o Incomplete data, or data that cannot be linked.
o Limited capacity for secure exchange and access to data.
o Limited capacity of public sector organisations to analyse and make use of linked
data.
For further information on the framework and a fuller explanation of the challenges see
http://www.scotland.gov.uk/Topics/Statistics/datalinkageframework
Funding of up to £250,000 is available. There is no minimum bid.
Funding will only be provided to projects that offer good value for money in tackling at least
one recognised challenge to effective data linkage. Projects that will have a long-term
impact on the capacity for data linkage in Scotland will be favoured. Please note that this
may include projects that involve data linkage directly if they can be shown to provide proof
of concept or pave the way for further development, but projects do not have to involve
data linkage directly. For example, projects to consider legislative or ethical issues, develop
analytical capacity for use of already linked data, or improvements to data collection
systems to allow for future data linkage will all be considered.
Projects that can synergise effectively with other elements of the data linkage programme
plan will be favoured, as will projects that involve or may support development of crosssectoral linkages (e.g. health-justice-housing or education-environment, rather than
linkages within a sector).
Bids will be assessed on the extent to which they offer value for money in tackling
the challenges that prevent more effective and efficient data linkages from occurring
in Scotland.
There is no annual deadline for bids, but early bids or early indication of interest would be
very much appreciated.
To discuss potential bids at any stage please contact Gerry Donnelly 0131 314 4312,
gerald.donnelly@scotland.gsi.gov.uk . We will be happy to work with you in developing
your proposal and aligning it with wider framework developments if feasible and
appropriate.
As a rough guide, no answer should be more than 500 words. No minimum. Further detail
will be requested if necessary, and a full research specification and/or a meeting to discuss
details may be required subsequent to this initial bid.
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
1. Contact name, organisation, phone number, email address
2. Brief overview of rationale, aim and objectives of the project
 To enable a better understanding of the differences between population counts in the
different data sources (Community Health Index (CHI), NHS Central Register (NHSCR),
Census, Council data etc) for NHS GGC and Glasgow Council.
3. Which challenge(s) to better data linkage in Scotland will this project help to tackle
(check box(es))
Uncertainty about the legalities and public acceptability of data sharing and
linkage
Incomplete data, or data that cannot be linked
Limited capacity for secure exchange and access to data
Limited capacity of public sector organisations to analyse and make use of
linked data
Briefly explain how the project will tackle this challenge (if not already covered in question
2).
3. Which National Outcome(s) will the project contribute to?
4. Proposed methodology (if the project is to link data, include which datasets will be
used, information about if/when/how ethical and access permissions will be sought, and
pilot work)
5. Funding required and top level breakdown of costs (e.g. year, staff, consultancy,
infrastructure)
6. Outline of project timetable (including key milestones)
Data Linkage Framework Programme Board - 5 November 2013
PAPER 5
7. Outline key deliverables (outputs)
8. Briefly outline main risks, their likelihood and impact and how they will be
mitigated
9. Explain if, how and when outputs, insights and lessons learned from this project
will be shared with others
10. If this project fits with any related programmes, existing or planned funding
streams, please explain how
Download