BLOG - Scotland’s portfolios: Research and Statistical Data - building a new approach to thematic data linkage
In this blog, former Scottish Government Data Discovery Team Leader in ADR Scotland, Nicola Kerr, discusses how a novel approach is being taken in Scotland to create an enduring resource of research datasets.
Why are we doing this?
We all know that linking data together is a powerful tool when it comes to research. Combining this with policy aspirations can be a potent mix when it comes to bringing widespread public benefit for the people of Scotland. We know that it can take time to link data together. The majority of administrative data has not originally been collected with research in mind and there can be a range of legal and ethical restrictions that apply to it. Data controllers are responsible for ensuring they have appropriate safeguards in place to share sensitive data safely and securely for appropriate uses. At the heart of ADR Scotland’s work in developing portfolios is laying the groundwork to create themed core datasets that can be used by researchers to address identified policy evidence needs.
What are we doing?
The Data Discovery team aims to build sets of defined linkable data around particular policy themes, referred to as portfolio kits. This work has the added benefit of also being able to facilitate a data access environment that is both transparent and more streamlined, for data controllers and researchers. This novel approach will remove burden around data awareness, linked dataset building and access for all users in the data landscape. It does this by addressing the legal, ethical and technical limitations that may apply to the data with a kit.
We have built a framework around the technical requirements for creating a linked dataset. In addition there is an information governance framework that includes streamlined access process and safeguarding needed for personal level data. These frameworks will be the foundation that all the portfolio kits rest upon. They were produced via a collaboration with Scottish Government, Public Health Scotland and National Records of Scotland experts in data governance, access and coding.
The data contained within each portfolio kit may come from multiple data controllers but a researcher won’t need to worry about that side of things! Information packs will be produced that will clearly set out:
- the variables contained within a kit
- what research themes a kit can be used for
- terms and conditions for accessing a kit
- details of data security and data protection safeguarding though template documents and access approvals
What comes next?
This is a long term, multi-year project. As the framework is now built, the team are looking ahead to creating kits on a pilot basis looking at two main areas. The learnings from the pilot will further refine our approach. The first two topics being explored are problematic substance use and homelessness. Through constructive engagement with various policy areas with an interest in this area, a defined portfolio theme has started to emerge. The theme for both planned portfolios is early intervention and prevention.
The team is now starting to explore the data that could be used to address this theme. The plans for the future include workshops with policy colleagues, researchers and analysts from various data controllers. This will ensure the data within the portfolio kit meets the requirements of the theme whilst being able to be shared and linked.
I’ve now moved into a different area within Scottish Government. However, I’m excited to see the team take this work forward to create core datasets for researcher use that addresses important policy needs.
Further information
For more details, my colleagues Nora Mielke and Scott McFarlane will be leading this work and can be contacted on ADR.DataTeam@gov.scot
To view our recent PowerPoint presentation on Scotland's Portfolios: Research and Statistical Data, please click here.
This article was published on 16 Feb 2023