Opening up access to birth cohort study data: A UK Medical Research Council pilot project Jack Kneeshaw Senior Data and Support Services Officer UK Data.

Slides:



Advertisements
Similar presentations
1 of 18 Information Dissemination New Digital Opportunities IMARK Investing in Information for Development Information Dissemination New Digital Opportunities.
Advertisements

April 2010 MRC Data Sharing Policy Peter Dukes Policy Lead – Data Sharing & Preservation.
Accessing longitudinal data via the UK Data Archive / ESDS Jack Kneeshaw NCDS summer school course, July 2005 ESDS Longitudinal.
The Economic and Social Data Service (ESDS) Kevin Schürer ESDS/UKDA ESDS Awareness Day 5 December 2003.
Accessing the MCS via the Economic and Social Data Service Jack Kneeshaw MCS workshop 10 November 2004 ESDS Longitudinal.
The Economic and Social Data Service (ESDS) Karen Dennison, Support Services Manager, UK Data Archive April 2008.
Accessing the MCS via the Economic and Social Data Service Jack Kneeshaw MCS workshop 23 June 2005 ESDS Longitudinal.
Using secondary qualitative data in interdisciplinary contexts Libby Bishop ESDS Qualidata, University of Essex Working Across Boundaries: 2 nd NCRM Summer.
Accessing the NCDS and the BCS70 via the Economic and Social Data Service Jack Kneeshaw and Alasdair Crockett NCDS/BCS workshop 29 October 2003 ESDS Longitudinal.
Accessing the MCS from the Economic and Social Data Service Jack Kneeshaw MCS workshop 13 October 2009 ESDS Longitudinal.
Accessing the NCDS and BCS70 via the Economic and Social Data Service Jack Kneeshaw NCDS/BCS70 workshop 31 March 2004 ESDS Longitudinal.
Accessing the NCDS and BCS70 via the Economic and Social Data Service Jack Kneeshaw NCDS/BCS70 workshop 27 October 2004 ESDS Longitudinal.
Accessing the NCDS and the BCS70 via the Economic and Social Data Service Jack Kneeshaw NCDS/BCS70 workshop 21 February 2007 ESDS Longitudinal.
Accessing Longitudinal Data via the Economic and Social Data Service Jack Kneeshaw 11 July 2006 ESDS Longitudinal.
An Introduction to the UK Data Archive and the Economic and Social Data Service November 2007 Jack Kneeshaw, UKDA.
Economic and Social Data Service a distributed data service for the social sciences.
Accessing the MCS from the Economic and Social Data Service Jack Kneeshaw MCS workshop 28 June 2007 ESDS Longitudinal.
Accessing the UK Longitudinal Studies via the ESDS Jack Kneeshaw UK Data Archive/Economic and Social Data Service 21 June 2004 ESDS Longitudinal.
Accessing the NCDS and the BCS70 via the Economic and Social Data Service Jack Kneeshaw NCDS/BCS70 workshop 16 October 2007 ESDS Longitudinal.
The Economic and Social Data Service (ESDS) Karen Dennison UK Data Archive Improving access to government datasets 18 January 2007.
Accessing the MCS via the Economic and Social Data Service Jack Kneeshaw and Alasdair Crockett MCS workshop 20 November 2003 ESDS Longitudinal.
Introduction to the ESRC Question Bank Julie Lamb Department of Sociology University of Surrey.
Metadata workshop, June The Workshop Workshop Timetable introduction to the Go-Geo! project metadata overview Go-Geo! portal hands on session.
Collection-level description & collection management: tool for the trade or information trade-off? Collection Description Focus Workshop 4 Newcastle, 8.
Collection-level description & the Information Landscape: users evaluate strategies for resource discovery Collection Description Focus Workshop 5 Cambridge,
Organising and Documenting Data Stuart Macdonald EDINA & Data Library DIY Research Data Management Training Kit for Librarians.
Corporate Records Management (Practitioner) Information Governance Policy Team NHS Connecting for Health.
The White Rose Collaborative Collection Partnership Brian Clifford University of Leeds.
Corporate Records Management (Practitioner) Information Governance Policy Team NHS Connecting for Health.
Health Records Management Practitioner
December 2008 MRC Data Support Services (DSS) Chris Morris 13 th February 2009 Sharing Research Data: Pioneers, Policies and Protocols The seventh cat.
NESSTAR - the data archive perspective by Margaret Ward UK Data Archive.
Elizabeth Newbold and Samantha Tillett GL8 New Orleans, December 2006
Allyn & Bacon 2003 Social Work Research Methods: Qualitative and Quantitative Approaches Topic 12: Reviewing Literature and Report Writing.
1 Sharing Research Data in Hong Kong (position paper) Professor John Bacon-Shone Associate Director, Knowledge Exchange The University of Hong Kong Forum.
Grey Literature, E-Repositories and Evaluation of Academic & Research Institutes. The case study of BPI e-repository Maria V. Kitsiou - Head Librarian,
Data Collection, Harmonisation and Storage (An international perspective) Jon Johnson (CLS, Senior Database Manager) Sub-brand to go here CLS is an ESRC.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
Medical Audit.
© 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
© 2013 Cengage Learning. All Rights Reserved. 1 Part Four: Implementing Business Ethics in a Global Economy Chapter 9: Managing and Controlling Ethics.
Statistics Canada’s Real Time Remote Access Solution 2011 MSIS Meeting – Karen Doherty May 2011.
Access to the LSYPE and associated resources at the Economic and Social Data Service Jack Kneeshaw LSYPE workshop 1 October 2009 ESDS Longitudinal.
Hydra Europe Symposium | April 2015 | 1 Hydra and open access Chris Awre Hydra Europe Symposium London School of Economics, 24 th April 2015.
Usability Issues Documentation J. Apostolakis for Geant4 16 January 2009.
INTERNATIONAL LABOUR ORGANIZATION Conditions of Work and Employment Programme (TRAVAIL) 2012 Module 13: Assessing Maternity Protection in practice Maternity.
The Adoption of METIS GSBPM in Statistics Denmark.
1 Women Entrepreneurs in Rural Tourism Evaluation Indicators Bristol, November 2010 RG EVANS ASSOCIATES November 2010.
© 2012 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the U.S.
Automated Benchmarking Of Local Authority Web Sites Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY UKOLN is supported by:
Chapter 6 Supporting Knowledge Management through Technology
Jump to first page (o ns) Modernising Statistical Systems to improve Quality The experiences of the Office for National Statistics (ONS) Presented by Emma.
ESDS resources for managing and analysing data Beate Lichtwardt Economic and Social Data Service UK Data Archive Research Method Festival, Oxford 1 July.
Developing Policy and Procedure Management System إعداد برنامج سياسات وإجراءات العمل 8 Safar February 2007 HERA GENERAL HOSPITAL.
1 ARRO: Anglia Ruskin Research Online Making submissions: Benefits and Process.
Joint UNECE / Eurostat meeting on Population and Housing Censuses 7-9 July 2010, Geneva Disseminating Census information to maximise use and value Keith.
UK LOCKSS Alliance: Investigation into Private LOCKSS Networks Adam Rusbridge EDINA, University of Edinburgh.
Data for secondary analysis: the experience of the UK Data Archive Hilary Beedham UK Data Archive.
DOE Data Management Plan Requirements
Report Performance Monitor & Control Risk Administer Procurement MONITORING & CONTROLLING PROCESS.
HETUS Pilot Group 8 Privacy procedures and ethical issues Kimberly Fisher, Centre for Time Use Research – co-ordinator External consultant Kai Ludwigs.
California Department of Public Health / 1 CALIFORNIA DEPARTMENT OF PUBLIC HEALTH Standards and Guidelines for Healthcare Surge during Emergencies How.
February, MansourahProf. Nadia Badrawi Implementation of National Academic Reference Standards Prof. Nadia Badrawi Senior Member and former chairperson.
Access Irena Vipavc Brvar ADP SEEDS Workshop I Belgrade, October.
Senior Data and Support Services Officer
Karen Dennison Collections Development Manager
Research Ethics Matthew Billington
ESDS resources for managing and analysing data
The JISC IE Metadata Schema Registry
Course: Module: Lesson # & Name Instructional Material 1 of 32 Lesson Delivery Mode: Lesson Duration: Document Name: 1. Professional Diploma in ERP Systems.
Presentation transcript:

Opening up access to birth cohort study data: A UK Medical Research Council pilot project Jack Kneeshaw Senior Data and Support Services Officer UK Data Archive May

The context: From principle to action MRC policy on data sharing recognises the value of making scientific data more widely available across the research community, a recognition that is agreed to be a next necessary step by other researchers, research funders, national and international government bodies. After a period of raising awareness in the research community about the value of timely and responsible sharing of data, MRC needs now to move from principle to action.

The subject: The NSHD, aka the 1946 British Birth Cohort Study established in 1946, the National Survey of Health and Development (NSHD) is one of the longest-running large- scale longitudinal studies in existence since 1962, the study has been funded continuously by the MRC data include a wide spectrum of risk exposures and of clinically validated measures of mental and physical health, and biological and cognitive function; survey has data on periods of the life-course that cannot be reliably accessed in retrospect or in GP records 12,000+ variables across the various component datasets

The state of play (1): Access/dissemination the study team receive slightly upwards of 30 data access requests p.a.: a figure increasing year-on-year process of request through to supply can be drawn-out and episodic: specifying and retrieving data and documentation to be sent is time consuming and involves a high level of manual intervention the ship is now creaking

The state of play (2): Finding/using the data the data collection is not well publicised: besides the NSHD web site itself, there are few finding aids that may guide potential users to the data potential users of the data have no means of searching for the survey instruments, topics, questions and variables that they might be interested in aside from the variable names, the data files supplied by the study team do not include any metadata

What might be? Is restricted access placing a ceiling on the level of scientific output that results from the use of the NSHD data? CohortAnnual usage (all users)Annual usage academic (exc. students) Publications since start of study 1946 cohort (NSHD) c cohort (NCDS) cohort (BCS70)

How do we get from here to there? Four criteria identified in order to make the data resource widely usable in the scientific community: (1) data have to be indexed to an international standard; (2) searching content of data and metadata has to be possible via the web for both in-house and remote users; (3) once identified through a search, data have to be easily accessible, along with all the information needed for informed research use; (4) technical and procedural (governance) arrangements need to respect data subject confidentiality and take account of statutory and other regulatory requirements.

The recurring theme: Wider access vs. risk of disclosure special and increased risk of disclosure for longitudinal studies – more data points, vast range of information collected – rightly concerns the study team and sponsor important to start from position that disclosure risk can never be eliminated but can only be managed balance between attracting wider use of the data and retaining an appropriate level of disclosure risk becomes key

It should always be borne in mind that, once data have been collected, the risk of disclosure can never be eliminated entirely; and, indeed, elimination of risk cannot be the aim if there is a policy to share the data. Instead, the aim must be to limit or control the risk. That is to say, in the context of sharing data so as to increase the scientific output, the aim of a disclosure risk strategy ought to be to define a level of risk that is acceptable: a policy aimed at reducing the risk of disclosure to a point as close as possible to elimination is not likely to be optimal.

Solutions?: From managing to sharing data 2-year pilot project initiated – project board convened – membership includes presenter specific aims of project: (1) prepare a subset of NSHD data, along with data descriptions and documentation (metadata) in a digital format suitable for entry into the Nesstar software;

(2) implement management tools for data security and integrity, such as logging and access controls, where appropriate; (3) define, document and implement governance arrangements for access to NSHD data through Nesstar; (4) evaluating the benefits of implementation for the perspectives of the NSHD research team and other data users; (5) determine the financial costs, time and effort required, and other implications for extending the approach to the whole NSHD data library; (6) document lessons learned to inform similar activities undertaken in the future.

Key outcomes expected generate a baseline measure of current activities (NSHD research team time, cost, etc.) required for provision of data and documentation help define access arrangements for whole study, including digitised genetic/phenotypic data widen (inc. geographers, social scientists, inc. non-UK) and deepen (current users find it more accessible, Nesstar facility to share derived variables) user base > > scientific output

Nesstar as the data sharing tool: What? Why? How? Whats to do? estimated 2,000 variables (of the total of 12,000+) will be described in terms of their origins, distribution and response in the cohort, derivation methods from other variables if appropriate, and code book sources each variable will be assigned keyword(s) datasets published on web via Nesstar

Why Nesstar? Nesstar is primarily a tool for data discovery with a strong focus on metadata that allows users to browse study information down to the level of variable software allows users, via a standard web browser, to view frequencies, conduct simple tabulations, produce graphs, sub-set and weight data user defined variables function of specific interest to study team Nesstar is not the only package of its type – but study teams view is that, for searching, browsing, locating and exploratory analysis purposes, Nesstar is almost certainly the package best suited for the projects needs

How to do it? NSHD in-house solutions (e.g. scrambling ID for issued files) Nesstar modifications, especially to download function, to protect against inappropriate use publication of new derived variables via user defined variables

Where next? project proper begins in July, though data/metadata prep. work already underway aim to make 2,000 variables available to selected user test group by end of year 1 testing year 2: evaluation inevitably limited in scope but user feedback very important findings/recommendations published at end of year 2 and a successful report may see more MRC data rolled out via modified Nesstar product

Further details: Jack Kneeshaw –