Download presentation
Presentation is loading. Please wait.
Published byClyde Rogers Modified over 9 years ago
1
Methodbox: From open-data to open-insight MethodBox Team Jul 2011
2
Presentation Problem Data tsunami + puddles of insight Solution Collective efficient science Deployment Sense-making networks on open-data
3
Quote “…you call it Epidemiology and we call it quantitative Social Science” A leading researcher, Jul 2011 Open data Common methods Potentially complementary expertise
4
Obesity Example Fragmented understanding of public health problems such as obesity...data, methods/models and expertise split across disciplines (e.g. social vs. biomedical) and settings (e.g. academia vs. healthcare)
5
Puddles of research around the organising principle … but policies need the big picture
6
Data Example Time series data from Health Visitors from Wirral Data deposit with UKDA but no uses for 16 years Children measured at the time the obesity epidemic took hold…
7
Fifths of IDAC 2004 Red (light) = most deprived Red (dark) Purple Blue (dark) Blue (light) = most affluent Material deprivation affecting children (households with children: % on benefits in 2001-3) Wirral (0.3M), UK
8
BMI of 3 yr olds 1988 - 1989 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest
9
BMI of 3 yr olds 1990 - 1991 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest
10
BMI of 3 yr olds 1992 - 1993 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest
11
BMI of 3 yr olds 1994 - 1995 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest
12
BMI of 3 yr olds 1996 - 1997 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest
13
BMI of 3 yr olds 1998 - 1999 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest
14
BMI of 3 yr olds 2000 – 2001 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest
15
BMI of 3 yr olds 2002 - 2003 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest
16
Child Obesity: Action 6 years after signal in the data Body Mass Index (BMI) trend in Wirral 3y-olds from 1988 to 2003 -0.4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3 0.4 0.5 Mar-88Jul-89Nov-90Apr-92Aug-93Jan-95May-96Sep-97Feb-99Jun-00Nov-01Mar-03Aug-04 Month of measurement by Health Visitor Three-monthly rolling average BMI SDS SDS = standard deviation score from 1990 British Growth Reference charts – adjusts for age and sex of the child Clues Actions
17
Similar Data in 2011 National Child Measurement Programme Anonymised national database Could be opened (like national pupil database) extend to other policy-relevant, timely research
18
Data Already in UK Data Archive Example: Health Surveys for England (annual) Analyses feed national policies Does evidence need to be localised?...
19
1 2 3 4 5 Men Women 25 25.5 26 26.5 27 27.5 BMI Income fifth (low to high) Women and not men from low-income households are fatter in England Data from Health Survey for England
20
1 2 3 4 5 Men Women 25 25.5 26 26.5 27 27.5 BMI Income fifth (low to high) Women from low-income households and men from high-income households are fatter in Greater Manchester Data from Health Survey for England
21
Linked-data ≠ Linked: data, methods & investigators Previous slides show social-biomedical signals about obesity from under-used datasets Biomedical Research: Data, methods & investigators Social Research: Data, methods & investigators
22
MethodBox Aim..to increase the sharing and reuse of data sources & extracts and data processing methods in one in-silico environment (‘e-Lab’) shared by social and health researchers
23
e-Lab Socially-stimulating science, in-silico Research Object Find Share Reuse Data-sources Data-preparation scripts Research protocolStatistical analysis scripts Slides Working datasets Figures/Graphics Manuscripts References Analysis-logs & notes
24
National Dataset Example Health Surveys for England – Large-scale (participants * variables) – Annual since early 90s – Under-used by NHS who fund it – Key barrier: extracting a research-ready subset of data – Data archive playground = e-Lab
25
Supporting and Developing Interdisciplinary Understanding Sharing resources – tools, methods, data Sharing expertise – discussions and reuse around shared resources Promoting interdisciplinary working Developing interdisciplinary understanding – language, tacit assumptions, methods First step - sharing of resources Shared resources provide the basis for discussion Discussions lead to deeper interdisciplinary understanding Understanding of other domains promotes more effective interdisciplinary working
26
Facilitating a social network of data archive users… …toward a reward environment for sharing data, methods, and expertise
27
Browsing for data extracts made by a social network of data archive users…
29
Shopping for variables from across different years of survey collections…
30
Instant access to relevant parts of survey documentation …
31
Making the data extract visible… Linking a data extract with a script for deriving variables… Sharing and visibility
32
Enabling user-visibility for data extraction or derivation contributions…
33
Current MethodBox Video link
34
Training Course Apr `10 Trained a mixture of NHS, academic and industry users of HSE in the use of Methodbox Course run in conjunction with CCSR Feedback forms completed by 15 of 16 attendees, asked to rate Methodbox from 1 (negative) to 7 (positive) on the following statements: – I thought MethodBox was: Terrible - Wonderful: Mean = 5.57 Difficult to understand - Easy = 5.57 Frustrating to use - Satisfying = 5.79 Dull - Stimulating = 5.29 Rigid - Flexible = 5.71 Difficult to navigate - easy to navigate = 6
35
Attitudes to Sharing DataScripts Academic social scientists YesNo Academic epidemiologists/ medical researchers NoYes NHS & Local Govt. analysts Yes
36
MethodBox Evolution Amazon-like user-prompting for other variables that may be relevant to the set being extracted More surveys/datasets incorporated User-contributed & community-curated datasets …. Feature request list exceeds resources
37
Building on Successful E-Science Most widely used scientific workflow sharing systems: myGrid, Taverna, myExperiment Over a decade of programme funding sustained world leading E-Infrastructure R&D ready to leverage more outputs from open-linked data
38
Toward Open Insight Researcher A is expert in deprivation Researcher B is expert in obesity Both use a common data archive but don’t usually meet MethodBox shares the expertise of A and B to create a more complete model of deprivation in obesity
39
Conclusion Open-data alone is not enough Social e-infrastructure for science is needed Sharing insights and methods is key, and can be achieved through systems like MethodBox + ESDS
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.