Presentation is loading. Please wait.

Presentation is loading. Please wait.

Methodbox: From open-data to open-insight MethodBox Team Jul 2011.

Similar presentations


Presentation on theme: "Methodbox: From open-data to open-insight MethodBox Team Jul 2011."— Presentation transcript:

1 Methodbox: From open-data to open-insight MethodBox Team Jul 2011

2 Presentation Problem Data tsunami + puddles of insight Solution Collective efficient science Deployment Sense-making networks on open-data

3 Quote “…you call it Epidemiology and we call it quantitative Social Science” A leading researcher, Jul 2011 Open data Common methods Potentially complementary expertise

4 Obesity Example Fragmented understanding of public health problems such as obesity...data, methods/models and expertise split across disciplines (e.g. social vs. biomedical) and settings (e.g. academia vs. healthcare)

5 Puddles of research around the organising principle … but policies need the big picture

6 Data Example Time series data from Health Visitors from Wirral Data deposit with UKDA but no uses for 16 years Children measured at the time the obesity epidemic took hold…

7 Fifths of IDAC 2004 Red (light) = most deprived Red (dark) Purple Blue (dark) Blue (light) = most affluent Material deprivation affecting children (households with children: % on benefits in 2001-3) Wirral (0.3M), UK

8 BMI of 3 yr olds 1988 - 1989 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest

9 BMI of 3 yr olds 1990 - 1991 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest

10 BMI of 3 yr olds 1992 - 1993 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest

11 BMI of 3 yr olds 1994 - 1995 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest

12 BMI of 3 yr olds 1996 - 1997 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest

13 BMI of 3 yr olds 1998 - 1999 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest

14 BMI of 3 yr olds 2000 – 2001 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest

15 BMI of 3 yr olds 2002 - 2003 Fifths of BMI SDS BMI fifth Red (light) = fattest Red (dark) Purple Blue (dark) Blue (light) = thinnest

16 Child Obesity: Action 6 years after signal in the data Body Mass Index (BMI) trend in Wirral 3y-olds from 1988 to 2003 -0.4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3 0.4 0.5 Mar-88Jul-89Nov-90Apr-92Aug-93Jan-95May-96Sep-97Feb-99Jun-00Nov-01Mar-03Aug-04 Month of measurement by Health Visitor Three-monthly rolling average BMI SDS SDS = standard deviation score from 1990 British Growth Reference charts – adjusts for age and sex of the child Clues Actions

17 Similar Data in 2011 National Child Measurement Programme Anonymised national database Could be opened (like national pupil database)  extend to other policy-relevant, timely research

18 Data Already in UK Data Archive Example: Health Surveys for England (annual) Analyses feed national policies Does evidence need to be localised?...

19 1 2 3 4 5 Men Women 25 25.5 26 26.5 27 27.5 BMI Income fifth (low to high) Women and not men from low-income households are fatter in England Data from Health Survey for England

20 1 2 3 4 5 Men Women 25 25.5 26 26.5 27 27.5 BMI Income fifth (low to high) Women from low-income households and men from high-income households are fatter in Greater Manchester Data from Health Survey for England

21 Linked-data ≠ Linked: data, methods & investigators Previous slides show social-biomedical signals about obesity from under-used datasets Biomedical Research: Data, methods & investigators Social Research: Data, methods & investigators

22 MethodBox Aim..to increase the sharing and reuse of data sources & extracts and data processing methods in one in-silico environment (‘e-Lab’) shared by social and health researchers

23 e-Lab Socially-stimulating science, in-silico Research Object Find Share Reuse Data-sources Data-preparation scripts Research protocolStatistical analysis scripts Slides Working datasets Figures/Graphics Manuscripts References Analysis-logs & notes

24 National Dataset Example Health Surveys for England – Large-scale (participants * variables) – Annual since early 90s – Under-used by NHS who fund it – Key barrier: extracting a research-ready subset of data – Data archive  playground = e-Lab

25 Supporting and Developing Interdisciplinary Understanding Sharing resources – tools, methods, data Sharing expertise – discussions and reuse around shared resources Promoting interdisciplinary working Developing interdisciplinary understanding – language, tacit assumptions, methods First step - sharing of resources Shared resources provide the basis for discussion Discussions lead to deeper interdisciplinary understanding Understanding of other domains promotes more effective interdisciplinary working

26 Facilitating a social network of data archive users… …toward a reward environment for sharing data, methods, and expertise

27 Browsing for data extracts made by a social network of data archive users…

28

29 Shopping for variables from across different years of survey collections…

30 Instant access to relevant parts of survey documentation …

31 Making the data extract visible… Linking a data extract with a script for deriving variables… Sharing and visibility

32 Enabling user-visibility for data extraction or derivation contributions…

33 Current MethodBox Video link

34 Training Course Apr `10 Trained a mixture of NHS, academic and industry users of HSE in the use of Methodbox Course run in conjunction with CCSR Feedback forms completed by 15 of 16 attendees, asked to rate Methodbox from 1 (negative) to 7 (positive) on the following statements: – I thought MethodBox was: Terrible - Wonderful: Mean = 5.57 Difficult to understand - Easy = 5.57 Frustrating to use - Satisfying = 5.79 Dull - Stimulating = 5.29 Rigid - Flexible = 5.71 Difficult to navigate - easy to navigate = 6

35 Attitudes to Sharing DataScripts Academic social scientists YesNo Academic epidemiologists/ medical researchers NoYes NHS & Local Govt. analysts Yes

36 MethodBox Evolution Amazon-like user-prompting for other variables that may be relevant to the set being extracted More surveys/datasets incorporated User-contributed & community-curated datasets …. Feature request list exceeds resources

37 Building on Successful E-Science Most widely used scientific workflow sharing systems: myGrid, Taverna, myExperiment Over a decade of programme funding sustained  world leading E-Infrastructure R&D ready to leverage more outputs from open-linked data

38 Toward Open Insight Researcher A is expert in deprivation Researcher B is expert in obesity Both use a common data archive but don’t usually meet MethodBox shares the expertise of A and B to create a more complete model of deprivation in obesity

39 Conclusion Open-data alone is not enough Social e-infrastructure for science is needed Sharing insights and methods is key, and can be achieved through systems like MethodBox + ESDS


Download ppt "Methodbox: From open-data to open-insight MethodBox Team Jul 2011."

Similar presentations


Ads by Google