Presentation is loading. Please wait.

Presentation is loading. Please wait.

Back-End Structures and Front End Visualizations DAMA Minnesota Matthew Israelson 19 November, 2014.

Similar presentations


Presentation on theme: "Back-End Structures and Front End Visualizations DAMA Minnesota Matthew Israelson 19 November, 2014."— Presentation transcript:

1 Back-End Structures and Front End Visualizations DAMA Minnesota Matthew Israelson 19 November, 2014

2 About Us IHME is an independent global health research center at the University of Washington Vision: Provide high-quality information on population health, its determinants, and the performance of health systems. Mission: Improve the health of the world’s populations by providing the best information on population health. Method: Produce rigorous and comparable measurements. For general information Phone: +1-206-897-2800 Fax: +1-206-897-2899 ihme@healthdata.org

3 A Short History Started in 2007 and continuing to grow into 2014 July 2007: Founding of IHME with support from Bill & Melinda Gates Foundation and the state of Washington July 2009: Published the Financing Global Health (FGH) report June 2010: Graduated the first Masters of Public Health March 2011: Launched the Global Health Data Exchange (GHDx) at the Global Health Metrics and Evaluation conference December 2012: The Lancet published The Global Burden of Diseases, Injuries, and Risk Factors Study 2010 (GBD 2010) December 2014: First annual update with GBD 2013

4 Agenda Back End Structures Front End Visualizations Data Collection Infrastructure Modeling and Analysis Audience Outreach Visualizations

5 Back-End Structures Data Collection Infrastructure Process Collection Cataloging People Networking Technology Modeling & Analysis GBD Data model Deliverables

6 Locate data Acquire data Catalog dataExtract data Identify gaps Overview the data cycle Search for new health sources from: Government and NGO websites Databases Expert advice Literature Negotiate with providers for access Formal requests Collaboration DUA / MOU / IRB Payment Add to the GHDx Assign NID Create citation Add keyword Attach files Provide data to our researchers Notify teams Extract data Import to research databases Provide sourcing Analyze results to identify data gaps Years Causes Countries Etc.

7 What we collect Health Surveys Census Records Surveillance Systems Disease Registries Vital Registration Hospital Records Financial Records Literature Estimates

8 How we collect it Added 15,000 new sources of data since January Not everything is on the Internet 900+ “high-touch” requests Applications Data Use Agreements IRB Approval Restricted Data A project management tool is essential Adopted JIRA in 2013

9 Sourcing data Global Health Data Exchange (GHDx) Centralized citation database for IHME Ensures same citation for the same data Allows us to source all data points http://ghdx.healthdata.org/ GHDx Nids All metadata Citations Federated Citations Accessed date Publication status Nids Research databases Nids Not publicly accessiblePublicly accessible

10 Organizing Data LIVE DEMO http://ghdx.healthdata.org/

11 The people Board of Directors & Scientific Oversight Group 210 Employees Professors: 20+ Researchers: 90+ IT: 20+ Staff: 80+ 16 Affiliate Professors GBD Expert Collaborators

12 The GBD network GBD enrolled 1,095 collaborators from 107 countries

13 Networking as an enabler The collaborative network enhances the GBD Assess the validity of country results Identify missing datasets or incorrect interpretations of data Interpret findings and facilitate country policy translation Assist with acquiring new sources of data Publish papers using GBD results The size of the network demands new ways to manage contacts CRM is an immediate priority

14 The technical infrastructure Capacity of 250 Terabytes Access limited to IHME Limited Use Access Restricted to named researchers Controlled or sensitive datasets Cluster for running Stata and R jobs (Sun Grid Engine) Largest capacity at the University of Washington Capacity to increase 10x for projections

15 IT requirements for GBD 12+ major database 8 Servers Cluster (STATA; R) Primary databases for GBD CodShared CovariatesGHDx EpiRisk Idie2GBD results mortalityCodmod GBDvizGBDx2.0 – every day, all day

16 The Global Burden of Disease (GBD) A systematic, scientific effort to quantify the comparative magnitude of health loss due to diseases, injuries & risk factors. GBD 2010 published by The Lancet GBD 2013 to be published in 2014 Annual updates to follow GBD 2010GBD 2013 Diseases and injuries291322 Sequelae1,1602,435 Risk factors6768 Countries187188 Years1990-20101990-2013

17 Measuring burden of diseases and injuries

18 Data Inputs for GBD Population-basedEncounter-levelOther Vital registration Censuses Surveys Verbal autopsy Disease registries Surveillance systems Hospital records Ambulatory records Primary care records Claims data Literature reviews Sensor data Mortuaries/burial sites Police records

19 Defining analysis Task of the analysts Research Prep data Write code Review estimates Interpret results Publish

20 Mortality 2 Causes of death 3 Nonfatal health outcomes 4 Risk factors 5 Co- variates 1 YLLs/ YLDs/ DALYs 6 Main components of the data model

21 Processes within the data model

22 Deliverables All-cause mortality rates Deaths by cause (1980-2013) Years of life lost (YLLs) Years lived with disability (YLDs) Disability adjusted life years (DALYs) 188 Countries 322 Disease and Injuries 68 Risk Factors Men and Women 20 Age Groups 1990-2013 At least 1,000 draw calculations per estimate based on known data points and uncertainty 1.03 billion estimates

23 COOPER LIVE DEMO http://ghdx.healthdata.org/cooper

24 Vignette – Using different sources for COD TypeSite years Coun- tries Vital registration2,798130 Verbal autopsy48666 Cancer registries2,71593 Police reports1,129122 Surveys/ census1,56482 Maternal mortality surveillance 838 Deaths in health facilities 219 Burial and mortuary 3211 600M deaths back to 1980

25 Vignette – Garbage codes in VR data

26 Vignette – Garbage code redistribution

27

28 Front End Visualizations Purpose & Audience Traditional Outreach Audience Underlying principles Publications Media Other approaches Interactive Visualizations Key Uses Development Demonstrations

29 Audience

30 Communicating data for impact Audiences and characteristics Casual user Data actor Data analyst Researcher Granularity of data Type of tool or visual http://bit.ly/1mogRom

31 Designing for the right audience Casual UserData ActorAnalyst Infographics Illustrative diagrams Narrative visualizations Press releases Reports Briefs Search tools Limited interactive visualizations Query tools Exploratory visualizations API Researcher Query tools Exploratory visualizations Data catalog – repository Methods

32 IHME outreach Research Articles Policy Reports Brochures Country Profiles Infographics Newsletters Presentations Videos Visualizations

33 Policy reports, articles & profiles Note …

34 Infographics Note …

35 News Articles

36 Blogging and newsletters

37 Twitter

38 @IHME_UW

39 reddit

40 Video

41 Open Source Tools Note …

42 Key uses for visualizations 1.Review input data & launch models 2.Review results 3.Obtain feedback from collaborators/ experts 4.Communicate results 5.Use as presentation / teaching aid 6.Convince data owners to share data Researchers Different Audiences

43 The development process 1.Contact product owner 2.Identification of relevant audience(s) 3.Business and technical requirements 4.Creation of appropriate design 5.Development (using Agile/Scrum) 6.Testing & initial user feedback 7.Launch under embargo (journalists) 8.Public launch 9.Feedback collection

44 Visualizations LIVE DEMO GBD Compare http://vizhub.healthdata.org/gbd- compare/ GBD Cause Patterns https://www.healthdata.org/data- visualization/gbd-cause-patterns

45 Visualizations LIVE DEMO US Health Map http://vizhub.healthdata.org/us-health-map/ Tobacco Burden Visualization http://vizhub.healthdata.org/tobacco/ Millennium Development Goals http://vizhub.healthdata.org/mdg//

46 Summary Gather and organize the data Utilize that information Inform and empower your audience Contact me: Matthew Israelson misra@uw.edu


Download ppt "Back-End Structures and Front End Visualizations DAMA Minnesota Matthew Israelson 19 November, 2014."

Similar presentations


Ads by Google