Developing Geographical Information Systems In A Cohort Study Andy Boyd ALSPAC, Social Medicine University of Bristol.

Slides:



Advertisements
Similar presentations
Centre for Market and Public Organisation An application of geographical data: inequalities in school access Paul Gregg, and Neil Davies, University of.
Advertisements

The English Longitudinal Study of Ageing (ELSA) Data & Documentation 2008 Jibby Medina NatCen.
Matching PLASC and ALSPAC PLASC/NPD User Group Workshop 13 th September 2006 Andy Boyd David Herrick
Will 2011 be the last Census of its kind in England and Wales? Roma Chappell, Programme Director Beyond 2011 Office for National Statistics, July 2011.
What are Geographical Information Systems (GIS) & ArcView GIS software? What is a Geographical Information System (GIS)? Introduction to ESRI ArcView 3.x.
Derek OHalloran 15 th March 2006 Improving Service, Delivering on Efficiency.
Using the Self Service BMC Helpdesk
Use of Whole Population Registers:
Easy to use Ability to attach policies/procedures to call types Ability to schedule calls in advance Officer safety alerts Robust search capabilities.
SI0131 – Dissertation Week 5 Luke Sloan Using & Sourcing Secondary Data Week 5 Luke Sloan Using & Sourcing Secondary Data.
OS Places New Service Products from May 2014 Address Capture & Verification Address Matching GeoSearch Ordnance Survey 2014.
1 Cohort management and the Secondary Uses Service (SUS) Nirupa Dattani Office for National Statistics.
RGS-IBG Online CPD course in GIS Analysing Data in ArcGIS Session 6.
Social Care Census & Mental Health Benchmarking - CHI Seeding 27th February 2014 – Social care event Atlantic Quay Euan Patterson.
Conducting Income Survey’s Indiana Office of Community and Rural Affairs “Serving Indiana’s rural communities through technical, financial and personal.
Using Administrative Data to Improve Social Statistics – An Example of Collaborative Work Minda Phillips, Office for National Statistics. Paul Sinclair,
GeoConvert: Creating that Spatial Relationship David Rawnsley Mimas, University of Manchester.
March 2013 ESSnet DWH - Workshop IV DATA LINKING ASPECTS OF COMBINING DATA INCLUDING OPTIONS FOR VARIOUS HIERARCHIES (S-DWH CONTEXT)
1 VCM VIRTUAL CASE MANAGEMENT SOFTWARE VIRTUAL CASE MANAGER. COM 6 South 2 ND Street, Suite 715 Hamilton, OH PH: FX:
SciVal Experts & SciVal Funding Information Sessions.
Request Material Information Use Case Item as created in Optiva. Supplier information request(s) can happen at any time. The same process works for Optiva.
Business Intelligence Accurate Information, Accurate Decisions June 2012 Presented by: Scott Lea Government Services Infogroup Government Division.
Developing and improving data resources for social science research Enhancing, enriching and developing household sample surveys in the UK: the strategic.
The use of GIS in the Central Bureau of Statistics (CBS), Namibia By Mrs Ottilie M Mwazi Chief Statistician, Survey, Cartography/GIS
© Digital Worlds Embedding Geographical Information Systems into the Curriculum.
Shirley Crompton Source: Rob Allan. Institutional Repository Subject Repository Data Producer Repository share resources solve bigger problems integrate.
National Pupil Database: The Future Catherine Blackham Data Services Group DfES.
Database Design Concepts Info 1408 Lecture 2 An Introduction to Data Storage.
Address register: HM Land Registry’s experience Jon Atkey Head of International Unit, HM Land Registry England and Wales.
Geography and Public Health: Using Technology to Strengthen Programs ANDREW INGLIS: USAID | DELIVER PROJECTOCTOBER 8, 2010 BLAKE ZACHARY: MEASURE DHS.
Census Census of Population, Housing,Buildings,Establishments and Agriculture Huda Ebrahim Al Shrooqi Central Informatics Organization.
1 1 Establishing a register-based statistical system Example: Population and housing censuses in Norway Statistical Training Course Use of Administrative.
5 Marzo 2007 EMERGING METHODOLOGIES OF CONTINUOUS USE OF REGISTERS AND GEOCODED DATABASES IN THE ITALIAN POPULATION AND HOUSING CENSUS Fabio Crescenzi,
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
Access to the LSYPE and associated resources at the Economic and Social Data Service Jack Kneeshaw LSYPE workshop 1 October 2009 ESDS Longitudinal.
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
Geo-Refer Geo-Refer: GEOgraphical REFerencing Resources for Social Scientists Samantha Cockings & Samuel Leung School of Geography, University of Southampton.
Role of Statistics in Geography
Census Mapping A Case of Zambia UN Workshop on Census Cartography and Management, Lusaka, 8-12 th October 2007.
1 Data Linkage for Educational Research Royal Statistical Society March 19th 2007 Andrew Jenkins and Rosalind Levačić Institute of Education, University.
ELSA ELSA datasets and documentation available from the archive or by special arrangement Kate Cox National Centre for Social.
Sampling Presentation on workshop in Luxembourg 10.April 2008 Johan Heldal.
General Register Office for S C O T L A N D information about Scotland's people Comparison between NHSCR and Community health index sources of migration.
Quality Assurance Programme of the Canadian Census of Population Expert Group Meeting on Population and Housing Censuses Geneva July 7-9, 2010.
GEOG3025 Geographical referencing and the modifiable areal unit problem.
May 12-15, Evaluating the Integrated Census Israel Pnina ZADKA Central Bureau of Statistics Israel.
1 For a Population Statistical Register Characteristics and Potentials for the Official Statistics Central department for administrative data and archives.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
National Programme for Information Technology The Secondary Uses Service Jeremy Thorp Director of Business Requirements Technology Office.
Defining areas using postcodes John Langley 27 th April 2007 at The Riverside Centre Derby.
Costa Rica´s business registry: Directory of institutional units and establishments Contacts: Odilia Bravo:
UN ECE Seminar on New Frontiers for Statistical Data Collection 31 Oct – 2 Nov 2012 Beyond 2011 The future of population statistics Andy Teague, Office.
Using administrative data to produce official social statistics New Zealand’s experience.
3 Digital Terrain Model (DTM) products: Issues Enhanced DTM & 10m DTM are created as part of the orthophotography creation process New 50m DTM to be created.
6/13/2016 U.S. Environmental Protection Agency 1 Starting a Facilities Flow Lee David
Regional DLI Training: Introduction to PCCF St. John’s Newfoundland Berenica Vejvoda May 5-6, 2016.
Ingest – Acquisition and deposit Irena Vipavc Brvar ADP SEEDS Workshop I Belgrade, October.
( ) 1 Chapter # 8 How Data is stored DATABASE.
A ssociation of Public Health Observatories Hospital Activity data Roy Maxwell SWPHO & Bristol University Dr Richard Wilson Sandwell PCT.
Jim Haywood (Product Manager) School Census Autumn 2016 Application Version 2.3.
Establishing a register-based statistical system Example: Population and housing censuses in Norway Training workshop on censuses using administrative.
Claire McKinley, PMP, CCRP
Validation and Quality Assessment of Data
Generic Statistical Business Process Model (GSBPM)
Geographic & Resources Analysis in Primary Health Care
Capacity building on the use of Geospatial Data and Technologies
Introduction to geospatial data management and technologies for PHDs
Andrew Jenkins and Rosalind Levačić
Changes in the Canadian Census of Population Program
WHERE TO FIND IT – Accessing the Inventory
Presentation transcript:

Developing Geographical Information Systems In A Cohort Study Andy Boyd ALSPAC, Social Medicine University of Bristol

2 Geographical Data Matching - the ALSPAC resource - Overview of our data, the issues involve and our plan for the future Time for questions Time for discussion on how other studies have developed their GIS data resource

Defining GIS GIS combine mapping and a record of location with database technology. This can be used in the storage, analysis, management or presentation of data. 3 E.W.Gilbert‘s1955 version of John Snow’s 1855 Soho Cholera Outbreak Map

Scope of this presentation Not about GIS tools Not about GIS analysis or techniques It is about the capture and storage of data in an accessible manner to allow future GIS analysis Uses ALSPAC as an example 4

5 The ALSPAC GIS dataset Geographic identifiers collected directly from the cohort Data collected via external data sources Geographical data linkage Precision of geographic variables – accuracy Precision of geographic variables – ethics Providing the data as an integral part of the resource Current data availability

6 ALSPAC administered data collection Residential Address (~50000 address points) updated from cohort (self reported) team who tracks lost cases second contacts database searches (osis, electoral roll) School the young person attends / wishes to attend via questionnaire (ALSPAC questionnaires/assessments administered in schools, primary to secondary transition questionnaire) clinic attendance interview collected from the school

7 Linkage to external data sources Validation / Cleaning Validation and cleaning of self reported data using data collected via record linkage (NSTS – NHS Tracing, NPD – National Pupil DB, Royal Mail/OS products) Missing Data Enhancing the resource through record linkage Data collection via geographical identifiers Accessing existing data organised around geographical IDs (census data,neighbourhood data) Primary data collection (distance to overhead power lines, air quality, commuting, school selection)

8 Data Collection through Record Linkage Office National Statistics (ONS) Tracing Health Authority Embarkation NSTS (NHS Strategic Tracing Service) Address registered with GP National Pupil Database (DCSF, DIUS*, UCAS*) School Address Pupil Residential Address DWP* Home Office* * Linkage currently being investigated

9 G.I.S – ALSPAC Resource ~50,000 ALSPAC residential address points, associated with a date range which can then be linked to ALSPAC data collection Schools attendance data from NPD ~17000 Schools attendance data from ALSPAC collection ~ The geographic relation between household income and polluting factories – FoE 1999

10 G.I.S Precision Spatial data held at many geographic levels Geographies range in scale from 0.1 meters to regional/national data Tied together via address, postcode or grid reference as central ID Key resources include: –NSPD ( was All Fields Postcode Directory) - geo linking database –Deprivation & Socio Economic indices (IMD, Townsend, Acorn) –Census data

11 G.I.S – How we link cases to data Master file of Postcodes (NSPD) Postcodes linked to grid reference Grid references of various scales PCs/GridRef mapped to: –Electoral geographies –Census geographies Ethics: –We don’t generally identify residence at PC or equivalent level Ordinance Survey – The National Grid

12 G.I.S – How we link geographies Current Situation Use Postcode / postcode centroid grid reference as our highest precision variable Link geographies using NSPD/AFPD appropriate to the measure required Proposed Method Use property reference number (UPRN) / property centroid grid reference as highest precision variable

13 G.I.S Problems Shifting geographies across time points Royal Mail change postcode areas (and therefore postcode centroids) Postcodes are ‘recycled’ Postcode not precise enough in some cases Postcode boundaries are not contiguous with other geographic boundaries

14 Accuracy issues with analysis at postcode level Address levelPostcode level

15 Accuracy issues with analysis at postcode level Address levelPostcode level

16 Accuracy issues with analysis at postcode level Address levelPostcode level

17 Linkage problems with the cohort data Missing data –Especially problematic for the cases who didn’t enrol in the original recruitment –Gaps in the address data –Move date often date we were informed not the actual move date However… –ONS matched 99.7% mothers, so we have their old & new NHS numbers and cleaned data (original recruitment cases only)

18 GIS Data Availability Collected as administrative resource Not yet cleaned, documented and presented to usual ALSPAC standards Initiatives under way to validate and fill gaps in record Schools GIS data in the main not processed Aim to build into standard ALSPAC resource

19 GIS Ethics Postcode level or greater accuracy treated as a personal identifier Research proposals to use these data need ALSPAC Law & Ethics Approval Broader geographical data can be released in normal manner A two-stage process is used to collect and process precise data Data collected via linkage not available for all cases due to ethical decisions

20 GIS Data Access Step 1 – Postcodes (or full address) provided to researcher with unique collection ID with no other data attached Step 2 – Researcher attaches their data and returns file to ALSPAC Step 3 – ID converted to the appropriate collaborator ID, postcode data removed Step 4 – Requested ALSPAC data added to the file and data sent to the researcher

Andy Boyd