Census Interaction Data: Characteristics and Access John Stillwell Centre for Interaction Data Estimation and Research (CIDER) School of Geography, University.

Slides:



Advertisements
Similar presentations
Using Interaction Data in the (Public Sector) GLA John Hollis Exploring the Research Potential of the 2011 Census University of Manchester 7 th July 2011.
Advertisements

Will 2011 be the last Census of its kind in England and Wales? Roma Chappell, Programme Director Beyond 2011 Office for National Statistics, July 2011.
2001 Census Programme Using the Census for contemporary and historical research ESRC Research Methods Festival Oxford, July 2004.
The Census Area Statistics Myles Gould Understanding area-level inequality & change.
RELEASE OF THE 2001 CENSUS RESULTS March Release of the 2001 Census Content Media and formats Release schedule Arrangements for using the results.
School of Geography FACULTY OF EARTH & ENVIRONMENT Using OAC for analysis of the 2001 Census interaction data Oliver Duke-Williams
1 Combining migration data from multiple sources: Applications to internal movements in England, James Raymer with Peter W.F. Smith and Corrado.
A model-based approach for estimating international emigration for local authorities Brian Foley, Office for National Statistics BSPS day meeting London.
Understanding Population Trends and Processes: Links between internal migration, commuting and within household relationships Oliver Duke-Williams School.
Internet access to UK Census interaction data: that's WICID! John Stillwell Centre for Computational Geography University of Leeds, Leeds LS2 9JT
T HE W EB - BASED I NTERFACE TO C ENSUS I NTERACTION D ATA - WICID Presentation to the ESRC Research Methods Festival Adam Dennett Centre for Interaction.
Web-based Access to Complex UK Census Data Sets IASSIST 2002,University of Connecticut, Storrs, CT, USA June 11-15, 2002 Oliver Duke-Williams and John.
Sample of Anonymised Records: User Meeting Propensity to migrate by ethnic group: 1991 & 2001 Paul Norman 1, John Stillwell 2 & Serena Hussain 2 School.
Phil Rees QMSS2 Summer School Projection Methods for Ethnicity and Immigration Status 2-9 July 2009 School of Geography, University of Leeds, UK.
2001 Census Programme Delivering UK Census Data to Researchers: Progress and Challenges David Martin University of Southampton and ESRC/JISC Census Programme.
Geography and Geographical Analysis using the ONS Longitudinal Study Christopher Marshall & Julian Buxton CeLSIUS.
Census interaction data: from CIDS to CIDER Census interaction data: from CIDS to CIDER John Stillwell School of Geography, University of Leeds, CIDS Director.
The ONS Longitudinal Study. © London School of Hygiene and Tropical Medicine The Office for National Statistics Longitudinal Study (LS) o What is it o.
Access to UK Census Data for Spatial Analysis: Towards an Integrated Census Support Service John Stillwell 1, Justin Hayes 2, Rob Dymond-Green 2, James.
Interaction Data John Stillwell and Oliver Duke-Williams Centre for Interaction Data Estimation and Research (CIDER) School of Geography, University of.
RGS-IBG Online CPD course in GIS Analysing Data using WebGIS: The Office of National Statistics Session 3.
Secondary Data Analysis Using the Census Stephen Drinkwater WISERD School of Business and Economics Swansea University.
Census.ac.uk Census Area Statistics and Casweb David Rawnsley Census Dissemination Unit (CDU) Mimas University of Manchester.
Geographical Data Products Carol Blackwood UKBORDERS 3 rd July 2012.
Nigel James Bodleian Library The Census Accessing and mapping British Census Data.
Interaction Data: Progress and Potential John Stillwell and Oliver Duke-Williams Centre for Interaction Data Estimation and Research (CIDER) School of.
Improving the estimation of long-term international emigration at local authority level Joshua Turner Population Statistics Research Unit (PSRU) Local.
GEOG3025 Census and administrative data sources 2: Outputs and access.
The micro-geography of UK demographic change Paul Norman School of Geography, University of Leeds understanding population trends and processes.
Internal migration of Britain’s ethnic populations Serena Hussain and John Stillwell School of Geography University of Leeds Presentation for the UPTAP.
Handling Migration and Commuting Flow Data Day session at the ESRC Research Methods Festival at St Catherine’s College, University of Oxford, 2 July 2008.
Using PostGIS and MapServer in the Census Interaction Data Service Presentation to AGI Technical SIG 'Open-Source in GIS' British Antarctic Survey, Cambridge,
National Statistics Quality Review on International Migration Estimates Update on taking forward the recommendations of the review Emma Wright & Giles.
Plans for Access to UK Microdata from 2011 Census Emma White Office for National Statistics 24 May 2012.
School of Geography FACULTY OF Environment Spatial Interaction: An Audit of Population Flow Data in the UK Adam Dennett, Oliver Duke-Williams and John.
Secondary data Relevance: A-Level Case study: 2011 UK census Topic: Geographical skills.
2011 Census: Analysis Jon Gough Office for National Statistics.
Monitoring UK internal migration in the twenty-first century John Stillwell Centre for Interaction Data Estimation and Research (CIDER), School of Geography,
Providing Access to Census- based Interaction Data in the UK: That’s WICID! John Stillwell School of Geography, University of Leeds Leeds, LS2 9JT, United.
POPGROUP Slide 1 The Derived Forecasts module of the POPGROUP software Ludi Simpson, University of Manchester BSPS day meeting on household projection.
General Register Office for S C O T L A N D information about Scotland's people BSPS Review of migration methods using health registrations Nick.
The Impact of Disclosure Control on Labour Market Statistics (& other issues)– the User’s Gripes Jill Tuffnell Head of Research Cambridgeshire County Council.
Using WICID (Web-based Interface to Census Interaction Data) in the Classroom John Stillwell School of Geography, University of Leeds Leeds, LS2 9JT, United.
New and easier ways of working with aggregate data and geographies from UK censuses Justin Hayes UK Data Service Census Support.
Internet Access to Census Migration and Journey-to-Work Data John Stillwell and Oliver Duke-Williams Centre for Computational Geography University of Leeds,
WICID AND THE 2001 INTERACTION DATA John Stillwell and Oliver Duke-Williams School of Geography, University of Leeds Presentation at the Ninth International.
ISR Training Jan. 21,  Canada’s largest survey  Complete population count  Gathers information on the demographic, social and economic conditions.
Regional workshop on migration statistics, October 2011, Antalya, Turkey Pablo Lattes Migration Section, Population Division - DESA United Nations,
Data on the Foreign Born in 2010: Accessing Information on Immigrants and Immigration from the U.S. Census Bureau’s American Community Survey Thomas A.
Joint UNECE / Eurostat meeting on Population and Housing Censuses 7-9 July 2010, Geneva Disseminating Census information to maximise use and value Keith.
School of Geography FACULTY OF EARTH & ENVIRONMENT New disclosure threats in Census interaction data Presented at the 6 th International Conference on.
Using the 2001 Census to measure the migration of ethnic groups in relation to concentration John Stillwell School of Geography, University of Leeds Presentation.
General Register Office for S C O T L A N D information about Scotland's people Comparison between NHSCR and Community health index sources of migration.
Web Access to Census Interaction Data John Stillwell and Oliver Duke-Williams Centre for Computational Geography University of Leeds, Leeds LS2 9JT Paper.
GEOG3025 Administrative and statistical geographies.
WP 19 Assessment of Statistical Disclosure Control Methods for the 2001 UK Census Natalie Shlomo University of Southampton Office for National Statistics.
1 Research Methods Festival 2008 Zhiqiang Feng 1,2 and Paul Boyle 1 1 School of Geography & Geosciences University of St Andrews 2 The Centre for Census.
ETHNIC MIGRATION IN BRITAIN: Analyses of census data at district and ward scales John Stillwell and Adam Dennett School of Geography, University of Leeds,
Jonathan Smith and Cal Ghee Migration Statistics Improvement, ONSCD Centre for Demography Improving internal migration estimates of students.
School of Geography FACULTY OF ENVIRONMENT ESRC Research Award RES What happens when international migrants settle? Ethnic group population.
2011 Census Data Quality Assurance Strategy: Plans and developments for the 2009 Rehearsal and 2011 Census Paula Guy BSPS 10 th September 2009.
The 2011 Census: Estimating the Population Alexa Courtney.
INTERNAL MIGRATION BY ETHNICITY: A LONDON WARD-LEVEL STUDY John Stillwell School of Geography, University of Leeds, Leeds LS2 9JT Paper prepared for the.
On the Map & Statistical Abstract South Dakota State University Demography Conference May 2013.
JOINT UN-ECE/EUROSTAT MEETING ON POPULATION AND HOUSING CENSUSES GENEVA, 7-9 JULY 2010 DISSEMINATING THE RESULTS OF THE 2011 CENSUS IN ENGLAND AND WALES.
Census 2011 – A Question of Confidentiality Statistical Disclosure control for the 2011 Census Carole Abrahams ONS Methodology BSPS – York, September 2011.
JOINT UN-ECE/EUROSTAT WORK SESSION ON MIGRATION STATISTICS GENEVA, OCTOBETR 2012 COLLECTING MIGRATION DATA IN THE UK CENSUS IAN WHITE, Office for.
GOVERNMENT OFFICE FOR THE SOUTH WEST South West Public Health Observatory Day 2: Datasets Jennie Mussard, Croydon PCT James Hebblethwaite, Kensington &
Jo Watson sepho South East Public Health Observatory Solutions for Public Health Day 2: Session 2 Populations and geography.
The London Health Observatory: monitoring health and health care in the capital, supporting practitioners and informing decision-makers Disclosure control.
Presentation transcript:

Census Interaction Data: Characteristics and Access John Stillwell Centre for Interaction Data Estimation and Research (CIDER) School of Geography, University of Leeds Presentation at the ‘After the Census’ session of the ‘ESRC Research Methods Festival’ University of Oxford, 3 July, 2012

Census Programme CIDER staff: John Stillwell Oliver Duke-Williams Adam Dennett Kostas Daras

Service reorganization UK Data Service Census Support Service Other data services ?

Presentation 1.What are the interaction data sets? 2.How are these data sets accessed? 3.How are the interaction data sets used in research? 4.What are the major characteristics of the 2011 Census interaction data? - same questions/new questions - SDC - licensing arrangements - geographies - possible tables 5. Conclusions

1. What are the interaction data sets? migration Data on migration that derive from the question in the Census: Where were you living 12 months ago? - Special Migration Statistics (SMS) in 2001 commuting Data on commuting that derive from the question in the Census: What is the address of your place of work? (and study in Scotland) - Special Workplace Statistics (SWS) in Special Travel Statistics (STS) (Scotland) in 2001 These data sets are unique because they have two geographies: origin and destination

SMS/SWS are large and often sparsely populated matrices, particularly for small areas Interaction flow matrix for Leeds contains 5,948,721 cells that have the potential to contain flow counts Leeds: 2,439 Output Areas

Large and multi-dimensional data sets 1991 SMS Set 1 (Wards) 1991 SMS Set 2 (Districts)

CountryLevel 1Level 2Level 3 EnglandLondon Boroughs (33), Metropolitan Districts (36), Unitary Authorities (46), Other Local Authorities (239) CAS wards (7,969)Output areas (165,665) WalesUnitary Authorities (22)CAS wards ( 881)Output areas (9,769) ScotlandCouncil Areas (32)ST wards ( 1,176)Output areas (42,604) Northern Ireland Parliamentary Constituencies (18) CAS wards (582 )Output areas (5,022) TotalDistricts (426)Interaction wards (10,608) Output areas (223,060) Geographies of 2001 SMS/SWS/STS Key point: Interaction data sets are for the UK

CIDER’s interaction data sets (a) Census data sets Origin-Destination Statistics 1981 SMS Set 2 and SWS Set C (County/region level) 1991 SMS Sets 1 and 2, SWS Sets A-C and Table 100 (students) 2001 SMS Sets 1 and 2, SWS/STS Levels 1-3 (and postal sectors) Commissioned Tables Set of tables from 2001 Census including, for example: C0649: Commuters by religion at district level C0711: Migrants by ethnic group and age at district level C0723: Migrants by age and ethnic group at region/ward level

CIDER’s interaction data sets (b) Derived or estimated data for census periods SMSGAPS: Counts for 1991 SMS Set 2 Tables 3-10 derived by Rees and Duke Williams that include estimates of suppressed values MIGPOP: Counts for 1991 SMS Set 2 Table 3 derived by Simpson and Middleton that adjust for under- enumeration 1981 SMS Set 2 (wards) and SWS Set C (wards): re- estimated for 1991 and 2001 geography by Boyle and Feng 1991 SMS Set 1 (wards) and SWS Set C (wards): re- estimated for 2001 geography by Boyle and Feng

CIDER’s interaction data sets (c) Estimated time series data sets Patient register/NHSCR flows between local authority districts in England and Wales, (rounded) – estimated and supplied by ONS Inter-NUTS2 region migration estimates for UK, mid to mid – estimated and supplied by Rees and Dennett (DEMIFER project) Inter-NUTS2 region migration estimates for UK, calendar year 2000 to – estimated and supplied by Rees and Dennett (DEMIFER project) Inter-region migration by age, sex and ethnicity for Britain, and estimated and supplied by Raymer and Giuletti (ESRC project) Inter-county migration by age, sex and ethnicity, , estimated and supplied by Raymer and Giuletti (ESRC project) Inter-county migration by age, sex and economic activity, , estimated and supplied by Raymer and Giuletti (ESRC project)

2. How are these data sets accessed? WICID CIDER Home Page 2. How are these data sets accessed? WICID is the online interface to the Census interaction data sets accessible from the CIDER Home Page Need to be a registered user of census data

WICID Query Interface

Data selection Tables available in 2001 SMS Level 1 Cells of Table 3 in 2001 SMS Level 1

Origin and destination geography selection Area selection tools available List selection of districts

Map Selection Tool

Map Selection Tool (detail)

Postcode based selection

Finalise Screen Screen Indicating Extraction Completed

Example of simple query and data extracted The Query: The Query: Extract the data on total migrant flows between the countries of the UK from Table MG1010 in 2001 SMS The Data: The Data: Origin by destination matrix of migration flows in

Analysis functions for use on extracted data

Help System Help System opening inside a new browser window

3. How are interaction data sets used in research? Interaction data sets used by various researchers: See some examples in Part 2 of CIDER book: Stillwell, J., Duke-Williams, O.W. and Dennett, A. (eds.) (2010) Technologies for Migration and Commuting Analysis Spatial Interaction Data Applications, pp. 357, IGI Global, Hershey.

Example: What processes of white migration are taking place in London at ward level? Net migration flows within Net migration flows between Greater London London and rest of England and Wales Source: 2001 Census Commissioned Table Stillwell, J. (2010) Ethnic population concentration and net migration in London, Environment and Planning A, 42: Location quotients

Are the same processes of migration apparent for Black migrants in London? Net migration flows within Net migration flows between Greater London London and rest of England and Wales Source: 2001 Census Commissioned Table Location quotients

Are the same processes of migration apparent for Chinese migrants in London? Net migration flows within Net migration flows between Greater London London and rest of England and Wales Source: 2001 Census Commissioned Table Location quotients

4. What are the major characteristics of the 2011 Census interaction data? 4.1 What interaction questions were asked? 4.2 What about statistical disclosure control? 4.3 What geographies will be used? 4.4 What migration and commuting tables will be available?

4.1 Interaction questions 4.1 Interaction questions Main questions for migration and commuting in 2011 are the same as in 2001

Student ‘migration’ picked up by separate questions Student term time/boarding school address in the UK: enter term time address below This means that it will be possible to generate flows of: (i)those who left HE/FE/boarding school and changed usual residence (ii)those in HE/FE or at boarding school who changed term time address

‘Another address’ question enables further ‘interaction’ data to be generated? Questions 5 and 6 ask about another address Potential to produce matrices of interaction flows between usual address and other address – very useful for analyses of mobility (weekly commuting, shared custody of children, second homes, international mobility) hitherto uncaptured

Questions about international immigration Potential to produce tables of immigrants by country of birth and country of previous usual residence

4.2 Statistical disclosure control? Small cell adjustment abandoned in 2011 in favour of record swapping: - Households swapped - Targeted to ‘risky’ records - Construct risk score for every individual; combine to household score - Imputation considered as part protection - Households swapped only as far as their risk is considered ‘high’ - Individuals swapped between communal establishments Work on SDC on Origin-Destination Tables still ongoing Source: Spicer, K. (2011) Statistical Disclosure Control for 2011 UK Census, consultation---main-statistical-outputs---second-round/index.html consultation---main-statistical-outputs---second-round/index.html

Data licensing arrangements TierInitial idea Data availabilityTierCurrent thinking Data availability 1PublicDownload without restriction 1PublicData available under open government license 2SafeguardedDownload with terms and conditions 2SafeguardedData available with Special user license 3Safeguarded (Approved researcher) Download only with approved researcher status 4Approved researcher Access only with approved researcher status in a secure setting 3Secure/VMLData available to approved researcher only in a secure setting Key question: Which data will be available at which tier of licensing?

4.3 What geographies will be used? Fundamental building blocks for origin-destination migration flows will be output areas (OAs) with data aggregated to wards and districts Problem of LG reorganisation since 2001 – which means there is a user requirement that flows for wards should be generated so as to be able to reconstitute old LG districts for comparison Preference for LG districts in Northern Ireland (rather than Parliamentary Constituencies as in 2001) New geography for commuting destinations – Workplace Zones (WPZs)

Workplace Zones (WPZs) OAs based on where people live not work – can be unsuitable for workplace statistics Some OAs contain no/few businesses; some contain many businesses or large employer, e.g. business parks, City of London Workplace Zones project looking at splitting/merging OAs for a new geography constrained to MSOAs Pilot areas: Tower Hamlets, City of London, Southampton, Nottingham, Suffolk Coastal Disclosure control: Population threshold same as OAs (100 workers min; 625 max; no household threshold) Source: Spicer, K. (2011) Statistical Disclosure Control for 2011 UK Census, main-statistical-outputs---second-round/index.html main-statistical-outputs---second-round/index.html

4.4 Migration and commuting tables ONS still undecided about table specifications for interaction data sets 2011 Census Prospectus indicates Migration and Workplace Statistics will be released after October 2013 ONS currently reviewing the table specifications proposed by Oliver Duke-Williams (UCL): - Special Migration Statistics (SMS) - Special Workplace Statistics/Special Travel Statistics (SWS/STS) - Special Student Statistics(SSS) - Special Residence Statistics (SRS) Important distinction between different types of counts and their relationship with spatial scale and tier of licensing

Three types of tables for each set of SMS/SWS/STS/SSS/SRS Likely to be important distinction between: (i)Flow (or headcount) tables, i.e. origin-destination flows of total persons only (ii)Univariate tables, i.e. origin-destination flows disaggregated by a single variable e.g. sex, or age or ethnic group (iii)Multivariate tables, i.e. Origin-destination flows disaggregated by more than one variable, e.g. age by sex or ethnic group by sex Each of these flow data sets likely to be produced for flows at different spatial scales: OA-OA; ward- ward; UA/LA-UA/LA with different access/licensing conditions

5. Conclusions Anticipate substantial demand for access to 2011 Census interaction data sets Collaboration underway with ONS about table design as well as joint dissemination strategy Interaction data service soon to be part of the Census Support Service (CSS) Key advantage of CSS is provision of user access to data from previous censuses Recognise the ‘new’ environment – with 2011 Census likely to be the last of its kind and results of ONS ‘Beyond 2011’ project due in September 2014 Changing focus of data collection from Census to surveys and administrative sources

Contact details John Stillwell Oliver Duke-Williams CIDER Web site: