Weighting and Imputation for CORE Social Housing Statistics Julia Bowman & Niall Goulding.

Slides:



Advertisements
Similar presentations
Will 2011 be the last Census of its kind in England and Wales? Roma Chappell, Programme Director Beyond 2011 Office for National Statistics, July 2011.
Advertisements

Longitudinal LFS Catherine Barham and Paul Smith ONS.
1 Editing the Integrated Census in Israel. EDITING THE INTEGRATED CENSUS IN ISRAEL Prepared by Eva Rotenberg, Central Bureau of Statistics, Israel (1)
Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.
provide information Longitudinal Weights for the Production of Transitions and Flow Estimates Katrin Baumgartner Angelika Meraner Alexander.
School statistic collections Summary of previous years, results, issues and proposed changes to future years collections.
SSRG Annual Workshop 2011: How can the Children in Need census help to improve children’s services and outcomes? Monday 7 th March Birmingham Isabella.
Weighting Methodology for the Private Landlords Survey Robert Bucknall, ONS.
2011 Census results for Edinburgh summary results for Edinburgh City Centre CEC Planning Information, Services for Communities, February 2014.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
Utilizing Administrative Records in the 2020 Census SDC/CIC Steering Committee Update October 24,
Dissemination of U.S. Census Data and Results: The role of ICPSR First Conference of Al-Khawarezmi Committee on Statistics Doha, Qatar 6-8 December 2010.
A model-based approach for estimating international emigration for local authorities Brian Foley, Office for National Statistics BSPS day meeting London.
State of the Cities Database Philippa Robinson Neighbourhoods, Cities and Regions Communities and Local Government.
Quality assurance -Population and Housing Census Alma Kondi, INSTAT, Albania.
Maintaining high quality surveys with optimized interviewers replacements : the new French sample monitoring strategy Sébastien Faivre, INSEE, Head of.
The Rural Housing Data Portal Information for Rural America Housing Assistance Council.
Housing in Salford Working with partners and stock options. Sarah Clayton, Head of Housing Strategy and Enabling.
Melissa Field Senior Information Analyst 21 May 2012 PRIMHD NGO Reports & Data Quality.
THE UNIVERSITY OF MISSISSIPPI The University of Mississippi Institute for Advanced Education in Geospatial Science Census to American Community Survey.
Edit and Imputation of the 2011 Abu Dhabi Census Glenn Hui and Hanan AlDarmaki Statistics Centre - Abu Dhabi UNECE CES Work Session on Statistical Data.
Joint UNECE/Eurostat Work Session on Migration Statistics 3 March, 2008, Geneva, Switzerland Selected methods to improve emigration estimates MEASURING.
National Household Survey: collection, quality and dissemination Laurent Roy Statistics Canada March 20, 2013 National Household Survey 1.
Household Surveys ACS – CPS - AHS INFO 7470 / ECON 8500 Warren A. Brown University of Georgia February 22,
Estimating the Labour Force Trinidad and Tobago 28 th May 2014 Sterling Chadee Director of Statistics.
Measuring the quality of regional estimates from the ABS Jennie Davies and Daniel Ayoubkhani.
Copyright 2010, The World Bank Group. All Rights Reserved. Estimation and Weighting, Part I.
Modernization and Reengineering of the Census of Governments A focus on the Quarterly Tax Survey June 4, 2010.
House Sales and LHS Datapack Stuart Law Communities ASD.
National Statistics Quality Review on International Migration Estimates Update on taking forward the recommendations of the review Emma Wright & Giles.
General Register Office for S C O T L A N D information about Scotland's people General Register Office for Scotland “Information about Scotland’s people”
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
Overview of error model for estimates of foreign-born immigration using data from the American Community Survey Mary H. Mulry U.S. Census Bureau 2011 International.
All the answers? Statistics New Zealand’s Integrated Data Infrastructure Paper by Felibel Zabala, Rodney Jer, Jamas Enright and Allyson Seyb Presented.
Coverage assessment and adjustment methodology Owen Abbott Methodology Directorate, ONS.
2011 CENSUS Coverage Assessment – What’s new? OWEN ABBOTT.
Secondary data Relevance: A-Level Case study: 2011 UK census Topic: Geographical skills.
1 The 2001 Census PUMFS Odyssey Sponsored by HAL and PALS Presented by Chuck Humphrey.
Methodology for producing the revised back series of population estimates for Julie Jefferies Population and Demography Division Office for.
General Register Office for S C O T L A N D information about Scotland's people Reviewing household estimates for Scotland Esther Roughsedge General Register.
Internet versus paper mode effects in the 2011 Census of England and Wales: analysis of Census Quality Survey agreement rates Cal Ghee 26 September 2014.
New and easier ways of working with aggregate data and geographies from UK censuses Justin Hayes UK Data Service Census Support.
Imputation in the 2001 Census Robert Beatty NILS User Forum 11 December 2009.
Some ACS Data Issues and Statistical Significance (MOEs) Table Release Rules Statistical Filtering & Collapsing Disclosure Review Board Statistical Significance.
Improving the Quality of the HMRC Personal Wealth Statistics Rebecca Ambler and Abeda Malek - HMRC.
General Register Office for S C O T L A N D information about Scotland's people Comparison between NHSCR and Community health index sources of migration.
WP 19 Assessment of Statistical Disclosure Control Methods for the 2001 UK Census Natalie Shlomo University of Southampton Office for National Statistics.
Quality Assurance Programme of the Canadian Census of Population Expert Group Meeting on Population and Housing Censuses Geneva July 7-9, 2010.
General Register Office for S C O T L A N D information about Scotland's people Household Estimates and Projections Esther Roughsedge General Register.
The challenge of a mixed-mode design survey and new IT tools application: the case of the Italian Structure Earning Surveys Fabiana Rocci Stefania Cardinleschi.
Design of the 2011 Census Coverage Survey Owen Abbott (ONS) James Brown (Institute of Education)
Updating Household Projections for England Bob Garland.
Household Surveys: American Community Survey & American Housing Survey Warren A. Brown February 8, 2007.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
New challenges for Social statistics, EurostatLuxemburg, 23 September 2008 New approach to migration statistics in Lithuania NEW APPROACH TO MIGRATION.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys Asunción,
2011 Census Data Quality Assurance Strategy: Plans and developments for the 2009 Rehearsal and 2011 Census Paula Guy BSPS 10 th September 2009.
© Statistisches Bundesamt, VI A Statistisches Bundesamt The new method of the next german Population census Johann Szenzenstein, Federal Statistical Office,
Beyond 2011 Administrative data sources and low-level aggregate models for producing population counts.
The 2011 Census: Estimating the Population Alexa Courtney.
JOINT UN-ECE/EUROSTAT MEETING ON POPULATION AND HOUSING CENSUSES GENEVA, 7-9 JULY 2010 A QUALITY ASSURANCE STRATEGY FOR THE 2011 CENSUS IN ENGLAND AND.
Sinclair Sutherland Labour supply: Finding and using statistics.
The changing household structure of tenants in new affordable housing in the English housing association sector, 1990–2011 Housing Studies Association.
Adjusting for coverage error in administrative sources in population estimation Owen Abbott Research, Development and Infrastructure Directorate.
Beyond 2011 Voluntary Sector Statistics User Event Minda Phillips Amelia Ash.
Evaluating imputation of sex and age for substitutes in substitute households Michael Ryan 2008 UNECE Work Session on Statistical Data Editing.
11 Measuring Disclosure Risk and Data Utility for Flexible Table Generators Natalie Shlomo, Laszlo Antal, Mark Elliot University of Manchester
Creation of synthetic microdata in 2021 Census Transformation Programme (proof of concept) Robert Rendell.
Integrating administrative data – the 2021 Census and beyond
Towards a Fully Adjusted Census Database for the 2011 Census
Presentation transcript:

Weighting and Imputation for CORE Social Housing Statistics Julia Bowman & Niall Goulding

What CORE is COntinuous REcording of Social Housing Lettings Census – hybrid of interview and administrative data Household level data collected Private Registered Providers and Local Authorities Collected from all housing providers in England since 2004 Many types of information are collected, not just the number of lettings…

Lettings log

2012/13 Headline stats Context – 378,700 lettings Household characteristics – 91% UK nationals, 22% in work, 3% under 18 Most common reason given for why the household left their last settled home - overcrowding Average weekly rent - £79.58 / £ Length of time vacant – 32 days Staying within local authority – 90% 378,700 lettingsOvercrowding £79.58 per week 32 days vacant 90% remain in LA

Complimentary data sets Local Authority Housing Statistics (LAHS) English Housing Survey (EHS)

Users

Interests around household characteristics And media interest…

QIF bid Two problems we sought to resolve… Placed bid to the UKSA’s Quality Improvement Fund (QIF) Work carried out by the ONS Methodology Advisory Board

Problem 1: LA missing records Lettings volume varies greatly by local authority Local Authority Housing Statistics (LAHS): nearly complete lettings data at LA level CORE: lettings data at household level

Problem 1: LA missing records Some LAs do not provide logs for every letting in CORE Introduces bias into demographic statistics Lettings grossed to LAHS counts on urban/rural classification Does not account for demographics of population

Solution 1: Improved Weighting Geographic approach maintained ONS area classifications (OACs) are used to replace urban/rural classifications. Areas grouped on many factors using a cluster methodology

Solution 1: Improved Weighting What is our best estimate for lettings per ONS cluster area? The highest of LAHS or CORE for each LA If neither, we use an imputed LAHS figure Sum these to get total lettings per ONS cluster area

Solution 1: Improved Weighting Highest of LAHS, CORE, imputed LAHS for each LA Sum lettings per ONS cluster area group Compare to reported CORE figure per area group Ratio of best estimate to CORE figure = weight

Problem 2: Record level missing data Both LA and PRPs submit logs with missing household characteristics Age, sex, ethnicity, nationality and economic status This can happen because  tenant refuses to provide the information  some LAs do not interview  admin data constraints  IT constraints

Solution 2: Imputation So how do we account for this? Donor imputation: Neighbour Imputation Method Canadian Census Edit and Imputation System – CanCEIS (Canadian Census 2001, UK Census 2011) Efficient, free license, variety of record editing rules

Solution 2: Imputation Raw data comes to DCLG (SPSS) Data reformatted for CanCEIS (ASCII) CanCEIS finds incomplete and donor records CanCEIS matches records Household characteristics that are available (age, sex, ethnicity, nationality, economic status) Area classification, provider type (LA/PRP), previous tenure, size of property, asylum seeker, refugee status (and client type) Record randomly picked from pool of donors Imputed output data set AgeSexNationalityAreaAsylum 45MUK6N 35MEEA2N 27FMISSING4N AgeSexNationalityAreaAsylum AgeSexNationalityAreaAsylum AgeSexNationalityAreaAsylum × 10 2

The complete process Raw data comes to DCLG Weighting Imputation Complete records Weights assigned Final data set

Results What happens when we weight and impute? PRPLATotal % UK113,07169, % A104,2582,5473.4% Other EEA1, % Other3,5373,7103.6% Missing4,32417,1319.7% Total lettings220,056 PRPLATotal % UK116,94496, % A104,4273,5693.4% Other EEA1,3471,3691.2% Other3,7585,5104.0% Total lettings233,334 Original reported dataWeighted and imputed dataImputed data PRPLATotal % UK116,94484, % A104,4273,1183.4% Other EEA1,3471,2041.2% Other3,7584,8193.9% Total lettings220,056

Testing But what further tests can we do? Remove logs from a complete data set and then test weighting against the complete version Deleting data and then imputing it to check error rate Finding other unaccounted biases needing weighting Any other thoughts?

Future work CORE is now National Statistics – improvements pending Use areas from 2011 census data Affordable rent weighting and imputation Improve data quality and volume from LAs – 2013/14 first year all LAs will participate On going disclosure control investigations Make CORE data more easily available via Open Data Communities

Thank you. Questions and comments please!