Identifying the cash-rich and the cash-poor: Lessons from the Census Rehearsal Dr Paul Williamson Department of Geography ESRC Census Development Programme.

Slides:



Advertisements
Similar presentations
Professor David Gordon and Dr Eldin Fahmy Townsend Centre for International Poverty Research University of Bristol Updating The Fuel Poverty Indicator.
Advertisements

Is there a Scottish effect for self-reported health? Frank Popham, University of Edinburgh 12th November 2007 Talk based on paper published in BMC Public.
Households Below Average Income 23 rd April 2007 Presentation at FRS User Group Meeting By Nick Herbert - DWP.
Longitudinal LFS Catherine Barham and Paul Smith ONS.
LFS User Group meeting 21 October 2003 Measuring ethnicity in the LFS Vivienne Avery Labour Market Division, ONS.
The Census Area Statistics Myles Gould Understanding area-level inequality & change.
GDP by Income Approach and Accounts of Household Sector For Qatar Experience Prepared by : Aisha Al-Mansoori Statistical Researcher Population & Social.
Dungannon & South Tyrone Borough Council Mr Vinny Beggs Mr Vinny Beggs Enterprise, Investment & Grants Manager October 2009.
Centre for Housing Research, University of St Andrews Occupational mobility and neighbourhood effects: a longitudinal study ESRC Seminar Series – 4 & 5.
University of Oxford Centre for the Analysis of South African Social Policy What can Social Science Contribute to Neighbourhood renewal? Indices of Multiple.
Housing Market Demand and Needs Study 2007 Planning sub-group - Scrutiny Committee 11th December 2007 Shahla Zandi / Matt Doherty.
Measuring socio-economic background and its influence on school education outcomes South Australian Institute for Education Research Spring Seminar Series.
The micro-geography of UK demographic change Paul Norman School of Geography, University of Leeds Understanding Population Trends and Processes.
Social Security Policy Branch – Random Sample Surveys.
Two Worlds of Ageing: Spatial Microsimulation Estimates of Small Area Advantage and Disadvantage Among Older Australians JUSTINE MCNAMARA, CATHY GONG,
Sample of Anonymised Records: User Meeting Propensity to migrate by ethnic group: 1991 & 2001 Paul Norman 1, John Stillwell 2 & Serena Hussain 2 School.
Northern Ireland Neighbourhood Information Service - NINIS Fiona Johnston Neighbourhood Statistics NISRA.
Adding Census Geographical Detail into the British Crime Survey for Modelling Crime Charatdao Kongmuang Naresuan University, Thailand Graham Clarke and.
Identifying new migrant populations in UK cities David Owen and Audrey Lenoël.
Identifying the cash-rich and the cash-poor: Lessons from the Census Rehearsal Dr Paul Williamson Department of Geography ESRC Census Development Programme.
Geography and Geographical Analysis using the ONS Longitudinal Study Christopher Marshall & Julian Buxton CeLSIUS.
The Social Profile of Rural Britain: Insights from longitudinal datasets Heather Joshi Gareth Hughes & Brian Dodgeon Centre for Longitudinal Studies Institute.
Creating synthetic sub-regional baseline populations Dr Paul Williamson Dept. of Geography University of Liverpool Collaborators: Robert Tanton (NATSEM,
Palestinian Central Bureau of Statistics (PCBS) Palestine Poverty Maps 2009 March
The micro-geography of UK demographic change Paul Norman School of Geography, University of Leeds understanding population trends and processes.
General Register Office for S C O T L A N D information about Scotland's people Producing small area housing and household statistics from Council Tax.
Socio-Economic & Demographic Data Tools for Proactive Planning Robin Blakely-Armitage STATE OF NEW YORK CITIES: Creative Responses to Fiscal Stress March.
GIS in Prevention, County Profiles, Series 3 (2006) A. Census Definitions The following is an excellent source of definitions and explanations of geography-related.
Understanding address accuracy: an investigation of the social geography of mismatch between census and health service records Ian Shuttleworth, David.
1 Economy and Poverty Bratislava, May 2003 Jean-Etienne Chapron Statistical Division UNECE.
What’s new in the Child Poverty Unit – Research and Measurement Team Research and Measurement Team Child Poverty Unit.
The new HBS Chisinau, 26 October Outline 1.How the HBS changed 2.Assessment of data quality 3.Data comparability 4.Conclusions.
The Risk Factors of Severe Child Poverty in the UK Monica Magadi Department of Sociology School of Social Sciences City University, London.
CHILD AND FAMILY POVERTY IN NORTHERN IRELAND Marina Monteith Child Poverty Researcher Save the Children, Northern Ireland Programme Co-author: late Prof.
What would full employment look like in contemporary Britain? Amna Silim 27 th November 2013.
The micro-geography of UK demographic change Paul Norman School of Geography, University of Leeds understanding population trends and processes.
MEASURING INCOME AND POVERTY AT A NATIONAL LEVEL Sian Rasdale Social Justice Analysis, Scottish Government.
Self-funders and relative needs Project Advisory Panel 17 April 2013.
Poverty Ms. C. Rughoobur Africa Statistics Day 18 November 2013.
Family background and young adults’ housing outcomes, Rory Coulter Housing Studies Association conference, University of York,
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
Parents’ basic skills and children’s test scores Augustin De Coulon, Elena Meschi and Anna Vignoles.
Denbighshire’s Anti-poverty Statement and action plan 2009 to 2012 National Association of Welfare Rights Advisers Workshop 5/03/2010 Poverty Estimates.
Poverty measurement: experience of the Republic of Moldova UNECE, Measuring poverty, 4 May 2015.
Scot Exec Course Nov/Dec 04 Survey design overview Gillian Raab Professor of Applied Statistics Napier University.
Additional analysis of poverty in Scotland 2013/14 Communities Analytical Services July 2015.
New and easier ways of working with aggregate data and geographies from UK censuses Justin Hayes UK Data Service Census Support.
Centre for Housing Research, University of St Andrews The Effect of Neighbourhood Housing Tenure Mix on Labour Market Outcomes: A Longitudinal Perspective.
Simon Power Managing Consultant John Rae Director Understanding Communities Through PayCheck
General Register Office for S C O T L A N D information about Scotland's people Household Estimates and Projections Esther Roughsedge General Register.
Poverty and Income Inequality in Edinburgh September 2015.
United Nations Workshop on Revision 3 of Principles and Recommendations for Population and Housing Censuses and Evaluation of Census Data, Amman 19 – 23.
The micro-geography of UK demographic change Paul Norman Cathie Marsh Centre for Census & Survey Research (CCSR), University of Manchester ESRC.
United Nations Workshop on Evaluation and Analysis of Census Data, 1-12 December 2014, Nay Pyi Taw, Myanmar DATA VALIDATION-I Evaluation of editing and.
Analysis of the characteristics of internet respondents to the 2011 Census to inform 2021 Census questionnaire design Orlaith Fraser & Cal Ghee.
Using microsimulation model to get things right: a wage equation for Poland Leszek Morawski, University of Warsaw Michał Myck, DIW - Berlin Anna Nicińska,
© Statistisches Bundesamt, VI A Statistisches Bundesamt The new method of the next german Population census Johann Szenzenstein, Federal Statistical Office,
Exploring Microsimulation Methodologies for the Estimation of Household Attributes Dimitris Ballas, Graham Clarke, and Ian Turton School of Geography University.
CMSSE Summer School Dots to boxes: Do the size and shape of spatial units jeopardize economic geography estimations? A.Briant, P.-P. Combes, M. Lafourcade.
The vulnerability of indebted households during the crisis: evidence from the euro area The vulnerability of indebted households during the crisis: evidence.
Western Ontario CFDCs Rural-Urban Divisions 2011 (Source: Statistics Canada Census of Population, 2011, Census Subdivision Data) 1.
Samples of Anonymised Records from the U.K. Census 1991 and 2001 Integrating Census Microdata Workshop Barcelona th July 2005 Dr. Ed Fieldhouse Cathie.
Social disparities in private renting amongst young families in England and Wales, Rory Coulter Housing, Wealth and Welfare.
Parental background and young adults’ housing outcomes in England and Wales, Rory Coulter SLLS Annual Conference, Dublin, 19/10/2015.
Taking Part 2008 Multivariate analysis December 2008
Parental background and young adults’ housing outcomes
Evangelos Charalambakis
Joseph B Nichols 2008 NASM of the Econometric Society June 21, 2008
Some challenges for small area estimation
Lifecycle Deficit (Consumption & Labor Income)
Presentation transcript:

Identifying the cash-rich and the cash-poor: Lessons from the Census Rehearsal Dr Paul Williamson Department of Geography ESRC Census Development Programme

Most requested addition to 2001 Census INCOME…

The 2001 Census Geography of income:

Other sources of data on income Benefits data Government surveys (e.g. GHS, LFS, FES, FRS, NES) Commercially-held data [Postcode sector and postcode unit estimates] The Census Rehearsal (1999)

Objectives Evaluation of: Extant methods for small-area income estimation New approaches Utility of non-census information (e.g. council tax; house price; benefits data) [ Methods of imputing income band means ]

Definition of ‘income’ Income  Wealth Gross or net income? Pre or post housing costs? Adult or Household? Household? –Total –Equivalised [Per capita / OECD / McClements]

Surrogates Univariate –% unemployed –% 2+ car households –% residents in Social Classes I + II –% owner-occupation

Multivariate (deprivation indices) –Carstairs –Townsend –Breadline –DLTR Index of Multiple Deprivation 2000 –Green (Wealth) [owning 2+ cars; NS-SEC I or II; High qualifications]

Geodemographic –SuperProfiles –MOSAIC –GB Profiles

Model Individual income –Dale (SOC2000; Economic activity; age; sex; Region] –Lee (SOC2000; Economic activity] –Regression (individual and/or ecological) Household income –Regression (household and/or ecological) –Bramley & Smart (H/h comp.; earners; tenure; area level deprivation)

The 1999 Census Rehearsal Key features full census questionnaire + INCOME Large achieved sample Spatially contiguous –c. 65,000 households –c. 140,000 individuals

Clustered sampling strategy: –7 part districts [Excluding NI] –38 wards –650 EDs

non-response rate –overall (~ 50%) –income (~15%) –other variables (5-20%) –full responses for ~ 55 % of achieved sample [individuals and households] non-response bias Potential problems

Banding of income question What is your total current gross income from all sources? Per week or Per year (approximately) Nil_ Nil Less than £60_ Less than £3,000 £60 to £119_ £3,000 to £5,999 £120 to £199_ £6,000 to £9,999 £200 to £299_ £10,000 to £14,999 £300 to £479_ £15,000 to £24,999 £480 or more_ £25,000 or more

–Only 10% of adults in top band –but problem compounded when individual incomes aggregated to estimate household income –band mid-point  band mean –value of band means area sensitive?

Source: FRS 1998/9 (Crown Copyright)

Digression: modelling income band means Alternative modelling strategies include: National mean Sub-group mean (e.g. by council tax band) Statistical distributions (log-normal; pareto) New variant of log-normal approach with addition of modelled median etc.

Results For all bands sub-group mean best – if possible For closed-bands, national mean is next best For open (top) band, new proposed log- normal approach is best, particularly where there is evidence of strong spatial clustering

–At what scale does income vary most? MAUP –1991 vs 1998/9 boundaries –zones with <10 households or 25 residents excluded from analysis SOC 2000 / NS-SEC –Lack of alternative SOC2000 coded data –Therefore have to use Census Rehearsal data –Use partitioned data to avoid unduly advantaging SOC2000 based approaches Spatial scale

Results

Census Rehearsal Income Distribution

At ward level the % household reps. in top income-band averaged 9.1% – but ranged from 2.8% to 21.6% 89% of EDs contained one or more household reps. in top income-band –i.e. in top income-decile of the population Heterogeneity rules OK!

Missing data Missing data have minimal impact on results –From ‘Raw’ to ‘Ideal’ data, most correlations change by <0.02 –Very few values change by >0.05 –Exception is NS-SEC 8 [by definition!] –Correlations lower for ‘Ideal’ than ‘Raw’ Surrogates calculated direct from Rehearsal –circumvents data response bias?

Scale Higher correlations at higher geographies District effect small but significant –BUT none of districts in SE England Overfitting No significant impact

MAUP Correlations vary by up to 0.1 between alternative boundaries at same spatial scale BUT No detectable effect on rankings of surrogate income measures

Adult income (r 2 )

Caveats ‘Best’ performing surrogates in danger of over-fitting? –For Dale, Lee and Voas mean occupational income calculated directly from Census Rehearsal dataset (no other SOC2000 sources available at time of analysis) BUT –No significant difference if SOC minor or unit codes used –No significant difference if data partitioned

Household income (r 2 )

Accuracy For many purposes relative, rather than absolute, accuracy is most important  ranking

< 1% of unexplained spatial variation in income attributable to area level effects House price has no significant impact –could be due to data problems Council tax band has small but significant effect [for areas of enumeration district size and below] Lack of utility counter-intuitive? –current value  purchase price –purchase income  current income Other data sources

Conclusions (I) Best approaches capture 80-90% of spatial variation in income, even for smallest spatial units But considerable within-area heterogeneity Best approaches are regression or sub- group mean based Conventional deprivation indices a poor second to % social class / NS-SEC I+II

Conclusions (II) Geodemographic classifications at best perform as well as % NS-SEC I+II, and perform best for areas of ward size and above Qualified support for use of statistical distributions in modelling top income band means

Implications Moral for marketers: Target people, not places Moral for policy makers: Deprivation indices not the best proxy for income ONS ward income estimates (based on ecological regression) likely to perform well

Longer term Consider external correlates (e.g. IMD 2000; benefits data) Lobby for Census Office to create small- area income estimate –by imputing income on Census microdata –include non-census information (?)

Acknowledgements House price data were taken from the Experían Limited Postal Sector Data, ESRC/JISC Agreement. Grateful thanks are due to the Census Custodians of England, Wales and Scotland for granting permission to access the Census Rehearsal dataset. A debt of gratitude is also owed to a number at the Office for National Statistics, in particular Keith Whitfield and Philip Clarke. Finally, thanks are due to David Voas for undertaking some of the preparatory work for this project. All analyses and conclusions remain my sole responsibility.

Definitions (I) NS-SEC I+II: % persons aged in NS-SEC I or II Townsend: Multiple deprivation indicator based on % economically active unemployed; % overcrowded households; % households with no car and % of households not owner occupied Green (Wealth): Affluence indicator based on % households with 2+ cars; % persons aged in NS-SEC I and % adults with high educational qualifications PCA_96: Geodemographic classification based on principal components analysis of 20 normalised census variables, individuals in each of 96 area types assumed to have mean income of all persons in area type Voas: Alternative geodemographic classification, in which five census variables are divided into above or below median, one variable into thirds; with all cross-tabulated to give a total of 96 discrete area types

Definitions (II) Dale: Income imputed given mean income for population sub-group defined by sex, SOC 2000 minor group, economic activity (missing; employed full-time; employed part-time; self-employed; other), age (missing; 0-15; 16-19; 20-29; 30-49; 50+) [Maximum of 4860 valid sub-groups] Lee: Income imputed given mean income for population sub-group defined by SOC 2000 minor group, economic activity (child; not applicable; employed full-time; employed part-time; self-employed; unemployed; retired; other inactive) [maximum of 649 valid sub- groups]

Definitions (III) Voas (individual): Regression model for adult income (children assumed to have 0 income); INCOME 0.5 predicted given: mean income by SOC2000 unit; mean income by Industry category, age, age 2, residents, residents 2, rooms and cars plus dummy variables for sex, white, full-time student, married, Single/Widowed/Divorced, Long- term ill, No qualifications, GCSE or equivalent, A levels or equivalent, Undergraduate degree or equivalent, employed full-time, employed part-time, self-employed, unemployed, retired, permanently sick, other economically inactive excluding pensioners and students, Semi-detached, terrace, flat, caravan, privately rented, social rented, employed manager or supervisor and district of residence Voas (household): Regression model for total household income; HHINC 0.5 predicted given same set of predictors as for Voas (individual), but based only upon head of household’s characteristics