Adding Census Geographical Detail into the British Crime Survey for Modelling Crime Charatdao Kongmuang Naresuan University, Thailand Graham Clarke and.

Slides:



Advertisements
Similar presentations
Experiences of Discrimination: The Impact of Metropolitan and Non- Metropolitan Location Brian Ray, University of Ottawa Valerie Preston, York University.
Advertisements

Longitudinal LFS Catherine Barham and Paul Smith ONS.
The Census Area Statistics Myles Gould Understanding area-level inequality & change.
Household Projections for England Yolanda Ruiz DCLG 16 th July 2012.
Commodification: a new phase in London’s gentrification? Antoine Paccoud LSE Geography & Environment
Weighting Methodology for the Private Landlords Survey Robert Bucknall, ONS.
Centre for Housing Research, University of St Andrews Occupational mobility and neighbourhood effects: a longitudinal study ESRC Seminar Series – 4 & 5.
Prepared by Kim Gilchrist Epidemiologist Public Health, MLHD May 2013 Socio-economic Disadvantage.
A comparison of the characteristics of childless women and mothers in the ONS Longitudinal Study Simon Whitworth Martina Portanti Office for National Statistics.
Introduction to STINMOD and Microsimulation Modelling in Australia Ben Phillips: Principal Research Fellow, NATSEM, 21 Feb 2015.
1.2.4 Statistical Methods in Poverty Estimation 1 MEASUREMENT AND POVERTY MAPPING UPA Package 1, Module 2.
A Tale of Two Sources Bringing Together Scotland’s Crime Statistics Trish Campbell, Justice Analytical SGJusticeAnalys.
A model-based approach for estimating international emigration for local authorities Brian Foley, Office for National Statistics BSPS day meeting London.
*Wahida Kihal 1, Cindy Padilla 1,2, Benoit Lalloué 1,2,3, Marcello Gelormini1, Denis Zmirou-Navier 1,2,3, Séverine Deguen 1,2 1 EHESP School of Public.
The micro-geography of UK demographic change Paul Norman School of Geography, University of Leeds Understanding Population Trends and Processes.
Modelling Crime: A Spatial Microsimulation Approach Charatdao Kongmuang School of Geography University of Leeds Supervisors Dr. Graham Clarke, Dr. Andrew.
Two Worlds of Ageing: Spatial Microsimulation Estimates of Small Area Advantage and Disadvantage Among Older Australians JUSTINE MCNAMARA, CATHY GONG,
Sample of Anonymised Records: User Meeting Propensity to migrate by ethnic group: 1991 & 2001 Paul Norman 1, John Stillwell 2 & Serena Hussain 2 School.
Northern Ireland Neighbourhood Information Service - NINIS Fiona Johnston Neighbourhood Statistics NISRA.
History &Policy connecting historians, policymakers and the media Editors: Alastair Reid (University of Cambridge) Simon.
Methods of Geographical Perturbation for Disclosure Control Division of Social Statistics And Department of Geography Caroline Young Supervised jointly.
Creating synthetic sub-regional baseline populations Dr Paul Williamson Dept. of Geography University of Liverpool Collaborators: Robert Tanton (NATSEM,
Individual and Household Level Estimates Based on 2001 UK Human Population Census Data Andy Turner CSAP Seminar on Microsimulation: Problems and Solutions.
Labor Statistics in the United States Grace York March 2004.
Socioeconomic inequalities in mortality: Men aged (4) Chris White Principal Research Officer Office for National Statistics LS Clearance 20105D.
Secondary Data Analysis Using the Census Stephen Drinkwater WISERD School of Business and Economics Swansea University.
National Statistical Office, Thailand 2-6 December 2013, Hanoi, Viet Nam Census Evaluation.
Spatial Simulation for Education Policy Analysis in Ireland An Initial Exploration Gillian Golden University College Dublin
Presentation title Social class groupings Social class groupings Joan Garrod Fotolia.
Multilevel models for predicting personal victimisation in England and Wales Andromachi Tseloni Analysis of crime data ESRC Research Methods Festival 2010.
SPATIAL MICROSIMULATION: A METHOD FOR SMALL AREA LEVEL ESTIMATION Dr Karyn Morrissey Department of Geography and Planning University of Liverpool Research.
The micro-geography of UK demographic change Paul Norman School of Geography, University of Leeds understanding population trends and processes.
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (28-30 October 2009) Accuracy evaluation of Nuts level 2 hypercubes with the adoption of.
C2ER 52 nd Annual Conference & LMI Training Institute Annual Forum Regional Socioeconomic Statistics Update on U.S. Census Bureau Programs June 8, 2012.
Comparing approaches of different (partly) register-based countries Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
1 Statistical Disclosure Control for Communal Establishments in the UK 2011 Census Joe Frend Office for National Statistics.
2001 CENSUS/NeSS OUTPUT ROADSHOW KEY STATISTICS HIGHLIGHTS Ian White/Sarah Crofts ONS, Census Division.
Longitudinal Data Recent Experience and Future Direction August 2012.
Sustainable rural populations: the case of two National Park areas Alan Marshall Ludi Simpson Cathie Marsh Centre for Census and Survey Research.
What is … small area estimation Dimitris Ballas Department of Geography University of Sheffield
Additional analysis of poverty in Scotland 2013/14 Communities Analytical Services July 2015.
Application of the ESeC on data of the Dutch Labour Force Survey: a comparison between years Sue Westerman Roel Schaart Service Centre for the Classifications.
New and easier ways of working with aggregate data and geographies from UK censuses Justin Hayes UK Data Service Census Support.
Census.ac.uk The UK Census Longitudinal Studies Chris Dibben, University of St Andrews.
Introduction to Spatial Microsimulation Dr Kirk Harland.
Centre for Housing Research, University of St Andrews The Effect of Neighbourhood Housing Tenure Mix on Labour Market Outcomes: A Longitudinal Perspective.
Evaluating Transportation Impacts of Forecast Demographic Scenarios Using Population Synthesis and Data Simulation Joshua Auld Kouros Mohammadian Taha.
Teaching Research Methods: Resources for HE Social Sciences Practitioners Workshop 2: Using Census 2011.
Simon Power Managing Consultant John Rae Director Understanding Communities Through PayCheck
Analysis of 2011 Census Data Irish Community Statistics, England & Selected Urban Areas Prof. Louise Ryan, Alessio D’Angelo, Mike Puniskis, Neil Kaye Wednesday.
WP 19 Assessment of Statistical Disclosure Control Methods for the 2001 UK Census Natalie Shlomo University of Southampton Office for National Statistics.
Aspects of victim survey methodology: results from the inventory of victimisation surveys Jonathan Allen UK Home Office.
General Register Office for S C O T L A N D information about Scotland's people Household Estimates and Projections Esther Roughsedge General Register.
United Nations Workshop on Revision 3 of Principles and Recommendations for Population and Housing Censuses and Evaluation of Census Data, Amman 19 – 23.
What is my neighbourhood like? Read this if you want to learn: 1)Why statistical data about your local area is important 2)What statistical information.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
Characterizing Rural England using GIS Steve Cinderby, Meg Huby, Anne Owen.
An ecological analysis of crime and antisocial behaviour in English Output Areas, 2011/12 Regression modelling of spatially hierarchical count data.
Analysis of the characteristics of internet respondents to the 2011 Census to inform 2021 Census questionnaire design Orlaith Fraser & Cal Ghee.
© Statistisches Bundesamt, VI A Statistisches Bundesamt The new method of the next german Population census Johann Szenzenstein, Federal Statistical Office,
Exploring Microsimulation Methodologies for the Estimation of Household Attributes Dimitris Ballas, Graham Clarke, and Ian Turton School of Geography University.
Mapping Social Indicators for South East Coastal Adaptation project Yogi Vidyattama, Binod Nepal and Itismita Mohanty The National Centre for Social and.
Social Issues (ST2): Wealth and Health Inequalities Aims of Study Theme (Pupils should be able to): Give evidence of inequalities in wealth and health.
The 2011 Census: Estimating the Population Alexa Courtney.
Sinclair Sutherland Labour supply: Finding and using statistics.
Samples of Anonymised Records from the U.K. Census 1991 and 2001 Integrating Census Microdata Workshop Barcelona th July 2005 Dr. Ed Fieldhouse Cathie.
European Socio-Economic Classification: A Validation Exercise Figen Deviren Office for National Statistics.
Improved Method for the Geographical Distribution of Out-Migrants
Presentation transcript:

Adding Census Geographical Detail into the British Crime Survey for Modelling Crime Charatdao Kongmuang Naresuan University, Thailand Graham Clarke and Andrew Evans University of Leeds, UK

Background Crime and risk of victimisation Crime and risk of victimisation  Unevenly distributed across population, time and space  Varies dramatically with demographic, socio-economic and area characteristics Crimes at the small area scale often do not match expectations based on national averages

Background (cont.) Although, the British Crime Survey (BCS) provides rich information about levels of crime and crime victimisation, it cannot be used to explain crime victimisation for small geographical units (not currently released below the national level)

Solution Attach the information from the BCS to the more geographically disaggregated census data using spatial microsimulation technique.

What is Microsimulation? A methodology aimed at building large-scale datasets on individual units such as persons, households or firms and can be used to simulate the effect of changes in policy or other changes on these micro units

What is Spatial Microsimulation? A microsimulation that takes space into account It contains geographical information that can be used to investigate the policy impacts

SimCrime Model A static spatial microsimulation model designed to estimate the likelihood of being a victim of crime and crime rates at the small area level in Leeds.

SimCrime Model (cont.) Simulated Annealing C ombines individual microdata from the British Crime Survey (BCS) with spatially small area aggregated census data to create synthetic microdata estimates for output areas (OAs) in Leeds using a Combinatorial Optimisation Simulated Annealing method.

SimCrime Model Specification ‘Simulated Annealing-Based Reweighting Program’ The synthetic microdata dataset was generated at the census output area for Leeds with the use of ‘Simulated Annealing-Based Reweighting Program’. 514,523 individuals aged living in households found in Leeds in the UK 2001 Census were recreated

Simulated Annealing Based Reweighting Program (generate a population microdata dataset at the Output Area Level) Implemented in Java. The process involves selecting the combination of individuals from the BCS microdata which best fits the known constraints in the selected small areas (of the 2001 UK Census). gradually improving fit The process is repeated with the aim of gradually improving fit between the observed data and the selected combination of individual from the BCS.

Data for generating synthetic micro-population:  Census Area Statistics of the 2001 UK Census  Constrained Tables  Number of total population in small-areas  Microdata from the 2001 British Crime Survey

Census Area Statistics (CAS) Equivalent to the Small Area Statistics (SAS) of the 1971, 1981, and 1991 Censuses. Available for geographical levels down to output area (OA), the smallest unit of the 2001 Census. Note: Note: Each OA contains approximately 290 persons or 125 households

Category IndicatorHigh Propensity Demographic Characteristics of Offender Age Sex Marital Status Family Status Family Size Young adult Male Single Broken Home, divorce (weak family life) Large Socio-Economic Status of Offender Income Employment status Education Deprivation Low income Unemployed Less High level of deprivation Household Characteristics Density of living Tenure Substandard Rented Victim Characteristics Age Sex Ethnicity Lifestyle Tenure Young adult Male Minority Group Away home Rented, not owner occupied Neighbourhood types and characteristics Urbanisation Population Density Proximity High Inner city, proximity to disadvantage areas Variables related to crime

Constrained Tables: CS004: CS004:Age by Sex and Living Arrangements (16 categories) CS047: CS047: National Statistic-Socioeconomic Classification by Tenure (18 categories) CS061: CS061: Tenure and Car or Van Availability by Economic Activity (24 categories)

SimCrime Constrained Variables Categories Age Aged Aged Aged Aged Sex Male Female Living Arrangement Couple Not couple Economic Activity Employed Unemployed Inactive Full-time Student Tenure Type Owned Rented Car or Van availability No Car One Car Two or more car Socio-economic Classification Higher Managerial and professional occupations Lower Managerial and professional occupations Intermediate occupations Small employers and own account workers Lower supervisory and technical occupations Semi-routine occupations Routine occupations Never worked and long-term unemployed Not classified

Discrepancies in census counts between tables Source: 2001 Census Area Statistics Source: 2001 Census Area Statistics Note: Note: Each cell shows the number of people aged living in households

The constraint tables should be adjusted to minimise discrepancies between the total populations in small areas.

Constraint Tables Adjustment Total number of people in the small areas (GroupNumber) Each table cell

Number of people in each cell = Number of people from the constraint table x GroupNumber Total Sum for each area Constraint Tables Adjustment (cont.)

Constraint Table Adjusted Table 116/230 * 228 = 115

What can we get? more consistent The adjustment method ensures the constraint tables are more consistent or at least can be guaranteed to produce the smallest discrepancy.

The British Crime Survey  One of the largest social research surveys conducted in England and Wales (Sample 40,000 households)  A victimisation survey (whether or not reported to the police)  Covers a wide range of topics (1,642 variables) NOT  The BCS can now provide limited information at the police force area level, but NOT for smaller geographical units.

Microdata (The 2001 BCS) 1,642 variables with 32,824 records

The Program: The microdata filtering process  Goes through the entire micro-database and checks whether an individual fits into each column of constraining tables for the current area. Simulated Annealing process  Searches for the best combinations of individuals based on the result of the filtering process.

Output: from the Simulated Annealing Based Reweighting Program  Synthetic Population: A list of individuals which contains the demographic and socio-economic characteristics ( crime variables from the BCS are attached).  Error Report: Provides information on the difference between distributions of constrained table and synthetic microdata at the output area level.

The absolute differences between estimated & expected counts Error Report

Distribution of female single, widow, or divorce aged living in rented house

Headingley

Distribution of high class households, owner occupier having at least 1 car

Evaluation of Synthetic Microdata Evaluate in terms of their match to the constraint tables from the census at the output area level.

Evaluation of Synthetic Microdata (cont.) Total Absolute Error (TAE) The measure of difference between distributions of constrained table and synthetic microdata is the Total Absolute Error (TAE)  The sum of absolute differences between estimated and observed counts. Standardised Absolute Error (SAE) To compare across the tables: Standardised Absolute Error (SAE)  TAE / Total expected count

SAE of 0 or perfect fit =1,318 output areas Note: The number in the bracket show number of output area for each SAE group. There are 2,439 output areas in Leeds. Source: SimCrime

Spatial distribution of SAE for all constraints at output area level SAE of 0 or perfect fit =1,212 output areas Note: The number in the bracket show number of output area for each SAE group. There are 2,439 output areas in Leeds. Source: SimCrime

Modelling Crime  Each individual in the BCS has crime variables associated with them, the microsimulation allows us to make small area estimates victims of crime and high-risk areas.  Assume that if the synthetic population have the same characteristics as the population from the BCS, they will have the same propensity to be a victim of crime.

Headingley University Estimated victim rate per 1,000 households by ward of ‘burglary dwelling’ in Leeds

Conclusion SimCrime effectively adds ‘geography’ to the British Crime Survey The spatial aspect of the data make it possible to do analysis at different spatial scales. Demonstrated a method to minimise discrepancies between the totals of the constraint tables

Conclusion (cont.) small area levels The spatial microsimulation has enabled the modelling of crime victimisation at small area levels. Before this the smallest area of modelling crime in the UK was at the police force area level.

More information Modelling Crime: A Spatial Microsimulation Approach (Completed PhD thesis) (Completed PhD thesis) SimCrime: A Spatial Microsimulation for Crime in Leeds (Working Paper 06/1) Dept. Natural Resources and Environment Fac. of Agriculture, Natural Resources and Environment Naresuan University, Muang, Phitsanulok, 65000, THAILAND

Thank you