T HE DEVELOPMENT OF A NEW MIGRATION CLASSIFICATION FOR LOCAL AUTHORITY DISTRICTS IN B RITAIN Presentation to the Office for National Statistics – 5 th.

Slides:



Advertisements
Similar presentations
The Census Area Statistics Myles Gould Understanding area-level inequality & change.
Advertisements

Are Area-Based Deprivation Indices A Nonsense? Dennis Pringle Dept. of Geography, NUI Maynooth; National Institute For Regional And Spatial Analysis; and.
Household Projections for England Yolanda Ruiz DCLG 16 th July 2012.
School of Geography FACULTY OF EARTH & ENVIRONMENT Using OAC for analysis of the 2001 Census interaction data Oliver Duke-Williams
A comparison of the characteristics of childless women and mothers in the ONS Longitudinal Study Simon Whitworth Martina Portanti Office for National Statistics.
In the case of Holywell and Heslington, there are clear differences in the ages of migrants involved in these flows, and the pattern fits well with this.
A model-based approach for estimating international emigration for local authorities Brian Foley, Office for National Statistics BSPS day meeting London.
The micro-geography of UK demographic change Paul Norman School of Geography, University of Leeds Understanding Population Trends and Processes.
England’s Changing Social Geology Dr Daniel Vickers RCUK Academic Fellow in Social and Spatial Inequalities Department of Geography University of Sheffield.
AEB 37 / AE 802 Marketing Research Methods Week 7
Statistics for Marketing & Consumer Research Copyright © Mario Mazzocchi 1 Cluster Analysis (from Chapter 12)
T HE W EB - BASED I NTERFACE TO C ENSUS I NTERACTION D ATA - WICID Presentation to the ESRC Research Methods Festival Adam Dennett Centre for Interaction.
Sample of Anonymised Records: User Meeting Propensity to migrate by ethnic group: 1991 & 2001 Paul Norman 1, John Stillwell 2 & Serena Hussain 2 School.
Adding Census Geographical Detail into the British Crime Survey for Modelling Crime Charatdao Kongmuang Naresuan University, Thailand Graham Clarke and.
Identifying new migrant populations in UK cities David Owen and Audrey Lenoël.
Software Quality Control Methods. Introduction Quality control methods have received a world wide surge of interest within the past couple of decades.
Geography and Geographical Analysis using the ONS Longitudinal Study Christopher Marshall & Julian Buxton CeLSIUS.
The Social Profile of Rural Britain: Insights from longitudinal datasets Heather Joshi Gareth Hughes & Brian Dodgeon Centre for Longitudinal Studies Institute.
Migration, methodologies and health inequality SEED Group
Decreasing segregation and increasing integration in England and Wales: what evidence of ‘White flight’? Dr Gemma Catney Leverhulme Trust Early Career.
Secondary Data Analysis Using the Census Stephen Drinkwater WISERD School of Business and Economics Swansea University.
The Gender Gap in Educational Attainment: Variation by Age, Race, Ethnicity, and Nativity in the United States Sarah R. Crissey, U.S. Census Bureau Nicole.
1 Applying Circular Statistics to the Study of Graduate Job Search: The Case of Great Britain. Alessandra Faggian 1, Jonathan Corcoran 2 and Philip McCann.
Migration and the Economy in Cornwall A quantitative analysis on the relationship between migration and the economy in the small areas of Cornwall Stuart.
Analysis of travel-to-work patterns and the identification and classification of REDZs Dr David Meredith, Teagasc, Spatial Analysis Unit, Rural Economy.
Changes to Internal Migration methodology for English Subnational Population Projections Robert Fry & Lucy Abrahams.
‘Estimating with Confidence’ and hindsight: Population estimates for areas smaller than districts, revisions to levels of 1991 Census non-response Paul.
The new HBS Chisinau, 26 October Outline 1.How the HBS changed 2.Assessment of data quality 3.Data comparability 4.Conclusions.
Are the results valid? Was the validity of the included studies appraised?
Improving the estimation of long-term international emigration at local authority level Joshua Turner Population Statistics Research Unit (PSRU) Local.
SPATIAL MICROSIMULATION: A METHOD FOR SMALL AREA LEVEL ESTIMATION Dr Karyn Morrissey Department of Geography and Planning University of Liverpool Research.
The micro-geography of UK demographic change Paul Norman School of Geography, University of Leeds understanding population trends and processes.
Internal migration of Britain’s ethnic populations Serena Hussain and John Stillwell School of Geography University of Leeds Presentation for the UPTAP.
COMP53311 Clustering Prepared by Raymond Wong Some parts of this notes are borrowed from LW Chan ’ s notes Presented by Raymond Wong
1 New York State Growth Model for Educator Evaluation 2011–12 July 2012 PRESENTATION as of 7/9/12.
2011 CENSUS Coverage Assessment – What’s new? OWEN ABBOTT.
Secondary data Relevance: A-Level Case study: 2011 UK census Topic: Geographical skills.
A Statistical Analysis of Seedlings Planted in the Encampment Forest Association By: Tony Nixon.
Monitoring UK internal migration in the twenty-first century John Stillwell Centre for Interaction Data Estimation and Research (CIDER), School of Geography,
IPC The Regional Housing Market for Older People in the South West Exeter 24th th September 2008.
Providing Access to Census- based Interaction Data in the UK: That’s WICID! John Stillwell School of Geography, University of Leeds Leeds, LS2 9JT, United.
1 Measuring Quality Issues Associated with Internal Migration Estimates Joanne Clements, Amir Islam, Ruth Fulton & Jane Naylor Demographics Methods Centre.
Methodology for producing the revised back series of population estimates for Julie Jefferies Population and Demography Division Office for.
T OWARDS AN AREA CLASSIFICATION FOR MIGRATION ANALYSIS AND INTER - CENSAL MONITORING Presentation to the BSPS Annual Conference, Manchester 2008 Adam Dennett.
Sustainable rural populations: the case of two National Park areas Alan Marshall Ludi Simpson Cathie Marsh Centre for Census and Survey Research.
Additional analysis of poverty in Scotland 2013/14 Communities Analytical Services July 2015.
WICID AND THE 2001 INTERACTION DATA John Stillwell and Oliver Duke-Williams School of Geography, University of Leeds Presentation at the Ninth International.
Paper presented at the BSPS Annual Conference, University of Kent at Canterbury, September 2005 Are our cities still losing human capital? The evidence.
General Register Office for S C O T L A N D information about Scotland's people Household Estimates and Projections Esther Roughsedge General Register.
ETHNIC MIGRATION IN BRITAIN: Analyses of census data at district and ward scales John Stillwell and Adam Dennett School of Geography, University of Leeds,
General Register Office for S C O T L A N D information about Scotland's people Scottish Demography - Local Perspectives Explores differences between parts.
Characterizing Rural England using GIS Steve Cinderby, Meg Huby, Anne Owen.
The dynamics of Britain’s ethnic group populations: the roles of natural change and net migration in producing the ethnic mosaic Nissa Finney and Ludi.
Household Projections Dorothy Watson General Register Office for Scotland Household Estimates and Projections Branch.
New estimates of housing requirements in England, 2012 to 2037 Neil McDonald and Christine Whitehead.
Exploring Microsimulation Methodologies for the Estimation of Household Attributes Dimitris Ballas, Graham Clarke, and Ian Turton School of Geography University.
Hukou Identity, Education and Migration: The Case of Guangdong
IAOS Shanghai – Reshaping Official Statistics Some Initiatives on Combining Data to Support Small Area Statistics and Analytical Requirements at.
By Dr Hidayathulla Shaikh. Objectives At the end of the lecture student should be able to –  Explain types of examination  Discuss different types of.
7.1 How have cities grown and what challenges do they face? 7.1a The world is increasingly urbanised as cities grow due to different processes Lesson 1-
Jo Watson sepho South East Public Health Observatory Solutions for Public Health Day 2: Session 2 Populations and geography.
INTRODUCTION Despite recent advances in spatial analysis in transport, such as the accounting for spatial correlation in accident analysis, important research.
Basic statistical concepts Variance Covariance Correlation and covariance Standardisation.
Unsupervised Learning
Taking Part 2008 Multivariate analysis December 2008
School of Geography, University of Leeds
Cluster Analysis.
Social Role Evolution of an Ideological Online Community Brittany I
NHS DUDLEY CCG Latest survey results August 2018 publication.
Unsupervised Learning
Presentation transcript:

T HE DEVELOPMENT OF A NEW MIGRATION CLASSIFICATION FOR LOCAL AUTHORITY DISTRICTS IN B RITAIN Presentation to the Office for National Statistics – 5 th May 2009 Adam Dennett Centre for Interaction Data Estimation and Research School of Geography University of Leeds

P RESENTATION O UTLINE Background: area classifications and their use in migration research Introduction to current research. Classification rationale and considerations for a migration classification. Developing a migration classification - methodology and issues Results – a new migration classification for local authority districts Conclusions and future research agenda

B ACKGROUND Classification of areas according to the individuals residing within has a long history: – Charles Booth 19 th Century London – Burgess and the Chicago School in 1920s – CACI (ACORN), Experian (MOSAIC), Eurodirect (CAMEO) – UK government (ONS) since the 1960s. Recent classifications for Local Authorities, Wards, SOAs, OAs & Health Authorities. Area Classifications

B ACKGROUND Dennett and Stillwell (2008) Population Trends 134 Dennett and Stillwell (2009) Population Space and Place – available online ‘Early View’ 6 th May Dennett and Stillwell (forthcoming) in Stillwell, J., Duke-Williams, O. and Dennett, A. (eds) Technologies for Migration and Commuting Analysis: Spatial Interaction Data Applications. Area classifications and migration research

At a relatively aggregate (Family) level, it can be argued that the process of counterurbanisation is continuing. At Group and Class levels, however, this is less so. There is evidence for a two tier ‘rural’ when Commuter belt is taken into consideration (even outside of city regions). BUT…Everything varies by age Net migrants and net migration rates by district classification – all ages,

A MIGRATION DATA CLASSIFICATION Interaction data are inherently more complex than standard count data – origin and destination. One of the principal purposes of any classification is to simplify complex data to aid understanding. District level classifications such as those developed by ONS and Vickers et al. have been created without the use of migration/commuting variables. Underlying populations in areas may or may not be representative of migrant populations; thus classifying areas separately by their migrants may reveal things not shown in other general purpose classifications. Rationale – Classification to aid understanding

A MIGRATION DATA CLASSIFICATION An new area classification will be useful as a framework for monitoring migration measured from other data sources post-2001 Further research need not be limited to using the classification directly: – The development of any classification necessitates the evaluation and selection of variables. Any variables selected are likely to be more helpful in explaining the migration system and thus useful for future migration research – e.g. the development of projection models. Rationale – Classification as part of the research process

A MIGRATION DATA CLASSIFICATION Where the processes of internal migration in Britain are complex, our understanding can be enhanced through examining the particular characteristics migrants and migrant flows can give to defined areas. Britain can be usefully classified by the types of migrant and the particular flows that they exhibit to, from and within defined geographical areas, and that classified areas will be distinct in the profiles they exhibit. Research Hypothesis

C ONSIDERATIONS FOR THE NEW CLASSIFICATION Interaction data available at variety of scales in UK (OA to Region), although more variables available at coarser scales. Census provides more variables at a wider range of scales than any other data source in the UK. Finer grained data less attribute rich and more prone to small count perturbation. A classification of coarser areal units may encourage more false ecological inferences. Data at some levels not available for same units of measurement in UK (districts – Britain, Parliamentary constituencies – NI). Scale

C ONSIDERATIONS FOR THE NEW CLASSIFICATION Internal migration data available from a variety of sources disaggregated by numerous variables (especially when census data are used). Largest influence on population change in the UK. International migration data available from fewer sources, although census data are directly comparable with internal migration census data. Evidence for linkage with internal migration (Stillwell and Duke- Williams, 2005). Commuting data are similarly available from census and non-census sources. Phenomenon closely related to migration – especially where longer distance, longer stay commutes are concerned. Internal migration, International migration, Commuting?

C ONSIDERATIONS FOR THE NEW CLASSIFICATION It could be argued that the defining element of interaction data is the linkage between origins and destinations and the flow between those points. Classification of flows has been carried out before – ‘functional regionalisation’ used by Coombes et al. (1986, 2002) to create Travel To Work Areas creates sets of contiguous areas based upon commuting flow data. The same technique used with migration data to create Housing Market Areas (Coombes et al., 2004). Classifying flows tends to create sets of contiguous areas, although contiguity may be an unnecessary constraint in an interaction data classification. Classifying flows or individuals?

A MIGRATION DATA CLASSIFICATION Scale: District level data used initially due to increased attribute availability, but only for Britain. Scope for trialling a ward level classification (including NI) at a later stage. Data: Internal and international migration data used initially. Classification type: This classification will concentrate on classifying individuals rather than flows. Contiguity constraints not useful at this stage. Initial decisions for a trial classification

A TRIAL CLASSIFICATION A standard clustering methodology similar to the one suggested by Milligan (1996) was used to create the classification. This consists of 6 steps: Local authority districts in Britain were chosen as objects to cluster 2.56 variables were selected from an original list of 5559 taken from Census SMS level 1 tables. Domains included: – In/out/within/international/no usual address migration rates for age, ethnicity, economic activity and long term illness – Migration efficiency rates (due to no suitable denominator) for Socio-economic status, family status and housing tenure Selection through a combination of correlation analysis, principal components analysis and analysis of standard deviation stats. Methodology

A TRIAL CLASSIFICATION 3. All variables were standardised by z-scores where measurements were on different scales 4. Euclidean distance was chosen as measure of proximity used to compute clusters 5. Ward’s hierarchical clustering algorithm was used in the initial partition as it does not require definition of the number of clusters prior to the clustering procedure – a suitable number of clusters chosen from Ward’s algorithm output. Methodology

A TRIAL CLASSIFICATION Data input to SPSS; algorithm run; cluster solution produced. Job done? Initial classification has produced a set of areas with distinctive characteristics in line with what might be expected (e.g. London, London periphery, university towns, ex-industrial economically more depressed areas, rural periphery all forming distinct clusters). BUT… how do we know this is a reliable classification? Are all districts clustered appropriately?

R EFINING AND CREATING A ROBUST CLASSIFICATION The robustness of any classification will depend on the quality of the decisions made at each point of the construction process. Initial trial classification can be seen as a draft – some decisions were made hastily in order that some output be produced which could then be improved upon. In refining the classification Important decisions need to be made in relation to: – data – methodology

R EFINING AND CREATING A ROBUST CLASSIFICATION Trial classification ignored variable distributions. Various authors (Vickers, 2006 and Založnik, 2006) advocate transformation of skewed variables as it stated they can bias clusters. Others (Openshaw and Wymer, 1995) say not necessary… Who is correct? Only solution to test the data… Data - badly behaved variables

R EFINING AND CREATING A ROBUST CLASSIFICATION Transforming data (log and sqrt transformations) failed to improve variable distributions – therefore not necessary to transform But, do skewed variables bias clusters? Next step to experiment with dropping very skewed variables… 8 dropped first Transforming or removing variables

R EFINING AND CREATING A ROBUST CLASSIFICATION Further 4 very skewed variables dropped (international variables) Dropping skewed variables has little effect, however removing these variables reduces complexity in the final solution Dropping international variables means that this is now an internal migration classification Transforming or removing variables

R EFINING AND CREATING A ROBUST CLASSIFICATION Ward’s algorithm can produce sub-optimal solutions - cases cannot be reallocated to more suitable clusters at a later stage in the algorithm. K-means algorithm can reallocate cases. BUT… Implementation of k-means in some software packages can produce different results from the same data through starting with different cluster centroids… Methodology – An optimising algorithm

C LUSTER O PTIMISATION K-means in SPSS with cases ordered differently

C HOOSING K MATLAB implements K- means such that a solution is chosen with cases surrounding the optimum cluster centroids. Solution with minimum average distance to these centroids is selected In a single tier classification, the number of clusters is key… Comparison of silhouette plot data enables the choice of the most appropriate number of clusters k Optimal Clustering Sub-optimal Clustering

F INAL SOLUTION Selection of most the appropriate variables and the best clustering techniques through an exhaustive process of testing and evaluation ensures the most robust final solution. But what does the classification reveal?

A ‘ FUZZY ’ CAVEAT Even though this is the best classification possible, some districts exhibit stronger cluster membership than others Silhouette values give an indication of the strength of cluster membership for each district Cluster profiles will have a greater or lesser association with each district accordingly

C LUSTER P ROFILES The cluster is characterised by in- migrants and within-area migrants in the older age groups – 45 and above. Younger in-migrants are very much underrepresented. Migrants into these areas are from across the socio-economic spectrum, although the very high socio-economic groups are less common. Migrants preferentially move into owner occupied accommodation, and tend to be either or alone or in couples, far more than parent families. Cluster 1 - Coastal and Rural Retirement Migrants

C LUSTER P ROFILES The cluster is characterised by high levels of student in- migration, and young person within-area migration. Non-household moving groups into privately rented accommodation are common in this cluster, as are non-family households and individuals moving into communal establishments – all characteristics of a student population. In addition, non-white within-area migration is important, as is in- migration of economically inactive migrants. Cluster 3 – Student Towns

C LUSTER P ROFILES Districts in this cluster have very much below average in-migration and out-migration for all age groups, signifying a degree of isolation from the rest of the clusters in Britain. Shorter distance, within-area migration is slightly above average. Moves into these areas come from individuals in the lower socio-economic groups, with moves of one-parent families being above average. Moves of economically inactive individuals are very much below average. Cluster 5 - Constrained, Working-Class, Local Britain

C LUSTER P ROFILES The cluster is defined by some of the highest and lowest z-score values, indicating it is the most dynamic cluster in the classification. Very high rates of in-migration for the migrants under 30, but below average in- migration rates for those over 30. Out-migration rates are very high for all groups, but especially for those between 30 and 45. This cluster features the highest rates of movement of the economically inactive. Across the four highest socio-economic groups there are positive efficiency rates for other moving groups, but negative rates for wholly moving households, Moves into privately rented accommodation are common. Cluster 7 – Dynamic London

C ONCLUSIONS AND MOVING FORWARD It is possible to classify districts in Britain by their migration characteristics Clusters in the classification exhibit very different profiles which are useful for understanding the complex internal migration processes and patterns going on in Britain BUT… A classification can only reliable once care has been taken at each stage of the construction process Even the most robust classification will have a degree of ‘fuzziness’ Back to the research hypothesis…

C ONCLUSIONS AND MOVING FORWARD The district level classification now provides a framework for the analysis of internal migration in Britain post-2001 Where annual internal migration data from Patient Registers are attribute poor, an attribute rich classification can add significant value The classification can also provide a framework for estimating and projecting internal migration into the future

C ONCLUSIONS AND MOVING FORWARD Comparison with other classifications ONS classificationVickers et al. Classification ONS IMPS LA clusters

C ONCLUSIONS AND MOVING FORWARD Producing a sister, ward-level classification – potentially including flow data from a new set of functional regions. This could be used for validation of district level classification. Analysis of patient register data using the classification as a framework - What are changes over time? Analysis of other migration data – HESA? Calibration and testing of spatial interaction models which can be used to estimate migration between classified areas and project migration patterns leading up to the next census in 2011.

T HANK YOU A NY QUESTIONS, COMMENTS OR SUGGESTIONS ? Working paper detailing all of this work will be available here: Me: