Modelling international migration to produce local level estimates Ruth Fulton Office for National Statistics
Outline of presentation Context Immigration –Current method –Using administrative data –Modelling approach Data sources, Fitting the model, Diagnostics/ validation, Impact on estimates Emigration
Context of work Improving Migration and Population Statistics (IMPS) Previous improvements to immigration and emigration methodology (2007) Forthcoming package of improvements (May 2010)
The importance of international migration Key driver of population change
Current method: immigration National level –International Passenger Survey (IPS) data only Government Office Region (GOR) & Wales level –IPS data calibrated to Labour Force Survey (LFS) data –LFS data averaged over three years Intermediate geography level –IPS data averaged over three years Local authority level –2001 Census data
Current method: issue Current method uses 2001 Census data to distribute to LA level Clear changes in migration trends since 2001 e.g. EU accession Bias introduced to LA estimates where Census distribution has changed
Improving the current method:
Use of administrative data Potential use of administrative data: GP registrations (Flag 4s) National Insurance Number (NINo) allocations to overseas nationals Improves timeliness at LA level Differences in coverage and definitions
Comparison of Flag 4s and NINos
A modelling approach Produces estimates for LAs (IPS data cannot be used directly at this level) ‘Borrows strength’ from other data sources (covariates) Model fitted at the LA level describing relationship between IPS and covariates Fitted model can be used to obtain LA estimates
Model specification where = direct IPS estimate, no. immigrants going to LADj = expected total count of immigrants, LADj = set of covariates for LADj
Alternative approaches Modelling IPS sample counts Modelling IPS sample counts, with average LA weight as offset OR additional covariate Scaling IPS direct estimate to a count scale (or standardising IPS sample count) Fitting model at NMGi level, estimating coefficients and applying model at LA level
Choice of covariates Covariates selected which are associated with immigration Direct - counts of actual migrants Indirect - factors associated with migration
Variables entered for potential selection NINos Country of Birth Ethnic Population Flag 4s UK-born Immigrants Population Density Foreign Armed Forces Industry Mid-year Pop Est Foreign Students Job Centre Vacancies Home Armed Forces Internal Migration Unemp Estimates
Choice of covariates Model fitted for each year to identify most important covariates Fixed set of covariates then selected for use in all models
Fixed covariates currently in the model NINoS Country of Birth Ethnic Population Flag 4s UK-born Immigrants Population Density Foreign Armed Forces Industry Mid-year Pop Est Foreign Students Job Centre Vacancies Home Armed Forces Internal Migration Unemp Estimates
Diagnostic and Validation tests Model diagnostics Pseudo R 2 Residual plots Model vs Sample estimate plots Comparing the 2001 model based estimates with the 2001 Census data Comparing the sum of the model based estimates for LAs with the NMGi estimate Checking the time-series
Time series check
Model vs Sample Estimate plot (04/05)
04/05 ExistingNew
Preliminary Impacts Assessment ≥ to to to to to ≤
Current methods: emigration National level International Passenger Survey (IPS) data only Government Office Region (GOR) & Wales level IPS data only Intermediate geography level IPS data averaged over three years Local authority level Model based distribution (propensity to migrate)
Improvements Fits model at local authority level rather than intermediate geography level Uses Poisson modelling and models number of migrants rather than propensity to migrate Tested some additional covariates, e.g. more detailed ethnic group and fixes covariates
Differences from immigration model Averages IPS data over 3 years Includes an Intermediate geography effect Includes covariates which are associated with emigration Does not include any direct counts of emigrants
Impact of model based distribution Only affects the distribution of number of immigrants and emigrants within the intermediate geography Migration estimates for local authorities will change for mid-2002 to mid-2008 as a result
Further Information Quarterly updates and other information at Consultation papers (December 09)