Administrative Data Sources ONS Centre for Demography.

Slides:



Advertisements
Similar presentations
Statistics NZs experience in using Administrative Data in an Integrated Programme of Economic Vince Galvin General Manager Strategy & Communications.
Advertisements

An Introduction to the UK Data Archive and the Economic and Social Data Service November 2007 Jack Kneeshaw, UKDA.
Will 2011 be the last Census of its kind in England and Wales? Roma Chappell, Programme Director Beyond 2011 Office for National Statistics, July 2011.
Research into an alternative sampling frame for the FRS Antonia Simon, Development Team, DWP.
Improved Methods for Population and Migration Estimates ONS Centre for Demography May 2007.
Update on Population Statistics Research Projects Jonny Tinsley, Population Statistics Research Unit
Improving Migration and Population Statistics: Mid-Year Estimates Office for National Statistics Centre for Demography.
1 Cohort management and the Secondary Uses Service (SUS) Nirupa Dattani Office for National Statistics.
ALB-3 Paper 1 Headline Measures and Business Intelligence Data Pack.
Population Estimates Jonathan Swan, ONS Mid-year population estimates The ONS mid-year population estimates: ●At national level for England, Wales ●At.
A model-based approach for estimating international emigration for local authorities Brian Foley, Office for National Statistics BSPS day meeting London.
What are Wage Records? Wage records are an administrative database used to calculate Unemployment Insurance benefits for employees who have been laid-off.
Internal Migration Research Update Kostas Loukas, Population Statistics Research Unit
Using Administrative Data to Improve Social Statistics – An Example of Collaborative Work Minda Phillips, Office for National Statistics. Paul Sinclair,
Family Resources Survey Data Linking Jo Cockerham.
EGM – Population & Housing Censuses Eurostat / UNECE - Geneva - 24/25 May 2012 Beyond 2011 The future of population statistics (England & Wales) Alistair.
Developments with ONS’ Small Area Population Estimates Project Andy Bates.
Identifying new migrant populations in UK cities David Owen and Audrey Lenoël.
Quality assuring the UK business register Andrew Allen.
Beyond 2011 – A new paradigm for population statistics? Pete Benton, Beyond 2011 Programme Director Office for National Statistics, UK.
Household projections for Scotland Hugh Mackenzie April 2014.
Becoming Canadian Citizens: Intent, process and outcome Kelly Tran, Tina Chui: Statistics Canada Stan Kustec, Martha Justus: Citizenship and Immigration.
Population Estimates and Projections in the U. S. John F. Long
Improving the estimation of long-term international emigration at local authority level Joshua Turner Population Statistics Research Unit (PSRU) Local.
Understanding Population Trends and Processes WHAT HAPPENS WHEN INTERNATIONAL MIGRANTS SETTLE? ETHNIC GROUP POPULATION TRENDS AND PROJECTIONS FOR UK LOCAL.
Beyond 2011 The Future of Population Statistics Martin Ralphs, Office for National Statistics.
1 1 BSPS Seminar, GLA May 2 nd 2006 Professor Philip Rees Dr Peter Boden Estimating London’s New Migrant Population.
UNITE and the benefit cap: building a new data platform Daniel Brown & William Baxter - DWP.
Quality Measures for ONS population estimates: Introduction Local Insight Reference Panels Autumn
Migration Statistics Improvement Programme – Overview of Phase 2 ONS Centre for Demography.
National Statistics Quality Review on International Migration Estimates Update on taking forward the recommendations of the review Emma Wright & Giles.
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
Coverage assessment and adjustment methodology Owen Abbott Methodology Directorate, ONS.
2011 CENSUS Coverage Assessment – What’s new? OWEN ABBOTT.
1 Sources of gender statistics Angela Me UNECE Statistics Division.
United Nations Economic Commission for Europe Statistical Division Sources of gender statistics Angela Me UNECE Statistics Division.
2011 Census 2007 Census Test – emerging findings Garnett Compton, ONS Updated 4 September 2007 BSPS – 12 September 2007.
1 Measuring Quality Issues Associated with Internal Migration Estimates Joanne Clements, Amir Islam, Ruth Fulton & Jane Naylor Demographics Methods Centre.
Inflation Report February Output and supply.
Register-based migration statistics and using additional administrative data sources Barica Razpotnik Statistical Office of the Republic of Slovenia UNECE.
Longitudinal Data Recent Experience and Future Direction August 2012.
General Register Office for S C O T L A N D information about Scotland's people Demographic Statistics 2005 Cecilia Macintyre General Register Office for.
1 Do UK higher education students overestimate their starting salary? John Jerrim Institute of Education, University of London.
Plausibility Ranges for Population Estimates Focusing on ranges for children.
General Register Office for S C O T L A N D information about Scotland's people BSPS Review of migration methods using health registrations Nick.
Interstate Statistical Committee of the Commonwealth of Independent States (CIS-Stat) WAYS AND MEANS OF IMPROVING THE QUALITY OF MIGRATION STATISTICS:
Projections of the Ethnic Minority Populations of Britain D.A. Coleman and M.D. Smith Department of Social Policy and Social Work, University of Oxford.
An Improved Method for Estimating Immigration to Local Authorities in England and Wales Nigel Swier British Society of Population Studies (BSPS) Conference.
DESTINATION MEASURES AND RAISING THE PARTICATION AGE REQUIREMENTS Simon Gentry Business Manager, Services for Young People, Education.
General Register Office for S C O T L A N D information about Scotland's people Comparison between NHSCR and Community health index sources of migration.
1 Understanding and Measuring Uncertainty Associated with the Mid-Year Population Estimates Joanne Clements Ruth Fulton Alison Whitworth.
UNSD/STATISTICS KOREA International Seminar on Population and Housing Censuses: Beyond the 2010 Round Seoul, November 2012 Beyond 2011: The future.
Modelling international migration to produce local level estimates Ruth Fulton Office for National Statistics.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
Jonathan Smith and Cal Ghee Migration Statistics Improvement, ONSCD Centre for Demography Improving internal migration estimates of students.
1 1 Topics difficult to measure in a register-based census Harald Utne Census Project Statistics Norway UNECE-Eurostat Meeting on Population.
Beyond 2011 Administrative data sources and low-level aggregate models for producing population counts.
The 2011 Census: Estimating the Population Alexa Courtney.
UN ECE Seminar on New Frontiers for Statistical Data Collection 31 Oct – 2 Nov 2012 Beyond 2011 The future of population statistics Andy Teague, Office.
Management Information and Data Jon Carling Head of NERIP Michael Johnson Research Fellow IPPR North.
Using administrative data to produce official social statistics New Zealand’s experience.
Methodology of estimating the annual number of usual resident population in Latvia Baiba Zukula Deputy Director of Social Statistics Department Central.
Beyond 2011 Voluntary Sector Statistics User Event Minda Phillips Amelia Ash.
Developments in the measuring and reporting of international migration statistics Royal Statistical Society Sarah Crofts and Paul Vickers 6 December.
Sarah Crofts Head of Migration Analysis and Development of Sources
Dominik Rozkrut Central Statistical Office of Poland
Building on the Migration Statistics Improvement Programme
Beyond 2011 Administrative data sources and low-level aggregate models for producing population estimates.
Pete Benton , Beyond 2011 Programme Director
Presentation transcript:

Administrative Data Sources ONS Centre for Demography

Overview Assess statistical data quality of record level data ONS have access to: School Census Higher Education Statistics Agency Migrant Workers Scan Lifetime Labour Market Database Patient Registers Detailed knowledge of these data sets gained during phase 1 will inform their potential use in phase 2

Statistical Quality Measures Relevance - Purpose/ uses/ definitions Accessibility - Collection procedures Accuracy - Coverage/ completeness/ error Timeliness - Frequency/ lag Comparability - Legislative/ administrative changes Coherence - Common identifiers

School Census: Overview The School Census (SC) collects pupil level data on all pupils attending maintained schools in England, whilst this is only a subset of the population it collects a wide range of demographic information for each child.

School Census: Key Features All pupils attending maintained schools in England Annual ‘Snap shot’ Much validation is carried out by the schools software, LEA and DfE. Wide range of variables available to ONS Introduction of new variables Accuracy of variables very high and improving

School Census: Key Issues Extract as at January, unlike timing for MYEs Excludes Independent schools and several other institutions Reliant on pupil/parent to up-date school of any changes Pupil reference matching number not always unique No migrant identifier Pupil number not available on other sources

Higher Education Statistics Agency Data: Overview The Higher Education Statistics Agency (HESA) collect data on all students in Higher Education Institutions in the UK (full and part time, UK domiciled and overseas students) – we have access to data for E&W.

HESA: Key Features All students registered at a Higher Education Institution in England and Wales HESA also record number of students who complete studies each year by term-time local authority Wide range of variables available High quality data source Domicile and study addresses used to identify internal migration Linking to other sources to identify international student inflows

HESA: Key Issues Data by academic year, so not directly comparable to ONS estimates Some student groups excluded Term-time address assumed as at 30 th June Student identifier not always unique Student identifier not available on other sources

Migrant Workers Scan: Overview The Migrant Workers Scan (MWS) is a subset of data from the National Insurance Recording and PAYE System (NPS) and contains information on all overseas nationals who have registered for and been allocated a National Insurance Number (NINo).

MWS: Key Features All overseas nationals who are allocated a NINo Quarterly extracts Key demographic variables supplied MWS considered a high quality source New registrations used to identify international immigration Address change used to identify internal migration of overseas nationals

MWS: Key Issues Not all overseas nationals captured Registration lag Arrival date is self reported Address at first extract is a proxy for arrival address Address changes are self reported (not compulsory) Includes short and long term migrants arrivals No measure of outflows Encrypted NINo not available on other sources

Lifetime Labour Market Database (L2): Overview The Lifetime Labour Market Database (L2) is a 1% extract of data from the National Insurance Recording and PAYE System (NPS) DWP create the L2 database to inform policy decisions and expenditure forecasts on NI registrations, National Insurance contributions, benefit entitlement and take-up, employment & self employment trends.

L2: Key Features L2 has over 700,000 individuals in the 1% sample. Holds detailed records for each tax year from 1975 & summary data between 1948 & 1974 Each person may have the following data present: –Registration details (for all people) and arrival dates & country of origin (for migrants) –Individual employment level data in each tax year –Benefit spells & number of weeks credited with NI contributions –Self employed & Voluntary NI contributions –Periods of NI liability / Non-liability, Child Benefit & Tax Credits –Address details and histories L2 also has 100% of all employer PAYE details by tax year (from 1997 onwards).

L2: Data Recording Data on benefits are recorded by DWP. Data is communicated to NPS via the relevant benefit system or the Customer Information System (CIS) Data on employment is submitted to the NPS by employers via their PAYE P14 returns. The P14 data is validated by HMRC prior to recording on the NPS system Data on Self Employment is submitted by individuals via NI contribution or self assessment. HMRC generates Class 2 bills & receipts for the payment of NI contributions. Data on Tax Credits is recorded on NPS by HMRC after the Tax Credit award Data on Child Benefit is captured on NPS by HMRC when the claim for benefit starts and when a claim is amended or ceases. Address data is updated via any interaction with either a DWP or HMRC system.

Creation of L2 } * Selection of data on NPS – not a complete list of data tables Tax Credit claims & awards. Self employment & Self assessment data Employee & Employer PAYE data (P14 etc) Changes of address notified by employer or person Changes of name or circumstances Births & claims to Child Benefit NPS*DWP ProcessesL2 database Benefit Systems NINO registrations (including migrants) Benefit claims and awards } Data exchange Identification data PAYE & NI data Liabilities data Address data Employer PAYE data IBJSAetc 1% sample Load data Validate & split Remove Audit trail Add archived records from old L2 Derive analysis variables Postcode matching Anonymise 100% extract Postcode matching Anonymise 1% sample Extract 1% from 5% sample of benefit systems Anonymise Identification data (1%) PAYE & NI data (1%) Liabilities data (1%) Address data (1%) Employer PAYE data (100%) Benefit spell data (1%)

L2: Coverage, Advantages & Limitations Coverage –Holds records for anyone who has ever had a National Insurance number –Records all employment, self employment, tax credit & benefit activities since 1975 –Holds address histories for all people who interact with NPS or DWP. Advantages –Large sample compared to survey data, spanning over 30 years. –Not subject to survey anomalies and response rates –Highly accurate. The data is used for benefit calculation and entitlement & State Pension accrual and so is highly accurate at individual record level –Individual record level data allows cohort and longitudinal analysis of activities over time. (L2 is anonymised to comply with Data Protection) Limitations –Doesn’t hold contextual data (education, household status etc) –Not collected specifically for population and migration purposes, so not always defined in the same way as ONS survey data

L2: Data Relevance Allows activities within the tax year to be analysed, and assessment made of whether the person is resident in the year or not, based on the type of activity. Enables methodological testing of data at individual record level to assess the most suitable and robust method of generating population and migration estimates using administrative data, based on activities that leave a ‘footprint’ on government computer systems. Enables cohort analyses to monitor changes in interaction behaviours over time, and to make adjustments accordingly to improve methodologies and population estimates using L2 data Early results from L2 suggest that this data source will provide robust population and migration estimates that will be highly beneficial to the MSIP & Beyond 2011 projects

L2: Timelines & Next steps Timeliness –NPS is a live system receiving continual updates –L2 extracts are taken in January and May. The extract takes around 2 months to load & process –The L2 extract is generally provided at least 9 months after the last tax year being analysed. This is due to the time it takes employers to submit PAYE returns. Next Steps –Refine the current L2 population & migration methodology to reflect (as near as possible) ONS survey/census selection criteria –Investigate the possibility of a larger sample size for the Beyond 2011 project. This would allow more robust analysis at Local Authority level and would improve the accuracy of internal migration measures.

Patient Registers: Overview ONS create a Patient Register (PR) dataset by collating data from each Primary Care Organisation, who hold records of patients registered with an NHS doctor within England and Wales.

Patient Register – Key features Annual ‘mid-year’ snap-shot Captures basic demographic variables Has been used by ONS for a number of years (main source for measuring internal migration) Other potential uses: –Population stock measure –International migration (Flag 4) Covers all ages

Patient Register – Key Issues Some gaps in coverage Reliant on patient to update their record and to re-register with a doctor Migrants may delay registering List inflation List cleaning (ad-hoc) Duplicate NHS numbers NHS number not present in other sources

How will the administrative data sources be used in phase 2?

These data sources will be considered, either singularly, or in conjunction with other sources, in forming plausibility ranges for each LA by age & sex: –Patient Register –School Census –HESA –L2 –Electoral Roll? These data sources can be use as an indirect contribution for informing the quality of each other, and also in comparison to mid-year estimates Phase 2 – Reconciling Administrative Sources

Age Coverage OTHER DATA SOURCES ? / PATIENT REGISTER 45 SCHOOL CENSUS HESA LIFETIME LABOUR MARKET DATABASE (L2) Phase 2 – Reconciling Administrative Sources

Phase 2 – Migrant Distribution Aim to deliver an improved method for distributing long and short term migrants at Regional and Local Authority level Research how administrative sources can be used to distribute the national IPS figure

Distribution Methods (LTM) National Immigration Estimate (IPS) Region/UK country Intermediate Geography Local Authority Current*Proposed Distribute using 3 yr LFS average Use 3 yr IPS average Model-based distribution Distribute using administrative data * For non-London LAs. A similar, but slightly more complex method is used for London LAs

Outline Proposals Immigrant streamPotential Sources (Long-term migrants) Potential Sources (Short-term migrants) WorkersMWS &L2 (Workers) StudentsHESAHESA & data on English language schools? Children under 16Flag 4s, School Census (ages 5-16) School Census? Returning migrantsIPS uncalibrated, L2 population data, Census Data from the IPS is distributed to local level using a regression model approach based on a wide range of sources e.g. Flag 4s, IDBR, Population estimates by ethnic group Other migrantsFlag 4, L2 (claiming benefits)

Further Information More information on the MSIP can be found at: Initial feasibility report on use of MWS: reports/historical-updates-reports/updates-reports-09/initial-feasibility-report--- october-2009.pdf Research paper on use of School Census: reports/historical-updates-reports/updates-reports-09/research-paper-on-the- use-of-school-census-data-to-improve-population-statistics---october-2009.pdf

Questions ?

Phase 2 – Other uses Internal migration adjustments –MWS - Student adjustment – end of study moves –MWS - First onward moves –HESA – already used to improve internal migration adjustments –Compare internal moves of SC/MWS to PR Other uses of administrative data: MWS - NPP assumption MWS - Other migration research SC - Continue to quality assure mid-Year Estimates for 5 to 15 year olds in England and Wales SC - Pursuing access to Welsh microdata L2 – creating sex ratios for comparison L2 – assess feasibility of gaining a larger sample of NPS

Administrative Data Sources ONS Centre for Demography