Design of the sampling frame of the 2008 French survey on disability and health INSEE Loic Midy.

Slides:



Advertisements
Similar presentations
Data: Quantitative (Histogram, Stem & Leaf, Boxplots) versus Categorical (Bar or Pie Chart) Boxplots: 5 Number Summary, IQR, Outliers???, Comparisons.
Advertisements

Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
Innovation Surveys: Advice from the Oslo Manual National training workshop Amman, Jordan October 2010.
SADC Course in Statistics Common complications when analysing survey data Module I3 Sessions 14 to 16.
Unit of EPIDEMIOLOGY SCIENTIFIC INSTITUTE OF PUBLIC HEALTH Comparison of HLY from different health surveys 4th Meeting of the Task Force on Health Expectancies.
Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.
Research on Improvements to Current SIPP Imputation Methods ASA-SRM SIPP Working Group September 16, 2008 Martha Stinson.
Surveys Sample Size By R. Heberto Ghezzo Ph.D. R. Heberto Ghezzo Ph.D. Meakins-Christie laboratories McGill University - Montreal - Canada.
The estimation strategy of the National Household Survey (NHS) François Verret, Mike Bankier, Wesley Benjamin & Lisa Hayden Statistics Canada Presentation.
Logistic Regression.
N ational T ransfer A ccounts Data Review (Hands On) Amonthep Chawla East-West Center & Nihon University Population Research Institute.
Riku Salonen Regression composite estimation for the Finnish LFS from a practical perspective.
4. FREQUENCY DISTRIBUTION
1 Scottish Social Survey Network: Master Class 1 Data Analysis with Stata Dr Vernon Gayle and Dr Paul Lambert 23 rd January 2008, University of Stirling.
Maintaining high quality surveys with optimized interviewers replacements : the new French sample monitoring strategy Sébastien Faivre, INSEE, Head of.
15 de Abril de A Meta-Analysis is a review in which bias has been reduced by the systematic identification, appraisal, synthesis and statistical.
Health related quality of life and satisfaction with life in Croatia Gorka Vuletic, BPsychol, PhD Andrija Stampar School of Public Health Medical Faculty,
Chengbin Hu Obese and overweight: problem and how to control? Problem: More than 2 in 3 adults are considered to be overweight or obese. More.
A new sampling method: stratified sampling
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. How to Get a Good Sample Chapter 4.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western College Publishing/Thomson Learning.
An article by Ben Heather published by the Dominion Post raises awareness of the child poverty crisis. He states “A new rigorous report has found that.
Data collection on homelessness in statistical offices in France 1. National Institute of Statistics and Economic Studies (INSEE) 2. Statistical Office.
Survey Experiments. Defined Uses a survey question as its measurement device Manipulates the content, order, format, or other characteristics of the survey.
Aim: How do we find confidence interval? HW#9: complete question on last slide on loose leaf (DO NOT ME THE HW IT WILL NOT BE ACCEPTED)
1 CHAPTER 7 Homework:5,7,9,11,17,22,23,25,29,33,37,41,45,51, 59,65,77,79 : The U.S. Bureau of Census publishes annual price figures for new mobile homes.
C M Clarke-Hill1 Collecting Quantitative Data Samples Surveys Pitfalls etc... Research Methods.
MEASURING INCOME AND POVERTY AT A NATIONAL LEVEL Sian Rasdale Social Justice Analysis, Scottish Government.
Copyright 2010, The World Bank Group. All Rights Reserved. Estimation and Weighting, Part I.
Andrey Veykher National Research University «Higher School оf Economics» in St-Petersburg, Russia The representative study of households based on the data.
September 151 Screening for Disability Washington Group on Disability Statistics.
Measurement and analysis of household welfare: possible approaches using GGS data L. Ovcharova, A. Pishniak, D. Popova Independent Institute.
18b. PROC SURVEY Procedures in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Multivariate Data Analysis CHAPTER seventeen.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Sampling Class 7. Goals of Sampling Representation of a population Representation of a population Representation of a specific phenomenon or behavior.
EvergreenEcon.com ESA 2011 Impact Evaluation Draft Report Public Workshop #2 August 7, 2013 Presented By: Steve Grover, President.
15 may 2008 The validation of the french population census Olivier Lefebvre, Insee.
Applications of Integration 6. Probability Probability Calculus plays a role in the analysis of random behavior. Suppose we consider the cholesterol.
Today Ensemble Methods. Recap of the course. Classifier Fusion
Sampling Methods and Sampling Distributions
5-4-1 Unit 4: Sampling approaches After completing this unit you should be able to: Outline the purpose of sampling Understand key theoretical.
New sources – administrative registers Genovefa RUŽIĆ.
Learning Objectives Explain the role of sampling in the research process Distinguish between probability and nonprobability sampling Understand the factors.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Philippe Brion Johara Khélif Insee 28/04/2014 Questions raised by the implementation of the data editing device for French structural business statistics.
The challenge of a mixed-mode design survey and new IT tools application: the case of the Italian Structure Earning Surveys Fabiana Rocci Stefania Cardinleschi.
1 G Lect 13W Imputation (data augmentation) of missing data Multiple imputation Examples G Multiple Regression Week 13 (Wednesday)
Measuring Disability: Results from the 2001 Census and the 2001 Post-Censal Disability Survey Statistics Canada January 10, 2003.
5.3 – Solving Multi-Step Inequalities. *Just like solving equations*
Changes to the collection of short walk data in the NTS Glenn Goodman, DfT.
Statistics Canada Citizenship and Immigration Canada Methodological issues.
Copyright © 2009 Pearson Education, Inc. 8.1 Sampling Distributions LEARNING GOAL Understand the fundamental ideas of sampling distributions and how the.
Armando Teixeira-Pinto AcademyHealth, Orlando ‘07 Analysis of Non-commensurate Outcomes.
SECTION 7.2 Estimating a Population Proportion. Practice  Pg  #6-8 (Finding Critical Values)  #9-11 (Expressing/Interpreting CI)  #17-20.
An assessment of the robustness of weights in the Famille et Employeurs survey Nicolas Razafindratsima & Elisabeth Morand.
Chapter Seventeen Copyright © 2004 John Wiley & Sons, Inc. Multivariate Data Analysis.
MOROCCAN EXPERIENCE ON DISABILITY STATISTICS THE KINGDOM OF MOROCCO HIGH COMMISSION OF PLANNING BY ZINEB EL OUAZZANI TOUAHAMI Statistician Engineer Directorate.
Review Statistical inference and test of significance.
A framework for multiple imputation & clustering -Mainly basic idea for imputation- Tokei Benkyokai 2013/10/28 T. Kawaguchi 1.
Weighting and reweighting in surveys
Workshop on Residential Property Price Indices
The European Statistical Training Programme (ESTP)
STAT262: Final Project
French recent surveys on disability A few lessons
l l l l l l l l l l l 18.
Metadata used throughout statistics production
Chapter 5: The analysis of nonresponse
Variable Storage Memory Locations (Logical) Variable Classes Stack
Presentation transcript:

design of the sampling frame of the 2008 French survey on disability and health INSEE Loic Midy

 Problem to solve: How can we get a high number of people with disability in the sample of our survey?  Our solution: we do a two phase survey: –Step 1: we did in 2007 a short survey called VQS on households –Step 2: we classify VQS respondants into stratums of assumed severity of disability –Step 3 : the sample of the disability survey (size ) is a stratified sampling on those stratums

This methodology will be of good quality only if all the 3 steps have themselves a good quality. This raises 3 issues tackled in this presentation  Issue 1 (linked to step 1): how should we design the short set of questions of VQS in order to detect people with disability?  Issue 2 (linked to step 2): how should we do the classification of VQS respondants into stratums of severity of disability?  Issue 3 (linked to step 3):what rules should the stratified sampling of the disability survey follow?

Issue 1: design of VQS questions Global philosophy: disability\handicap is a multi dimensionnal issue. Backed by a study of the 1999 VQS data on the people aged 16 or more: 2,4% persons declared a global limitation for 6 months or more but not a single functionnal limitations (7 questions). 13,3% persons declared a least one functionnal limitation but no global limitation for 6 months or more. 14% of the persons who have an official recognition of a disability\ handicap don’t think they have a disability\ handicap !!!

Issue 1 (2) Dimensions included in the 2007 VQS  Questions on health (mini module 1 and 2)  Question on limitation (mini module 3)  Questions on functionnal limitations  Questions on help (human, technical, and housing adaptations)  Question on self perception of handicap  Question on official recognition of handicap

Issue 1 (3) the intellectual and psychical handicap were not well detected with the 1999 VQS questions: 48% of HID respondants who declared that they often put themselves in danger answered NO to all 1999 VQS questions. 63% of HID respondants who declared that people often reproached them to be to agressive or to impulsive answered NO to all 1999 VQS questions.

Issue 1 (4) => we added 4 questions to detect those handicap. Those 4 questions where asked in another survey on adults asking an official recognition of handicap where the real impairment is available (coded by a general practitionner using the medical files)

Issue 1 (5) questionsIntellectual impairment Psychical impairment sensibi lity specifi city sensibi lity specifi city Concentration0,540,590,580,64 Memory0,590,500,620,53 initiatives0,680,610,600,65 Problem solving0,720,650,540,67 A least one positive answer 0,870,330,840,37

Issue 2 We used 2 methods to classify VQS respondant: One is based on a score The second is a mixed classification

Issue 2: score Principle:  A note is assigned to each answer. Ex: question How is your health in general –Answer very bad=>5 points –Answer bad=>3 points  Final score for a person=(sum of note)\ (maximum sum of note)*100 => partial non response taken into account  Allocation of each person to a stratum according to his score (ex: if score>80 => stratum 4). –Arbitrary threshold –OR hierachical classification on a sample of persons on variable final score

Issue 2 (2) How can we assign a weight to each answer? In 1999 we had a VQS survey and then a disability survey called HID. We detect the VQS 1999 questions explaining the best one HID variable by a logistic regression => Gives an idea of the most important VQS questions BUT not perfect nor complete because VQS 1999 and VQS 2007 are not all the same (questions added, deleted, reformulated)

Issue 2 : mixed classification The steps are:  Basic imputation on missing data : non response => imputation answer NO.  Analysis of multiples correspondancies (ACM)  Classification (mobile centers méthod: SAS: FASTCLUS) on the firsts axes of the ACM=> clusters  Hierarchical classification (SAS: CLUSTER) on the clusters

Issue 2 (2): comparisons (temporary data)

Issue 3  what rules should the stratified sampling of the disability survey follow?  We have 4 stratums of severity of handicap from 1 (no handicap at all) to 4 (important handicap).  We sample 100% of people in stratum 4. The main question is which percentage of people should we sample in stratum 1?

Issue 3 (2) We should estimation confidence interval on variable of interest to answer BUT is this case the calculus are very difficult=> study of the dispersion of the weight.

Estimation for 2008 disability survey (temporary data) Data from the 1999 disability survey Quantile Estimate of weights 100% Max % Max % % % % % % % Q % Q % Median % Median % Q % Q % 29310% 188 5% 2215% 110 1% 1031% 51 0% Min 290% Min 28