Canadian Community Health Survey A new program for collecting health information Interuniversity Research Data Seminar University of British Columbia Béland.

Slides:



Advertisements
Similar presentations
Multiple Indicator Cluster Surveys Survey Design Workshop
Advertisements

Labour Force Historical Review Sandra Keys, University of Waterloo DLI OntarioTraining University of Guelph, Guelph, ON April 12, 2006.
National Population Health Survey 2004 Congress of the Humanities and Social Sciences All Congress Symposium - A Journey through data: the riches of the.
An Assessment of the Impact of Two Distinct Survey Design Modifications on Health Insurance Coverage Estimates in a National Health Care Survey Steven.
An example of longitudinal LFS weights
Determining How to Select a Sample
Brian A. Harris-Kojetin, Ph.D. Statistical and Science Policy
SAMPLING.
Sampling Strategy for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
Riku Salonen Regression composite estimation for the Finnish LFS from a practical perspective.
Multiple Indicator Cluster Surveys Survey Design Workshop
Canada’s Health by Region By: Jack Wei and Colin McClenaghan.
Meeting the Challenge The National Population Health Survey and Data Access E. Hamilton UNB Libraries IASSIST 2003.
Canadian Community Health Survey
Population Health Surveys at STC Prepared for: B.C. Research Data Centre Date: Nov. 15, 2000.
Sample Design (Click icon for audio) Dr. Michael R. Hyman, NMSU.
Providing Context for Understanding The Data Life Cycle and the National Population Health Survey E. Hamilton IASSIST 2005.
Canadian Community Health Survey Cycle 1.1 Overview of methodological issues and more...
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
2014 Survey on Living with Chronic Diseases in Canada (SLCDC): Mood & Anxiety Disorders National Mental Health and Addictions Information Collaborative.
A new sampling method: stratified sampling
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
08/08/2015 Statistics Canada Statistique Canada Paradata Collection Research for Social Surveys at Statistics Canada François Laflamme International Total.
The Weighting Strategy of the Canadian Community Health Survey Cathlin Sarafin Methodologist Statistics Canada March 25, 2008.
Operational and Methodological Lessons Learned from the 2003 Joint Canada/U.S. Survey of Health Catherine Simile, Ph.D. National Center for Health Statistics.
Producing migration data using household surveys Experience of the Republic of Moldova UNECE Work Session on Migration Statistics, Geneva, October.
Joint Canada/U.S. Health Survey Catherine Simile, National Center for Health Statistics Patrice Mathieu, Statistics Canada Ed Rama, Statistics Canada NCHS.
18/08/2015 Statistics Canada Statistique Canada Responsive Collection Design (RCD) for CATI Surveys and Total Survey Error (TSE) François Laflamme International.
National Household Survey: collection, quality and dissemination Laurent Roy Statistics Canada March 20, 2013 National Household Survey 1.
Sample Design.
CANADIAN COMMUNITY HEALTH SURVEY Data and Products Sylvie Lafortune Laurentian University DLI Spring Meeting (ON) April 13, 2010.
Household Surveys ACS – CPS - AHS INFO 7470 / ECON 8500 Warren A. Brown University of Georgia February 22,
Canadian Community Health Survey (CCHS) – Healthy Aging Data Liberation Initiative Webinar Leslie Geran Health Statistics Division, Statistics Canada April.
ICVS IN SLOVENIA Tatjana Škrbec. Content of presentation  Short history  Crime victim survey 2001 within SORS  Methodology and content of questionnaire.
STATISTICSSTATISTIQUECANADA Aboriginal Labour Force Survey Province of Alberta.
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
6. Implications for Analysis: Data Content. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2.
Sampling Techniques LEARNING OBJECTIVES : After studying this module, participants will be able to : 1. Identify and define the population to be studied.
Chapter Nine Copyright © 2006 McGraw-Hill/Irwin Sampling: Theory, Designs and Issues in Marketing Research.
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
Health Statistics Information on STC website Calgary–DLI training–Dec 2003 Michel B. Séguin, Statistics Canada,
Data and Social Research Chuck Humphrey Data Library Rutherford North Library.
Collecting Electronic Data From the Carriers: the Key to Success in the Canadian Trucking Commodity Origin and Destination Survey François Gagnon and Krista.
1 The 2001 Census PUMFS Odyssey Sponsored by HAL and PALS Presented by Chuck Humphrey.
New National Approaches to Immigrant Health Assessment M. DesMeules, J. Gold, B. Vissandjée, J. Payne, A. Kazanjian, D. Manuel Health Canada, University.
The Census of Canada and Immigration & Ethno-cultural Data Chuck Humphrey University of Alberta February 10, 2006.
Population and Sampling
May 27, 2014 Surveillance and Analysis Division Centre for Chronic Disease Prevention Public Health Agency of Canada Positive Mental Health in Canada.
Current Population Survey Sponsor: Bureau of Labor Statistics Collector: Census Bureau Purpose: Monthly Data for Analysis of Labor Market Conditions –CPS.
National design, fieldwork and data harmonization for Labour Force Survey Irena Svetin Statistical Office of the Republic of Slovenia September 2014.
October 28-30, 2009 UNECE Geneva Quality Assessment of 2008 Integrated Census - Israel Pnina ZADKA Central Bureau of Statistics Israel.
Copyright 2010, The World Bank Group. All Rights Reserved. Reducing Non-Response Section B 1.
Chapter Eleven The entire group of people about whom information is needed; also called the universe or population of interest. The process of obtaining.
Household Surveys: American Community Survey & American Housing Survey Warren A. Brown February 8, 2007.
Lynn Lethbridge SHRUG November, What is Bootstrapping? A method to estimate a statistic’s sampling distribution Bootstrap samples are drawn repeatedly.
Introduction to Survey Sampling
Statistics Canada Citizenship and Immigration Canada Methodological issues.
CASE STUDY: NATIONAL SURVEY OF FAMILY GROWTH Karen E. Davis National Center for Health Statistics Coordinating Center for Health Information and Service.
1 of 22 INTRODUCTION TO SURVEY SAMPLING October 6, 2010 Linda Owens Survey Research Laboratory University of Illinois at Chicago
Anticipating Great Things: A 2006 Census Preview June, 2006 DLI, Ottawa, ON Paul Schwets // Stuart Fyffe.
University of Warwick, Department of Sociology, 2012/13 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Survey Design: Some Implications for.
RESEARCH METHODS Lecture 28. TYPES OF PROBABILITY SAMPLING Requires more work than nonrandom sampling. Researcher must identify sampling elements. Necessary.
Sampling Design and Procedure
Sampling Chapter 5. Introduction Sampling The process of drawing a number of individual cases from a larger population A way to learn about a larger population.
Statistics Canada National Population Health Surveys (NPHS) Amir Erfani, PhD. Department of Sociology Nipissing University North Bay,
Health Statistics Division
Telling Canada’s story in numbers Marie-Josée Major
Chapter 5: The analysis of nonresponse
Presentation transcript:

Canadian Community Health Survey A new program for collecting health information Interuniversity Research Data Seminar University of British Columbia Béland Yves Household Survey Methods Division Statistics Canada February 19, 2002

Presentation Outline &Health Information Roadmap –Origin of the CCHS –Objectives / Content –CCHS two-year plan &CCHS Cycle Sample Design –Allocation, frame –Selection - Oversampling –Data Collection –Imputation –Weighting, sampling error –Bootstrap Variance Estimation –Data Quality –Data Dissemination &CCHS Cycle Overview &Future Cycles of CCHS

Health Information Roadmap &Four-year action plan to strengthen Canada’s health information system &Earmarks funds for specific priorities/activities based on national vision and provincial/regional consultations &Partners: Health Canada, Canadian Institute on Health Information (CIHI) and Statistics Canada &Key elements: –fill critical data gaps in health services and address population health data gaps at a sub-provincial level –foster common data and technical standards –develop indicators and conduct special studies

Canadian Community Health Survey Results of the Consultation Process &Assess health measure variations at many levels of geography &Collect data on issues unique to a health region or province &Respond quickly to emerging issues &Explore certain key health issues in-depth &Analyse the effects of shocks including policy changes

Canadian Community Health Survey Two-year Plan &Cycle Health region-level survey –Produce reliable estimates for sub-provincial areas –Continuous monthly collection : Sept Nov –Sample size : 133,300 respondents –Questionnaire content health determinantshealth determinants health statushealth status utilization of health servicesutilization of health services socio-demographic / socio-economic characteristicssocio-demographic / socio-economic characteristics &Cycle Provincial-level survey –Produce reliable provincial estimates from a sample of 30,000 respondents –Monthly collection : May Dec –In-depth focus content: minute interviews on mental health and well-being

CCHS and NPHS A More Robust Health Survey Program &CCHS –cross-sectional –sample of 160,000 respondents over two years –national, provincial and regional level estimates –customized questionnaires at regional level –built-in flexibility for buy- in sample and/or content –continuous development of in-depth health content &NPHS - Household –« goes longitudinal » only, starting in wave 4 –sample of 20,000 persons –national and provincial level estimates &NPHS - Health Care Institutions –longitudinal and cross-sectional –sample of 2,500 –national level estimates

CCHS - Cycle 1.1 Health Region-level survey &Produce timely cross-sectional estimates for 136 health regions &Target population –individuals living in private occupied dwellings aged 12 years old or over –Exclusions: those living on Indian Reserves and Crown Lands, residents of institutions, full-time members of the Canadian Armed Forces and residents of some remote areas &CCHS 1.1 covers ~98% of the Canadian population

CCHS - Questionnaire content &45-minute interview questionnaire –30 minutes of common modules common to all health regions –10 minutes of optional items selected by health regions from a predefined list of modules –5 minutes of standard socio-economic items & 27 different versions of the questionnaire The complete questionnaire can be found at

CCHS - Sample Allocation to Provinces ProvPop # of 1st Step2nd StepTotal Size HRs500/HRX-propSample NFLD551K 6*2,7801,2304,010 PEI135K 21,0001,0002,000 NS909K 63,0002,0405,040 NB738K 73,5001,6505,150 QUE7,139K 168,00016,28024,280 ONT10,714K 3718,50023,76042,260 MAN1,114K 115,5002,5008,000 SASK990K 11*5,4002,3207,720 ALB2,697K 17*8,1506,05014,200 BC3,725K 2010,0008,09018,090 CAN29,000K 13365,83064,920130,750 * The sampling fraction in some small HRs was capped at 1 in 20 households

CCHS - Sample Allocation to Health Regions Pop. Size# of Mean RangeHRsSample Size Smallless than 75, Medium75, , Large240, , ,500 X-Large640,000 and more7 2,500

CCHS - Sample Allocation to Territories Population Sample Yukon25, NWT36, Nunavut22,000800

CCHS - Sample Frame &CCHS sample selected from three frames: Area frame (Labour Force Survey structure)Area frame (Labour Force Survey structure) RDD frame of telephone numbers (Random Digit Dialling)RDD frame of telephone numbers (Random Digit Dialling) List frame of telephone numbersList frame of telephone numbers Three frames are needed for CCHS for the following reasons: 1. To yield the desired sample sizes in all health regions 2. Have a telephone data collection structure in place to quickly address provincial/regional requests for buy-in sample and/or content at any point in time 3. Optimize collection costs

Area frame - Sampling of households &83% of CCHS sampled households &Stratified multistage sample design Stratum #1 Stratum #2 #1: Each health region is divided into strata #2: Clusters selected within strata (PPS sampling) (1st stage)        #3: Dwellings selected within clusters (2nd stage)  

RDD frame of telephone numbers Sampling of households &Elimination of non-working banks method –7% of CCHS sampled households –Telephone bank: area code + first 5 digits of a 7-digit phone # 1- Keep the banks with at least one valid phone # 2- Group the banks to encompass as closely as possible the health region areas - RDD strata 3- Within each RDD stratum, first select one bank at random and then generate at random one number between 00 and Repeat the process until the required number of telephone numbers within the RDD stratum is reached

List frame of telephone numbers Sampling of households &Simple random sample of telephone numbers –10% of CCHS sampled households –Telephone companies’ billing address files and Telephone Infobase (repository of phone directories) 1- Create a list of phone numbers 2- Stratify the phone numbers by health region using the residential postal codes 3- Select phone numbers at random within a health region 4- Repeat the process until the required number of telephone numbers is reached

CCHS - Sampling of persons &Area frame  SRS of one person aged 12 years of age or older (82% of households)  SRS of two persons aged 12 years of age or older (18%) &RDD / List frames  SRS of one person aged 12 years of age or older

CCHS - Sampling of persons Age1996LFS* CCHS groupCensussamplesimulated (all persons) sample ( only 1 person) * averaged distribution over 100 repetitions using the May 99 LFS sample

CCHS - Representativity of sub-populations To address users’ needs, two sub-population groups needed larger effective sample sizes: & &Youths (12-19 years old) – –Decision > Oversample youths by selecting a second person (12-19) in some households based on their composition & &Elderlies (65 years old and +) – –Decision > Do not oversample - let the general sample selection process address the issue by itself

Sampling strategy based on household composition Number of persons aged 20 or over Number of persons aged 20 or over Number of Number of AAAAB 1 1AACCCB 2 2ACCCCC 3+ 3+ACCCCC A: SRS of one person aged 12+ B: SRS of two persons aged 12+ C: SRS of one person in the age group and SRS of one person 20+

CCHS - Sample Distribution after Oversampling Age1996* CCHS* CCHS groupCensussimulated simulated samplesample ( only 1 person)( some 2 persons) * averaged distribution over 100 repetitions using the May 99 LFS sample

CCHS - Initial data collection plan &12 monthly samples &12 collection months + 1 Area frame !CAPI !STC field interviewers !targeted response rate: 90% !anticipated vacancy rate: 13% (09 / / 2001) + 09 / 2001 RDD / List frames !CATI !STC call centres !targeted response rate: 85% !telephone hit rate: 15-60%

CCHS data collection - Observed situation &Field interviewers –workload exceeded field staff capacity &Call centres –new collection infrastructure –unequal allocation of work among call centres

CCHS - Final response rates Field Call centres Total Field Call centres Total NFLD PEI NS NB QUE ONT MAN SASK ALB BC YUK NWT NUN* CAN

CCHS - Proxy interviews &Higher number of proxy interviews than expected –~ 6% instead of 2-3% &Major consequence: one third of the questionnaire is missing which could be proble- matic for small health regions &Solution : Imputation

CCHS - Imputation & 3-step strategy –common modules / mental health related optional modules / other optional modules & more than 2,000 imputation classes (region, age, sex, questionnaire type, skip patterns, etc…) & hot-deck imputation using nearest neighbour approach according to key characteristics

CCHS - Weighting and Estimation & &Three separate weighting systems: – –Area frame design – –RDD frame design – –List frame design & &Several adjustments – – non-response (household and person) – – seasonal factor – – etc... & &Integration of the two weighting systems based on Deffs & &Calibration using a one-dimensional poststratification adjustment of ten age/sex poststrata within each health region & &Variance estimation : bootstrap re-sampling approach – –set of 500 bootstrap weights for each individual

CCHS Weighting Strategy

Weighting & Estimation

CCHS - Special Weights & &For various reasons, many other weights are produced – – Quarter 4 special weight – – PEI special weight – – Share weights (master, Q4 and PEI special) – – Link weights (master, Q4 and PEI special)

Sampling Error &Difference in estimates obtained from a sample as compared to a census &The extent of this error depends on four factors: –sample size –variability of the characteristic of interest –sample design –estimation method &Generally, the sampling error decreases as the size of the sample increases

Sampling Error &Measure of precision, reliability of the estimates –Variance (standard deviation) –Coefficient of variation Standard deviation of estimate x 100% / estimate itselfStandard deviation of estimate x 100% / estimate itself CV allows comparison of precision of estimates with different scalesCV allows comparison of precision of estimates with different scales –Example: 24% of population are daily smokers, std dev. = % of population are daily smokers, std dev. = CV=0.003/0.24 x 100%=1.25%CV=0.003/0.24 x 100%=1.25%

Sampling Variability Guidelines Type of estimate CVGuidelines Acceptable General unrestricted release Marginal General unrestricted release but with warningcautioning users of the high sampling variablitity. Should be identified by letter M. Unacceptable> 33.3 No release. Should be flagged with letter U. Should be flagged with letter U.

Sampling Error &Measuring sampling error for complex sample designs: –Simple formulas not available –Most software packages do not incorporate design effect (and weights adjustments) appropriately for calculations –Solution for CCHS: the Bootstrap method

Bootstrap method &Principle: –You want to estimate how precise is your estimation of the number of smokers in Canada –You could draw 500 totally new CCHS samples, and compare the 500 estimations you would get from these samples. The variance of these 500 estimations would indicate the precision. –Problem: drawing 500 new samples is $$$ –Solution: Use your sample as a population, and take many smaller subsamples from it.

Bootstrap method T = 40 Var =  (B i - B) 2 / 499 &How CCHS Bootstrap weights are created (the secret is now revealed!!!)

Bootstrap Method &How Bootstrap replicates are built (cont’d) lThe “real” recipe 1- Subsampling of clusters (SRS) within strata 2- Apply (initial design) weight 3- Adjust weight for selection of n-1 among n 4- Apply all standard weight adjustments (nonresponse, share, etc.) 5- Post-stratification to population counts lThe bootstrap method intends to mimic the same approach used for the sampling and weighting processes

Bootstrap Method &Sampling weight vs. Bootstrap weights –Sampling weight used to compute the estimation of a parameter (e.g.: number of smokers) –Bootstrap weights used to compute the precision of the estimation (e.g.: the CV of the number of smokers estimation)

CCHS - Data Dissemination Strategy & &Wide range of users and capacity – –136 health regions – –13 provincial/territorial Ministries of Health – –Health Canada and CIHI – –Internal STC analysts – –Academics – –Others & &Data products – –Microdata – –Analytical products (Health Reports, How Healthy are Canadians, etc…) – –Tabular statistics (ePubs, Cansim II, community profiles, etc…) – –Client support (head and regional offices, CCHS website, workshops, etc…)

CCHS - Access to microdata & &Master file – –all records, all variables Statistics Canada university research data centres remote access & &Share / Link files – –respondents who agreed to share / link provincial/territorial Ministries of Health health regions (through the STC third-party share agreement) & &Public Use Microdata File (PUMF) – –all records, subset of variables with collapsed response categories free for 136 health regions cost recovery for others

CCHS - Overview of Cycle 1.2 & &Produce provincial cross-sectional estimates from a sample of 30,000 respondents & &Area frame sample only / one person per household & &CAPI only & & minute in-depth interviews on mental health and well- being based on WMH2000 questionnaire & &Scheduled to begin collection in May 2002

CCHS - Future Plans & &Same two-year cycle approach: – –health region level survey starting in January 2003 – –provincial level survey starting in January 2004 & &New consultation process with provincial and regional authorities & &Flexible sample designs (adaptable to regional needs) & &Development of an in-depth nutrition focus content (Cycle 2.2)

CCHS Web site

Contacts in Methodology &Yves Béland: &François Brisebois: