The Ontario Cancer Risk Factor Surveillance Program Michael Spinks Senior Research Analyst Cancer Care Ontario at 5 th Annual RRFSS Workshop Institute.

Slides:



Advertisements
Similar presentations
Calculation of Sampling Errors MICS3 Regional Workshop on Data Archiving and Dissemination Alexandria, Egypt 3-7 March, 2007.
Advertisements

Calculation of Sampling Errors MICS3 Data Analysis and Report Writing Workshop.
Multiple Indicator Cluster Surveys Survey Design Workshop
Variance Estimation in Complex Surveys Third International Conference on Establishment Surveys Montreal, Quebec June 18-21, 2007 Presented by: Kirk Wolter,
Qualitative predictor variables
9. Weighting and Weighted Standard Errors. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Estimates and sampling errors for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
Results Introduction Tobacco use is the leading preventable cause of death in Wisconsin and the United States. Given the risk of smoking initiation during.
Riku Salonen Regression composite estimation for the Finnish LFS from a practical perspective.
Chapter 13 Multiple Regression
© 2003 Prentice-Hall, Inc.Chap 14-1 Basic Business Statistics (9 th Edition) Chapter 14 Introduction to Multiple Regression.
Why sample? Diversity in populations Practicality and cost.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Joint Canada/U.S. Health Survey Catherine Simile, National Center for Health Statistics Patrice Mathieu, Statistics Canada Ed Rama, Statistics Canada NCHS.
CANADIAN COMMUNITY HEALTH SURVEY Data and Products Sylvie Lafortune Laurentian University DLI Spring Meeting (ON) April 13, 2010.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
Aspects of the National Health Interview Survey (NHIS) Chris Moriarity National Conference on Health Statistics August 16, 2010
Complexities of Complex Survey Design Analysis. Why worry about this? Many government studies use these designs – CDC National Health Interview Survey.
Inference for regression - Simple linear regression
A Profile of Health among Massachusetts Adults: Highlights from the Massachusetts Behavioral Risk Factor Surveillance System (BRFSS) Health Survey.
1 CHAPTER 7 Homework:5,7,9,11,17,22,23,25,29,33,37,41,45,51, 59,65,77,79 : The U.S. Bureau of Census publishes annual price figures for new mobile homes.
Definitions Observation unit Target population Sample Sampled population Sampling unit Sampling frame.
Understanding Multivariate Research Berry & Sanders.
Deanna E. White, Adam Stevens, John Barbaro, Kristy McGill and Lynne Russell.
“Analyzing Health Equity Using Household Survey Data” Owen O’Donnell, Eddy van Doorslaer, Adam Wagstaff and Magnus Lindelow, The World Bank, Washington.
1/26/09 1 Community Health Assessment in Small Populations: Tools for Working With “Small Numbers” Region 2 Quarterly Meeting January 26, 2009.
From Surveys to Surveillance Time Series Analysis Eric Holowaty, Sr. Scientist, Informatics Unit, CCO Michael Spinks, Sr. Res. Assoc., CCO. Under Construction.
18b. PROC SURVEY Procedures in SAS ®. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
Design Effects: What are they and how do they affect your analysis? David R. Johnson Population Research Institute & Department of Sociology The Pennsylvania.
Secondary Data Analysis Linda K. Owens, PhD Assistant Director for Sampling and Analysis Survey Research Laboratory University of Illinois.
1 Things That May Affect Estimates from the American Community Survey.
Tobacco Use Supplement to the Current Population Survey User’s Workshop June 9, 2009 Tips and Tricks Analyzing TUS-CPS Data Lloyd Hicks Westat.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
JENNIFER SAYLOR, PHD, RN, ANCS-BC UNIVERSITY OF DELAWARE SEPTEMBER 14, 2012 Essentials of Complex Data Analysis Utilizing National Survey.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
Lohr 2.2 a) Unit 1 is included in samples 1 and 3.  1 is therefore 1/8 + 1/8 = 1/4 Unit 2 is included in samples 2 and 4.  2 is therefore 1/4 + 3/8 =
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
MEPS WORKSHOP Household Component Survey Estimation Issues Household Component Survey Estimation Issues Steve Machlin, Agency for Healthcare Research and.
5-4-1 Unit 4: Sampling approaches After completing this unit you should be able to: Outline the purpose of sampling Understand key theoretical.
Today - Messages Additional shared lab hours in A-269 –M, W, F 2:30-4:25 –T, Th 4:00-5:15 First priority is for PH5452. No TA or instructor Handouts –
Core Indicators- PHAC Project (July 4, 2008-July 4, 2009): “Core Indicators for Chronic Disease and Risk Factor Surveillance” Harleen Sahota December 5,
Acute and Chronic Disability Among US Farmers and Pesticide Applicators: The National Health Interview Survey O Gómez-Marín, D Zheng, W LeBlanc, D Lee,
Introduction to Secondary Data Analysis Young Ik Cho, PhD Research Associate Professor Survey Research Laboratory University of Illinois at Chicago Fall,
Lynn Lethbridge SHRUG November, What is Bootstrapping? A method to estimate a statistic’s sampling distribution Bootstrap samples are drawn repeatedly.
Quality control & Statistics. Definition: it is the science of gathering, analyzing, interpreting and representing data. Example: introduction a new test.
Introduction to Survey Sampling
Analytical Example Using NHIS Data Files John R. Pleis.
Lecture 4 Ways to get data into SAS Some practice programming
Statistics Canada Citizenship and Immigration Canada Methodological issues.
CASE STUDY: NATIONAL SURVEY OF FAMILY GROWTH Karen E. Davis National Center for Health Statistics Coordinating Center for Health Information and Service.
1 of 22 INTRODUCTION TO SURVEY SAMPLING October 6, 2010 Linda Owens Survey Research Laboratory University of Illinois at Chicago
Statistical Weights and Methods for Analyzing HINTS Data HINTS Data Users Conference January 21, 2005 William W. Davis, Ph.D. Richard P. Moser, Ph.D. National.
Sample Design of the National Health Interview Survey (NHIS) Linda Tompkins Data Users Conference July 12, 2006 Centers for Disease Control and Prevention.
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
Appropriate use of Design Effects and Sample Weights in Complex Health Survey Data: A Review of Articles Published using Data from Add Health, MTF, and.
Statistics Canada National Population Health Surveys (NPHS) Amir Erfani, PhD. Department of Sociology Nipissing University North Bay,
Cancer Risk Factors in Ontario Alcohol. Proportion of cancer cases attributable to alcohol consumption, by sex and cancer type, Ontario, Cancer.
Cancer Risk Factors in Ontario Healthy Weights, Healthy Eating and Active Living.
ESTIMATION.
Working with the ECLS-B Datasets Weights and other issues.
Analysis of Complex Sample Data
Introduction to Survey Data Analysis
Estimation of Sampling Errors, CV, Confidence Intervals
LAMAS Working Group 7-8 December 2015
Random sampling Carlo Azzarri IFPRI Datathon APSU, Dhaka
New Techniques and Technologies for Statistics 2017  Estimation of Response Propensities and Indicators of Representative Response Using Population-Level.
Presentation transcript:

The Ontario Cancer Risk Factor Surveillance Program Michael Spinks Senior Research Analyst Cancer Care Ontario at 5 th Annual RRFSS Workshop Institute for Social Research, York University June, 2006

Contents Risk Factor Surveillance at CCO CCO analysis of RRFSS data Generating complex survey estimates using SPSS Risk factor indicator inference and trends CCO Risk Factor Surveillance Reporting System Next Steps

CCO Cancer Risk Factor Surveillance System CCO is very supportive of RRFSS Risk Factor Surveillance Project established at CCO Important to liaise with suppliers and users of risk factor data

Risk Factor Surveillance Methodology Data Sources RRFSS (monthly survey, available in 6 weeks) CCHS (annual survey, available in 6 months) Other Survey and Related Data (OHS, NPHS, OBSP, SHAPES, OSDUS) Population Estimates and Projections Census data

Risk Factor Surveillance Methodology Indicator Development Cancer 2020 project Review of indicator definitions from other agencies – CWIG(APHEO), RRFSS, Statcan, camh Develop indicators using flow diagrams and existing survey data Indicator refinement and standardization (Beth Theis – CCO representative on CWIG)

Current Risk Factor Indicators by Survey IndicatorCCHSRRFSS Adult smoking Teen smoking Quitting smoking Exposure to 2 nd hand smoke Adult obesity Teen obesity Physical activity Alcohol consumption Fruit & Vegetable Intake Mammography screening Cervical screening Colorectal screening Sun safety Tanning equipment usage

Risk Factor Surveillance Methodology Survey Analysis Review Single-stage sampling - random selection of individuals from the population is sampled - for a simple random sample, each sample of a given size is equally likely to be selected from the population - each individual has the same probability of being selected - computation of point and variance estimates relatively straightforward Multistage sampling - units at the first stage are clusters of individuals (or clusters of smaller clusters) - mainly used for cost and logistical reasons - individuals have unequal probabilities of being selected - variability or estimates greater compared with simple random sample of same size - computations of point and variance estimates more complex

Risk Factor Surveillance Methodology RRFSS Survey Design At provincial level RRFSS considered to be a multistage cluster sample design stage 1 cluster (PHU) and stage 2 cluster (household)

PHU and CCO Weighting Procedures What is the sampling weight - each individual represents other persons not in sample - computed as the inverse of the inclusion probability - used to obtain unbiased estimates of risk factor indicators Sample weight used by PHU (monthly/annual) - inclusion probability of selecting an adult member from sample of households - weights total to number of respondents in sample Sample weight used by CCO (annual) - inclusion probability of selecting an adult member in the population - adjusted so each month is equally represented - adjusted to size of population age/sex structure - weights total to number of adults in population

Respondents by PHU and Wave, 2004 Number of respondents vary slightly by month

Comparison of Estimates – PHU and CCO Point estimates - Both methods yield almost identical point estimates Variance estimates - Assuming simple random sampling (PHU) - Taylor’s series linearization (CCO) - Bootstrap resampling (CCHS) - Jack-knife resampling - Balanced half-sample

Comparison of estimates - PHU and CCO Approaches PHU approach underestimates variance of multistage survey design

Comparison of estimates - PHU and CCO Approaches Was the percentage of smokers in Durham significantly lower in 2003 than in 2001?

Tools for computing estimates from complex surveys SAS (CCO) – proc surveyfreq, surveymeans, surveyreg, surveylogistic SPSS (PHU) - CSPlan then - CSDescriptives, CSTables, CSTabulate, CSGLM, CSLogistic Sudaan – proc crosstab, descript, ratio, regress, logistic Stata – svyset, then svy: mean, proportion, ratio, total, regress, logit, etc.

* Analysis Preparation Wizard. CSPLAN ANALYSIS /PLAN FILE='M:\RRFSS\SPSS\rrfssplan.csaplan' /PLANVARS ANALYSISWEIGHT=fwgt /PRINT PLAN /DESIGN STRATA= h_unit CLUSTER= idnum /ESTIMATOR TYPE=WR. Computing estimates from complex surveys in SPSS SPSS Syntax 1 3 2

Computing estimates from complex surveys in SPSS

Computing estimates from complex surveys in SPSS - Results

Comparison of estimates generated from SPSS and SAS % of current smokers, Durham Regional Health Unit, 2004 SexAge groupSPSSSAS %se% male female Estimate of point statistic identical Estimate of standard error identical to the 5 th decimal place

CCO Risk Factor Measures Indicator definitionsPoint statisticsStatistics for evaluating precision Multiple combinations of numerators and denominators as required e.g. for female low alcohol risk 1. <=9 drinks/week 2. <=9 drinks/week and <=2 drinks daily in last week Counts/Prevalence ratios Sex and/or age specific Crude/Age standardized PHU/LHIN/Province 95% confidence intervals Standard errors Coefficient of Variation (CV) Numerator/Denominator sample sizes Compute range of statistics for different indicators to be able to respond to the majority of analytical needs

Almost 100% of population and 100% of Health Units represented in CCHS 85% of population and 67% (24) Public Health Units represented in RRFSS 2004 Estimates from RRFSS Public Health Units are not usually used as a proxy for the province RRFSS not representative of northern PHUs Risk Factor Estimates at the Provincial Level

Comparison of Risk Factor Estimates between RRFSS Health Units and Non-RRFSS Health Units using CCHS 2.1 Significantly different at 5% Data Source: CCHS 2.1, Statistics Canada Prevalence of Selected Risk Factor Indicators with 95% CI

Comparison of Risk Factor Estimates Overlapping confidence intervals Compute age-standardized rates (age groups-12-17, 18-44, 45-64, 65+) Funnel plots for comparing PHUs Significance testing using logistic regression and controlling for age and sex differences

Risk Factor Surveillance Methodology Trends Annual plots of RRFSS and CCHS estimates Quarterly plots of RRFSS estimates Change point analysis Control charts Box-jenkins time series analysis

PrevCan - CCO Risk Factor Surveillance Reporting System

Next Steps Collaboration with CE RRFSS Group Establish agreement with RRFSS for sharing of data and technical support Share developments with MOHLTC Refine methods for testing and dissemination of results Expand indicators to include socio-economic and environmental factors Include GIS in risk factor surveillance