Panel Study of Entrepreneurial Dynamics Richard Curtin University of Michigan.

Slides:



Advertisements
Similar presentations
Survey Methodology Nonresponse EPID 626 Lecture 6.
Advertisements

Issues in factorial design
Pengolahan dan Analisa Data Indra Budi Fasilkom UI.
Sampling.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Processing and Fundamental Data Analysis CHAPTER fourteen.
1 CDBG Income Survey Requirements For Grant Administrators.
7.Implications for Analysis: Parent/Youth Survey Data.
EBI Statistics 101.
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 8 Using Survey Research.
Why sample? Diversity in populations Practicality and cost.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
The Excel NORMDIST Function Computes the cumulative probability to the value X Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc
Multiple Regression – Basic Relationships
Sampling Design.
Multiple Indicator Cluster Surveys Data Interpretation, Further Analysis and Dissemination Workshop Overview of Data Quality Issues in MICS.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Sampling Design.
Quantifying Data.
STATISTICS CANADA SURVEY LIFECYCLE WOLFVILLE, APRIL 2008 SURVEY LIFECYCLE Michel B. Séguin Atlantic DLI Training.
Chapter 5: Descriptive Research Describe patterns of behavior, thoughts, and emotions among a group of individuals. Provide information about characteristics.
Measures of Central Tendency
Joint Canada/U.S. Health Survey Catherine Simile, National Center for Health Statistics Patrice Mathieu, Statistics Canada Ed Rama, Statistics Canada NCHS.
Aspects of the National Health Interview Survey (NHIS) Chris Moriarity National Conference on Health Statistics August 16, 2010
Household Surveys ACS – CPS - AHS INFO 7470 / ECON 8500 Warren A. Brown University of Georgia February 22,
McGraw-Hill/Irwin © 2004 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 9 Processing the Data.
Organizing Your Data for Statistical Analysis in SPSS
There are no two things in the world that are exactly the same… And if there was, we would say they’re different. - unknown.
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
Chapter 2: Statistics of One Variable
Sampling: Theory and Methods
From Sample to Population Often we want to understand the attitudes, beliefs, opinions or behaviour of some population, but only have data on a sample.
Shari carlos Mark shawky Nate stocks
Chapter Nine Copyright © 2006 McGraw-Hill/Irwin Sampling: Theory, Designs and Issues in Marketing Research.
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
Chapter Twelve Census: Population canvass - not really a “sample” Asking the entire population Budget Available: A valid factor – how much can we.
 Collecting Quantitative  Data  By: Zainab Aidroos.
Benchmarking in European Service of public Transport (BEST) Main results of the BEST 2010 Survey.
Why Draw A Sample? Why not just the get the whole enchilada? – Pragmatic reasons – The true population is typically “unknowable” When done right, a small.
Tobacco Use Supplement to the Current Population Survey User’s Workshop June 9, 2009 Tips and Tricks Analyzing TUS-CPS Data Lloyd Hicks Westat.
Statistics Networking Day Question design Jacki Schirmer Senior Research Fellow Health Research Institute & Institute for Applied Ecology
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
Quantitative analysis Sonia Williams Northern College of Acupuncture 19 th February 2011.
ELSA ELSA datasets and documentation available from the archive or by special arrangement Kate Cox National Centre for Social.
DTC Quantitative Methods Survey Research Design/Sampling (Mostly a hangover from Week 1…) Thursday 17 th January 2013.
The Challenge of Non- Response in Surveys. The Overall Response Rate The number of complete interviews divided by the number of eligible units in the.
American Community Survey “It Don’t Come Easy”, Ringo Starr Jane Traynham Maryland State Data Center March 15, 2011.
Assessing SES differences in life expectancy: Issues in using longitudinal data Elsie Pamuk, Kim Lochner, Nat Schenker, Van Parsons, Ellen Kramarow National.
DATA PREPARATION: PROCESSING & MANAGEMENT Lu Ann Aday, Ph.D. The University of Texas School of Public Health.
Building Wave Response Rates in a Longitudinal Survey: Essential for Nonsampling Error Reduction or Last In - First Out? Steven B. Cohen Fred Rohde and.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
RESEARCH METHODS Lecture 29. DATA ANALYSIS Data Analysis Data processing and analysis is part of research design – decisions already made. During analysis.
Copyright 2010, The World Bank Group. All Rights Reserved. Testing and Documentation Part II.
1 Public Library Use in Oregon Results from the 2006 Oregon Population Survey Oregon State Library March 2007.
National Boot camp Vancouver Heather Dryburgh and Michel B. Séguin May 31 st, 2011 Survey Life cycle.
TIMOTHY SERVINSKY PROJECT MANAGER CENTER FOR SURVEY RESEARCH Data Preparation: An Introduction to Getting Data Ready for Analysis.
Analysis of the characteristics of internet respondents to the 2011 Census to inform 2021 Census questionnaire design Orlaith Fraser & Cal Ghee.
1 Responsive Design and Survey Management in the National Survey of Family Growth (NSFG) William D. Mosher, NCHS FCSM Statistical Policy Seminar Washington,
1 Introduction to Statistics. 2 What is Statistics? The gathering, organization, analysis, and presentation of numerical information.
Statistical Weights and Methods for Analyzing HINTS Data HINTS Data Users Conference January 21, 2005 William W. Davis, Ph.D. Richard P. Moser, Ph.D. National.
1 Data Collection and Sampling ST Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results of a statistical.
Sampling Design and Procedure
Chapter 29 Conducting Market Research. Objectives  Explain the steps in designing and conducting market research  Compare primary and secondary data.
Sampling and Sampling Distributions. Sampling Distribution Basics Sample statistics (the mean and standard deviation are examples) vary from sample to.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Sampling: Final and Initial Sample Size Determination
Surveys of Consumers: Mixed Mode Experiments
15.1 The Role of Statistics in the Research Process
Presentation transcript:

Panel Study of Entrepreneurial Dynamics Richard Curtin University of Michigan

Research Design Basic Sample Design Used large representative sample to identify nascent entrepreneurs Supplemental Samples Women Minorities Mixed gender control group Minority control group Survey Waves Telephone and mail interviews Third annual wave completed in 2001 Wisconsin conducted wave 1 & 2; Michigan wave 3

Completed Interviews Main Sample Wave 1: 446 Wave 2: 342 or 77% of eligible wave 1 respondents Wave 3: were wave 1 & 2 respondents 93% of eligible (275 = 342 – 67 deaths by wave 2) 25 from wave 1 but lost in wave 2 Women’s Sample Wave 1: 223 Wave 2: 159 or 71% of eligible wave 1 respondents Wave 3: were wave 1 & 2 respondents 93% of eligible (128 = 159 –31 deaths by wave 2) 22 from wave 1 but lost in wave 2

Completed Interviews Minority Sample Wave 1: 161 Wave 3: 114 or 71% of eligible wave 1 respondents Total Sample of Entrepreneurs Wave 1: 830 Wave 2: 501 Wave 3: 511 Deaths by wave 3: 224 Control Group Samples (wave 1) Both genders: 223 Minority: 208

PSED Weights No weights needed if: Equal probability of selection (EPSEM) No differential non response PSED needs weights because: Differential selection probabilities Additional samples of women and minorities Differential non response Some types of respondents were harder to reach and maintain in panel

PSED Weights Weights Differential selection probabilities Weight by inverse of selection probability If selection probability is 2.0 then weight is 0.5 If selection probability is 0.5 then weight is 2.0 Differential non response/panel attrition Weight by inverse of response rate If response rate is 0.5 then weight is 2.0 If response rate is 2.0 then weight is 0.5 Two factors often interact Often more difficult to contact special samples Don’t know impact of differential non response Can combine correction factors

Weight Strategy Sample weights needed for: Base screening sample Nascent entrepreneurs Additional sample of women Additional sample of minorities Control groups Rather than multiple weights developed integrated weights Enables analysis of all cases across sample types for example, all nascent entrepreneurs in wave 1 regardless of whether in added samples or main sample Can still do separate analysis of subsamples

Calculation of Screener Weights Use Census (CPS) data to determine weight targets Used: sex, race, age, and education Too much missing data on income Creates multi-way table of distributions: Example Male, white, 45-54, college = 5% of actual population If sample = 10% then weight = 0.5 If sample = 2.5% then weight = 2.0 The single set of target distributions correct for both differences in selection and non response Outcome: when use weights get same distributions as in CPS data

Calculation of Sample Weights Used weighted screener data to determine targets Sex, minority, age, and education Corrects for selection differentials, Non response bias, and panel attrition Procedure repeated for all waves separately Results in weights that represents national sample Same procedure used for control groups

Pitfalls of Weighting Extreme weights How large a weight are you willing to accept Highest three times lowest? Five times? Ten times? Hundred times? Same problems as extreme values in analysis Current weights: Highest three times lowest. How much should weights contribute to the variance of a variable? Example: compare two different weights Same estimate of mean or proportion But one has much higher variance Variance 1: 95% “true” & 5% due to weights Variance 2: 65% “true” & 35% due to weights New weights are like 1 and old weights like 2

Age Distribution for Entrepreneurs In Screening Sample AgeOriginalNew SRC %12.99% %28.29% %30.19% %19.80% 55 or older8.88%8.73% Old and new weights give nearly identical point estimates as the example below indicates

Original versus New Weights Original MF Weights by SRC’s Weights Variance of Old weights is much greater than new weights Old range from 0.1 to 10.0 (left scale) new from 0.7 to 1.7

Use of Weights in Analysis Always use weighted data in analysis of: Nascent entrepreneurs Women and minority samples Control groups Screener sample Weight variables integrate all sample types WTW1, WTW2, WTW3 WTCG WT_scrn Weights for women and minority subsamples If want to do women’s analysis, simply select all women in total sample For analysis of minorities, select all minorities

Use Centered Weights in Analysis Use of weights always yield correct estimate of mean, but they may not yield correct standard error Need to have mean weight equal to 1.0 in analysis If mean weight within each analysis not equal 1.0 then get bias estimate of standard errors because sum of cases not equal to sum of weights How to center weights on 1.0 Define analysis subgroup Calculate mean weight in subgroup, call it WTsub Then define NewWT = WTW1 / WTsub

Relative and Absolute Weights Centering weights changes absolute value not relative value of weights Relative value of weights provide accurate estimate of means and proportions Absolute value of weights provide accurate estimate of standard errors of means and proportions and hence accurate statistical tests Over-samples yield more precise estimates, not less

Documentation Project Internet Site Complete project documentation Most up-to-date version Questionnaires Screening questionnaire Phone interviews from wave 1, 2, 3 (100+ pages each) Mail forms: wave 1 (long) wave 2,3 (short) PDF format; book marked files Data File and Codebook Complete SPSS data file (system file and portable) Comprehensive codebooks (450 pages) PDF format; book marked files

Codebook Codebook includes All variables in all waves Excludes names and other confidential information Variable numbers and questions Uses question number in questionnaire Same numbers as in waves 1 & 2 data files Displays identical questions across waves together Codebook order and index Table of contents – grouped by topic Index gives page numbers for every variable Listed in alphabetical order in appendix Can use questionnaire, then look up page number

Codebook Example

Codebook Items Frequencies Sum of frequencies add to the total number of respondents eligible to be asked the question Ineligible respondents coded SPSS system missing Eligibility Respondents answer to prior question If question answered in prior wave If respondent decline to participate If respondent was not eligible for subsequent wave Frequencies not included for open-ended numerical codes Dollar amounts, dates, etc.

Questionnaire Example

Matching Codebook Entries