Q2008 1 USE OF PROCESS DATA TO DETERMINE THE NUMBER OF CALL ATTEMPTS IN A TELEPHONE SURVEY Annica Isaksson Linköping University, Sweden Peter Lundquist.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

Nonparametric estimation of non- response distribution in the Israeli Social Survey Yury Gubman Dmitri Romanov JSM 2009 Washington DC 4/8/2009.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Chapter 7. Statistical Estimation and Sampling Distributions
Sampling: Final and Initial Sample Size Determination
Estimation  Samples are collected to estimate characteristics of the population of particular interest. Parameter – numerical characteristic of the population.
Fundamentals of Data Analysis Lecture 12 Methods of parametric estimation.
Ch. 19 Unbiased Estimators Ch. 20 Efficiency and Mean Squared Error CIS 2033: Computational Probability and Statistics Prof. Longin Jan Latecki Prepared.
Sampling Strategy for Establishment Surveys International Workshop on Industrial Statistics Beijing, China, 8-10 July 2013.
Quality indicators for measuring and enhancing the composition of survey response Q2008 – Special topic session, July 9 Jelke Bethlehem and Barry Schouten.
Economics 105: Statistics Review #1 due next Tuesday in class Go over GH 8 No GH’s due until next Thur! GH 9 and 10 due next Thur. Do go to lab this week.
Chapter 5 Stratified Random Sampling n Advantages of stratified random sampling n How to select stratified random sample n Estimating population mean and.
Econ 140 Lecture 61 Inference about a Mean Lecture 6.
Quality in Italian consumer price survey: optimal allocation of resources and indicators to monitor the data collection process Federico Polidoro, Rosabel.
QBM117 Business Statistics Statistical Inference Sampling 1.
AP Statistics C5 D2 HW: p.287 #25 – 30 Obj: to understand types of samples and possible errors Do Now: How do you think you collect data?
The Mean Square Error (MSE):. Now, Examples: 1) 2)
Today Today: Chapter 9 Assignment: Recommended Questions: 9.1, 9.8, 9.20, 9.23, 9.25.
Part 2b Parameter Estimation CSE717, FALL 2008 CUBS, Univ at Buffalo.
Descriptive statistics Experiment  Data  Sample Statistics Experiment  Data  Sample Statistics Sample mean Sample mean Sample variance Sample variance.
SAMPLING DISTRIBUTIONS. SAMPLING VARIABILITY
Fundamentals of Sampling Method
STAT262: Lecture 5 (Ratio estimation)
STAT 4060 Design and Analysis of Surveys Exam: 60% Mid Test: 20% Mini Project: 10% Continuous assessment: 10%
Cluster Sampling The basics. What are we trying to achieve in a survey? A sample that is representative of the larger population A sample that is representative.
~ Draft version ~ 1 HOW TO CHOOSE THE NUMBER OF CALL ATTEMPTS IN A TELEPHONE SURVEY IN THE PRESENCE OF NONRESPONSE AND MEASUREMENT ERRORS Annica Isaksson.
REPRESENTATIVE RESPONSE Some Examples ITSEW Peter Lundquist Statistics Sweden.
The Lognormal Distribution
Standard error of estimate & Confidence interval.
Measurement Error.
Statistical Methods, part 1 Module 2: Latent Class Analysis of Survey Error Models for measurement errors Dan Hedlin Stockholm University November 2012.
Lecture Presentation Software to accompany Investment Analysis and Portfolio Management Seventh Edition by Frank K. Reilly & Keith C. Brown Chapter 7.
AP Statistics Chapter 9 Notes.
A Monte Carlo exploration of methods to determine the UHECR composition with the Pierre Auger Observatory D.D’Urso for the Pierre Auger Collaboration
Topic 5 Statistical inference: point and interval estimate
Random Sampling, Point Estimation and Maximum Likelihood.
AP Statistics Section 9.3A Sample Means. In section 9.2, we found that the sampling distribution of is approximately Normal with _____ and ___________.
Chapter 7 The Logic Of Sampling. Observation and Sampling Polls and other forms of social research rest on observations. The task of researchers is.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
: Chapter 3: Maximum-Likelihood and Baysian Parameter Estimation 1 Montri Karnjanadecha ac.th/~montri.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Eurostat Weighting and Estimation. Presented by Loredana Di Consiglio Istituto Nazionale di Statistica, ISTAT.
Chapter 7 Point Estimation of Parameters. Learning Objectives Explain the general concepts of estimating Explain important properties of point estimators.
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Improving of Household Sample Surveys Data Quality on Base of Statistical Matching Approaches Ganna Tereshchenko Institute for Demography and Social Research,
Confidence Interval & Unbiased Estimator Review and Foreword.
Chapter 5 Sampling Distributions. The Concept of Sampling Distributions Parameter – numerical descriptive measure of a population. It is usually unknown.
McGraw-Hill/IrwinCopyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved. SAMPLING Chapter 14.
Survey and Sampling Methods Session 9. Introduction Nonprobability Sampling and Bias Stratified Random Sampling Cluster Sampling Systematic Sampling Nonresponse.
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Training Workshop on the ICCS 2009 database Weights and Variance Estimation picture.
CHAPTER 7, THE LOGIC OF SAMPLING. Chapter Outline  A Brief History of Sampling  Nonprobability Sampling  The Theory and Logic of Probability Sampling.
Chapter 7 Data for Decisions. Population vs Sample A Population in a statistical study is the entire group of individuals about which we want information.
Week 21 Order Statistics The order statistics of a set of random variables X 1, X 2,…, X n are the same random variables arranged in increasing order.
Designing ICT Surveys: An Introduction to the Basic Theory Phillippa Biggs, Economist, ITU MCIT, Cairo, Egypt 10 March 2009.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Chapter 4: Designing Studies... Sampling. Convenience Sample Voluntary Response Sample Simple Random Sample Stratified Random Sample Cluster Sample Convenience.
Week 21 Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced.
Bias-Variance Analysis in Regression  True function is y = f(x) +  where  is normally distributed with zero mean and standard deviation .  Given a.
Fundamentals of Data Analysis Lecture 11 Methods of parametric estimation.
STATISTICAL INFERENCE
Random Sampling Population Random sample: Statistics Point estimate
An Online CNN Poll.
Sampling Distribution
Sampling Distribution
Additional notes on random variables
CHAPTER 15 SUMMARY Chapter Specifics
Additional notes on random variables
The estimate of the proportion (“p-hat”) based on the sample can be a variety of values, and we don’t expect to get the same value every time, but the.
1-Way Random Effects Model
Presentation transcript:

Q USE OF PROCESS DATA TO DETERMINE THE NUMBER OF CALL ATTEMPTS IN A TELEPHONE SURVEY Annica Isaksson Linköping University, Sweden Peter Lundquist Statistics Sweden Daniel Thorburn Stockholm University, Sweden

Q The Problem Consider a telephone survey of individuals, in which a maximum number A of call attempts is to be made to sampled individuals. Part of a larger problem of designing efficient call scheduling algorithms. HOW SHALL A BE CHOSEN?

Q Prerequisites (Single-occasion survey) Direct sampling from a frame with good population coverage Estimation of a population total by the direct weighting estimator Response set after A call attempts Inclusion probability for individual k Estimated response probability for individual k after A call attempts Observed value for individual k (proxy for the true value µ k )

Q The Survey as a Three-Stage Process Stage 1: Sample selection Stage 2: Contact and response Maximally A call attempts are made. Individuals respond in accordance with an unknown response distribution. Stage 3: Measurement Observed values are related to the true values according to a measurement error model.

Q Response Model all individuals within the same group have the same probability of responding individuals respond independently of each other individuals respond independently of each other after different numbers of call attempts The sample can be divided into H s response homogeneity groups (RHG) such that, for all A, given the sample,

Q Measurement Error Model For an individual k in RHG h, given the sample and that the individual responds at call attempt a, Indicates if individual k responds at attempt a=a k Random interviewer effect with expectation 0 and variance Random response error with expectation 0 and variance True value for individual k

Q Bias and Variance Bias if the RHG model does not hold: The variance of is derived in the paper Sample covariance between response probabilities and design weighted true values Average response probability within RHG

Q Cost Function

Q Optimum A for RHG h Assume: of the costs are allocated to RHG h

Q Optimum A for RHG h: Result The optimum number of call attempts for RHG h is the number A h that gives the lowest value on the function

Q Our Data Annual salary 2006 according to the Swedish Tax Register (our y) Process data from WinDati (WD). LFS data from March-Dec. 2007, supplemented with: Note: not all WD events are call attempts

Q Data Processing and Estimation. Each monthly sample viewed as a SRS Parameter: = total annual salary 2006 Bias within RHG h and month l estimated by

Q Relative Bias, Monthly Averages

Q =.002 = 55,267,619,616 = 110,979,155. Measurement Error Model Parameters Intraclass correlation, ICC (Biemer and Trewin, 1997):

Q No Bias, ICC =.002

Q Bias, ICC =.002

Q Tentative Results Efficient planning requires high-quality data on processes and costs Perhaps the choice of A should be based on variance rather than MSE

Q Discussion and Future Work Do the results hold for other study variables, other survey settings? Improved models for measurement errors, response and costs? Develop a planning tool?

Q Thank you for your attention! Annica Isaksson, Peter Lundquist,