Measuring change in sample survey data. Underlying Concept A sample statistic is our best estimate of a population parameter If we took 100 different.

Slides:



Advertisements
Similar presentations
A Sampling Distribution
Advertisements

Estimation in Sampling
Sampling: Final and Initial Sample Size Determination
RESEARCH METHODOLOGY & STATISTICS LECTURE 6: THE NORMAL DISTRIBUTION AND CONFIDENCE INTERVALS MSc(Addictions) Addictions Department.
Chapter 10: Sampling and Sampling Distributions
Chapter 7 Introduction to Sampling Distributions
The standard error of the sample mean and confidence intervals
BHS Methods in Behavioral Sciences I
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Central Limit Theorem The Normal Distribution The Standardised Normal.
How to calculate Confidence Intervals and Weighting Factors
Understanding sample survey data
Standard error of estimate & Confidence interval.
Sampling: Theory and Methods
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
Estimation Statistics with Confidence. Estimation Before we collect our sample, we know:  -3z -2z -1z 0z 1z 2z 3z Repeated sampling sample means would.
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.
Chapter Nine Copyright © 2006 McGraw-Hill/Irwin Sampling: Theory, Designs and Issues in Marketing Research.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Sample Size Determination CHAPTER Eleven.
Sampling Distributions
A Sampling Distribution
Introduction to Statistical Inference Chapter 11 Announcement: Read chapter 12 to page 299.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Introduction to Inferential Statistics. Introduction  Researchers most often have a population that is too large to test, so have to draw a sample from.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Sample Size Determination CHAPTER thirteen.
Sampling Distributions Chapter 7. The Concept of a Sampling Distribution Repeated samples of the same size are selected from the same population. Repeated.
Understanding sample survey data. Underlying Concept A sample statistic is our best estimate of a population parameter If we took 100 different samples.
LECTURE 3 SAMPLING THEORY EPSY 640 Texas A&M University.
The Logic of Sampling. Methods of Sampling Nonprobability samplesNonprobability samples –Used often in Qualitative Research Probability or random samplesProbability.
Chapter 7 Estimation Procedures. Basic Logic  In estimation procedures, statistics calculated from random samples are used to estimate the value of population.
CONFIDENCE INTERVAL It is the interval or range of values which most likely encompasses the true population value. It is the extent that a particular.
Confidence Intervals and Tests of Proportions. Assumptions for inference when using sample proportions: We will develop a short list of assumptions for.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 7 - Sampling Distribution of Means.
Determination of Sample Size: A Review of Statistical Theory
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
OPENING QUESTIONS 1.What key concepts and symbols are pertinent to sampling? 2.How are the sampling distribution, statistical inference, and standard.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
Determining the Size of a Sample 1 Copyright © 2014 Pearson Education, Inc.
Sampling Distribution WELCOME to INFERENTIAL STATISTICS.
Chapter Thirteen Copyright © 2004 John Wiley & Sons, Inc. Sample Size Determination.
STA Lecture 171 STA 291 Lecture 17 Chap. 10 Estimation – Estimating the Population Proportion p –We are not predicting the next outcome (which is.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Confidence Intervals (Dr. Monticino). Assignment Sheet  Read Chapter 21  Assignment # 14 (Due Monday May 2 nd )  Chapter 21 Exercise Set A: 1,2,3,7.
Chapter 10: Introduction to Statistical Inference.
Learning Objective Chapter 12 Sample Size Determination Copyright © 2000 South-Western College Publishing Co. CHAPTER twelve Sample Size Determination.
What is a Confidence Interval?. Sampling Distribution of the Sample Mean The statistic estimates the population mean We want the sampling distribution.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Various Topics of Interest to the Inquiring Orthopedist Richard Gerkin, MD, MS BGSMC GME Research.
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Training Workshop on the ICCS 2009 database Weights and Variance Estimation picture.
From the population to the sample The sampling distribution FETP India.
1 Probability and Statistics Confidence Intervals.
1 VI. Why do samples allow inference? How sure do we have to be? How many do I need to be that sure? Sampling Distributions, Confidence Intervals, & Sample.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Chapter Eleven Sample Size Determination Chapter Eleven.
Sampling and Sampling Distributions
Advanced Quantitative Techniques
Ch5.4 Central Limit Theorem
Chapter 7 Review.
ESTIMATION.
Sample Size Determination
Introduction to Sampling Distributions
Statistics in Applied Science and Technology

Calculating Probabilities for Any Normal Variable
Sampling Distribution of the Mean
Determining Which Method to use
Sample vs Population (true mean) (sample mean) (sample variance)
Chapter 4 (cont.) The Sampling Distribution
How Confident Are You?.
Presentation transcript:

Measuring change in sample survey data

Underlying Concept A sample statistic is our best estimate of a population parameter If we took 100 different samples from the same population to measure, for example, the mean height of men, we would get a 100 different estimates of mean height. The mean of these means would be very close to the real population mean.

Population of men (Population Size = N)

Sample of men from population (Sample size = n) We take a sample from the population and measure the heights of all our sample members. The mean height from this sample is cm.

We take a another sample from the same population and measure the heights of all sample members. The mean height from this sample is cm. Sample of men from population (Sample size = n)

Sample of sample means We take a another 100 samples from the same population and measure the heights of all sample members. Sample 1 mean was cm Sample 2 mean was cm Sample 3 mean was cm Sample 4 mean was cm Sample 5 mean was cm Etc

We don’t ever take hundreds of samples. We just take 1. The concept of the mean of sample means is central to all survey statistics. The central limit theorem says that if we took a sufficiently large number of samples, the mean of the sample means would be normally distributed. This is true even if the thing we are measuring is not normally distributed. The central limit theorem can be proved mathematically. It is the basis of how we calculate our required sample size and how we calculate confidence intervals around our estimates………………. Sample of sample means

Variance, standard deviation and standard error Variance = the sum of squared differences from the mean divided by n-1 Variance = 30 / 4 = 7.5 Standard deviation = the square root of the variance SD = √ variance = √7.5 = 2.74 Standard error = the square root of the variance divided by the sample size SE = √ (variance / n) = √ (7.5 / 5) = 1.22 Sample values (n=5) Difference from mean Squared difference from the mean = x -1 = Mean = 171 Sum = 0Sum of squares = 30

Standard Error The standard error is our best estimate of the standard deviation of the sample means. In other words if we took 100 samples from the same population and got 100 estimates of men’s mean height, the standard deviation of that mean is the standard error.

Confidence Intervals Because the means of sample means are normally distributed, we can use the characteristics of the normal distribution to look at our mean and standard error. We know that in a normal distribution 68.3% of values fall within one standard deviation of the mean and 95% fall within 1.96 standard deviations of the mean. So 1.96 times the standard error gives us the 95% confidence limits. Our standard error is x 1.22 = 2.4 Our sample mean is – 2.4 = = So.. If we took 100 samples, 95 of them would have a mean somewhere between and Or… we can be 95% confident that the true mean (the population mean) lies between and

It works the same for proportions The 95% confidence interval around a proportion is 1.96 times the standard error of the estimate. The standard error of a proportion is √ ( (p (100-p)) / n ) Where p is the percentage and n is the sample size. So if we estimate that 75% of people prefer dogs from a sample of 45, p=75 and n= 45. = √ (( 75 x (100-75)) / 45 ) = √ ( (75 x 25) / 45 ) = √ ( 1875 / 45 ) = √ 42.7 = – 6.5 = 68.5 and = 81.5 So.. if we took 100 samples, 95 of them would have a percentage somewhere between 68.5 and Or… we can be 95% confident that the true percentage of people who prefer dogs (the population percentage) lies between 68.5 and 81.5.

Design effects If the sample is not a simple random sample then an adjustment will need to be made to the standard error Proportionate stratification will decrease the standard error Disproportionate stratification will increase the standard error Clustering will increase the standard error See PEAS website for information about design effects

Finite Population Correction If the sample size is a large proportion of the population size (>5%) then applying the finite population correction will reduce the standard error

Showing confidence intervals graphically

Measuring change over time As a rule of thumb, if two confidence intervals do not overlap we can be confident that there has been a change in the population This requires that broadly similar sample methodology was used, and exactly the same survey questions If different methodologies are used or the question changes, it becomes very difficult to say whether change in the population has occurred

Setting a target 1.Calculate the confidence interval around the baseline estimate 2.Estimate what the confidence interval will be around the target figure 3.Make sure they don’t overlap

Word of caution Don’t mistake “statistically significant” for “meaningful” A change of 0.01% can be statistically significant if the survey is large and precise enough, but most people wouldn’t call that meaningful A meaningful change in the population could be missed if the survey isn’t designed to be precise enough: Make sure the survey is designed with the purpose of monitoring change in mind And don’t forget all the non-statistical issues!!!