Understanding sample survey data. Underlying Concept A sample statistic is our best estimate of a population parameter If we took 100 different samples.

Slides:



Advertisements
Similar presentations
THE CENTRAL LIMIT THEOREM
Advertisements

Estimation of Means and Proportions
Chapter 8: Estimating with Confidence
Chapter 10: Estimating with Confidence
HUDM4122 Probability and Statistical Inference March 30, 2015.
RESEARCH METHODOLOGY & STATISTICS LECTURE 6: THE NORMAL DISTRIBUTION AND CONFIDENCE INTERVALS MSc(Addictions) Addictions Department.
AP Statistics – Chapter 9 Test Review
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 19 Confidence Intervals for Proportions.
Confidence Intervals for Proportions
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 18, Slide 1 Chapter 18 Confidence Intervals for Proportions.
Chapter 19: Confidence Intervals for Proportions
Confidence Intervals for
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Performance Measurement: Indicators Sara Grainger Statistical Support for Public Bodies Office of the Chief Statistician.
Confidence Intervals for Proportions
Chapter 10: Estimating with Confidence
Understanding sample survey data
Full time and part time employment Coventry population in employment by gender Source: Annual Population Survey, Office for National Statistics
Chapter 19: Confidence Intervals for Proportions
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 9 Section 1 – Slide 1 of 39 Chapter 9 Section 1 The Logic in Constructing Confidence Intervals.
Chapter Nine Copyright © 2006 McGraw-Hill/Irwin Sampling: Theory, Designs and Issues in Marketing Research.
A Sampling Distribution
16-1 Copyright  2010 McGraw-Hill Australia Pty Ltd PowerPoint slides to accompany Croucher, Introductory Mathematics and Statistics, 5e Chapter 16 The.
Sampling Distribution ● Tells what values a sample statistic (such as sample proportion) takes and how often it takes those values in repeated sampling.
Normal Distributions Z Transformations Central Limit Theorem Standard Normal Distribution Z Distribution Table Confidence Intervals Levels of Significance.
Source: Annual Population Survey, Office for National Statistics. Full time and part time employment Coventry population.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Chapter 10 – Sampling Distributions Math 22 Introductory Statistics.
Confidence Intervals and Tests of Proportions. Assumptions for inference when using sample proportions: We will develop a short list of assumptions for.
Determination of Sample Size: A Review of Statistical Theory
LECTURE 25 THURSDAY, 19 NOVEMBER STA291 Fall
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Chapter 10: Introduction to Statistical Inference.
+ DO NOW. + Chapter 8 Estimating with Confidence 8.1Confidence Intervals: The Basics 8.2Estimating a Population Proportion 8.3Estimating a Population.
What is a Confidence Interval?. Sampling Distribution of the Sample Mean The statistic estimates the population mean We want the sampling distribution.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Copyright © 2009 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
1 Chapter 9: Introduction to Inference. 2 Thumbtack Activity Toss your thumbtack in the air and record whether it lands either point up (U) or point down.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a measure of the population. This value is typically unknown. (µ, σ, and now.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Measuring change in sample survey data. Underlying Concept A sample statistic is our best estimate of a population parameter If we took 100 different.
Comparing Systems Using Sample Data Andy Wang CIS Computer Systems Performance Analysis.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Estimating a Population Proportion ADM 2304 – Winter 2012 ©Tony Quon.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
10.1 Estimating with Confidence Chapter 10 Introduction to Inference.
Statistics 19 Confidence Intervals for Proportions.
CHAPTER 8 (4 TH EDITION) ESTIMATING WITH CONFIDENCE CORRESPONDS TO 10.1, 11.1 AND 12.1 IN YOUR BOOK.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Political Science 30: Political Inquiry. The Magic of the Normal Curve Normal Curves (Essentials, pp ) The family of normal curves The rule of.
 Confidence Intervals  Around a proportion  Significance Tests  Not Every Difference Counts  Difference in Proportions  Difference in Means.
 Normal Curves  The family of normal curves  The rule of  The Central Limit Theorem  Confidence Intervals  Around a Mean  Around a Proportion.
Statistics Unit Check your understanding…. Can you answer these? What does the standard deviation of a sample represent? What is the difference between.
Comparing Two Proportions
Confidence Intervals for Proportions
Comparing Systems Using Sample Data
ESTIMATION.
Confidence Intervals for Proportions
Confidence Intervals for Proportions
Sampling Distribution Models
The estimate of the proportion (“p-hat”) based on the sample can be a variety of values, and we don’t expect to get the same value every time, but the.
Confidence Intervals for Proportions
AP Statistics Chapter 12 Notes.
Confidence Intervals for Proportions
Presentation transcript:

Understanding sample survey data

Underlying Concept A sample statistic is our best estimate of a population parameter If we took 100 different samples from the same population to measure, for example, the mean height of men, we would get a 100 different estimates of mean height. The mean of these means would be very close to the real population mean.

Population of men (Population Size = N)

Sample of men from population (Sample size = n) We take a sample from the population and measure the heights of all our sample members. The mean height from this sample is cm.

We take a another sample from the same population and measure the heights of all sample members. The mean height from this sample is cm. Sample of men from population (Sample size = n)

Sample of sample means We take a another 100 samples from the same population and measure the heights of all sample members. Sample 1 mean was cm Sample 2 mean was cm Sample 3 mean was cm Sample 4 mean was cm Sample 5 mean was cm Etc

We don’t ever take hundreds of samples. We just take 1. The concept of the mean of sample means is central to all survey statistics. The central limit theorem says that if we took a sufficiently large number of samples, the mean of the sample means would be normally distributed around the true population mean. This is true even if the thing we are measuring is not normally distributed. The central limit theorem can be proved mathematically. It is the basis of how we calculate our required sample size, how we check our surveys are giving us sufficiently robust estimates, how we set targets, how we assess whether there has been any change between time………………. Sample of sample means

Confidence Intervals Central Limit Theorem allows us to calculate values around our estimates that tell us how close to the population parameter our sample statistic is likely to be

Confidence Intervals For example, Our survey suggests that 18% of people are poor. The confidence interval is 2 percentage points either side. So.. If we took 100 samples, 95 of them would have a proportion somewhere between 16 and 20% Or… we can be 95% confident that the true proportion of poor people in the population is somewhere between 16 and 20%

Measuring change over time As a rule of thumb, if two confidence intervals do not overlap we can be confident that there has been a change in the population This requires that broadly similar sample methodology was used, and exactly the same survey questions If different methodologies are used or the question changes, it becomes very difficult to say whether change in the population has occurred

Showing confidence intervals graphically

Setting a target 1.Calculate the confidence interval around the baseline estimate 2.Estimate what the confidence interval will be around the target figure 3.Make sure they don’t overlap

For example There is a local initiative to increase community cohesion by encouraging neighbours to get to know each other A 2009 survey reveals 30% (+ or – 3 percentage points) of people regularly speak to their neighbours By 2012 we want that to have increased What is the lowest target we can reliably measure? a)33% b)34% c)37%

Answer = 37% In 2009 we can be 95% sure that between 27 and 33% of people regularly speak to their neighbours In 2012 we would need our estimate to be 37% in order for us to be 95% sure that more people are speaking to their neighbours than in 2009

Word of caution Don’t mistake “statistically significant” for “meaningful” A change of 0.01% can be statistically significant if the survey is large and precise enough, but most people wouldn’t call that meaningful A meaningful change in the population could be missed if the survey isn’t designed to be precise enough: Make sure the survey is designed with the purpose of monitoring change in mind

Further Information Statistics Without Tears by Derek Rowntree (Penguin, 1981) Scottish Government Statistics Glossary Forthcoming Analytical Guidance Library on the ScotStat Network website