The Normal Curve and Sampling A.A sample will always be different from the true population B.This is called “sampling error” C.The difference between a.

Slides:



Advertisements
Similar presentations
Chapter 5 Some Key Ingredients for Inferential Statistics: The Normal Curve, Probability, and Population Versus Sample.
Advertisements

Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 6 Hypothesis Tests with Means.
Sampling: Final and Initial Sample Size Determination
1. Exams 2. Sampling Distributions 3. Estimation + Confidence Intervals.
Hypothesis Testing It is frequently expected that you have clear hypotheses when you have a study using quantitative data. Older citizens are more likely.
Objectives Look at Central Limit Theorem Sampling distribution of the mean.
Chapter 7 Introduction to Sampling Distributions
1. Estimation ESTIMATION.
Confidence Intervals Mon, March 22 nd. Point & Interval Estimates  Point estimate – use sample to estimate exact statistic to represent pop parameter.
Topics: Inferential Statistics
Sampling Distributions
PPA 415 – Research Methods in Public Administration Lecture 5 – Normal Curve, Sampling, and Estimation.
Need to know in order to do the normal dist problems How to calculate Z How to read a probability from the table, knowing Z **** how to convert table values.
Review Measures of Central Tendency –Mean, median, mode Measures of Variation –Variance, standard deviation.
Chapter 4 SUMMARIZING SCORES WITH MEASURES OF VARIABILITY.
Standard error of estimate & Confidence interval.
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
Quiz 2 Measures of central tendency Measures of variability.
Variance Formula. Probability A. The importance of probability Hypothesis testing and statistical significance Probabilistic causation - because error.
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
Estimation Statistics with Confidence. Estimation Before we collect our sample, we know:  -3z -2z -1z 0z 1z 2z 3z Repeated sampling sample means would.
Chapter 11: Estimation Estimation Defined Confidence Levels
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Confidence Intervals for Means. point estimate – using a single value (or point) to approximate a population parameter. –the sample mean is the best point.
Statistics 101 Chapter 10. Section 10-1 We want to infer from the sample data some conclusion about a wider population that the sample represents. Inferential.
LECTURE 16 TUESDAY, 31 March STA 291 Spring
Estimation in Sampling!? Chapter 7 – Statistical Problem Solving in Geography.
Inferential Statistics 2 Maarten Buis January 11, 2006.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Slide 1 © 2002 McGraw-Hill Australia, PPTs t/a Introductory Mathematics & Statistics for Business 4e by John S. Croucher 1 n Learning Objectives –Identify.
Chapter 7 Estimation Procedures. Basic Logic  In estimation procedures, statistics calculated from random samples are used to estimate the value of population.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 7 - Sampling Distribution of Means.
Determination of Sample Size: A Review of Statistical Theory
PPA 501 – Analytical Methods in Administration Lecture 6a – Normal Curve, Z- Scores, and Estimation.
Chapter Thirteen Copyright © 2004 John Wiley & Sons, Inc. Sample Size Determination.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Confidence Interval Estimation For statistical inference in decision making:
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
Chapter 10: Confidence Intervals
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 6 Hypothesis Tests with Means.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
CONFIDENCE INTERVALS.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 8. Parameter Estimation Using Confidence Intervals.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Statistics for Political Science Levin and Fox Chapter Seven
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
The Normal Probability Distribution. What is a distribution? A collection of scores, values, arranged to indicate how common various values, or scores.
m/sampling_dist/index.html.
Chapter Eleven Sample Size Determination Chapter Eleven.
Lab Chapter 9: Confidence Interval E370 Spring 2013.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Review Day 2 May 4 th Probability Events are independent if the outcome of one event does not influence the outcome of any other event Events are.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Samples and Populations Statistics for Political Science Levin and Fox Chapter 6 1.
GOVT 201: Statistics for Political Science
Inference: Conclusion with Confidence
Week 10 Chapter 16. Confidence Intervals for Proportions
Statistics in Applied Science and Technology
Chapter 7 Sampling Distributions.
Samples and Populations
Calculating Probabilities for Any Normal Variable
Chapter 7 Sampling Distributions.
Chapter 7 Sampling Distributions.
Chapter 12 Inference for Proportions
Chapter 7 Sampling Distributions.
Chapter 4 (cont.) The Sampling Distribution
How Confident Are You?.
Presentation transcript:

The Normal Curve and Sampling A.A sample will always be different from the true population B.This is called “sampling error” C.The difference between a sample and the true population, regardless of how well the survey was designed or implemented D.Different from measurement error or sample bias

Sampling distribution of Means The existence of sampling error means that if you take a 1000 random samples from a population and calculate a 1000 means and plot the distribution of those means you will get a consistent distribution that has the following characteristics:

Characteristics of a Sampling Distribution 1. the distribution approximates a normal curve 2. the mean of a sampling distribution of means is equal to the true population 3. the standard deviation of a sampling distribution is smaller than the standard deviation of the population. Less variation in the distribution because we are not dealing with raw scores but rather central tendencies.

Probability and the Normal Curve In chapter 6 – we are not interested in the distribution of raw scores but rather the distribution of sample means and making probability statements about those sample means.

Probability and the Sampling Distribution Why is making probabilistic statements about a central tendency important? 1. it will allow us to engage in inferential statistics (later in ch. 7) 2. it allows us to produce confidence intervals

Example of number 1: President of UNLV states that the average salary of a new UNLV graduate is $60,000. We are skeptical and test this by taking a random sample of a 100 UNLV students. We find that the average is only $55,000. Do we declare the President a liar?

Not Yet!!!! We need to make a probabilistic statement regarding the likelihood of Harter’s statement. How do we do that? With the aid of the standard error of the mean we can calculate confidence intervals - the range of mean values within with our true population mean is likely to fall.

How do we do that? First, we need the sample mean Second, we need the standard deviation of the sampling distribution of means (what’s another name for this?) a.k.a standard error of the mean

What’s the Problem? The problem is… We don’t have the standard deviation of the sampling distribution of means? What do we do?

First – let’s pretend Let’s pretend that I know the Standard Deviation of the Sampling Distribution of Means (a.k.a. the standard error of the mean). It’s 3000 For a 95% confidence interval we multiply the standard error of the mean by 1.96 and add & subtract that product to our sample mean Why 1.96?

So is the President Lying? CI = Mean + or – 1.96 (SE) = 55,000 +/ (3000) = 55,000 +/ = $49,120 to 60,880

Estimating the SE We Can Estimate the Standard Error of the Mean. –Divide the standard deviation of the sample by √n-1 –For example a sample standard deviation of would produce a estimate of the SE of around 3000 [29849 divide by √n-1] [remember n = 100] Then multiply this estimate by t rather than 1.96 and then add this product to our sample mean. Why t?

The t Distribution Empirical testing and models shows that a standard deviation from a sample underestimates the standard deviation of the true population This is why we use N-1 not N when calculating the standard deviation and the standard error So in reality, we are calculating t-scores, not z-scores since we are not using the true sd.

So when we are using a sample and calculating a 95% confidence interval (CI) we need to multiply the standard error by t, not 1.96 How do we know what t is? Table in back of book (Appendix C; Table C) Df = N – = 99; Use the df of 60 and level of significance of.05 (why?) T = 2

Confidence Intervals for Proportions Calculate the standard error of the proportion: Sp = 95% conf. Interval = P +/- (1.96)S p

Example National sample of 531 Democrats and Democratic-leaning independents, aged 18 and older, conducted Sept , 2007 Clinton 47%; Obama 25%; Edwards 11% P(1-P) =.47(1-.47) =.47(.53) =.2491 Divide by N =.2491/531 = Take square root = % CI =.47 +/ (.0217).47 +/ or to.511