Ch. 20 Chance errors in sampling

Slides:



Advertisements
Similar presentations
SADC Course in Statistics Estimating population characteristics with simple random sampling (Session 06)
Advertisements

Inference about Means/Averages Chapter 23 Looking at means rather than percentages.
6-1 Stats Unit 6 Sampling Distributions and Statistical Inference - 1 FPP Chapters 16-18, 20-21, 23 The Law of Averages (Ch 16) Box Models (Ch 16) Sampling.
Ch. 17 The Expected Value & Standard Error Review of box models 1.Pigs – suppose there is a 40% chance of getting a “trotter”. Toss a pig 20 times. –What.
Ch.18 Normal approximation using probability histograms Review measures center and spread –List of numbers (histogram of data) –Box model For a “large”
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
STAT 1301 Confidence Intervals for Averages. l we usually DON’T KNOW the population AVG l we take simple random sample of size n l find sample AVG - this.
Unit 7: Chance Variation. Repeating with Excel Two EV’s and two SE’s Suppose we know the box (model) of a game, and we make many draws from the box with.
Objectives Look at Central Limit Theorem Sampling distribution of the mean.
Ch 6 Introduction to Formal Statistical Inference.
Chapter 10 Sampling and Sampling Distributions
Ch.18 Normal approximation using probability histograms Review measures of center and spread For a “large” number of draws, a histogram of observed sums.
Stat 217 – Day 18 t-procedures (Topics 19 and 20).
The Normal Curve and Sampling A.A sample will always be different from the true population B.This is called “sampling error” C.The difference between a.
Ch. 17 The Expected Value & Standard Error Review box models –Examples 1.Pigs – assume 40% chance of getting a “trotter” -20 tosses 2.Coin toss – 20 times.
Review Measures of Central Tendency –Mean, median, mode Measures of Variation –Variance, standard deviation.
Sampling and Estimating Population Percentages and Averages Math 1680.
QUIZ CHAPTER Seven Psy302 Quantitative Methods. 1. A distribution of all sample means or sample variances that could be obtained in samples of a given.
Review of normal distribution. Exercise Solution.
Math 3680 Lecture #11 The Central Limit Theorem for Sample Counts and Sample Proportions.
10.1 & 10.2 Probability & Permutations. WARM UP:
Hypothesis testing – mean differences between populations
T-test Mechanics. Z-score If we know the population mean and standard deviation, for any value of X we can compute a z-score Z-score tells us how far.
Chapter 5 Sampling Distribution of the Proportion
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Understanding the scores from Test 2 In-class exercise.
Stat 13, Tue 5/8/ Collect HW Central limit theorem. 3. CLT for 0-1 events. 4. Examples. 5.  versus  /√n. 6. Assumptions. Read ch. 5 and 6.
FPP Expected Values, Standard Errors, Central Limit Theorem.
Expected Value and Standard Error for a Sum of Draws (Dr. Monticino)
The Central Limit Theorem © Christine Crisp “Teach A Level Maths” Statistics 1.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
Sample Proportions Target Goal: I can FIND the mean and standard deviation of the sampling distribution of a sample proportion. DETERMINE whether or not.
READING HANDOUT #5 PERCENTS. Container of Beads Container has 4,000 beads 20% - RED 80% - WHITE Sample of 50 beads with pallet. Population - the 4,000.
Confidence Intervals (Dr. Monticino). Assignment Sheet  Read Chapter 21  Assignment # 14 (Due Monday May 2 nd )  Chapter 21 Exercise Set A: 1,2,3,7.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
Example Random samples of size n =2 are drawn from a finite population that consists of the numbers 2, 4, 6 and 8 without replacement. a-) Calculate the.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
ISMT253a Tutorial 1 By Kris PAN Skewness:  a measure of the asymmetry of the probability distribution of a real-valued random variable 
Term 1 Week 7 Warm Ups. Warm Up 9/21/15 1. Give the percent of area under the normal curve represented by the arrows: 2. A survey shows that 35% will.
Math 3680 Lecture #15 Confidence Intervals. Review: Suppose that E(X) =  and SD(X) = . Recall the following two facts about the average of n observations.
Lecture 5 Introduction to Sampling Distributions.
1 VI. Why do samples allow inference? How sure do we have to be? How many do I need to be that sure? Sampling Distributions, Confidence Intervals, & Sample.
Hypothesis test flow chart
6 - 1 © 2000 Prentice-Hall, Inc. Statistics for Business and Economics Sampling Distributions Chapter 6.
Example A population has a mean of 200 and a standard deviation of 50. A random sample of size 100 will be taken and the sample mean x̄ will be used to.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics Statistics & Econometrics.
Tests of Significance We use test to determine whether a “prediction” is “true” or “false”. More precisely, a test of significance gets at the question.
The expected value The value of a variable one would “expect” to get. It is also called the (mathematical) expectation, or the mean.
Review Statistical inference and test of significance.
Introduction Sample surveys involve chance error. Here we will study how to find the likely size of the chance error in a percentage, for simple random.
Review Design of experiments, histograms, average and standard deviation, normal approximation, measurement error, and probability.
Review Law of averages, expected value and standard error, normal approximation, surveys and sampling.
LESSON 19: UNDERSTANDING VARIABILITY IN ESTIMATES Student Outcomes Students understand the term sampling variability in the context of estimating a population.
Unit 7: Chance Variation
Inference for the Mean of a Population
STANDARD ERROR OF SAMPLE
Probability 9/22.
Probability and Estimation
LBSRE1021 Data Interpretation Lecture 9
Behavioral Statistics
Probability and Estimation
Chance Errors in Sampling (Dr. Monticino)
Chapter 8: Estimating With Confidence
Counting and Classification (Dr. Monticino)
Accuracy of Averages.
Presentation transcript:

Ch. 20 Chance errors in sampling Finding the EV and SE of a percent Our temporary assumptions are that we know how to find the average & SD of a box.

Example 1: We have a group of 100,000 people, of which 20% have college degrees and 80% do not. Suppose we draw an SRS of 100 people. Fill in the blanks: The percent of college educated people in the sample is around ____, give or take ____ or so.

Set up the box model and determine how many tickets there are of each type. Find the average, SD of the box, EV of the sum, SE of the sum. Change to percents.

In general, EV of percent = x100% SE of percent = x100% A flaw in SRS is that it is not realistic. If the population size is large compared to the sample size, EV & SE are good approximations. (Correction for SE discussed later & EV doesn’t need to corrected. See Ex. A #4 solution)

EV for percentage for a sample is equal to population percentage. EV & SE for percentages depend on average and SD of box and sample size, not population size. EV for percentage for a sample is equal to population percentage. Increasing the sample size (# of draws) increases the EV of sum increases the SE of sum, but more slowly than EV. no effect on EV for percent SE for percent decreases

Example 2: Same situation as Example 1 except the sample size is 400. When we multiply the sample size by 4, the SE for the percentage is the original samples SE for %

Activity: Work in pairs on Exercise Set A #2, 6 p. 361 #2. A university has 25,000 students, of whom 10,000 are older than 25. The registrar draws an SRS of 400 students. a) Find the EV & SE for the number of students in the sample older than 25. b) Find the EV & SE for the percentage of students in the sample older than 25. c) The percentage of students in the sample who are older than 25 will be around ____ , give or take ______ or so.

#6. 900 draws are made at random with replacement from a box which has 1 red marble and 9 blue ones. The SE for the percentage of red marbles in the sample is 1%. A sample percentage which is 1 SE above its EV equals: _________ CHOICES: 10%+1% (2) 1.01x10% Explain.

Using the Normal Curve Example 3: Return to Example 1. Estimate the chance that more than 22% of the sample will have college degrees. Why is it okay to use the normal curve for estimates? We know that it is okay for sums and a percent is just a re-scaling of a sum.

In our college degree example, the variable was qualitative, so the variable was clear cut. Compare this to the example in the book on p. 363. Here the variable is quantitative and the cut-point is arbitrary. This could lead to potential interpretation issues and possibly biases results. Note: We do not yet have a way to work with average income in a sample, just net income.

Correction factor SE when drawing without replacement = (correction factor)x(SE when drawing with replacement) Correction factor =