Bias and Variability Lecture 28 Section 8.3 Tue, Oct 25, 2005.

Slides:



Advertisements
Similar presentations
“Students” t-test.
Advertisements

CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
 These 100 seniors make up one possible sample. All seniors in Howard County make up the population.  The sample mean ( ) is and the sample standard.
AP Statistics: Section 9.1 Sampling Distributions
Sampling: Final and Initial Sample Size Determination
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Suppose we are interested in the digits in people’s phone numbers. There is some population mean (μ) and standard deviation (σ) Now suppose we take a sample.
Chapter 8 Estimation: Single Population
9.1 Sampling Distributions A parameter is a number that describes the population. A parameter is a fixed number, but in practice we do not know its value.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Central Limit Theorem The Normal Distribution The Standardised Normal.
The Basics  A population is the entire group on which we would like to have information.  A sample is a smaller group, selected somehow from.
1 Psych 5500/6500 Statistics and Parameters Fall, 2008.
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Tue, Oct 23, 2007.
Chapter 7 Estimation: Single Population
Chapter 11: Estimation Estimation Defined Confidence Levels
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
STA291 Statistical Methods Lecture 16. Lecture 15 Review Assume that a school district has 10,000 6th graders. In this district, the average weight of.
1 Introduction to Estimation Chapter Concepts of Estimation The objective of estimation is to determine the value of a population parameter on the.
Estimation Bias, Standard Error and Sampling Distribution Estimation Bias, Standard Error and Sampling Distribution Topic 9.
LECTURE 16 TUESDAY, 31 March STA 291 Spring
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
Bias and Variability Lecture 38 Section 8.3 Wed, Mar 31, 2004.
AP Statistics: Section 9.1 Sampling Distributions.
Selecting Input Probability Distribution. Simulation Machine Simulation can be considered as an Engine with input and output as follows: Simulation Engine.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a measure of the population. This value is typically unknown. (µ, σ, and now.
Testing Hypotheses about a Population Proportion Lecture 29 Sections 9.1 – 9.3 Fri, Nov 12, 2004.
Paired Samples Lecture 39 Section 11.3 Tue, Nov 15, 2005.
Testing Hypotheses about a Population Proportion Lecture 31 Sections 9.1 – 9.3 Wed, Mar 22, 2006.
Review of Statistical Terms Population Sample Parameter Statistic.
Sampling Theory and Some Important Sampling Distributions.
1 Virtual COMSATS Inferential Statistics Lecture-4 Ossam Chohan Assistant Professor CIIT Abbottabad.
Sampling Distributions: Suppose I randomly select 100 seniors in Anne Arundel County and record each one’s GPA
Ex St 801 Statistical Methods Inference about a Single Population Mean (CI)
MATH Section 4.4.
Making Decisions about a Population Mean with Confidence Lecture 35 Sections 10.1 – 10.2 Fri, Mar 25, 2005.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Heights  Put your height in inches on the front board.  We will randomly choose 5 students at a time to look at the average of the heights in this class.
Chapter 9 Estimation using a single sample. What is statistics? -is the science which deals with 1.Collection of data 2.Presentation of data 3.Analysis.
Chapter 9 Lesson 9.1 Estimation Using a Simple Sample 9.1: Point Estimation.
Bias and Variability Lecture 27 Section 8.3 Wed, Nov 3, 2004.
Chapter 9 Sampling Distributions 9.1 Sampling Distributions.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Statistics for Business and Economics 7 th Edition Chapter 7 Estimation: Single Population Copyright © 2010 Pearson Education, Inc. Publishing as Prentice.
Student’s t Distribution Lecture 32 Section 10.2 Fri, Nov 10, 2006.
Student’s t Distribution
Lecture 28 Section 8.3 Fri, Mar 4, 2005
CHAPTER 6 Random Variables
Testing Hypotheses about a Population Proportion
Section 1: Estimating with large samples
Behavioral Statistics
Chapter 9 Hypothesis Testing.
Independent Samples: Comparing Proportions
CHAPTER 22: Inference about a Population Proportion
Chapter 9.1: Sampling Distributions
One-Way Analysis of Variance
Sampling Distribution of a Sample Proportion
Sampling Distribution of a Sample Proportion
Making Decisions about a Population Mean with Confidence
CHAPTER 15 SUMMARY Chapter Specifics
Sampling Distribution of a Sample Proportion
Testing Hypotheses about a Population Proportion
Chapter 7: Sampling Distributions
Continuous Random Variables 2
Sampling Distribution of a Sample Proportion
Sampling Distributions
Warmup Which of the distributions is an unbiased estimator?
Testing Hypotheses about a Population Proportion
Section 9.2: Sample Proportions
Testing Hypotheses about a Population Proportion
Presentation transcript:

Bias and Variability Lecture 28 Section 8.3 Tue, Oct 25, 2005

Unbiased Statistics Unbiased statistic – A statistic whose average value equals the parameter that it is estimating. Unbiased statistic – A statistic whose average value equals the parameter that it is estimating. We have already seen that p ^ is an unbiased estimator of p, because  p^ = p. We have already seen that p ^ is an unbiased estimator of p, because  p^ = p. Would the sample range be an unbiased estimator of the population range? Would the sample range be an unbiased estimator of the population range?

Variability of a Statistic The variability of a statistic is a measure of how spread out the sampling distribution of that statistic is. The variability of a statistic is a measure of how spread out the sampling distribution of that statistic is. All estimators exhibit some variability. All estimators exhibit some variability. The less variability, the better. The less variability, the better.

The Parameter The parameter

Unbiased, Low Variability The parameter The sampling distribution

Unbiased, High Variability The parameter

Biased, High Variability The parameter

Biased, Low Variability The parameter

Accuracy and Precision An unbiased statistic allows us to make accurate estimates. An unbiased statistic allows us to make accurate estimates. A low variability statistic allows us to make precise estimates. A low variability statistic allows us to make precise estimates. The best estimator is one that is unbiased and with low variability. The best estimator is one that is unbiased and with low variability. Then we can make estimates that are both accurate and precise. Then we can make estimates that are both accurate and precise.

The Sampling Distribution of p ^ Since the mean of p ^ equals p, then p ^ is an unbiased estimator of p. Since the mean of p ^ equals p, then p ^ is an unbiased estimator of p. Because n appears in the denominator of the standard deviation, Because n appears in the denominator of the standard deviation, The standard deviation of p ^ decreases as n increases. The standard deviation of p ^ decreases as n increases. Therefore, for large samples (large n), p ^ has a lower variability than it does for small samples. Therefore, for large samples (large n), p ^ has a lower variability than it does for small samples. In that respect, larger samples are better. In that respect, larger samples are better.

Experiment I will use randBin(50,.1, 200) to simulate selecting 50 people from a population that is 10% male 200 times, and counting the males. I will use randBin(50,.1, 200) to simulate selecting 50 people from a population that is 10% male 200 times, and counting the males. Volunteer #1: randBin(50,.3, 200) (30% male) Volunteer #1: randBin(50,.3, 200) (30% male) Volunteer #2: randBin(50,.5, 200) (50% male) Volunteer #2: randBin(50,.5, 200) (50% male) Volunteer #3: randBin(50,.7, 200) (70% male) Volunteer #3: randBin(50,.7, 200) (70% male) Volunteer #4: randBin(50,.9, 200) (90% male) Volunteer #4: randBin(50,.9, 200) (90% male) It will take the TI-83 about 6 minutes. It will take the TI-83 about 6 minutes.

Experiment Divide the list by 50 to get proportions. Divide the list by 50 to get proportions. Store the results in list L 1. Store the results in list L 1. STO L 1. STO L 1. Compute the statistics for L 1. Compute the statistics for L 1. 1-Var Stats L 1. 1-Var Stats L 1. What are the means and standard deviations? What are the means and standard deviations? Do they seem to change, depending on the population proportion? Do they seem to change, depending on the population proportion?

Sampling Distributions and Hypothesis Testing Suppose we choose a sample of n students from an unknown population. Suppose we choose a sample of n students from an unknown population. However, we know that the population consists of either 1/3 freshmen or 2/3 freshmen. However, we know that the population consists of either 1/3 freshmen or 2/3 freshmen. Our purpose is to test the following hypotheses: Our purpose is to test the following hypotheses: H 0 : p = 1/3. H 0 : p = 1/3. H 1 : p = 2/3. H 1 : p = 2/3.

Sampling Distributions and Hypothesis Testing Under H 0, the sampling distribution of p ^ should be Under H 0, the sampling distribution of p ^ should be normal, normal,  p^ = 1/3,  p^ = 1/3,  p^ =  ((1/3)(2/3)/n) = /  n.  p^ =  ((1/3)(2/3)/n) = /  n. Under H 1, the sampling distribution of p ^ should be Under H 1, the sampling distribution of p ^ should be normal, normal,  p^ = 2/3,  p^ = 2/3,  p^ =  ((2/3)(1/3)/n) = /  n.  p^ =  ((2/3)(1/3)/n) = /  n.

Sampling Distributions and Hypothesis Testing The likelihood of being able to tell the difference based on p ^ will depend on the sample size. The likelihood of being able to tell the difference based on p ^ will depend on the sample size. The larger the sample, the more likely it is that we will be able to distinguish between the two hypothetical populations. The larger the sample, the more likely it is that we will be able to distinguish between the two hypothetical populations.

PDFs of p ^ for n = 5 H0H0 H1H1

PDFs of p ^ for n = 10 H0H0 H1H1

PDFs of p ^ for n = 20 H0H0 H1H1

PDFs of p ^ for n = 50 H0H0 H1H1

PDFs of p ^ for n = 100 H0H0 H1H1

Let’s Do It! Let’s do it! 8.5, p. 521 – Probabilities about the Proportion of People with Type B Blood. Let’s do it! 8.5, p. 521 – Probabilities about the Proportion of People with Type B Blood. Let’s do it! 8.6, p. 523 – Estimating the Proportion of Patients with Side Effects. Let’s do it! 8.6, p. 523 – Estimating the Proportion of Patients with Side Effects. Let’s do it! 8.7, p. 525 – Testing hypotheses about Smoking Habits. Let’s do it! 8.7, p. 525 – Testing hypotheses about Smoking Habits. See Example 8.5, p. 524 – Testing Hypotheses about the Proportion of Cracked Bottles. See Example 8.5, p. 524 – Testing Hypotheses about the Proportion of Cracked Bottles.