Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics.

Slides:



Advertisements
Similar presentations
Inference about a Population Proportion
Advertisements

Confidence Intervals for Population Means
Ch. 8 – Practical Examples of Confidence Intervals for z, t, p.
Statistics 1: Introduction to Probability and Statistics Section 3-3.
Part 7: Bernoulli and Binomial Distributions 7-1/32 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department.
Inference for a Population Proportion
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 19 Confidence Intervals for Proportions.
Math 161 Spring 2008 What Is a Confidence Interval?
Statistics and Data Analysis
1 The Basics of Regression Regression is a statistical technique that can ultimately be used for forecasting.
Variability Measures of spread of scores range: highest - lowest standard deviation: average difference from mean variance: average squared difference.
Professor William Greene Stern School of Business IOMS Department Department of Economics Statistical Inference and Regression Analysis: Stat-GB ,
Part 18: Regression Modeling 18-1/44 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics.
1 Econ 240A Power Four Last Time Probability.
 a fixed measure for a given population  ie: Mean, variance, or standard deviation.
Copyright ©2005 Brooks/Cole, a division of Thomson Learning, Inc. How to Get a Good Sample Chapter 4.
Determining the Size of
The arithmetic mean of a variable is computed by determining the sum of all the values of the variable in the data set divided by the number of observations.
Agenda Review Exam I Sampling Start Probabilities.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
10.3 Estimating a Population Proportion
Basics of Statistical Analysis. Basics of Analysis The process of data analysis Example 1: –Gift Catalog Marketer –Mails 4 times a year to its customers.
Part 17: Regression Residuals 17-1/38 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics.
Ch 8 Estimating with Confidence. Today’s Objectives ✓ I can interpret a confidence level. ✓ I can interpret a confidence interval in context. ✓ I can.
Confidence Intervals about a Population Proportion
Chapter Twelve Census: Population canvass - not really a “sample” Asking the entire population Budget Available: A valid factor – how much can we.
Ch 8 Estimating with Confidence. Today’s Objectives ✓ I can interpret a confidence level. ✓ I can interpret a confidence interval in context. ✓ I can.
Section 2 Part 2.   Population - entire group of people or items for which we are collecting data  Sample – selections of the population that is used.
CONFIDENCE INTERVALS Feb. 18 th, A STATS PROFESSOR ASKED HER STUDENTS WHETHER OR NOT THEY WERE REGISTERED TO VOTE. IN A SAMPLE OF 50 OF HER STUDENTS.
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Day 3: Sampling Distributions. CCSS.Math.Content.HSS-IC.A.1 Understand statistics as a process for making inferences about population parameters based.
Making Inferences. Sample Size, Sampling Error, and 95% Confidence Intervals Samples: usually necessary (some exceptions) and don’t need to be huge to.
Mar. 22 Statistic for the day: Percent of Americans 18 or older who believe Martha Stewart’s sentence should include jail time: 53% Assignment: Read Chapter.
Part 2: Model and Inference 2-1/49 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
Confidence Intervals and Tests of Proportions. Assumptions for inference when using sample proportions: We will develop a short list of assumptions for.
Determination of Sample Size: A Review of Statistical Theory
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
Biostatistics Unit 5 – Samples. Sampling distributions Sampling distributions are important in the understanding of statistical inference. Probability.
February 2012 Sampling Distribution Models. Drawing Normal Models For cars on I-10 between Kerrville and Junction, it is estimated that 80% are speeding.
Chapter 5 Parameter estimation. What is sample inference? Distinguish between managerial & financial accounting. Understand how managers can use accounting.
Inference about a Population Proportion BPS chapter 19 © 2010 W.H. Freeman and Company.
Physics 270 – Experimental Physics. Let say we are given a functional relationship between several measured variables Q(x, y, …) x ±  x and x ±  y What.
Statistics : Statistical Inference Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University 1.
CONFIDENCE STATEMENT MARGIN OF ERROR CONFIDENCE INTERVAL 1.
Learning Objective Chapter 12 Sample Size Determination Copyright © 2000 South-Western College Publishing Co. CHAPTER twelve Sample Size Determination.
Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O’Halloran.
What is a Confidence Interval?. Sampling Distribution of the Sample Mean The statistic estimates the population mean We want the sampling distribution.
Honors Advanced Algebra Presentation 1-7.  The mean of the quiz grades was 33.77, and the standard deviation was  The mean of Algebra grades is.
Chapter 19 Confidence intervals for proportions
Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5.
Ch. 18 – Sampling Distribution Models (Day 1 – Sample Proportions) Part V – From the Data at Hand to the World at Large.
Copyright © 2012 by Nelson Education Limited. Chapter 5 Introduction to inferential Statistics: Sampling and the Sampling Distribution 5-1.
Warm Up In May 2006, the Gallup Poll asked 510 randomly sampled adults the question “Generally speaking, do you believe the death penalty is applied fairly.
Sample Size Mahmoud Alhussami, DSc., PhD. Sample Size Determination Is the act of choosing the number of observations or replicates to include in a statistical.
9.1: Sampling Distributions. Parameter vs. Statistic Parameter: a number that describes the population A parameter is an actual number, but we don’t know.
 Here’s the formula for a CI for p: p-hat is our unbiased Estimate of p. Z* is called the critical value. I’ll teach you how to calculate that next. This.
Chapter Eleven Sample Size Determination Chapter Eleven.
Statistic for the day: % of Americans who didn’t attend college who say they believe in extraterrestrials: 37 Who did attend college: 47 Assignment: Read.
Margin of Error S-IC.4 Use data from a sample survey to estimate a population mean or proportion; develop a margin of error through the use of simulation.
Copyright © 2010 Pearson Education, Inc. Slide
Chapter Seven Point Estimation and Confidence Intervals.
 Normal Curves  The family of normal curves  The rule of  The Central Limit Theorem  Confidence Intervals  Around a Mean  Around a Proportion.
Sampling and Sampling Distributions
CHAPTER 8 Estimating with Confidence
Ch. 18 – Sampling Distribution Models (Day 1 – Sample Proportions)
Central Limit Theorem General version.
Confidence Intervals for Proportions
EXAMPLE: The weight of a can of Coca Cola is supposed to have mean = 12 oz with std. dev.= 0.82 oz. The product is declared underweight if it weighs.
Standard Deviation Lecture 18 Sec Wed, Oct 4, 2006.
Presentation transcript:

Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics

The Margin of Error The CNN/Opinion Research Corp. said 51 percent of those polled thought Biden did the best job, while 36 percent thought Palin did the best job. On the question of the candidates' qualifications to assume the presidency, 87 percent of those polled said Biden is qualified and 42 percent said Palin is qualified. The poll had a margin of error of plus or minus 4 percentage points. (9:30 AM, Friday, October 3, 2008)

What Does the “Margin of Error” Tell You? Did Biden do the better job? If we could ask every individual all over the world who had an opinion, the proportion who think yes would be θ. We assume that such a value exists. It has to be as of a moment in time. The next day, the same question might get a different answer.

The Margin of Error We can’t ask everyone, so we ask a sample of people, i = 1,…,n. Do you think Biden did the better job? X i = 1 if the person answers yes, X i = 0 if they answer no. 51% said yes, so P =(1/n)Σ i x i = 0.51 Is π = 0.51? No, we didn’t ask everyone, we just asked a sample is an estimate of π.

Why the 4% Margin of Error? We acknowledge that the 0.51 might be inaccurate because it is based on a sample. We assume that whatever n is, the sample was drawn randomly. We use our empirical rule to figure out what the real value of π might be.

Some Theory P = (1/n)Σ i x i = 0.51 P is a random variable. It is the sum of n Bernoulli random variables, divided by n. E[x i ] = the probability that person i will answer yes. This is π. So, the expected value of P is (1/n)Σ i θ = θ. They think P is a good estimate of θ.

Theory Continued Since P is a random variable it has a variance. Var[P] = (1/n 2 ) Σ i θ (1- θ ) = θ (1- θ )/n The standard deviation is the square root. Use P to estimate this. The estimated standard deviation is sqr(.51(.49)/n).

That Margin of Error They use the same empirical rule we do. The margin of error is ±.04 is ±2 standard deviations. So, one standard deviation is = P(1-P)/n. If P is.51, n = 625. (According to David Gregory, they asked 560).

What they Found Based on a survey of 625 people, we believe the proportion of people who think Biden did a better job is between 47% and 55%. Is this CERTAIN? Based on the same logic, the proportions for Palin are 28% to 36%. Is this CERTAIN? What would these ranges be if they had asked 6,250 people instead of 625?

Regression: θ|State ≠ θ Overall