Copyright (c) Bani Mallick1 Stat 651 Lecture 5. Copyright (c) Bani Mallick2 Topics in Lecture #5 Confidence intervals for a population mean  when the.

Slides:



Advertisements
Similar presentations
CHAPTER 14: Confidence Intervals: The Basics
Advertisements

Chapter 6 Sampling and Sampling Distributions
Sampling: Final and Initial Sample Size Determination
Sampling Distributions and Sample Proportions
Sampling Distributions
Sampling Distributions (§ )
Terminology A statistic is a number calculated from a sample of data. For each different sample, the value of the statistic is a uniquely determined number.
Inference: Confidence Intervals
Chapter 7 Introduction to Sampling Distributions
Math 161 Spring 2008 What Is a Confidence Interval?
Copyright (c) Bani Mallick1 Lecture 2 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #2 Population and sample parameters More on populations.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture 9.
Copyright (c) Bani Mallick1 STAT 651 Lecture 7. Copyright (c) Bani Mallick2 Topics in Lecture #7 Sample size for fixed power Never, ever, accept a null.
Chapter 7 Sampling and Sampling Distributions
Copyright (c) Bani Mallick1 STAT 651 Lecture 10. Copyright (c) Bani Mallick2 Topics in Lecture #10 Comparing two population means using rank tests Comparing.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #16.
Copyright (c) Bani Mallick1 Lecture 4 Stat 651. Copyright (c) Bani Mallick2 Topics in Lecture #4 Probability The bell-shaped (normal) curve Normal probability.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #15.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #13.
Sampling Distributions
The Excel NORMDIST Function Computes the cumulative probability to the value X Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc
Copyright (c) Bani K. Mallick1 STAT 651 Lecture # 12.
Copyright (c) Bani Mallick1 STAT 651 Lecture # 11.
BCOR 1020 Business Statistics
Inference on averages Data are collected to learn about certain numerical characteristics of a process or phenomenon that in most cases are unknown. Example:
Chapter 10: Estimating with Confidence
Chapter 11: Random Sampling and Sampling Distributions
Objectives (BPS chapter 14)
Chapter 7 Confidence Intervals and Sample Sizes
Sociology 5811: Lecture 7: Samples, Populations, The Sampling Distribution Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Section 8.2 Estimating  When  is Unknown
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 9 Section 1 – Slide 1 of 39 Chapter 9 Section 1 The Logic in Constructing Confidence Intervals.
Chapter 11: Estimation Estimation Defined Confidence Levels
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
Dan Piett STAT West Virginia University
Estimation of Statistical Parameters
16-1 Copyright  2010 McGraw-Hill Australia Pty Ltd PowerPoint slides to accompany Croucher, Introductory Mathematics and Statistics, 5e Chapter 16 The.
Sampling Distribution ● Tells what values a sample statistic (such as sample proportion) takes and how often it takes those values in repeated sampling.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
PARAMETRIC STATISTICAL INFERENCE
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Vegas Baby A trip to Vegas is just a sample of a random variable (i.e. 100 card games, 100 slot plays or 100 video poker games) Which is more likely? Win.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
Statistical estimation, confidence intervals
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Distributions of the Sample Mean
Confidence intervals: The basics BPS chapter 14 © 2006 W.H. Freeman and Company.
1 Chapter 6 Estimates and Sample Sizes 6-1 Estimating a Population Mean: Large Samples / σ Known 6-2 Estimating a Population Mean: Small Samples / σ Unknown.
Section 10.1 Confidence Intervals
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 10: Confidence Intervals
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall
Copyright (c) Bani K. Mallick1 STAT 651 Lecture 6.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. PPSS The situation in a statistical problem is that there is a population of interest, and a quantity or.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
Lecture 5 Introduction to Sampling Distributions.
Copyright (c) Bani Mallick1 STAT 651 Lecture 8. Copyright (c) Bani Mallick2 Topics in Lecture #8 Sign test for paired comparisons Wilcoxon signed rank.
1 Probability and Statistics Confidence Intervals.
Estimation by Intervals Confidence Interval. Suppose we wanted to estimate the proportion of blue candies in a VERY large bowl. We could take a sample.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
10.1 Estimating with Confidence Chapter 10 Introduction to Inference.
Statistics 200 Objectives:
This Week Review of estimation and hypothesis testing
Confidence Intervals for a Population Mean,
Sampling Distributions (§ )
Interval Estimation Download this presentation.
Presentation transcript:

Copyright (c) Bani Mallick1 Stat 651 Lecture 5

Copyright (c) Bani Mallick2 Topics in Lecture #5 Confidence intervals for a population mean  when the population standard deviation  is known. Properties of confidence intervals: what things make them longer and shorter. Sample size calculation for a population mean  when the population standard deviation  is known : a simple illustration of a method.

Copyright (c) Bani Mallick3 Book Sections Covered in Lecture #5 Chapter 5.1 Chapter 5.2 Chapter 5.3

Copyright (c) Bani Mallick4 Lecture 4 Review: Pr(X < c) for Normal Populations Compute the z-score Look up value in Table 1

Copyright (c) Bani Mallick5 Lecture 4 Review: Pr(X > c) for Normal Populations Compute the z-score Look up the value for z in Table 1 Subtract this value from 1.0

Copyright (c) Bani Mallick6 Lecture 4 Review: Inference The sample mean is a random variable Its own “population” mean is  It’s standard deviation is Note how the standard deviation of the sample mean becomes smaller as the sample size becomes larger More data = more precision!!!!!

Copyright (c) Bani Mallick7 Lecture 4 Review: Central Limit Theorem The sample mean is a random variable Its own “population” mean is  It’s standard deviation is In “large enough” samples, the sample mean is very nearly normally distributed, i.e., has a bell--shaped histogram

Copyright (c) Bani Mallick8 Confidence Interval for a Population Mean A considerable part of basic statistics is to make inferences about the population mean  It is impossible to know the value of  exactly. This is a key factoid: why do I say this with such certainty?

Copyright (c) Bani Mallick9 Confidence Interval for a Population Mean A considerable part of basic statistics is to make inferences about the population mean  It is impossible to know the value of  exactly. Because (almost) every sample will give you a unique sample mean, and that sample mean will not equal the population mean.

Copyright (c) Bani Mallick10 Confidence Interval for a Population Mean What we can do is to construct an interval of possible values for the population mean . The interval is determined by how much “confidence” we want in saying that the population mean  is in the interval. The interval is always of the form

Copyright (c) Bani Mallick11 Confidence Interval for a Population Mean The confidence factor is determined by how much confidence we want in concluding that the population mean is actually in the interval Which interval has higher confidence of including the population mean? -100 to -50 OR -150 to 0

Copyright (c) Bani Mallick12 Confidence Interval for a Population Mean: Formal Method The first method assumes that the population standard deviation  is known. Suppose we want to be 95% confident that our interval includes the population mean , i.e., the probability is 95% that the population mean  is in the interval. Here is the interval:

Copyright (c) Bani Mallick13 WOMEN’S INTERVIEW SURVEY OF HEALTH (WISH)  computed the reported mean caloric intake at the start of the study, and the mean reported caloric intake at the end My random variable X was the change (difference) My hypothesis is that the population mean of X is < 0. In other words, I think women report less calories the more they are asked about their diet (Hawthorne Effect).

Copyright (c) Bani Mallick14 WISH: Change in Caloric Intake Does it look like a big change? Note that the scale of the box plot is to 2000

Copyright (c) Bani Mallick15 WISH The sample size is n = 271 The sample mean change = -180 I am going to pretend that the population standard deviation is  = 600.

Copyright (c) Bani Mallick16 WISH: Change in Reported Caloric Intake n = 271,  = 600, = % CI = -251 to -109

Copyright (c) Bani Mallick17 Review  = 600, n = 271, = -180 Then, with 95% probability, true population mean change is in the interval from -251 to The chance is 95% that the population mean change is between 251 and 109 calories lower Is there a Hawthorne effect?

Copyright (c) Bani Mallick18 Confidence Intervals You can construct a confidence interval for the population mean with any level of confidence. Generally, people report the 95% CI, but sometimes they report the 90% and 99% confidence intervals. This is easy to do via a formula, and even easier to do via SPSS.

Copyright (c) Bani Mallick19 Confidence Interval for a Population Mean  when  is Known Want 90%, 95% and 99% chance of interval including . 90% 95% 99%

Copyright (c) Bani Mallick20 Confidence Intervals There is a general formula given on page 200 If you want a (1-  )100% confidence interval for the population mean  when the population s.d.  is known, use the formula The term z  is the value in Table 1 that gives probability 1 -  /2.  = 0.10, z  = 1.645:  = 0.05, z  = 1.96,  = 0.01, z  = 2.58

Copyright (c) Bani Mallick21 WISH The sample size is n = 271 The sample mean change = -180 I am going to pretend that the population standard deviation is  = 600. I want a 99% confidence interval: z  = 2.58

Copyright (c) Bani Mallick22 WISH: Change in Reported Caloric Intake n = 271,  = 600, = % CI = -274 to -86

Copyright (c) Bani Mallick23 WISH: Change in Reported Caloric Intake 99% CI = -274 to -86 The chance is 99% that the population mean change in reported caloric intake is between 274 and 86 calories The chance is less than 1% that there is no change in the population mean.

Copyright (c) Bani Mallick24 WISH: Change in Reported Caloric Intake 99% CI = -274 to % CI = -251 to -109 Note that the 99% CI is longer than the 95% CI. This is clear(!): the more confidence you want, the longer the CI has to be. Put another way, the less willing you are to be wrong, the more conservative your claims.

Copyright (c) Bani Mallick25 Effect of Sample Size 95% CI = -251 to -109 with n = 271 If n = 1000, the 95% CI would be from -217 to -143 Note how the CI gets shorter in length as the sample size gets larger. This is a general fact: the larger the sample size the shorter the CI.

Copyright (c) Bani Mallick26 Effect of Population Standard Deviation 95% CI = -251 to -109 with  = 600 If  = 2000, the 95% CI would be from -418 to +58 Note how the CI gets longer in length as the population standard deviation  gets larger. This is a general fact: the larger the population standard deviation  the longer the CI.

Copyright (c) Bani Mallick27 Using SPSS to Construct CI SPSS actually assumes that the population standard deviation is unknown: we will consider this case later. Its default is a 95% CI You can easily change to any level of confidence SPSS demo using Wish Data

Copyright (c) Bani Mallick28 Sample Size Determination In general, this is a relatively complex issue, depending very heavily on the experiment. I will show you a simple calculation in the special case that the population standard deviation  is known. Of course,  is not known in practice, and more complex methods are required, but this will give you a feel for the process.

Copyright (c) Bani Mallick29 Sample Size Determination The usual answer to “what sample size should I take” is “what can you afford”. Remember, more precision with larger sample sizes Less precision with smaller sample sizes

Copyright (c) Bani Mallick30 Sample Size Determination The length of a confidence interval is 2 x confidence factor Thus, our 95% CI for WISH was -251 to -109, so that the length was 142 calories What if I wanted the length to be 100 calories? Then the CI would have to be

Copyright (c) Bani Mallick31 Sample Size Determination The length of the CI is If I want the length of a confidence interval to be 2 x E then I have to set Now I do some algebra

Copyright (c) Bani Mallick32 Sample Size Determination I want the length of a confidence interval to be 2 x E then the sample size I need is

Copyright (c) Bani Mallick33 Sample Size Determination Consider WISH, where = 600. Suppose I want the confidence interval length of 95% CI to be 2xE = 100 E = 50, z  = 1.96

Copyright (c) Bani Mallick34 Sample Size Determination Consider WISH, where = 600. Suppose I want the confidence interval length of 95% CI to be 2xE = 60 E = 30, z  = 1.96

Copyright (c) Bani Mallick35 Sample Size Determination 95% confidence Length = 100, E = 50, n = 553 Length = 60, E = 30, n = 1,557 General fact: the more precise you want to be (shorter CI), the larger the sample size you will need.

Copyright (c) Bani Mallick36 Sample Size Determination General fact: the larger the population standard deviation, the larger the sample size you will need to have a CI of length 2xE

Copyright (c) Bani Mallick37 Reactiver Oxygen Species (ROS) Data Rats fed with Fish oil enhanced diets Response is the change in ROS for an animal when the cells are exposed to butyrate

Copyright (c) Bani Mallick38 ROS Data

Copyright (c) Bani Mallick39 ROS Data Sample mean = 3.21 Sample size is n = 20 Pretend  = 3.33 Then = % interval for population mean change is [ * 1.96, * 1.96] = [1.76, 4.66]: Does butyrate increase ROS? How certain are we?

Copyright (c) Bani Mallick40 ROS Data  = 3.33, n = 20 95% interval for population mean change is [1.76, 4.66] The length of the CI is 2xE = 2.90 What sample size would I need to make the length of the CI = 1.00? Here 2xE = 1.00, E = 0.50, and