Having Confidence in our Means: Confidence Intervals

Slides:



Advertisements
Similar presentations
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Confidence Intervals Chapter 8.
Advertisements

Objectives Look at Central Limit Theorem Sampling distribution of the mean.
Estimating the Population Mean Assumptions 1.The sample is a simple random sample 2.The value of the population standard deviation (σ) is known 3.Either.
1 (Student’s) T Distribution. 2 Z vs. T Many applications involve making conclusions about an unknown mean . Because a second unknown, , is present,
Section 8.2 Estimating  When  is Unknown
Estimation Statistics with Confidence. Estimation Before we collect our sample, we know:  -3z -2z -1z 0z 1z 2z 3z Repeated sampling sample means would.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 7. Using Probability Theory to Produce Sampling Distributions.
Topic 5 Statistical inference: point and interval estimate
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7.4 Estimation of a Population Mean  is unknown  This section presents.
Quick and Simple Statistics Peter Kasper. Basic Concepts Variables & Distributions Variables & Distributions Mean & Standard Deviation Mean & Standard.
Section 9.3: Confidence Interval for a Population Mean.
 At the beginning of the term, we talked about populations and samples  What are they?  Why do we take samples?
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
1 Mean Analysis. 2 Introduction l If we use sample mean (the mean of the sample) to approximate the population mean (the mean of the population), errors.
1 Probability and Statistics Confidence Intervals.
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
GOSSET, William Sealy How shall I deal with these small batches of brew?
1 Day 3 QMIM Confidence Interval for a Population mean of Large and Small Samples by Binam Ghimire.
The Statistical Imagination Chapter 7. Using Probability Theory to Produce Sampling Distributions.
Dr.Theingi Community Medicine
Simulation – Output Analysis. Point Estimate Point estimates are single value estimates of parameters of interest (in this case the mean and standard.
CHAPTER 8 Estimating with Confidence
GOVT 201: Statistics for Political Science
And distribution of sample means
Summary of t-Test for Testing a Single Population Mean (m)
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2016 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
AP Biology Intro to Statistics
Confidence Interval Estimation
Statistical Intervals Based on a Single Sample
Hypothesis Testing: One Sample Cases
Practice & Communication of Science From Distributions to Confidence
Using the t-distribution
Chapter 6 Confidence Intervals.
Practice & Communication of Science
Estimates and Sample Sizes Sections 6-2 & 6-4
Chapter 8: Estimating with Confidence
Higher National Certificate in Engineering
From Distributions to Confidence
Elementary Statistics
AP Biology Intro to Statistics
Chapter 5 Normal Distribution
Inference for distributions: - Optional topics in comparing distributions IPS chapter 7.3 © 2006 W.H. Freeman and Company.
Hypothesis Tests for a Population Mean in Practice
The Detail of the Normal Distribution
CI for μ When σ is Unknown
Elementary Statistics
Review: What influences confidence intervals?
Chapter 6 Confidence Intervals.
CHAPTER 8 Estimating with Confidence
Estimation Goal: Use sample data to make predictions regarding unknown population parameters Point Estimate - Single value that is best guess of true parameter.
Chapter 8: Estimating with Confidence
8.3 – Estimating a Population Mean
Chapter 8: Estimating with Confidence
Basic Practice of Statistics - 3rd Edition Inference for Regression
Normal Distribution Z-distribution.
Chapter 8 Confidence Intervals.
ESTIMATION OF THE MEAN AND PROPORTION
Intro to Confidence Intervals Introduction to Inference
Chapter 8: Estimating with Confidence
GENERALIZATION OF RESULTS OF A SAMPLE OVER POPULATION
Chapter 8: Estimating with Confidence
Variance and Hypothesis Tests
Chapter 8: Confidence Intervals
5: Introduction to estimation
Chapter 8: Estimating with Confidence
Objectives 6.1 Estimating with confidence Statistical confidence
Chapter 8: Estimating with Confidence
Objectives 6.1 Estimating with confidence Statistical confidence
Scientific Practice The Detail of the Normal
STA 291 Spring 2008 Lecture 14 Dustin Lueker.
Presentation transcript:

Having Confidence in our Means: Confidence Intervals Scientific Practice Having Confidence in our Means: Confidence Intervals

Samples are Estimates When we sample a population, we end up with a sample mean, x it’s our ‘best guess’ of the real population mean, µ the ‘real’ mean of the population is ‘hidden’ Our sample also has a measure of the variability of the data comprising it the sample Standard Deviation, s which is also an estimate of the population SD, σ s can be also be used to indicate the variability of the mean itself SEM = s / √ N can then use SEM to determine confidence limits

Confidence Limits and the SEM The SEM reflects the ‘fit’ of a sample mean, x , to the underlying population mean, µ if we calculate two sample means and they are the same, but for one the SEM is high, we are less ‘confident’ about how well that one estimates the population mean Just like the ‘raw’ data used to calculate a sample mean follows a distribution, so will repeat estimates of the population mean itself this is the t Distribution

The t Distribution Yet another distribution!  but distributions are important because they define how we expect our data to behave if we know that, then we gain insight into our expts! Generally ‘flatter’ than the Normal Distribution any particular area is more ‘spread out’ (less clear) the more ‘pointed’ a curve, the clearer the peak

t Distribution Pointedness Varies! Logic… the number of samples influences the ‘accuracy’ of our estimate of the population mean from the sample mean as N increases, the ‘peak’ becomes sharper a given area of the curve is less ‘spread out’ At high N, t Distribution = Normal Distribution

Using the t Distribution When we calculate a sample mean and call it our estimate of the population mean… it’s nice to know how ‘confident’ we are in that estimate One measure of confidence is the 95% Confidence Interval (95%CI) the range over which we are 95% confident the true population mean lies derived from our sample mean (we calculate) and our SEM (we calculate) and the N (though it’s the ‘degrees of freedom’, N-1) we use this to look up a ‘critical t vlaue’

From t Distribution to 95%CI The t Distribution is centred around our mean and its shape is influenced by N-1 95%CI involves chopping off the two 2.5% tails Need a t table to look up how many SEMs along the x-axis this point will be Value varies with N-1 And level of confidence sought

Step 1: The t Table t value varies with… For N = 10, α = 0.05 Row… DoF is N-1 Column… level of ‘confidence’ 95%CI involves chopping off the two 2.5% tails α = 0.05 (5%) For N = 10, α = 0.05 t(N-1),0.05 = 2.262 when N large, t=1.96

Step 2: Using the t value The t value is the number of SEMs along the x-axis (in each direction) that encompasses that % of the t distribution centred on our mean 2.262 in the case of t(N-1),0.05 Eg we measure the FVC (litres) of 10 people… mean = 3.83, SD = 1.05, N = 10 SEM = 1.05/√10 = 0.332 litres t(N-1),0.05 = 2.262 standard errors to cover 95% curve So, litres either side of the mean = 2.262 * 0.332 = 0.751 litres either side of mean covers 95% of dist So, 95%CI is 3.83 ± 0.751 = 3.079  4.581 litres (3.079, 4.581)

Effect of Bigger N A larger sample size gives us greater confidence in any population mean we estimate so 95%CI should be smaller In previous example… mean = 3.83, SD = 1.05, N =10, SEM = 0.332 95%CI is (3.079, 4.581) But say we measured 90 more people… mean = 3.55, SD = 0.915, N = 100 mean and SD similar to before, but SEM now a lot smaller, at 0.915/√100 = 0.0915 so too is t(N-1),0.05 = 1.96 (rather than 2.262) 95%CI = 3.55 ± (1.96 * 0.0915) = 3.371  3.729

Effect of Bigger N A bigger N ‘sharpens’ the t distribution so that the 95% boundaries are less far apart ie our confidence interval will become smaller 95%CI also shrinks because SEM = SD/√N

Summary Sample means are estimates of population means Bigger samples give more confident estimates SEM reflects the distribution of mean estimates SEM = s / √ N Estimates of means follow the t Distribution t Distribution becomes ‘sharper’ with higher N ‘Width’ of t dist covering 95% is called 95%CI range in which 95/100 mean estimates would fall 95%CI = mean ± (t(N-1),0.05 * SEM) t is the number of SEMs along dist covering that %