Inferring the Mean and Standard Deviation of a Population.

Slides:



Advertisements
Similar presentations
Statistics and Quantitative Analysis U4320
Advertisements

© 2013 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the World through.
Confidence Interval and Hypothesis Testing for:
PSY 307 – Statistics for the Behavioral Sciences
Estimating the Population Mean Assumptions 1.The sample is a simple random sample 2.The value of the population standard deviation (σ) is known 3.Either.
The Normal Distribution. n = 20,290  =  = Population.
Tuesday, October 22 Interval estimation. Independent samples t-test for the difference between two means. Matched samples t-test.
HIM 3200 Normal Distribution Biostatistics Dr. Burton.
EEM332 Design of Experiments En. Mohd Nazri Mahmud
Chapter 2 Simple Comparative Experiments
BPS - 5th Ed. Chapter 171 Inference about a Population Mean.
Chapter 11: Inference for Distributions
1 (Student’s) T Distribution. 2 Z vs. T Many applications involve making conclusions about an unknown mean . Because a second unknown, , is present,
Hypothesis Testing Using The One-Sample t-Test
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
AM Recitation 2/10/11.
Two Sample Tests Ho Ho Ha Ha TEST FOR EQUAL VARIANCES
Statistical inference: confidence intervals and hypothesis testing.
Statistical Analysis Statistical Analysis
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Confidence Interval Estimation
Slide 23-1 Copyright © 2004 Pearson Education, Inc.
Confidence Intervals for Means. point estimate – using a single value (or point) to approximate a population parameter. –the sample mean is the best point.
Statistics 101 Chapter 10. Section 10-1 We want to infer from the sample data some conclusion about a wider population that the sample represents. Inferential.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Chapter 9 Hypothesis Testing and Estimation for Two Population Parameters.
Chapter 11 Inference for Distributions AP Statistics 11.1 – Inference for the Mean of a Population.
BPS - 5th Ed. Chapter 171 Inference about a Population Mean.
Statistical Analysis Mean, Standard deviation, Standard deviation of the sample means, t-test.
When σ is Unknown The One – Sample Interval For a Population Mean Target Goal: I can construct and interpret a CI for a population mean when σ is unknown.
1 Happiness comes not from material wealth but less desire.
For 95 out of 100 (large) samples, the interval will contain the true population mean. But we don’t know  ?!
Pengujian Hipotesis Dua Populasi By. Nurvita Arumsari, Ssi, MSi.
Statistics in Biology. Histogram Shows continuous data – Data within a particular range.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
5.1 Chapter 5 Inference in the Simple Regression Model In this chapter we study how to construct confidence intervals and how to conduct hypothesis tests.
BPS - 3rd Ed. Chapter 161 Inference about a Population Mean.
Essential Statistics Chapter 161 Inference about a Population Mean.
AP Statistics Chapter 24 Comparing Means.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Chapter 8 Parameter Estimates and Hypothesis Testing.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Mystery 1Mystery 2Mystery 3.
MTH3003 PJJ SEM II 2014/2015 F2F II 12/4/2015.  ASSIGNMENT :25% Assignment 1 (10%) Assignment 2 (15%)  Mid exam :30% Part A (Objective) Part B (Subjective)
Inferences Concerning Variances
Estimating a Population Mean. Student’s t-Distribution.
Confidence Intervals for a Population Mean, Standard Deviation Unknown.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 13 Sampling distributions
Statistics: Unlocking the Power of Data Lock 5 Section 6.4 Distribution of a Sample Mean.
Inference About Means Chapter 23. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it’d be nice.
Essential Statistics Chapter 171 Two-Sample Problems.
6.3 One- and Two- Sample Inferences for Means. If σ is unknown Estimate σ by sample standard deviation s The estimated standard error of the mean will.
Ex St 801 Statistical Methods Inference about a Single Population Mean (CI)
Inference for distributions: - Comparing two means.
Chapter 7 Inference Concerning Populations (Numeric Responses)
Inference about the mean of a population of measurements (  ) is based on the standardized value of the sample mean (Xbar). The standardization involves.
AP Statistics Chapter 24 Comparing Means. Objectives: Two-sample t methods Two-Sample t Interval for the Difference Between Means Two-Sample t Test for.
16/23/2016Inference about µ1 Chapter 17 Inference about a Population Mean.
Review of Power of a Test
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Chapter 2 Simple Comparative Experiments
Basic Practice of Statistics - 3rd Edition Two-Sample Problems
Essential Statistics Two-Sample Problems - Two-sample t procedures -
Chapter 23 Inference About Means.
Presentation transcript:

Inferring the Mean and Standard Deviation of a Population

Central Problem Two important numbers tell us a lot about a distribution of data: Mean tells us the central tendency of the data Standard deviation tells us the spread in the data The problem is … we don’t normally know either of these and must infer them from a SRS of the population

Baby Paradox Two hospitals in the same city deliver, on average, a 50:50 ratio of baby girls and baby boys. Hospital A delivers 120 babies a day (on average) while hospital B delivers 12 babies a day (on average). One day there were twice as many boys as girls born in one of the hospitals. In which hospital is this more likely to happen?

Measuring the mean… How do we know the mean of a population? Answer: We can either measure every single sample in the population or estimate the mean from a suitable SRS  We will assume that the population is normally distributed so X has a normal distribution N( ,  /√n)

Standard Error and Standard Deviation These are two very distinct and different ideas:  Standard error measures the uncertainty in the measure of the mean This depends on how YOU measure and sample size  Standard deviation measures the spread in the data This is a property of the data set – does not change We can often estimate the standard deviation by measuring the standard error.

Standard error is always less than standard deviation SE gets smaller as n grows  does not change! SE measures the uncertainty in location of mean  measures spread in data

t-Distributions If we know  then setting a confidence interval on how well our sample mean X measures the true mean is easy: But – if we don’t know  then we estimate use the t- distribution:

Closer look at t-distributions The t-distribution looks very much like the Normal distribution and as the number of degrees of freedom (df) gets large the two become indistinguishable t-distribution tables are used much the same way as N(0,1) – major difference is the df value

Example… You are inspecting a shipment of precision machined rods to be used in an engine assembly plant. You select a random sample of 20 and measure the diameters. You find that the average diameter of the sample is cm with a standard deviation in the measurements of cm. It is critical that the diameters do not exceed cm. You are willing to accept a 1% failure rate. Should you accept the shipment?

Solution: This would be an example of a 1-tailed t- distribution,  = 0.01, t 19,0.01 = 2.539t 19,0.01 = A 1% failure rate looks like this:

Test the numbers… This implies that % of the sample will not exceed the threshold diameter Accept!

Two-tailed t-Tests In the previous example we looked at whether or not the diameter was less than a maximum allowable value. Just as we have done earlier with confidence intervals we can also specify a maximum allowable range (“plus or minus”) for our mean. Let’s test the mean diameter at a 95% confidence level that is implied by our measurement Use following formula: Margin of error

We measured mean diameter as cm, s = so the upper and lower margins are: We can be 95% confident that the diameters of the parts are in the range (5.463,5.467) cm

Example 7.9 Plot data: Identify variables, etc:  df = (50-1) = 49   = 0.05   = 23.56, s =  t = Interval = (20.00,27.12) ?

Example of a Matched Pairs t-test: Exercise 7.40 Formulate appropriate hypotheses  H 0 : no difference  H  : LH > RH Re-arrange data:  find  and s (see next page)

H o :  = 0 df = = 24 Find  Use Excel =tdist(t, df, #tails)  Use Table D The probability of the null hypothesis is only LH thread takes longer

Robustness… A statistical test is considered robust if:  It is insensitive to deviations from original assumptions being made. This could include smaller sample size or deviation from normality

Rules of thumb – When to use the t-test Small sample sizes (n≈15) and close to normal Mid range sample size (n ≥ 15) as long as distribution not strongly skewed and no outliers Large sample size (n > 40) even if skewed or with some outliers Fine print: Rules of thumb do not obviate the need to always inspect your data! Stemplots or histograms give you insight into just how “skewed” or “outlier-riddled” is your data. Always know what the data set looks like before applying tests.

In conclusion… Read 7.1 carefully – we skipped over some terms and discussions of applicability of the t- test Be sure you understand when (and why) we need the t-test Know the difference between standard deviation and Standard Error Try: 7.4, 7.12, 7.26, 7.42