The T Distribution ©Dr. B. C. Paul 2005. Wasn’t the Herby Assembly Line Problem Fun But there is one little problem But there is one little problem We.

Slides:



Advertisements
Similar presentations
Chapter 10 Introduction to Inference
Advertisements

Introductory Mathematics & Statistics for Business
1 Confidence Interval for Population Mean The case when the population standard deviation is unknown (the more common case).
Estimation of Means and Proportions
Partly based on material by Sherry O’Sullivan
T-Tests For Dummies As in the books, not you personally!
It was mid-year, I’d say around November when I would have to face the consequences of a life time. I was off on my own a lot more and having fun. More.
Module 16: One-sample t-tests and Confidence Intervals
Dr Richard Bußmann CHAPTER 12 Confidence intervals for means.
Psych 5500/6500 t Test for Two Independent Groups: Power Fall, 2008.
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Simple Linear Regression 1. 2 I want to start this section with a story. Imagine we take everyone in the class and line them up from shortest to tallest.
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
1 Hypothesis Testing Chapter 8 of Howell How do we know when we can generalize our research findings? External validity must be good must have statistical.
1 One Tailed Tests Here we study the hypothesis test for the mean of a population when the alternative hypothesis is an inequality.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #22.
Using Statistics to Analyze your Results
Review: What influences confidence intervals?
Statistics for the Social Sciences
Point estimation, interval estimation
An Inference Procedure
1 The Basics of Regression Regression is a statistical technique that can ultimately be used for forecasting.
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Inference about a Mean Part II
The one sample t-test November 14, From Z to t… In a Z test, you compare your sample to a known population, with a known mean and standard deviation.
1 Confidence Interval for Population Mean The case when the population standard deviation is unknown (the more common case).
P-value Method 2 means, sigmas unknown. Sodium levels are measured in millimoles per liter (mmol/L) and a score between 136 and 145 is considered normal.
Applications of the Normal Distribution Model (The Confidence Interval) ©Dr. B. C. Paul 2003 revision 2009 Note – The concepts found in these slides are.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 21 More About Tests.
14. Introduction to inference
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Proportions for the Binomial Distribution ©2005 Dr. B. C. Paul.
One Way ANOVA ©2005 Dr. B. C. Paul modified 2009 Note – The concepts presented in these slides are considered common knowledge to those familiar with statistics.
1 Statistical Inference. 2 The larger the sample size (n) the more confident you can be that your sample mean is a good representation of the population.
Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control.
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
Inferential Statistics Part 1 Chapter 8 P
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
What is a Random Sample (and what if its not) ©Dr. B. C. Paul 2005.
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Chapter 21: More About Tests
Welcome to MM570 Psychological Statistics
The Impact of Sample Size ©Dr. B. C. Paul 2005 revised 2009 Note – The information supplied in these slides is regarded as common knowledge to those familiar.
Chapter 11: Estimation of Population Means. We’ll examine two types of estimates: point estimates and interval estimates.
Hypothesis Testing and the T Test. First: Lets Remember Z Scores So: you received a 75 on a test. How did you do? If I said the mean was 72 what do you.
Sight Words.
Quadratic Regression ©2005 Dr. B. C. Paul. Fitting Second Order Effects Can also use least square error formulation to fit an equation of the form Math.
7.2 The Standard Normal Distribution. Standard Normal The standard normal curve is the one with mean μ = 0 and standard deviation σ = 1 We have related.
Week 6. Statistics etc. GRS LX 865 Topics in Linguistics.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
AGENDA Review In-Class Group Problems Review. Homework #3 Due on Thursday Do the first problem correctly Difference between what should happen over the.
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
Inference About Means Chapter 23. Getting Started Now that we know how to create confidence intervals and test hypotheses about proportions, it’d be nice.
Problems with Variance ©2005 Dr. B. C. Paul. Determining What To Do We have looked at techniques that depend on normally distributed data with variance.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Welcome to MM570 Psychological Statistics Unit 5 Introduction to Hypothesis Testing Dr. Ami M. Gates.
Chapter Seven Point Estimation and Confidence Intervals.
INF397C Introduction to Research in Information Studies Spring, Day 12
Hypothesis Testing: Preliminaries
Chapter 21 More About Tests.
Hypothesis Testing.
Analyzing the Association Between Categorical Variables
Reasoning in Psychology Using Statistics
Presentation transcript:

The T Distribution ©Dr. B. C. Paul 2005

Wasn’t the Herby Assembly Line Problem Fun But there is one little problem But there is one little problem We knew that our mean value could have been all over the map relative to the real true mean We knew that our mean value could have been all over the map relative to the real true mean We calculated our standard deviation from the same sample We calculated our standard deviation from the same sample How come our mean could be anything and yet our standard deviation is God’s own value for the standard deviation? How come our mean could be anything and yet our standard deviation is God’s own value for the standard deviation?

It Isn't When our value for the standard deviation is just an estimate we have another chance for things to be way out in the tails When our value for the standard deviation is just an estimate we have another chance for things to be way out in the tails Sadisticians – woops I mean statisticians figured out probability distribution for what would happen then Sadisticians – woops I mean statisticians figured out probability distribution for what would happen then Called it the T distribution Called it the T distribution First published in 1908 perfected in 1926 First published in 1908 perfected in 1926 We look up values for areas under the curve of a T distribution just like we did with a normal distribution. We look up values for areas under the curve of a T distribution just like we did with a normal distribution.

Let’s Redo Herby’s Problem Right This Time We will use the T distribution We will use the T distribution S is the estimated standard deviation The test statistic has a T distribution (assuming the underyling population Really is normally distributed) The distribution has n-1 degrees of freedom

Degrees of Freedom! What are you talking about? – this isn’t an Amnesty International Class Consider # of equations and # of unknowns Consider # of equations and # of unknowns To uniquely solve 3 unknowns you need 3 independent equations To uniquely solve 3 unknowns you need 3 independent equations Each sample is like an equation Each sample is like an equation If I have one sample I first use it as an estimate of the mean. If I have one sample I first use it as an estimate of the mean. I can’t calculate a standard deviation – I don’t have enough data I can’t calculate a standard deviation – I don’t have enough data If I have two samples If I have two samples I can estimate std deviation and still have one degree of freedom to measure something else I can estimate std deviation and still have one degree of freedom to measure something else Happens to be the mean Happens to be the mean How much extra data do I have above the bear minimum? How much extra data do I have above the bear minimum?

So How Do I Use This? (I have a really bad feeling your going to tell me) Note that this table is set up Different from Z values for normal Distribution. Area under the curve comes from The top line. Degrees of Freedom from the side Value in the middle is the T value (equivalent to the Z value) Remember in the normal table The Z value was on the edge And the area under the curve In the middle of the table

Lets Do the Problem X = 3.8 S= 0.73 N= 7 OK – So What Is t?

Finding t If we do this as a two tailed test (ie we would be concerned if our Balls were to hard or to soft) we Can only have 2.5% in each tail Pick 97.5 We have 7 samples hence n-1 or 6 degrees of freedom Read into the table 2.45

Plug and Chug 4.48 We can still reject the null hypothesis with an Alpha Level of 5% but it is now much closer Than before

Some Observations About Degrees of Freedom and the T statistic 95% of a normal distribution is within 1.96 standard deviations of the mean 95% of a normal distribution is within 1.96 standard deviations of the mean 95% of a T distribution is within 2.45 estimated standard deviations of the mean if the standard deviation estimate came from 7 samples 95% of a T distribution is within 2.45 estimated standard deviations of the mean if the standard deviation estimate came from 7 samples With 20 samples it is 2.09 estimated standard deviation units With 20 samples it is 2.09 estimated standard deviation units With 50 samples it is 2.01 With 50 samples it is 2.01 With 100 samples it is 1.98 With 100 samples it is 1.98 With 500 samples it is 1.96 With 500 samples it is 1.96 Note that as the number of samples increases the T distribution converges to a normal distribution Note that as the number of samples increases the T distribution converges to a normal distribution

So When Do I Use a T Distribution The underlying population must be realistic to model as having a normal distribution The underlying population must be realistic to model as having a normal distribution The standard deviation of the population must have been estimated from a standard deviation calculation using a sample of the population The standard deviation of the population must have been estimated from a standard deviation calculation using a sample of the population You can get out of using the T distribution and pretend that God gave you the standard deviation if you used about 100 or more samples to calculate your estimate of the standard deviation You can get out of using the T distribution and pretend that God gave you the standard deviation if you used about 100 or more samples to calculate your estimate of the standard deviation People with a lot of experience with a distribution often ignore the T distribution completely because they have seen results from hundreds of samples People with a lot of experience with a distribution often ignore the T distribution completely because they have seen results from hundreds of samples They are not “doing it wrong” using a simple normal distribution if they have that kind of data supporting their standard deviation value They are not “doing it wrong” using a simple normal distribution if they have that kind of data supporting their standard deviation value

Why Did You Do a Two Tailed Test? Herby was going Bananas because he thought the line might be putting out soft balls Herby was going Bananas because he thought the line might be putting out soft balls That sounds to me like he is only concerned about 1 side of the distribution. That sounds to me like he is only concerned about 1 side of the distribution. We may be upset about one particular thing but that doesn’t mean nothing else is important We may be upset about one particular thing but that doesn’t mean nothing else is important One problem with things that are too hard is that they are often brittle One problem with things that are too hard is that they are often brittle Premature ball failure could be due to the balls being too soft or breaking up because they are too hard Premature ball failure could be due to the balls being too soft or breaking up because they are too hard We have to ask our own case specific question about what we are concerned about – You plan a one tailed test only if you are only concerned about events on just one tail We have to ask our own case specific question about what we are concerned about – You plan a one tailed test only if you are only concerned about events on just one tail

Common Cheating on Random Samples Experiments should be planned before we look at the data Experiments should be planned before we look at the data If we look at the data and then decide what the experiment should have been we are “political spin doctors” not scientists If we look at the data and then decide what the experiment should have been we are “political spin doctors” not scientists A spin doctor looks at a result and then tries to make it say what he wants A spin doctor looks at a result and then tries to make it say what he wants A scientist sets up the test and lets the truth be what ever it is A scientist sets up the test and lets the truth be what ever it is Often we had a theory that made us want to look deeper Often we had a theory that made us want to look deeper Many theories are based on observations Many theories are based on observations But the scientific method causes you to then plan an experiment and go out and get the data you need to test the theory But the scientific method causes you to then plan an experiment and go out and get the data you need to test the theory It’s a subtle difference but its often ignored It’s a subtle difference but its often ignored The doctrine of “political correctness” is causing us all to loose our integrity The doctrine of “political correctness” is causing us all to loose our integrity

Back to Herby and the Two Tailed Test If it is true that hard balls make no difference – only soft ones then the test should have been set up as one tailed only If it is true that hard balls make no difference – only soft ones then the test should have been set up as one tailed only If the concern was the line being out of spec and that causing unhappy customers we could not know the sample would come out below 4.5 unless we peaked first If the concern was the line being out of spec and that causing unhappy customers we could not know the sample would come out below 4.5 unless we peaked first If at that point we decided we only cared about soft balls we distort the reliability of our analysis If at that point we decided we only cared about soft balls we distort the reliability of our analysis The data would have not only determined what the values of the test statistics were – it would have determined the test The data would have not only determined what the values of the test statistics were – it would have determined the test Normal distribution theory only accounts for the data determining the test statistic Normal distribution theory only accounts for the data determining the test statistic We in fact do not have good models for exactly what the consequences are if we let the data set up the test – we can say we are taking a chance of something bad happening We in fact do not have good models for exactly what the consequences are if we let the data set up the test – we can say we are taking a chance of something bad happening

My Choice So why did I do this example as a two tailed test So why did I do this example as a two tailed test 1- because that sample size analysis I did is nastier to explain if I’m only working on one side 1- because that sample size analysis I did is nastier to explain if I’m only working on one side 2- Because it sets up a great discussion on random samples and peaking and cherry picking data 2- Because it sets up a great discussion on random samples and peaking and cherry picking data 3- Because it allowed me to discuss when I should run one and two tailed tests 3- Because it allowed me to discuss when I should run one and two tailed tests The story problem told is inconclusive about whether Herby was vulnerable to the line being out of spec on one side only or on both sides The story problem told is inconclusive about whether Herby was vulnerable to the line being out of spec on one side only or on both sides

Look at the Problems We Have Run So Far We looked at a storm washing out the drainage system in a subdivision We looked at a storm washing out the drainage system in a subdivision Only too much rain would create the disaster – we really only were worried about too big rain events Only too much rain would create the disaster – we really only were worried about too big rain events (And we ran a one tailed test on the upper side) (And we ran a one tailed test on the upper side) We looked at a Mine and the amount of ore below cut-off grade that would go to the dump We looked at a Mine and the amount of ore below cut-off grade that would go to the dump We aren’t going to dump our high grade ore – we really only care about how much stuff is on the lower end We aren’t going to dump our high grade ore – we really only care about how much stuff is on the lower end (And we ran a one tailed test on the lower side) (And we ran a one tailed test on the lower side) We looked at tolerance on a machined part We looked at tolerance on a machined part The spec said we had to be plus or minus so our customer would be upset if the pegs were too big or too little The spec said we had to be plus or minus so our customer would be upset if the pegs were too big or too little (And we ran a two tailed test) (And we ran a two tailed test) Determine whether to run a one or two tailed test based on the concerns for the process or design you are working on – not from peaking at the data. Determine whether to run a one or two tailed test based on the concerns for the process or design you are working on – not from peaking at the data.