Statistical Tests How to tell if something (or somethings) is different from something else.

Slides:



Advertisements
Similar presentations
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Advertisements

Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Hypothesis Testing making decisions using sample data.
Review of the Basic Logic of NHST Significance tests are used to accept or reject the null hypothesis. This is done by studying the sampling distribution.
Thursday, September 12, 2013 Effect Size, Power, and Exam Review.
Review: What influences confidence intervals?
Business 205. Review Sampling Continuous Random Variables Central Limit Theorem Z-test.
Statistics for the Social Sciences
Using Statistics in Research Psych 231: Research Methods in Psychology.
What z-scores represent
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Don’t spam class lists!!!. Farshad has prepared a suggested format for you final project. It will be on the web
Sampling Distributions
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
PSY 307 – Statistics for the Behavioral Sciences
Probability Population:
Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 10 Sampling Distributions.
Introduction to Hypothesis Testing
Hypothesis Testing:.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Section #4 October 30 th Old: Review the Midterm & old concepts 1.New: Case II t-Tests (Chapter 11)
CHAPTER 11: Sampling Distributions
Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of times would result in heads half the time (i.e.,
The Hypothesis of Difference Chapter 10. Sampling Distribution of Differences Use a Sampling Distribution of Differences when we want to examine a hypothesis.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Jan 17,  Hypothesis, Null hypothesis Research question Null is the hypothesis of “no relationship”  Normal Distribution Bell curve Standard normal.
EDUC 200C Friday, October 26, Goals for today Homework Midterm exam Null Hypothesis Sampling distributions Hypothesis testing Mid-quarter evaluations.
1 Statistical Inference Greg C Elvers. 2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
STA Statistical Inference
Individual values of X Frequency How many individuals   Distribution of a population.
Stat 13, Tue 5/8/ Collect HW Central limit theorem. 3. CLT for 0-1 events. 4. Examples. 5.  versus  /√n. 6. Assumptions. Read ch. 5 and 6.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
Distributions of the Sample Mean
Revisiting Sampling Concepts. Population A population is all the possible members of a category Examples: the heights of every male or every female the.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Example You give 100 random students a questionnaire designed to measure attitudes toward living in dormitories Scores range from 1 to 7 –(1 = unfavorable;
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
Hypothesis Testing and the T Test. First: Lets Remember Z Scores So: you received a 75 on a test. How did you do? If I said the mean was 72 what do you.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Statistical Analysis – Chapter 6 “Hypothesis Testing” Dr. Roderick Graham Fashion Institute of Technology.
INFERENTIAL STATISTICS DOING STATS WITH CONFIDENCE.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
One Sample Inf-1 In statistical testing, we use deductive reasoning to specify what should happen if the conjecture or null hypothesis is true. A study.
Hypothesis test flow chart
SPSS Problem and slides Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of times would result in.
SPSS Homework Practice The Neuroticism Measure = S = 6.24 n = 54 How many people likely have a neuroticism score between 29 and 34?
m/sampling_dist/index.html.
Inferential Statistics Psych 231: Research Methods in Psychology.
SPSS Homework Practice The Neuroticism Measure = S = 6.24 n = 54 How many people likely have a neuroticism score between 29 and 34?
Chapter 5: Introduction to Statistical Inference
Is this quarter fair?. Is this quarter fair? Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of.
Central Limit Theorem, z-tests, & t-tests
Significance Tests: The Basics
Hypothesis Testing.
Practice The Neuroticism Measure = S = 6.24 n = 54
Psych 231: Research Methods in Psychology
Is this quarter fair?. Is this quarter fair? Is this quarter fair? How could you determine this? You assume that flipping the coin a large number of.
Presentation transcript:

Statistical Tests How to tell if something (or somethings) is different from something else

Populations vs. Samples Remember that a population is all the possible members of a category that we could measure Examples: the heights of every male or every female the temperature on every day since the beginning of time Ever person who ever has, and ever will, take a particular drug

Populations vs. Samples So a population is kind of abstract - typically you couldn’t ever hope to measure the entire population Notable exceptions include: Standardized tests (mean IQ is 100 with std. dev. of 15) Special populations such as rare diseases or isolated groups of people

Populations vs. Samples A sample is some subset of a population Examples: The heights of 10 students picked at random The participants in a drug trial

Populations vs. Samples The notation Sample statistics are usually regular letters like s and Population statistics are usually greek letters like: X  - the population mean  - the population standard deviation

Populations vs. Samples Test your intuition: Under what circumstances does the mean of a sample equal the mean of the population from which it was drawn? What about the standard deviation? What if your sample was very small relative to the population?

Populations vs. Samples Test your intuition: Most importantly: What if you took more than one sample

Central Limit Theorem There is a distribution of sample means

Central Limit Theorem There is a distribution of sample means The population of IQ scores: 100

Central Limit Theorem There is a distribution of sample means Your = 95 The population of IQ scores: 100

Central Limit Theorem There is a distribution of sample means Your = 103 The population of IQ scores: 100

Central Limit Theorem There is a distribution of sample means Your = 99 The population of IQ scores: 100

Central Limit Theorem There is a distribution of sample means This is the sampling distribution of the mean

Central Limit Theorem What is the mean of the sampling distribution of the mean? mean of the sampling distribution approaches the mean of the population with many resamplings

Central Limit Theorem What is the standard deviation of the sampling distribution of the mean? The standard error of the mean Notice it will always be less than the standard deviation of the population!

Central Limit Theorem What is the shape of the sampling distribution of the mean? Central Limit Theorem: the sampling distribution of the mean is normal regardless of the shape of the underlying distribution ! This means you can use the Z transform and use the Z table

The Logic of Statistical Tests

Statistical Tests Consider a simple example: you are testing the hypothesis that eating walnuts makes people smarter by feeding walnuts to a group of 30 subjects and then testing their IQ

Statistical Tests Consider a simple example: you are testing the hypothesis that eating walnuts makes people smarter by feeding walnuts to a group of 30 subjects and then testing their IQ If you are right, then eating walnuts will make the average IQ of your subjects be higher than the average IQ of all people (the population) since, mostly, those other people don’t eat walnuts much

Statistical Tests Consider a simple example: Put another way: Is this sample (entirely) of walnut eaters different from the population of mostly non-walnut-eaters

Types of Errors There are two “mistakes” you could make:

Types of Errors There are two “mistakes” you could make: Type I error or False-Positive - you decide the walnut treatment works when it doesn’t really Type II error or False-Negative - you decide the walnuts don’t work when really they do

Types of Successes There are two ways to succeed: Hit or True-Positive: You decide the walnuts do make people smarter and, in fact, they really do Correct-Rejection or True-Negative: You decide the walnuts don’t work and, in fact they really don’t

Outcome Matrix Actual Situation Works Doesn’t Work “Works” True Positive Type I “Doesn’t Work” Type II True-Negative Your Conclusion

Statistical Tests Consider a simple example: Your subjects turn out to have a mean IQ of 107.5 (1/2 S.D. from the mean of the population) after eating walnuts

Statistical Tests What are two reasons why the mean IQ of your subjects might be greater than the mean of the population? you happened to pick 30 very smart people (i.e. university students) WARNING: Type I error is possible!

Statistical Tests What are two reasons why the mean IQ of your subjects might be greater than the mean of the population? you happened to pick 30 very smart people (i.e. university students) WARNING: Type I error is possible! the walnuts worked

Statistical Tests Usually we are worried about making a type I error so we need to know: What fraction of all possible groups of 30 subjects would have a mean IQ of 105 or less?

Statistical Tests Usually we are worried about making a type I error so we need to know: What fraction of all possible groups of 30 subjects would have a mean IQ of 105 or less? In other words, we are interested not in the distribution of IQ scores themselves, but rather in the distribution of mean IQ scores for groups of 30 subjects

…as it is more formally known The Z Test …as it is more formally known

Example Z Test Using our example in which we are testing the hypothesis that walnuts make people smarter null hypothesis is that they don’t X = 107.5  = 100  = 15

Example Z Test Using our example in which we are testing the hypothesis that walnuts make people smarter (null hypothesis was that they don’t) We want to know how many standard errors from the mean (of the sampling distribution of means) is 107.5 X = 107.5  = 15

Example Z Test Here’s what we’ve got: X = 107.5  = 15 n = 30 Here’s what we can compute: That’s what we’re after so that we can use the Z table

Example Z Test Here’s what we’ve got: X = 107.5  = 15 n = 30 Here’s what we can compute: Which is much less than 15!

Example Z Test Here’s what we’ve got: X = 107.5  = 15 n = 30 Here’s what we can compute:

Example Z Test Here’s what we’ve got: X = 107.5  = 15 n = 30 Thus X = 107.5 isn’t half a standard deviation from the sampling distribution mean! It’s actually more than two and a half standard deviations from the sampling distribution mean !

Example Z Test Here’s what we’ve got: X = 107.5  = 15 n = 30 Looking up 2.739 in the Z table reveals that only .0031 or .31% of the means in the sampling distribution of mean IQs (for groups of 30 people each) would have a mean equal to or greater than 107.5!

Example Z Test What this means is that you have only a 0.31% chance of making a type I error if you conclude that walnuts made your subjects smarter !

Example Z Test What this means is that you have only a 0.31% chance of making a type I error if you conclude that walnuts made your subjects smarter ! Put another way, there is only a 0.31% chance that this sample of IQs is taken from the regular population…walnut eaters are different

Alpha Is .31% small enough? What risk of making a Type I error is too great?

Alpha Is .31% small enough? What risk of making a Type I error is too great? There is no absolute answer - it depends entirely on the circumstances

Alpha Is .31% small enough? What risk of making a Type I error is too great? There is no absolute answer - it depends entirely on the circumstances 5% or probability (p) = .05 is generally accepted

Alpha Is .31% small enough? What risk of making a Type I error is too great? There is no absolute answer - it depends entirely on the circumstances 5% or probability (p) = .05 is generally accepted This rate of making Type I errors (ie. number of Type I errors per 100 experiments) is called the Alpha Level

Statistical Significance So we conclude that walnuts have a statistically significant effect on IQ with a probability of a Type I error of less than 5% In a research article we might say “the effect of walnuts on IQ was significant (one-tailed Z test, p = .0031)”