Exam 1 Median: 74 Quartiles: 68, 84 Interquartile range: 16 Mean: 74.9 Standard deviation: 12.5 z = -1: 62.4 z = -1: 87.4 z = -1z = +1 Worst Question:

Slides:



Advertisements
Similar presentations
Chapter 7 Hypothesis Testing
Advertisements

Statistics Hypothesis Testing.
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Hypothesis Testing making decisions using sample data.
Inference Sampling distributions Hypothesis testing.
1 Hypothesis Testing Chapter 8 of Howell How do we know when we can generalize our research findings? External validity must be good must have statistical.
Review You run a t-test and get a result of t = 0.5. What is your conclusion? Reject the null hypothesis because t is bigger than expected by chance Reject.
Testing Hypotheses About Proportions Chapter 20. Hypotheses Hypotheses are working models that we adopt temporarily. Our starting hypothesis is called.
Statistical Techniques I EXST7005 Lets go Power and Types of Errors.
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Hypothesis Testing: One Sample Mean or Proportion
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Statistics for the Social Sciences Psychology 340 Spring 2005 Hypothesis testing.
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
PSY 307 – Statistics for the Behavioral Sciences
The problem of sampling error in psychological research We previously noted that sampling error is problematic in psychological research because differences.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
Overview of Statistical Hypothesis Testing: The z-Test
Confidence Intervals and Hypothesis Testing - II
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Introduction to Hypothesis Testing for μ Research Problem: Infant Touch Intervention Designed to increase child growth/weight Weight at age 2: Known population:
Introduction to Biostatistics and Bioinformatics
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Sampling Distributions and Hypothesis Testing. 2 Major Points An example An example Sampling distribution Sampling distribution Hypothesis testing Hypothesis.
Chapter 8 Introduction to Hypothesis Testing
1 Chapter 10: Section 10.1: Vocabulary of Hypothesis Testing.
Let’s flip a coin. Making Data-Based Decisions We’re going to flip a coin 10 times. What results do you think we will get?
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Chapter 8 Introduction to Hypothesis Testing
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
Chapter 20 Testing hypotheses about proportions
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Chapter 20 Testing Hypothesis about proportions
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Testing the Differences between Means Statistics for Political Science Levin and Fox Chapter Seven 1.
Introduction Suppose that a pharmaceutical company is concerned that the mean potency  of an antibiotic meet the minimum government potency standards.
© Copyright McGraw-Hill 2004
Hypothesis Testing. Outline of Today’s Discussion 1.Logic of Hypothesis Testing 2.Evaluating Hypotheses Please refrain from typing, surfing or printing.
Statistical Techniques
Chapter 8: Introduction to Hypothesis Testing. Hypothesis Testing A hypothesis test is a statistical method that uses sample data to evaluate a hypothesis.
Today: Hypothesis testing. Example: Am I Cheating? If each of you pick a card from the four, and I make a guess of the card that you picked. What proportion.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Today: Hypothesis testing p-value Example: Paul the Octopus In 2008, Paul the Octopus predicted 8 World Cup games, and predicted them all correctly Is.
+ Testing a Claim Significance Tests: The Basics.
Introduction to Hypothesis Testing: The Binomial Test
Introduction to Hypothesis Testing: The Binomial Test
9.3 Hypothesis Tests for Population Proportions
Hypothesis Testing.
Section Testing a Proportion
Review Ordering company jackets, different men’s and women’s styles, but HR only has database of employee heights. How to divide people so only 5% of.
Review You run a t-test and get a result of t = 0.5. What is your conclusion? Reject the null hypothesis because t is bigger than expected by chance Reject.
Introduction to Hypothesis Testing: The Binomial Test
Hypothesis Testing: Hypotheses
1 Chapter 8: Introduction to Hypothesis Testing. 2 Hypothesis Testing The general goal of a hypothesis test is to rule out chance (sampling error) as.
Statistical Test A test of significance is a formal procedure for comparing observed data with a claim (also called a hypothesis) whose truth we want to.
Presentation transcript:

Exam 1 Median: 74 Quartiles: 68, 84 Interquartile range: 16 Mean: 74.9 Standard deviation: 12.5 z = -1: 62.4 z = -1: 87.4 z = -1z = +1 Worst Question: Operational definition (33/84)

Introduction to Hypothesis Testing: The Binomial Test 9/30

ESP Your friend claims he can predict the future You flip a coin 5 times, and he’s right on 4 Is your friend psychic?

Two Hypotheses Hypothesis – A theory about how the world works – Proposed as an explanation for data – Posed as statement about population parameters Psychic – Some ability to predict future – Not perfect, but better than chance Luck – Random chance – Right half the time, wrong half the time Hypothesis testing – A method that uses inferential statistics to decide which of two hypotheses the data support

Likelihood – Probability distribution of a statistic, according to each hypothesis – If result is likely according to a hypothesis, we say data “support” or “are consistent with” the hypothesis Likelihood for f(correct) – Psychic: hard to say; how psychic? – Luck: can work out exactly; 50/50 chance each time HypothesesStatistics Likelihood Support PopulationSample Probability Inference

Likelihood According to Luck Flip YYYYYYNYYYNYYYYNNYYY YYYYNYNYYNNYYYNNNYYN YYYNYYNYNYNYYNYNNYNY YYYNNYNYNNNYYNNNNYNN YYNYYYNNYYNYNYYNNNYY YYNYNYNNYNNYNYNNNNYN YYNNYYNNNYNYNNYNNNNY YYNNNYNNNNNYNNNNNNNN f(Y)likelihood 51/32 45/32 310/ /32 01/32

Binomial Distribution Binary data – A set of two-choice outcomes, e.g. yes/no, right/wrong Binomial variable – A statistic for binary samples – Frequency of “yes” / “right” / etc. Binomial distribution – Probability distribution for a binomial variable – Gives probability for each possible value, from 0 to n A family of distributions – Like Normal (need to specify mean and SD) – n: number of observations (sample size) – q: probability correct each time

Binomial Distribution n = 5, q =.5n = 20, q =.5 n = 20, q =.25 Formula (optional): Frequency

Binomial Test Hypothesis testing for binomial statistics Null hypothesis – Some fixed value for q, usually q =.5 – Nothing interesting going on; blind chance (no ESP) Alternative hypothesis – q equals something else – One outcome more likely than expected by chance (ESP) Goal: Decide which hypothesis the data support Strategy – Find likelihood distribution for f(Y) according to null hypothesis – Compare actual result to this distribution – If actual result is too extreme, reject null hypothesis and accept altenative hypothesis “Innocent until proven guilty” – Believe null hypothesis unless compelling evidence to rule it out – Only accept ESP if luck can’t explain the data

Testing for ESP Null hypothesis: Luck, q =.5 Alternative hypothesis: ESP, q >.5 Need to decide rules in advance – If too extreme, abandon luck and accept ESP – How unlikely before we will give up on Luck? Where to draw cutoff? – Critical value: value our statistic must exceed to reject null hypothesis (Luck) Likelihood according to Luck: Binomial(n = 20, q =.5) Frequency Not at all unexpected Luck still a viable explanation Nearly impossible by luck alone

Another Example: Treatment Evaluation Do people tend to get better with some treatment? – Less depression, higher WBC, better memory, etc. – Measure who improves and who worsens – Want more people better off than worse Sign test – Ignore magnitude of change; just direction – Same logic as other binomial tests Count number of patients improved Compare to probabilities according to chance

Structure of a Binomial Test Binary data – Each patient is Better or WorseEach coin prediction is Correct or Incorrect Population parameter q – Probability each patient will improveProbability each guess will be correct Null hypothesis, usually q =.5 – No effect of treatmentNo ESP – Better or worse equally likelyCorrect and incorrect equally likely Alternative hypothesis, here q >.5 – Effective treatment; more people improveESP; guess right more often than chance Work out likelihood according to null hypothesis – Probability distribution for f(improve)Probability distribution for f(correct) Compare actual result to these probabilities – If more improve than likely by chance,If more correct than likely by chance, accept treatment is usefulabandon luck and accept ESP Need to decide critical value – How many patients must improve?How many times correct? Frequency Probability

Errors Whatever the critical value, there will be errors – All values 0 to n are possible under null hypothesis – Even 20/20 happens once in 1,048,576 times Can only minimize how often errors occur Two kinds of errors: Type I error – Null hypothesis is true, but we reject it – Conclude a useless treatment is effective Type II error – Null hypothesis is false, but we don’t reject it – Don’t recognize when a treatment is effective

Frequency Null Hypothesis (Bogus Treatments) Critical Value and Error Rates Type I Errors Frequency ? ? ? ? ? ? ? ? ? ? ? ? ? Alternative Hypothesis (Effective Treatments) Type II Errors

Critical Value and Error Rates Frequency Null Hypothesis (Bogus Treatments) Type I Errors Frequency ? ? ? ? ? ? ? ? ? ? ? ? ? Alternative Hypothesis (Effective Treatments) Type II Errors Increasing critical value reduces Type I error rate but increases Type II error rate (and vice versa) So, how do we decide critical value? Two principles – Type I errors are more important to avoid – Can’t figure out Type II error rate anyway Strategy – Decide how many Type I errors are acceptable – Choose critical value accordingly

Controlling Type I Error Type I error rate – Proportion of times, when null hypothesis is true, that we mistakenly reject it – Fraction of bogus treatments that we conclude are effective Type I error rate equals total probability beyond the critical value, according to null hypothesis Strategy – Decide what Type I error rate we want to allow – Pick critical value accordingly Alpha level (  ) – Chosen Type I error rate – Usually.05 in Psychology – Determines critical value Frequency Null Hypothesis (Bogus Treatments) Type I Error Rate

Summary of Hypothesis Testing Determine Null and Alternative Hypotheses – Competing possibilities about a population parameter – Null is always precise; usually means “no effect” Find probability distribution of test statistic according to null hypothesis – Likelihood of the statistic under that hypothesis Choose acceptable rate of Type I errors (  ) Pick critical value of test statistic based on  – Under Null, probability of a result past critical value equals  Compare actual result to critical value – If more extreme, reject Null as unable to explain data – Otherwise, stick with Null because it’s an adequate explanation Frequency Null Hypothesis (Bogus Treatments) Type I Error Rate (  ) Reject Null Keep Null

Review To test whether infants can read, you show pairs of good and bad words, and count how many times the baby crawls to the good word. What’s the null hypothesis? A.Baby is more likely to crawl to good word B.Baby is more likely to crawl to bad word C.Baby is equally likely to crawl to either word D.Baby will not choose either word Teddy bear Monster

Review What would be a Type II Error for this experiment? A.Concluding babies can’t read when they really can B.Concluding babies can read when they really can’t C.Correctly concluding babies can read D.Correctly concluding babies can’t read Teddy bear Monster

Review Each baby gets 6 trials. If any baby chooses the good word ≥5 times, you declare (s)he can read. Here are the probabilities of what will happen, according to the null hypothesis: What’s the Type I error rate? A.0.09 B.0.11 C.0.82 D.0.89 Number of good words Probability