Review Nine men and nine women are tested for their memory of a list of abstract nouns. The mean scores are Mmale = 15 and Mfemale = 17. The mean square.

Slides:



Advertisements
Similar presentations
Hypothesis Testing W&W, Chapter 9.
Advertisements

Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Hypothesis Testing making decisions using sample data.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
Review You run a t-test and get a result of t = 0.5. What is your conclusion? Reject the null hypothesis because t is bigger than expected by chance Reject.
Review Find the mean square (MS) based on these two samples. A.26.1 B.32.7 C.43.6 D.65.3 E M A = 14 M B = Flood.
Chapter Seventeen HYPOTHESIS TESTING
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 10 Notes Class notes for ISE 201 San Jose State University.
Hypothesis Tests for Means The context “Statistical significance” Hypothesis tests and confidence intervals The steps Hypothesis Test statistic Distribution.
Major Points An example Sampling distribution Hypothesis testing
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Chapter 9 Hypothesis Testing.
Ch. 9 Fundamental of Hypothesis Testing
PSY 307 – Statistics for the Behavioral Sciences
Major Points Formal Tests of Mean Differences Review of Concepts: Means, Standard Deviations, Standard Errors, Type I errors New Concepts: One and Two.
Determining Statistical Significance
Inferential Statistics
AM Recitation 2/10/11.
Hypothesis Testing:.
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Overview Definition Hypothesis
Introduction to Biostatistics and Bioinformatics
1/2555 สมศักดิ์ ศิวดำรงพงศ์
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Statistics for the Behavioral Sciences Second Edition Chapter 11: The Independent-Samples t Test iClicker Questions Copyright © 2012 by Worth Publishers.
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Review Nine men and nine women are tested for their memory of a list of abstract nouns. The mean scores are M male = 15 and M female = 17. The mean square.
Chapter 20 Testing Hypothesis about proportions
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Overview.
Chapter 8 Parameter Estimates and Hypothesis Testing.
1 Hypothesis Testing A criminal trial is an example of hypothesis testing. In a trial a jury must decide between two hypotheses. The null hypothesis is.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Introduction to hypothesis testing Hypothesis testing is about making decisions Is a hypothesis true or false? Ex. Are women paid less, on average, than.
Sampling Distribution (a.k.a. “Distribution of Sample Outcomes”) – Based on the laws of probability – “OUTCOMES” = proportions, means, test statistics.
Chapter Ten McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. One-Sample Tests of Hypothesis.
Chapter 12 Tests of Hypotheses Means 12.1 Tests of Hypotheses 12.2 Significance of Tests 12.3 Tests concerning Means 12.4 Tests concerning Means(unknown.
6.2 Large Sample Significance Tests for a Mean “The reason students have trouble understanding hypothesis testing may be that they are trying to think.”
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Lecture Slides Elementary Statistics Twelfth Edition
Putting Things Together
Chapter 9 Hypothesis Testing.
Hypothesis Testing.
Effect Size 10/15.
Two-sample t-tests 10/16.
Independent-Samples t-test
INF397C Introduction to Research in Information Studies Spring, Day 12
Putting Things Together
Review Ordering company jackets, different men’s and women’s styles, but HR only has database of employee heights. How to divide people so only 5% of.
Effect Size.
Review You run a t-test and get a result of t = 0.5. What is your conclusion? Reject the null hypothesis because t is bigger than expected by chance Reject.
Two-sample t-tests.
Three Views of Hypothesis Testing
Hypothesis Testing: Hypotheses
Statistical Inference
Inferential Statistics
Chapter 9 Hypothesis Testing.
Hypothesis Testing A hypothesis is a claim or statement about the value of either a single population parameter or about the values of several population.
Chapter 12 Power Analysis.
Reasoning in Psychology Using Statistics
Last Update 12th May 2011 SESSION 41 & 42 Hypothesis Testing.
Testing Hypotheses I Lesson 9.
Calculating t X -  t = s x X1 – X2 t = s x1 – x2 s d One sample test
Introduction To Hypothesis Testing
Presentation transcript:

Review Nine men and nine women are tested for their memory of a list of abstract nouns. The mean scores are Mmale = 15 and Mfemale = 17. The mean square based on both samples is MS = 25. What is your best estimate of the (non-standardized) effect size, µfemale – µmale? 0.40 1.33 2.00 18.0

Review Nine men and nine women are tested for their memory of a list of abstract nouns. The mean scores are Mmale = 15 and Mfemale = 17. The mean square based on both samples is MS = 25. Calculate the standardized effect size. 0.08 0.10 0.40 0.49

Review A sample of 16 subjects has a mean of 53 and a sample variance of 9. Calculate a 95% confidence interval for the population mean. The critical value for t (at 97.5% with 15 df) is 2.13. [51.40, 54.60] [48.21, 57.79] [51.80, 54.20] [50.16, 55.84]

Three Views of Hypothesis Testing 10/22

What are we doing? In science, we run lots of experiments In some cases, there's an "effect”; in others there's not Some manipulations have an impact; some don't Some drugs work; some don't Goal: Tell these situations apart, using the sample data If there is an effect, reject the null hypothesis If there’s no effect, retain the null hypothesis Challenge Sample is imperfect reflection of population, so we will make mistakes How to minimize those mistakes?

Analogy: Prosecution Think of cases where there is an effect as "guilty” No effect: "innocent" experiments Scientists are prosecutors We want to prove guilt Hope we can reject null hypothesis Can't be overzealous Minimize convicting innocent people Can only reject null if we have enough evidence How much evidence to require to convict someone? Tradeoff Low standard: Too many innocent people get convicted (Type I Error) High standard: Too many guilty people get off (Type II Error)

Binomial Test Example: Halloween candy Sack holds boxes of raisins and boxes of jellybeans Each kid blindly grabs 10 boxes Some kids cheat by peeking Want to make cheaters forfeit candy How many jellybeans before we call kid a cheater?

Binomial Test ? How many jellybeans before we call kid a cheater? Work out probabilities for honest kids Binomial distribution Don’t know distribution of cheaters Make a rule so only 5% of honest kids lose their candy For each individual above cutoff Don’t know for sure they cheated Only know that if they were honest, chance would have been less than 5% that they’d have so many jellybeans Therefore, our rule catches as many cheaters as possible while limiting Type I errors (false convictions) to 5% Honest Cheaters ? Jellybeans 0 1 2 3 4 5 6 7 8 9 10

Testing the Mean ? Example: IQs at various universities Some schools have average IQs (mean = 100) Some schools are above average Want to decide based on n students at each school Need a cutoff for the sample mean Find distribution of sample means for Average schools Set cutoff so that only 5% of Average schools will be mistaken as Smart 100 Average Smart ? p(M)

Unknown Variance: t-test Want to know whether population mean equals m0 H0: m = m0 H1: m ≠ m0 Don’t know population variance Don’t know distribution of M under H0 Don’t know Type I error rate for any cutoff Use sample variance to estimate standard error of M Divide M – m0 by standard error to get t We know distribution of t exactly p(t) Distribution of t when H0 is True m0 p(M) Critical value Critical Region a

Two-tailed Tests H0 H1 ? Often we want to catch effects on either side Split Type I Errors into two critical regions Each must have probability a/2 H0 H1 ? Test statistic Critical value a/2

An Alternative View: p-values Probability of a value equal to or more extreme than what you actually got Measure of how consistent data are with H0 p >  t is within tcrit Retain null hypothesis p <  t is beyond tcrit Reject null hypothesis; accept alternative hypothesis tcrit for a = .05 p = .03 t t t = 2.57

Three Views of Inferential Statistics Confidence interval M Effect size & confidence interval Values of m0 that don’t lead to rejecting H0 Test statistic & critical value Measure of consistency with H0 p-value & a Type I error rate Can answer hypothesis test at any level Result predicted by H0 vs. confidence interval Test statistic vs. critical value p vs.  m0 m0 m0 m0 Test statistic Critical value 1 a p

Review Which of the following would lead you to reject the null hypothesis? t < tcrit p < a M outside the confidence interval t > a p > tcrit

Review In a paired-samples experiment, you get a confidence interval of [-2.4, 3.6] for µdiff. What is your conclusion? t < tcrit, p > a, and you keep the null hypothesis t < tcrit, p < a, and you reject the null hypothesis t > tcrit, p > a, and you reject the null hypothesis t > tcrit, p < a, and you keep the null hypothesis

Review In a one-tailed t-test, you get a result of t = 2.8. The critical value is tcrit = 2.02. What region(s) correspond to alpha? Blue Blue + red Blue + green Blue + red + orange + green t (from data) tcrit