SUMMARY Hypothesis testing. Self-engagement assesment.

Slides:



Advertisements
Similar presentations
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Advertisements

Chapter 9 Hypothesis Testing Understandable Statistics Ninth Edition
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Inferential Statistics
Tests of Significance about Percents Reading Handout # 6.
Chapter 10 Section 2 Hypothesis Tests for a Population Mean
AP Statistics – Chapter 9 Test Review
Summary.
Chapter 8 Hypothesis Testing I. Significant Differences  Hypothesis testing is designed to detect significant differences: differences that did not occur.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
Statistics for the Social Sciences Psychology 340 Spring 2005 Hypothesis testing.
Statistics for the Social Sciences Psychology 340 Fall 2006 Hypothesis testing.
Introduction to Hypothesis Testing CJ 526 Statistical Analysis in Criminal Justice.
Ch. 9 Fundamental of Hypothesis Testing
© 1999 Prentice-Hall, Inc. Chap Chapter Topics Hypothesis Testing Methodology Z Test for the Mean (  Known) p-Value Approach to Hypothesis Testing.
Getting Started with Hypothesis Testing The Single Sample.
Probability Population:
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
One Sample Z-test Convert raw scores to z-scores to test hypotheses about sample Using z-scores allows us to match z with a probability Calculate:
Quiz 7 Hypothesis testing of single samples z-test and t-test.
Introduction to Hypothesis Testing
Statistics for the Social Sciences
Hypothesis Testing:.
Overview of Statistical Hypothesis Testing: The z-Test
Hypothesis Testing A hypothesis is a conjecture about a population. Typically, these hypotheses will be stated in terms of a parameter such as  (mean)
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
Descriptive statistics Inferential statistics
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Mid-semester feedback In-class exercise. Chapter 8 Introduction to Hypothesis Testing.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 8 Introduction to Hypothesis Testing
Chapter 8 Hypothesis Testing. Section 8-1: Steps in Hypothesis Testing – Traditional Method Learning targets – IWBAT understand the definitions used in.
14. Introduction to inference
Testing Hypotheses Tuesday, October 28. Objectives: Understand the logic of hypothesis testing and following related concepts Sidedness of a test (left-,
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Chapter 10 Hypothesis Testing
Chapter 8 Introduction to Hypothesis Testing
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
SUMMARY Hypothesis testing. Self-engagement assesment.
Hypothesis Testing Introduction to Statistics Chapter 8 Mar 2-4, 2010 Classes #13-14.
Thursday, September 19, 2013 Introduction to t-tests & one-sample t-test.
1 ConceptsDescriptionHypothesis TheoryLawsModel organizesurprise validate formalize The Scientific Method.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
SUMMARY. Central limit theorem Statistical inference If we can’t conduct a census, we collect data from the sample of a population. Goal: make conclusions.
© Copyright McGraw-Hill 2004
WS 2007/08Prof. Dr. J. Schütze, FB GW KI 1 Hypothesis testing Statistical Tests Sometimes you have to make a decision about a characteristic of a population.
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Course Overview Collecting Data Exploring Data Probability Intro. Inference Comparing Variables Relationships between Variables Means/Variances Proportions.
Psych 230 Psychological Measurement and Statistics Pedro Wolf October 21, 2009.
Major Steps. 1.State the hypotheses.  Be sure to state both the null hypothesis and the alternative hypothesis, and identify which is the claim. H0H0.
Welcome to MM207 Unit 7 Seminar Dr. Bob Hypothesis Testing and Excel 1.
If we fail to reject the null when the null is false what type of error was made? Type II.
Statistics (cont.) Psych 231: Research Methods in Psychology.
Review Statistical inference and test of significance.
Hypothesis Testing Chapter Hypothesis Testing  Developing Null and Alternative Hypotheses  Type I and Type II Errors  One-Tailed Tests About.
Chapter 9: Hypothesis Tests for One Population Mean 9.5 P-Values.
Presentation transcript:

SUMMARY Hypothesis testing

Self-engagement assesment

Null hypothesis no song song Null hypothesis: I assume that populations without and with song are same. At the beginning of our calculations, we assume the null hypothesis is true.

Hypothesis testing song Because of such a low probability, we interpret 8.2 as a significant increase over 7.8 caused by undeniable pedagogical qualities of the 'Hypothesis testing song'.

Four steps of hypothesis testing 1. Formulate the null and the alternative (this includes one- or two-directional test) hypothesis. 2. Select the significance level α – a criterion upon which we decide that the claim being tested is true or not. --- COLLECT DATA Compute the p-value. The p-value is the probability that the data would be at least as extreme as those observed, if the null hypothesis were true. 4. Compare the p-value to the α-level. If p ≤ α, the observed effect is statistically significant, the null is rejected, and the alternative hypothesis is valid. Za předpokladu platnosti nulové hypotézy je p-hodnota pravděpodobnost, že data jsou nejméně tak extrémní jako ta pozorovaná.

One-tailed and two-tailed one-tailed (directional) test two-tailed (non-directional) test Z-critical value, what is it?

NEW STUFF

Decision errors Hypothesis testing is prone to misinterpretations. It's possible that students selected for the musical lesson were already more engaged. And we wrongly attributed high engagement score to the song. Of course, it's unlikely to just simply select a sample with the mean engagement of 8.2. The probability of doing so is , pretty low. Thus we concluded it is unlikely. But it's still possible to have randomly obtained a sample with such a mean.

Four possible things can happen Decision Reject H 0 Retain H 0 State of the world H 0 true 13 H 0 false 24 In which cases we made a wrong decision?

Four possible things can happen Decision Reject H 0 Retain H 0 State of the world H 0 true 1 H 0 false 4 In which cases we made a wrong decision?

Four possible things can happen Decision Reject H 0 Retain H 0 State of the world H 0 true Type I error H 0 false Type II error

Type I error When there really is no difference between the populations, random sampling can lead to a difference large enough to be statistically significant. You reject the null, but you shouldn't. False positive – the person doesn't have the disease, but the test says it does

Type II error When there really is a difference between the populations, random sampling can lead to a difference small enough to be not statistically significant. You do not reject the null, but you should. False negative - the person has the disease but the test doesn't pick it up Type I and II errors are theoretical concepts. When you analyze your data, you don't know if the populations are identical. You only know data in your particular samples. You will never know whether you made one of these errors.

The trade-off If you set α level to a very low value, you will make few Type I/Type II errors. But by reducing α level you also increase the chance of Type II error. Your decision whether to allow more false positives (Type I error) or more false negatives (Type II error) is based on practical consequences of these errors. See next example

Clinical trial for a novel drug Drug that should treat a disease for which there exists no therapy. If the result is statistically significant, drug will me marketed. If the result is not statistically significant, work on the drug will cease. Type I error: treat future patients with ineffective drug Type II error: cancel the development of a functional drug for a condition that is currently not treatable. Which error is worse? I would say Type II error. To reduce its risk, it makes sense to set α = 0.10 or even higher. Harvey Motulsky, Intuitive Biostatistics

Clinical trial for a me-too drug Similar to the previous example, but now our prospective drug should treat a disease for which there already exists another therapy. Type I error: treat future patients with ineffective drug Type II error: cancel the development of a functional drug for a condition that can be treated adequately with existing drugs. Thinking scientifically (not commercially) I would minimize the risk of Type I error (set α to a very low value). Harvey Motulsky, Intuitive Biostatistics

population of students that did not attend the musical lesson population of students that did attend the musical lesson parameters are known sample statistic is known

Test statistic test statistic Z-test

New situation An average engagement score in the population of 100 students is 7.5. A sample of 50 students was exposed to the musical lesson. Their engagement score became 7.72 with the s.d. of 0.6. DECISION: Does a musical performance lead to the change in the students' engagement? Answer YES/NO. Setup a hypothesis test, please.

Hypothesis test

Formulate the test statistic but this is unknown! population of students that did attend the musical lesson sample population of students that did not attend the musical lesson known unknown

t-statistic one sample t-test jednovýběrový t-test

One-sample t-test

Quiz

Z-test vs. t-test

Typical example of an one-sample t-test

Two-sample t-test So far, we have been working with just one sample. Now, we want to compare the sample of students without song with the sample of students with the song. population of students that did not attend the musical lesson population of students that did attend the musical lesson unknown sample statistic is known sample statistic is known dvouvýběrový t-test

Two-sample t-test

Dependent t-test for paired samples Two samples are dependent when the same subject takes the test twice. For example, give one person two different conditions to see how he/she reacts. paired t-test (párový t-test) Examples: Each subject is assigned to two different conditions Give a person two types of treatment. Growth over time. Compare the effect of the treatment (drug) after 12 hours and after 24 hours.

Engagement example

Test statistic

Paired t-test

Dependent samples Advantages we can use fewer subjects cost-effective less time-consuming Disadvantages carry-over effects order may influence results