Introduction to Hypothesis Testing

Slides:



Advertisements
Similar presentations
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Advertisements

Beyond Null Hypothesis Testing Supplementary Statistical Techniques.
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
10 Hypothesis Testing. 10 Hypothesis Testing Statistical hypothesis testing The expression level of a gene in a given condition is measured several.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Lecture 2: Thu, Jan 16 Hypothesis Testing – Introduction (Ch 11)
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
TESTING HYPOTHESES FOR A SINGLE SAMPLE
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th Edition Chapter 9 Hypothesis Testing: Single.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Chapter 9 Hypothesis Testing.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Chapter 8 Introduction to Hypothesis Testing
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 8-1 TUTORIAL 6 Chapter 10 Hypothesis Testing.
© 1999 Prentice-Hall, Inc. Chap Chapter Topics Hypothesis Testing Methodology Z Test for the Mean (  Known) p-Value Approach to Hypothesis Testing.
Probability Population:
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.
Statistical hypothesis testing – Inferential statistics I.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Hypothesis Testing:.
Probability Distributions and Test of Hypothesis Ka-Lok Ng Dept. of Bioinformatics Asia University.
Overview of Statistical Hypothesis Testing: The z-Test
Chapter 10 Hypothesis Testing
Business Statistics - QBM117 Introduction to hypothesis testing.
Confidence Intervals and Hypothesis Testing - II
Descriptive statistics Inferential statistics
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Mid-semester feedback In-class exercise. Chapter 8 Introduction to Hypothesis Testing.
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 8 Introduction to Hypothesis Testing
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap th Lesson Introduction to Hypothesis Testing.
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Lecture 3: Review Review of Point and Interval Estimators
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Chapter 10 Hypothesis Testing
1 Introduction to Hypothesis Testing. 2 What is a Hypothesis? A hypothesis is a claim A hypothesis is a claim (assumption) about a population parameter:
Chapter 8 Introduction to Hypothesis Testing
Lecture 7 Introduction to Hypothesis Testing. Lecture Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses.
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
10.2 Tests of Significance Use confidence intervals when the goal is to estimate the population parameter If the goal is to.
1 ConceptsDescriptionHypothesis TheoryLawsModel organizesurprise validate formalize The Scientific Method.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Chap 8-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 8 Introduction to Hypothesis.
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Math 4030 – 9a Introduction to Hypothesis Testing
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
© 2004 Prentice-Hall, Inc.Chap 9-1 Basic Business Statistics (9 th Edition) Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests.
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Course Overview Collecting Data Exploring Data Probability Intro. Inference Comparing Variables Relationships between Variables Means/Variances Proportions.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
 What is Hypothesis Testing?  Testing for the population mean  One-tailed testing  Two-tailed testing  Tests Concerning Proportions  Types of Errors.
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Chapter 9 Introduction to the t Statistic
Presentation transcript:

Introduction to Hypothesis Testing The One-Sample z Test

The One-Sample z Test Conditions of Applicability: One group of subjects Comparing to population with known mean and variance. Note: this is not a common situation in Psychology! PSYC 6130, PROF. J. ELDER

Example: Finish times for the 2005 Toronto Marathon (Oct 16, 2005) Suppose your population of interest are women who ran the marathon (slightly artificial). You hypothesize that women in their early twenties (20-24) are faster than the average woman who ran the marathon. Here the ‘treatment’ is ‘youth’. PSYC 6130, PROF. J. ELDER

Null Hypothesis Testing Largely due to English mathematician Sir R.A. Fisher (1890-1962) ‘Proof by contradiction’ Suppose the null hypothesis is true In our example, the null hypothesis is that the finishing times for young women are drawn from the same distribution as for the rest of the female contestants. Knowing the mean and standard deviation of the population, we can compute the sampling distribution of the mean for a sample of size n. This is the null hypothesis distribution. The mean time for our sample of young women should be plausible under this sampling distribution. If it is not plausible, it suggests that the null hypothesis is false. This lends credence to our alternate hypothesis (that young women are faster). PSYC 6130, PROF. J. ELDER

How do we judge the plausibility of the null hypothesis? The sample mean should be plausible under the sampling distribution of the mean. Implausible Fairly plausible Highly plausible PSYC 6130, PROF. J. ELDER

Plausibility of the null hypothesis The plausibility of the null hypothesis is judged by computing the probability p of observing a sample mean that is at least as deviant from the population mean as the value we have observed. PSYC 6130, PROF. J. ELDER

Plausibility of the null hypothesis This computation is simplified by converting to z-scores. Under the assumption of normality, we can determine this probability from a standard normal table. PSYC 6130, PROF. J. ELDER

Results for 2005 Toronto Marathon PSYC 6130, PROF. J. ELDER

Results for Random Sample of Women Under 25 PSYC 6130, PROF. J. ELDER

Statistical Decisions We now know the probability that an observation like ours could have been drawn from the general female contestant population, i.e. that our ‘treatment of youth’ had no effect. This probability is pretty small. Should we reject the null hypothesis? This is the process of turning a continuous probability (a real number) into a binary decision (yes or no). If we reject the null hypothesis, there is a chance we will be wrong. We have to decide what chance we are willing to take, i.e. the maximum p-value we will accept as grounds for rejecting the null hypothesis. We call this probability threshold the alpha (a) level. A typical value is .05. The a-level must be decided prior to the experiment. PSYC 6130, PROF. J. ELDER

Type I and Type II Errors Type I Error: the null hypothesis is true and we reject it. Type II Error: the null hypothesis is false and we fail to reject it. Actual Situation Researcher’s Decision Null Hypothesis is True Null Hypothesis is False Accept the Null Hypothesis Reject the Null Hypothesis PSYC 6130, PROF. J. ELDER

Type I and Type II Errors Which is more serious? Type I can be bad, as rejecting the null hypothesis (e.g., ‘This stuff really works’), may cause actions to be taken that have no value. Type II may not be so bad, if it is understood that the treatment may still have an effect (we fail to reject the null hypothesis, but we do not reject the alternate hypothesis). But Type II may be bad if it leads to inaction when action would have produced good results (e.g., a cure for cancer). PSYC 6130, PROF. J. ELDER

One-Tailed vs Two-Tailed Tests Our marathon hypothesis was one-tailed, because we made a specific prediction about the direction of the effect (young women are faster). Suppose we had simply hypothesized that young women are different. PSYC 6130, PROF. J. ELDER

Two-Tailed Test PSYC 6130, PROF. J. ELDER

One-Tailed vs Two-Tailed Tests Use a one-tailed test when you have a specific reason to believe the effect will be in a particular direction, and you do not care if the effect is in the opposite direction. Otherwise, use a two-tailed test. One-tailed tests will always result in smaller p values, and hence a greater chance of reaching significance for your directional hypothesis. The decision of whether to perform one-tailed or two-tailed tests must be made prior to data collection. PSYC 6130, PROF. J. ELDER

Basic Procedure for Statistical Inference State the hypothesis Select the statistical test and significance level Select the sample and collect the data Find the region of rejection Calculate the test statistic Make the statistical decision PSYC 6130, PROF. J. ELDER

Step 1. State the Hypothesis PSYC 6130, PROF. J. ELDER

Step 2. Select the Statistical Test and the Significance Level We are comparing a sample mean to a population with known mean and standard deviation  z-test p=.05 is probably appropriate. PSYC 6130, PROF. J. ELDER

Step 3. Select the Sample and Collect the Data Ideally, we would randomly assign the treatment to a random sample of the population (Toronto Marathon women). Is this possible? Instead, we randomly sample female contestants under 25. PSYC 6130, PROF. J. ELDER

Step 4. Find the Region of Rejection The z value defining the rejection region is called the critical value for your test, and is a function of the selected α-level. For this reason, we often denote the critical value as zα PSYC 6130, PROF. J. ELDER

Step 5. Calculate the Test Statistic PSYC 6130, PROF. J. ELDER

Step 6. Make the Statistical Decision p<a: Reject null hypothesis. p>a: Fail to reject null hypothesis. PSYC 6130, PROF. J. ELDER

Example: Height of Female Psychology Graduate Students Sample: Female students enrolled in PSYC 6130C 2008-09 PSYC 6130, PROF. J. ELDER

Assumptions Underlying One-Sample z Test Random sampling Variable is normal CLT: Deviations from normality ok as long as sample is large. Dispersion of sampled population is the same as for the comparison population e.g. suppose means are the same, but dispersion of sampled population is greater than dispersion of comparison population. PSYC 6130, PROF. J. ELDER

Limitations of the One-Sample Test Strongly depends on random sampling. Better to have two groups of subjects: test (treatment) group and control group. Problem of random sampling reduces to problem of random assignment to two groups: much easier! PSYC 6130, PROF. J. ELDER

Reporting your results Express your result in evocative English, then include the required numbers. Follow APA style. Example: Young female runners were not found to be significantly faster than the general female contestant population, z=-1.31, p=0.095, one-tailed. PSYC 6130, PROF. J. ELDER

More on Type I and Type II Errors Total number of significant results Consistent use of a fixed alpha-level determines the proportion of null experiments that generate significant results. Don’t have enough information to know how many reported results are errors, because: Don’t know the relative proportion of cases where H0 is true and H0 is false. Don’t know the power of effective experiments. Typically only significant results are reported (publication bias). PSYC 6130, PROF. J. ELDER