Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.

Slides:



Advertisements
Similar presentations
1 Hypothesis Testing William P. Wattles, Ph.D. Psychology 302.
Advertisements

CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Hypothesis testing Week 10 Lecture 2.
Chapter 8 Hypothesis Testing I. Significant Differences  Hypothesis testing is designed to detect significant differences: differences that did not occur.
Behavioural Science II Week 1, Semester 2, 2002
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Hypothesis testing & Inferential Statistics
Statistics for the Social Sciences Psychology 340 Fall 2006 Review For Exam 1.
Chapter Sampling Distributions and Hypothesis Testing.
Ch 15 - Chi-square Nonparametric Methods: Chi-Square Applications
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
PSY 307 – Statistics for the Behavioral Sciences
Today Concepts underlying inferential statistics
Chapter 12 Inferring from the Data. Inferring from Data Estimation and Significance testing.
Probability Population:
Inferential Statistics
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.
Inferential Statistics
Inferential Statistics
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Testing Hypotheses.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
AM Recitation 2/10/11.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Overview of Statistical Hypothesis Testing: The z-Test
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Descriptive statistics Inferential statistics
Introduction to Hypothesis Testing for μ Research Problem: Infant Touch Intervention Designed to increase child growth/weight Weight at age 2: Known population:
Tuesday, September 10, 2013 Introduction to hypothesis testing.
Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 8 Introduction to Hypothesis Testing
Hypothesis Testing.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
1 Statistical Inference Greg C Elvers. 2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
The Argument for Using Statistics Weighing the Evidence Statistical Inference: An Overview Applying Statistical Inference: An Example Going Beyond Testing.
Chapter 8 Introduction to Hypothesis Testing
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true We classify hypothesis tests in.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Chapter 6 USING PROBABILITY TO MAKE DECISIONS ABOUT DATA.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Smith/Davis (c) 2005 Prentice Hall Chapter Nine Probability, the Normal Curve, and Sampling PowerPoint Presentation created by Dr. Susan R. Burns Morningside.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Chapter 3: Statistical Significance Testing Warner (2007). Applied statistics: From bivariate through multivariate. Sage Publications, Inc.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
1 URBDP 591 A Lecture 12: Statistical Inference Objectives Sampling Distribution Principles of Hypothesis Testing Statistical Significance.
© Copyright McGraw-Hill 2004
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Course Overview Collecting Data Exploring Data Probability Intro. Inference Comparing Variables Relationships between Variables Means/Variances Proportions.
Psych 230 Psychological Measurement and Statistics Pedro Wolf October 21, 2009.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Statistical principles: the normal distribution and methods of testing Or, “Explaining the arrangement of things”
Inferential Statistics Introduction to Hypothesis Testing.
Introduction To Hypothesis Testing
Presentation transcript:

Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010

Hypotheses Hypothesis: specific prediction about the outcome of a study Remember directional and non-directional hypotheses? Both kinds of hypotheses are called the alternative hypothesis (H 1 ): a prediction that there will be some change or difference among scores There is a relation between X and Y in the sample Treatment and control groups are different The sample scores are distinct from the population scores The other option is called the null hypothesis (H 0 ): negation of the alternative hypothesis There is NO relation between X and Y in the sample Treatment and control groups are NOT different The sample scores are NOT distinct from the population scores

Deciding which one is right  The alternative hypothesis (H 1 ) and the null hypothesis (H 0 ) can be envisioned as two separate probabilistic distributions that represent two distinct sets of scores  The alternative hypothesis (H 1 ) represents the scores in which there is a significant change or difference in the sample of scores compared to the population  The null hypothesis (H 0 ) represents the scores in which there is NO significant change or difference in the sample of scores compared to the population  Significance tests determine the probability that the null hypotheses (H 0 ) is true (nothing happening with data)

Deciding which one is right Before conducting a significance test, we can look for certain clues to decide which is correct… Is there overlap between the H 0 and H 1 distributions? If the two distributions overlap a lot, it is likely that there IS NOT a meaningful difference and we should maintain the null hypothesis (H o ) (nothing happening)  Which means reject the alternative hypothesis (H 1 ) If the two distributions do not overlap much, it is likely that there IS a meaningful difference and we should reject the null hypothesis (H o ) (something happening)  Which means maintain the alternative hypothesis (H 1 )

Which has more overlap?

How do we decide whether the difference is meaningful?  In order to judge the two distributions as significantly different, we should look at how close the two Means are to each other  If the means are close likely to maintain H 0 (again, the distributions overlap)  If the means are far, likely to reject H 0 (again, the distributions don’t overlap)  We use a certain probability, p, or critical value as a cut-off point to decide whether the overlap is significant or not  p corresponds to an area underneath the distributions curve  The consensus in psychology is if p ≤.05, reject H 0 (reasonable to assume that if a score is beyond 95% of scores in a distribution, it is not part of that distribution)  To be very cautious, researchers sometimes use p ≤.01

An example…  A high school uses tracking to divide students into “Advanced Placement” and “College Prep” classes. They believe College Prep students represent a general student population while AP students represent a “gifted” population and will have higher PSAT scores.  Assume a directional alternative hypothesis (H 1 )  AP students will score higher than general students on the PSAT  What would the null hypothesis (H 0 ) be?  No real difference between AP and general students’ PSAT scores  General school PSAT data  Mean = 500 and Std. Dev. = 200  Advanced Placement PSAT data  Mean = 650 and Std. Dev. = 100  Assume p ≤.05

Where to locate p, the critical value  Directional alternative hypothesis (H 1 ): likelihood that a score will fall in either extreme of the distribution or that change will be either positive or negative  One-tailed  All of p at one extreme (in predicted direction)  Because we predicted AP students’ PSAT scores would be higher, we drew p at the positive end of the distribution  Non-directional alternative hypothesis (H 1 ): equal likelihood that a score could fall in either extreme of the distribution or that change could be positive or negative (two-tailed)  Two-tailed  Divide p by 2 for both extremes of distribution  p of.05 would mean.025 at each extreme

H 1 is non- directional p ≤.05 Two-tailed 2.5% in each extreme H 1 is directional p ≤.05 One-tailed 5% in one extreme Where to locate p, the critical value

Errors in our decisions about H 1 and H 0  Type I error: H 0 is rejected when it is actually true and should be maintained; OVERCONFIDENCE - WORSE  Area of Type I error: α  “Confidence” is the probability of retaining null when alternative is false (correct decision)  Area of confidence: 1- α  Type II error: H 0 is maintained when it is actually false and should be rejected; BEING TOO CAUTIOUS  Area of Type II: ß  “Power” is the probability of rejecting null when alternative is true (correct decision)  Area of power: 1- ß

Power Confidence Making too much of results (worse error!) Overlooking significant results

Understanding these errors  One can reduce the risk of committing a Type I or Type II error by using a larger sample size  To minimize the chances of committing the worse, Type I error, researchers use smaller or more strict p values (I.e., select.01 instead of.05)  But one can never totally eliminate the risk of making either type of error  It is always possible that concluding there is a significant difference or change (rejecting H 0 ) is due to random error in the sample data  This is good reminder of how scientific knowledge remains fallible!

Approaches to data analysis  Descriptive statistics  Describe or summarize data; characterize sample  Organize responses to show trends in data  Options:  Frequency distributions  Measures of Central Tendency  Measures of Variability  Inferential statistics  Draw inferences about population from sample  Capture impact of random error on responses  Options:  Parametric  Non-parametric

Within inferential statistics…  Parametric tests deal with parameters (statistics that describe the population) and try to infer whether characteristics of the sample match the population  Parametric tests are powerful but have many assumptions (e.g., normal distributions)  Non-parametric tests have fewer assumptions and do not depend on population parameters, though they are less powerful

Non-parametric: The Chi-Square Test  The Chi-square test is a statistical test used to examine differences in nominal level variables (categories)  Is there an interesting or meaningful pattern in the responses or is the distribution of responses simply due to chance?  H 0 : no interesting pattern, responses just different by chance  H 1 : meaningful difference or relationship in responses  The Chi-square test allows us to check for meaningful differences among data for which we can’t compute the mean or standard deviation (e.g., Republican or Demoncrat)  Can be univariate (one variable) or bivariate (two variables)  Univariate example: types of music preferred  Bivariate example: types of music preferred by region of country

Non-parametric: The Chi-Square Test  The outcome of the Chi-square test is a comparison of expected frequencies of responses (theoretically predicted) vs. observed frequencies of responses (actually obtained) for any given variable  Sometimes expected frequency is known (e.g., in gambling, we can predict how many times a 6 should be rolled in 100 die rolls)  More often in psychology, we do NOT know the expected frequency for a given variable so we define the expected frequency as the number of observations predicted for any category if H 0 is true  We use the observed vs. expected frequencies to calculate a  2 value and compare it to a critical  2 value in a hypothetical distribution of  2 scores

Chi-Square Test Example: Music preferences of college students (N = 80) CountryR&BAlternativePop The H 0 would state that there is no difference in types of music preferred by college students so expected frequency, f E, would be 20 for each category Univariate, Observed frequencies: The  2 value for these scores (14) surpasses the critical  2 value at p =.05 (7.8) so we would reject H 0 and conclude that there is a significant difference in the music preference of students surveyed.

Chi-Square Test Example: Music preferences of college students (N = 80) The H 0 would state that there is no difference in type of music preferred by college students by region of the country so expected frequency, f E, would be 6.7 for each category Bivariate, Observed frequencies: The  2 value for these scores (32.9) surpasses the critical  2 value at p =.05 (12.6) so we would reject H 0 and conclude that the observed differences are not due to chance. Rather, music preference and region of the country are meaningfully related. CountryR&BAlternativePop N.East West M.West