Behavioural Science II Week 1, Semester 2, 2002

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Hypothesis testing Week 10 Lecture 2.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
1. Estimation ESTIMATION.
Review: What influences confidence intervals?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Chapter 8 Hypothesis Testing I. Significant Differences  Hypothesis testing is designed to detect significant differences: differences that did not occur.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview of Lecture Independent and Dependent Variables Between and Within Designs.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 11: Power.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Inferential Statistics
Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.
Chapter Ten Introduction to Hypothesis Testing. Copyright © Houghton Mifflin Company. All rights reserved.Chapter New Statistical Notation The.
AM Recitation 2/10/11.
Overview of Statistical Hypothesis Testing: The z-Test
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Descriptive statistics Inferential statistics
Introduction to Hypothesis Testing for μ Research Problem: Infant Touch Intervention Designed to increase child growth/weight Weight at age 2: Known population:
Mid-semester feedback In-class exercise. Chapter 8 Introduction to Hypothesis Testing.
Sampling Distributions and Hypothesis Testing. 2 Major Points An example An example Sampling distribution Sampling distribution Hypothesis testing Hypothesis.
Chapter 8 Introduction to Hypothesis Testing
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Chapter 8 Hypothesis Testing. Section 8-1: Steps in Hypothesis Testing – Traditional Method Learning targets – IWBAT understand the definitions used in.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Chapter 8 Hypothesis Testing I. Chapter Outline  An Overview of Hypothesis Testing  The Five-Step Model for Hypothesis Testing  One-Tailed and Two-Tailed.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Chapter 9: Testing Hypotheses
Chapter 8 Introduction to Hypothesis Testing
STA Statistical Inference
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Psy B07 Chapter 4Slide 1 SAMPLING DISTRIBUTIONS AND HYPOTHESIS TESTING.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Education 793 Class Notes Decisions, Error and Power Presentation 8.
Chapter 8 Parameter Estimates and Hypothesis Testing.
1 When we free ourselves of desire, we will know serenity and freedom.
Chapter 8 Hypothesis Testing I. Significant Differences  Hypothesis testing is designed to detect significant differences: differences that did not occur.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Sampling Distribution (a.k.a. “Distribution of Sample Outcomes”) – Based on the laws of probability – “OUTCOMES” = proportions, means, test statistics.
Chapter 8: Introduction to Hypothesis Testing. Hypothesis Testing A hypothesis test is a statistical method that uses sample data to evaluate a hypothesis.
PEP-PMMA Training Session Statistical inference Lima, Peru Abdelkrim Araar / Jean-Yves Duclos 9-10 June 2007.
CHAPTER 7: TESTING HYPOTHESES Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Chapter 9 Hypothesis Testing Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Chapter 9 Introduction to the t Statistic
Hypothesis Testing: Hypotheses
Statistical Inference for the Mean Confidence Interval
Hypothesis Testing.
Testing Hypotheses I Lesson 9.
Presentation transcript:

Behavioural Science II Week 1, Semester 2, 2002 Hypothesis testing Behavioural Science II Week 1, Semester 2, 2002

Behavioural Science II Hypothesis testing Null hypothesis is that there is no systematic relationship between independent variables (IVs) and dependent variables (DVs). Research hypothesis is that any relationship observed in the data is real. Behavioural Science II

Behavioural Science II Hypothesis testing Whereas research hypothesis tends to be imprecise about numerical differences between groups (e.g., difference in reaction times), null hypothesis states very specifically that difference should be zero. Behavioural Science II

Null hypothesis versus alternative hypothesis The null hypothesis assumes that scores for different levels of the IV are random samples from the same population. The alternative hypothesis is that samples come from different populations. Behavioural Science II

Null hypothesis versus alternative hypothesis For any single experiment, we are bound to see a difference, just as we see a difference between the means of two random samples in a distribution of sample means. If the null hypothesis is true, then differences in mean scores are just two random samples from the same population. Behavioural Science II

Testing the null hypothesis A statistical test assesses the probability of obtaining a given sample or samples of scores, assuming the null hypothesis is correct. Behavioural Science II

Testing the null hypothesis If the probability is low enough (e.g., p<.05), then the null hypothesis is rejected in favour of the alternative (research) hypothesis, and the IV is deemed to have a systematic effect. If the probability is not sufficiently low (e.g., p>.05), then the null hypothesis is not rejected but retained, and the IV is deemed to have no effect (i.e., the observed changes are due to chance). Behavioural Science II

Statistical significance Statistical significance refers to the probability of the data obtained, given that the null hypothesis is true. A statistically significant result does not mean that the null hypothesis is improbable. There is an ongoing gap between statistical significance and substantive significance. Behavioural Science II

Hypothesis testing and sampling distributions The decision to reject or not reject the null hypothesis usually is made with reference to the sampling distribution of a statistic of some kind (e.g., z-distribution, t- distribution). Behavioural Science II

Example of hypothesis testing using z-distribution Null hypothesis population parameters:  = 15 =15 Random sample statistics Mean = 110 N=9 Behavioural Science II

Behavioural Science II Applying formulae Given that z-score of 1.96 = p< .05 (two- tailed), would reject null hypothesis. Behavioural Science II

Example of hypothesis testing using t-distribution Null hypothesis population parameters: =100 Random sample statistics Mean = 110 N=9 ∑x2 = 960 Behavioural Science II

Behavioural Science II Applying formulae Given that t- scores of 2.306 (df=8) =p< .05 (two-tailed), would reject the null hypothesis. Behavioural Science II

Hypothesis testing using confidence intervals We reject null hypothesis when null population mean lies outside the confidence interval. We infer alternative population mean is higher than null population mean if lower limit of confidence intervals is to right of null population mean and lower if upper limit of confidence intervals is to left of null population mean. Behavioural Science II

Errors in hypothesis testing Given the gap between statistical and substantive significance, a decision based on probability to retain or reject the null hypothesis can be wrong. Behavioural Science II

When null hypothesis is true (Type I error) When null hypothesis is true, and it is rejected, this decision is called a Type 1 error. The probability of making such an error is designated alpha () and is equivalent to the significance level (e.g., p<.05). Behavioural Science II

When null hypothesis is true (Type I error) If null hypothesis is true and alpha level is set at .05, then the null hypothesis will be rejected 5% of time even though it is true. One way to safeguard against a Type I error is to set a more stringent alpha level (e.g., p<.01). Behavioural Science II

When null hypothesis is false (Type II or III errors) When alternative hypothesis is true, and the statistic (mean) from alternative distribution falls within cut-off points (i.e., p>.05), then null hypothesis would be retained. Behavioural Science II

Behavioural Science II Type II error Retaining null hypothesis when alternative hypothesis is true is called a Type II error. The probability of making a Type II error usually is symbolized as beta (). The probability of beta depends on how much the alternative hypothesis sampling distribution overlaps the retention region of the null hypothesis sampling distribution. Behavioural Science II

Behavioural Science II Type III error It is also possible to make a Type III error, by rejecting a null hypothesis but inferring the incorrect alternative hypothesis. The probability of making a Type III error usually is symbolized as gamma () and is equivalent to whatever percentage of scores in the alternative distribution falls in the far end of the null hypothesis distribution. The probability of making a Type III error is usually quite small. Behavioural Science II

Behavioural Science II The power of a test The probability of rejecting a false null hypothesis and correctly inferring the position or direction of the alternative hypothesis with respect to the null hypothesis. Factors affecting power and error rates Behavioural Science II

Power is affected by significance (alpha) level Setting a less stringent significance level increases the discriminatory power of the statistical test and increases power as long as the alternative hypothesis is true. Behavioural Science II

Power is affected by magnitude of difference between sample means So, increasing the difference in the size of the mean at differing levels of the IV increases the power of the test. Behavioural Science II

Power is affected by sample size An increase in sample size increases the power of the test, if the alternative hypothesis is true. This is because as sample size increases, the standard error of the mean decreases, thus reducing the overlap between the null and alternative hypotheses. Behavioural Science II

Behavioural Science II Effect size In order to gauge the effect of the IV, it makes sense to contrast the difference between the population mean for the null hypothesis and the population mean for the alternative hypothesis. Behavioural Science II

Behavioural Science II Effect size formula where  is standard deviation of population of dependent measure scores. Behavioural Science II

Behavioural Science II Judging effect sizes According to Cohen (1988) .20 = small effect size .50 = medium effect size .80 = large effect size Behavioural Science II

Do we really need the null hypothesis? A significant test of the null hypothesis does not mean the data are not a product of chance. The significant result may simply be a Type I error (falsely rejecting null hypothesis). Behavioural Science II

Do we really need the null hypothesis? Better to test research hypothesis, if know size and direction of effect. Even better report combination of outcome values (e.g., effect sizes, confidence intervals, strength of relationship). Behavioural Science II

One-tailed versus two-tailed tests Conventionally reject null hypothesis if obtained z-score or t-score falls beyond certain values in either tail of the relevant sampling distribution (i.e., a two-tailed test). In specific contexts, a one-tailed test might seem appropriate (e.g., reject null hypothesis only if test statistic fell in 5% left-hand tail of distribution. Behavioural Science II

One-tailed versus two-tailed tests Generally, two-tailed tests are preferred to one-tailed tests. The IV may have an effect in opposite direction to the one predicted. Behavioural Science II