Lab 9: Two Group Comparisons. Today’s Activities - Evaluating and interpreting differences across groups – Effect sizes Gender differences examples Class.

Slides:

Advertisements

Similar presentations

Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.

Advertisements

Lab 3: Correlation and Retest Reliability. Today’s Activities Review the correlation coefficient Descriptive statistics for Big Five at Time 1 and Time.

Research Methods for Counselors COUN 597 University of Saint Joseph Class # 8 Copyright © 2015 by R. Halstead. All rights reserved.

Data Analysis Statistics. Inferential statistics.

The Two Sample t Review significance testing Review t distribution

Chapter 8 The t Test for Independent Means Part 1: March 6, 2008.

PSY 307 – Statistics for the Behavioral Sciences

T-Test (difference of means test) T-Test = used to compare means between two groups. Level of measurement: –DV (Interval/Ratio) –IV (Nominal—groups)

Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Interactions in Regression.

Independent Samples and Paired Samples t-tests PSY440 June 24, 2008.

Lecture 9: One Way ANOVA Between Subjects

Two Groups Too Many? Try Analysis of Variance (ANOVA)

S519: Evaluation of Information Systems

 What is t test  Types of t test  TTEST function  T-test ToolPak 2.

Data Analysis Statistics. Inferential statistics.

Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.

Today Concepts underlying inferential statistics

Using Statistics in Research Psych 231: Research Methods in Psychology.

Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.

The t Tests Independent Samples.

Point Biserial Correlation Example

Statistics for the Social Sciences

Statistical Analyses & Threats to Validity

Hypothesis Testing:.

Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.

Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.

© 2011 Pearson Prentice Hall, Salkind. Introducing Inferential Statistics.

Hypothesis Testing II The Two-Sample Case.

Lecture 14 Testing a Hypothesis about Two Independent Means.

Comparing Means From Two Sets of Data

Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,

Comparing Two Population Means

T tests comparing two means t tests comparing two means.

RMTD 404 Lecture 8. 2 Power Recall what you learned about statistical errors in Chapter 4: Type I Error: Finding a difference when there is no true difference.

The t Tests Independent Samples. The t Test for Independent Samples Observations in each sample are independent (not from the same population) each other.

Chapter 9: Testing Hypotheses

1 Why do we need statistics? A.To confuse students B.To torture students C.To put the fear of the almighty in them D.To ruin their GPA, so that they don’t.

January 31 and February 3,  Some formulae are presented in this lecture to provide the general mathematical background to the topic or to demonstrate.

COURSE: JUST 3900 TIPS FOR APLIA Developed By: Ethan Cooper (Lead Tutor) John Lohman Michael Mattocks Aubrey Urwick Chapter : 10 Independent Samples t.

One-sample In the previous cases we had one sample and were comparing its mean to a hypothesized population mean However in many situations we will use.

Hypothesis testing Intermediate Food Security Analysis Training Rome, July 2010.

SPSS Basics and Applications Workshop: Introduction to Statistics Using SPSS.

Education 795 Class Notes Data Analyst Pitfalls Difference Scores Effects Sizes Note set 12.

6/4/2016Slide 1 The one sample t-test compares two values for the population mean of a single variable. The two-sample t-test of population means (aka.

DIRECTIONAL HYPOTHESIS The 1-tailed test: –Instead of dividing alpha by 2, you are looking for unlikely outcomes on only 1 side of the distribution –No.

Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.

Review Hints for Final. Descriptive Statistics: Describing a data set.

Chapter 10 The t Test for Two Independent Samples

Chapter Eight: Using Statistics to Answer Questions.

Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.

CHAPTER OVERVIEW Say Hello to Inferential Statistics The Idea of Statistical Significance Significance Versus Meaningfulness Meta-analysis.

Tuesday, September 24, 2013 Independent samples t-test.

Handout Six: Sample Size, Effect Size, Power, and Assumptions of ANOVA EPSE 592 Experimental Designs and Analysis in Educational Research Instructor: Dr.

Statistics (cont.) Psych 231: Research Methods in Psychology.

Hypothesis test flow chart

Jump to first page Inferring Sample Findings to the Population and Testing for Differences.

HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.

Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.

Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.

Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.

 What is Hypothesis Testing?  Testing for the population mean  One-tailed testing  Two-tailed testing  Tests Concerning Proportions  Types of Errors.

Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.

Introduction to Power and Effect Size  More to life than statistical significance  Reporting effect size  Assessing power.

CHAPTER 15: THE NUTS AND BOLTS OF USING STATISTICS.

Dependent-Samples t-Test

Statistics for the Social Sciences

UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE

Psych 231: Research Methods in Psychology

Chapter Nine: Using Statistics to Answer Questions

Presentation transcript:

Lab 9: Two Group Comparisons

Today’s Activities - Evaluating and interpreting differences across groups – Effect sizes Gender differences examples Class example with data Point biserial correlations – Null Hypothesis testing What does “significant” mean? – Comparing more than 2 groups – Homework 9

Why compare 2 groups?

Research Questions Lots of research questions center on whether there are differences between groups – Does an intervention work? – Do different groups show different levels of a construct of interest? – Etc. Need methods to evaluate when differences are meaningful. Science provides guidelines to aid interpretation. (Still requires judgment!) – Effect sizes – Null Hypothesis testing

2 Group Examples: Gender Differences

Basic Questions Are women and men basically different or the same when it comes to personality and other variables? Have these differences been exaggerated due to gender stereotypes?

Two Positions Minimalist Position – Sex differences are small and inconsequential. – Great variability within each sex – Most sex differences are actually quite small – Sex differences don’t matter for everyday life Maximalist – Sex differences exist and they matter. – Small effect sizes can have large practical consequences – Sex differences are comparable in size to many other effects that psychologists think are important – There is a range of variability in the sex differences

Thinking Meta-Analytically Lots of studies compare men and women on the average level of Variable X Goal of a Meta-Analysis: Collect and summarize the results of a body of literature. Compute an effect size using the d metric d = Mean of Group 1 – Mean of Group 2 Pooled Standard Deviation

Cohen’s Rule of Thumb for d Small: Around.20 or -.20 Medium: Around.50 or -.50 Large: Around.80 and above or -.80 and below Comparisons: – Throwing distance: d = 2.0 (Men Longer) – Arm Strength: d = 1.25 (Men Stronger) – Height: d = 1.75 (Men Taller)

Sex Differences in Personality Source: Feingold (1994)

Big Five Facet Differences Anxiety (N): d = -.28 (Women higher) Openness to Ideas (O): d =.03 Assertiveness (E): d =.50 (Men higher) Activity (E): d =.09 Trust (A): d = -.25 (Women higher) Tender-Mindedness (A): d = -.97 (Women higher) Order (C): d = -.13 (Women higher)

Other Differences Smiling: d = -.60 (Women smile more) Self-Reports of Attractiveness: d =.24 (Men higher) Actual Ratings of Attractiveness: d = -.20 (Men lower) Body Image/Satisfaction: d =.58 (Men higher) Aggression – Fantasy Measures: d =.80 (Men higher) – Peer Reports: d =.63 (Men higher) – Self-Reports: d =.40 (Men higher) – Worldwide Men commit about 90% of the homicides

Data Example – Use file on class website

Are men and women different? Use class data for an example. Are there gender differences on the sexual attitudes scale (sexsc)? Compute an effect size using the d metric d = Mean of Women – Mean of Men Pooled Standard Deviation

Calculate effect size for sexsc IN SPSS: – Get the pooled SD Analyze, Descriptives, Descriptives, sexsc – Get the mean scores for men and women Data, Split file Select ‘Organize output by groups’, select gender, OK Descriptive Statistics, Descriptives, sexsc Calculate by hand using the formula for d d = Mean of Group 1 – Mean of Group 2 Pooled Standard Deviation

Group differences – recast as point biserial correlations

Point Biserial Correlations Can also look at the same information in the context of the now-familiar correlation Use 0/1 variable to indicate group – E.g. Male group = 0, Female group = 1 Run Correlation in SPSS – Be sure that group variable is coded correctly. Interpretation is similar to a traditional correlation coefficient. – The point biserial correlation is positive when small values of X are associated with group=0 and large values of X are associated with group=1.

Null Hypothesis Testing What do people mean by “significant”?

Testing for Group Differences Effect sizes describe the magnitude of difference between groups. Another way of interpreting the data is called null hypothesis testing – Set up a specific hypothesis – Set a decision criteria (specific threshold for rejecting or failing to reject the hypothesis) – Collect sample data – Evaluate hypothesis for “significance” Statistical significance depends on sample size, so often what really matters is the effect size. (Is the difference meaningful?)

Hypothesis testing with the t-test T-test is used when we’re interested in the mean difference between two sets of data Want to know if the sample data support rejecting equal means between groups in the population. If this support is obtained, then we can conclude that the mean of one group is significantly different from the mean of another group.

Independent Samples T-Test – Hypothesis Testing Step 1: Hypotheses and  – Ho:  1 –  2 = 0 (no difference between population means) – H1:  1 –  2  0 OR equivalently H1:  1   2 (there is a mean difference between groups) –  =.05 and a two-tailed test will be used. Step 2: Set decision criteria and locate critical regions. – Example: To reject Ho, the obtained t-statistic must be Step 3: Collect sample data and calculate the t-statistic. Step 4: Evaluate Ho. – Example: The obtained t-statistic is greater than (We reject Ho.)

T-test example in SPSS Example research hypothesis: Are there mean differences in grade point average reported by men and women in PSY395? – Identify: Hypotheses Decision criteria Calculate t in SPSS – Analyze, Compare Means, Independent samples t-test – Enter test variables (GPA), and Grouping variables (gender) – OK Evaluate hypothesis and make decision (reject or fail to reject null hypothesis)

What if there are more than 2 groups?

More than 2 groups Example research hypotheses: – Is birth order (only child, 1 st, 2 nd, 3 rd born) associated with different amounts of conscientiousness? – Is there a difference in the efficacy of different treatments for depression (individual psychotherapy, medication, group therapy)? Regardless of the number of groups, want to get a sense of the effect size. – How much do the groups differ? Often use theory to determine what comparisons are needed and meaningful.

Effect Sizes for Multiple Groups Imagine we are interested in studying how mood impacts performance on a task. – Create 3 experimental groups Happy mood Neutral mood Sad mood – Examine differences of theoretical interest. E.g., calculate an effect size comparing performance in the happy group with performance in the sad group.

Hypothesis testing with more than 2 groups Analysis of Variance (ANOVA) – Used to test for mean differences in dependent variable between groups/levels – Can use when there are more than two groups (which is the same as more than two levels of your IV) – Indicates if there is a mean level difference between groups. (Post hoc tests for details) – ANOVA is significant if F-statistic is greater than the critical values of F

Repeated Measures When we want to compare mean differences in the DV for the same group of people measured at two or more times – Does psychotherapy decrease depression? (where the same people would complete a depression measure both before and after receiving therapy) Can calculate effect sizes or use hypothesis testing. – Effect size (Mean at time 1 compared to mean at time 2) – Repeated measures T-test (2 groups) – Repeated measures ANOVA (more than 2 levels of I.V.)

What you need to turn in for HW#9 Use the class data to evaluate whether there are gender differences in displaced aggression (displace). – State the null hypothesis and the alternative hypothesis. (2 pts) – Calculate the d-metric effect size of this difference. Provide an interpretation of this estimate using Cohen’s guidelines. (2 pts) – Perform the same comparison using a point-biserial correlation. Provide the interpretation for this estimate. (2 pts) – Use SPSS to perform an independent samples t-test to compare the means of displaced aggression for men and women. What conclusion did you come to and why? (2 pts)

What you need to turn in for HW#9 Imagine a researcher is interested in whether self-esteem varies based on year in college. State the null and alternative hypotheses the researcher would want to test. Identify what analysis could best determine whether there are mean level differences in self-esteem across 1 st, 2 nd, 3 rd, and 4 th year college students. (2 pts)