Planned Contrast: Execution (Conceptual) 1. Must predict pattern of interaction before gathering data. Predict that Democratic women will be most opposed.

Slides:



Advertisements
Similar presentations
Statistics for Linguistics Students Michaelmas 2004 Week 7 Bettina Braun
Advertisements

Hypothesis testing 5th - 9th December 2011, Rome.
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
Chapter 6: Correlational Research Examine whether variables are related to one another (whether they vary together). Correlation coefficient: statistic.
By Wendiann Sethi Spring  The second stages of using SPSS is data analysis. We will review descriptive statistics and then move onto other methods.
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
Research methods and statistics
Descriptive Statistics Primer
Lecture 19: Tues., Nov. 11th R-squared (8.6.1) Review
Chapter Eighteen MEASURES OF ASSOCIATION
Data Analysis Statistics. Inferential statistics.
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Today Concepts underlying inferential statistics
Measures of Association Deepak Khazanchi Chapter 18.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Chapter 14 Inferential Data Analysis
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
T-tests and ANOVA Statistical analysis of group differences.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Inferential Statistics
Leedy and Ormrod Ch. 11 Gray Ch. 14
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Inferential Statistics: SPSS
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Repeated Measures ANOVA
ASSOCIATION BETWEEN INTERVAL-RATIO VARIABLES
N318b Winter 2002 Nursing Statistics Specific statistical tests: Correlation Lecture 10.
SPSS Series 1: ANOVA and Factorial ANOVA
Descriptive Statistics e.g.,frequencies, percentiles, mean, median, mode, ranges, inter-quartile ranges, sds, Zs Describe data Inferential Statistics e.g.,
By Hui Bian Office for Faculty Excellence 1. K-group between-subjects MANOVA with SPSS Factorial between-subjects MANOVA with SPSS How to interpret SPSS.
Statistical Power 1. First: Effect Size The size of the distance between two means in standardized units (not inferential). A measure of the impact of.
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
Experimental Research Methods in Language Learning Chapter 11 Correlational Analysis.
Planned Contrasts and Data Management Class 19. QUIZ 3 ON THURSDAY, DEC. 5 Covers: Two-way ANOVA through Moderated Multiple Regression.
Moderated Multiple Regression Class 18. Functions of Regression 1. Establishing relations between variables Do frustration and aggression co-occur? 2.
Regression Analyses. Multiple IVs Single DV (continuous) Generalization of simple linear regression Y’ = b 0 + b 1 X 1 + b 2 X 2 + b 3 X 3...b k X k Where.
Examining Relationships in Quantitative Research
Hypothesis testing Intermediate Food Security Analysis Training Rome, July 2010.
Intro: “BASIC” STATS CPSY 501 Advanced stats requires successful completion of a first course in psych stats (a grade of C+ or above) as a prerequisite.
Descriptive Research: Quantitative Method Descriptive Analysis –Limits generalization to the particular group of individuals observed. –No conclusions.
SPSS Basics and Applications Workshop: Introduction to Statistics Using SPSS.
 Muhamad Jantan & T. Ramayah School of Management, Universiti Sains Malaysia Data Analysis Using SPSS.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Within Subjects Analysis of Variance PowerPoint.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Review Hints for Final. Descriptive Statistics: Describing a data set.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
ONE-WAY BETWEEN-GROUPS ANOVA Psyc 301-SPSS Spring 2014.
Chapter 10 Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law:
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Analysis of Variance ANOVA. Example Suppose that we have five educational levels in our population denoted by 1, 2, 3, 4, 5 We measure the hours per week.
Handout Twelve: Design & Analysis of Covariance
Planned Contrasts and Data Management
Handout Eight: Two-Way Between- Subjects Design with Interaction- Assumptions, & Analyses EPSE 592 Experimental Designs and Analysis in Educational Research.
Lecture 7 Data Analysis.  Developing coding scheme  Data processing  Data entry  Data cleaning & transformation  Data analysis  Interpretation of.
Power Point Slides by Ronald J. Shope in collaboration with John W. Creswell Chapter 7 Analyzing and Interpreting Quantitative Data.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
STATISTICAL TESTS USING SPSS Dimitrios Tselios/ Example tests “Discovering statistics using SPSS”, Andy Field.
Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.
Chapter 11: Test for Comparing Group Means: Part I.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Statistics & Evidence-Based Practice
Psych 706: stats II Class #4.
Chapter 10 CORRELATION.
Chapter 13 Created by Bethany Stubbe and Stephan Kogitz.
Parametric versus Nonparametric (Chi-square)
Understanding Statistical Inferences
RES 500 Academic Writing and Research Skills
Exercise 1 Use Transform  Compute variable to calculate weight lost by each person Calculate the overall mean weight lost Calculate the means and standard.
Presentation transcript:

Planned Contrast: Execution (Conceptual) 1. Must predict pattern of interaction before gathering data. Predict that Democratic women will be most opposed to gun instruction in school, compared to Democratic men, Republican men, and Republican women.

Post Hoc Tests Do female democrats differ from other groups? 1= Male/Republican5.00 2=Male/Democrat4.50 3=Female/Republican4.75 4=Female/Democrat2.75 Conduct six t tests? NO. Why not?Will capitalizes on chance. Solution: Post hoc tests of multiple comparisons. Post hoc tests consider the inflated likelihood of Type I error Kent's favorite—Tukey test of multiple comparisons, which is the most generous. NOTE: Post hoc tests can be done on any multiple set of means, not only on planned contrasts.

Conducting Post Hoc Tests 1. Recode data from multiple factors into single factor, as per planned contrast. 2. Run oneway ANOVA statistic 3. Select "posthoc tests" option. ONEWAY gunctrl BY genparty /CONTRAST= /STATISTICS DESCRIPTIVES /MISSING ANALYSIS /POSTHOC = TUKEY ALPHA(.05). Selected post- hoc test Note: Not necessary to conduct planned contrast to conduct post-hoc test

Post hoc Tests, Page 1

Post Hoc Tests, Page 2

Data Management Issues Setting up data file Checking accuracy of data Disposition of data Why obsess on these details? Murphy's Law If something can go wrong, it will go wrong, and at the worst possible time. Errars Happin!

Creating a Coding Master 1. Get survey copy 2. Assign variable names 3. Assign variable values 4. Assign missing values 5. Proof master for accuracy 6. Make spare copy, keep in file drawer

Coding Master variable names variable values Note: Var. values not needed for scales

Cleaning Data Set 1. Exercise in delay of gratification 2. Purpose: Reduce random error 3. Improve power of inferential stats.

Complete Data Set Note: Are any cases missing data?

Are any “Minimums” too low? Are any “Maximums” too high? Do N s indicate missing data? Do SDs indicate extreme outliers? Checking Descriptives

Do variables correlate in the expected manner? Checking Correlations Between Variables

Using Cross Tabs to Check for Missing or Erroneous Data Entry Case A: Expect equal cell sizes Gender OldestYoungestOnly Child Males10 20 Females TOTAL Case B: Impossible outcome Number of Siblings OldestYoungestOnly Child None 43 6 One340 More than one34 2 TOTAL10 8

Storing Data Raw Data 1. Hold raw data in secure place 2. File raw data by ID # 3. Hold raw date for at least 5 years post publication, per APA Automated Data 1. One pristine source, one working file, one syntax file 2. Back up, Back up, Back up `3. Use external hard drive as back-up for PC

File Raw Data Records By ID Number

COMMENT SYNTAX FILE GUN CONTROL STUDY SPRING 2007 COMMENT DATA MANAGEMENT IF (gender = 1 & party = 1) genparty = 1. EXECUTE. IF (gender = 1 & party = 2) genparty = 2. EXECUTE. IF (gender = 2 & party = 1) genparty = 3. EXECUTE. IF (gender = 2 & party = 2) genparty = 4. EXECUTE. COMMENT ANALYSES UNIANOVA gunctrl BY gender party /METHOD = SSTYPE(3) /INTERCEPT = INCLUDE /PRINT = DESCRIPTIVE /CRITERIA = ALPHA(.05) /DESIGN = gender party gender*party. ONEWAY gunctrl BY genparty /CONTRAST= /STATISTICS DESCRIPTIVES /MISSING ANALYSIS /POSTHOC = TUKEY ALPHA(.05). Save Syntax File!!!

Research Project Notebook Purpose : All-in-one handy summary of research project Content: 1. Administrative (timeline, list of staff, etc.) 2. Overview of Research 3. Experiment Materials * Surveys * Consents, debriefings * Manipulations * Procedures summary/instructions 4. IRB materials * Application * Approval 5. Data * Coding forms * Syntax file * Primary outcomes

Correlation Class 20

Today's Class Covers What and why of measures of association Covariation Pearson's r correlation coefficient Partial Correlation Comparing two correlations Non-Parametric correlations

Do Variables Relate to One Another? Is teacher pay related to performance? Is exercise related to illness? Is CO 2 related to global warming? Is platoon cohesion related to PTSD? Is TV viewing related to shoe size? Positive Negative Positive Negative Zero

Exercise and Illness 1. How many times a week do you exercise? _____ 2. How many days have you missed school this term due to illness? _____ 3. How many hours of sleep do you get each night? ____

Interpreting Correlations [C] Sleep Hours [A] Exercise [B] Illness A --> B Exercise reduces illness B --> C Illness reduces exercise C --> (A & B) Third variable (sleep) affects exercise and illness simultaneously

Exercise and Illness Data (fabricated) subjectexerise.dayssleep.hourssick.days

Description of Data

Scatterplot: Exercise and Days Sick Regression Line

Co-variation exercise days sick days Subject Number # Days

Covariation Formula cov (x,y) = Σ ( X i – X ) ( Y i – Y ) N – 1 cov (exercise, sickness) = (-3.32) + (0.40) + (-0.46) …+ (-1.02) 14-1 = -23/13 = -1.77

Problem with Covariation "To all health and exercise researchers: Please send us your exercise and health covariations." Team 1: exercise = days per week exercise, covariation = Team 2: exercise = hours per week exercise, covariation = What if we all we have are the covariations? How do we compare them? How would we know, in this case, whether Team 1 showed a larger, smaller, or equal covariation than did Team 2?

Pearson Correlation Coefficient r = cov xy sxsysxsy r = Σ ( X i – X ) ( Y i – Y ) (N – 1) sxsysxsy Pearson r (“rho”): to

Using R 2 to Interpret Correlation R 2 = r 2 = amount of variance shared between correlated variables. Correl: exercise.hours, sick.days =.613 R 2 = =.376 “About 38% of variability in sick days is explained by variability in exercise hours.”

Variation in Sick Days Explained by Exercise Hours Exercise hours =.376% Number of Sick Days Last Term R 2 = =.376

Partial Correlation Issue: How much does Variable 1 explain Variable 2, AFTER accounting for the influence of Variable 3? Sickness and Exercise Study: How much does exercise explain days sick, AFTER accounting for the influence of nightly hours of sleep? Partial Correlation answers this question.

Partial Correlation Sick Days Exercise Days Sleep Hours var. explained =.376 var. explained =.277 var. explained by exercise alone (.04) var. explained by sleep alone (.17) var. explained by exercise + sleep (.21)

Partial Correlations in SPSS PARTIAL CORR /VARIABLES= sleep.hours sick.days by exercise.days /SIGNIFICANCE=TWOTAIL /MISSING=LISTWISE. PARTIAL CORR /VARIABLES= sleep.hours exercise.days by sick.days /SIGNIFICANCE=TWOTAIL /MISSING=LISTWISE.

Non-Parametric Correlations Assumptions of Correlations 1. Normally distributed data 2. Homogeneity of variance 3. Interval data (at least) What if Assumptions Not Met? Spearman's rho: Data are ordinal. Kendall's tau: Data are ordinal, but small sample, and many scores have the same ranking

Parametric Correlations Assumptions of Correlations 1. Normally distributed data 2. Homogeneity of variance 3. Interval data (at least) Var. AVar. B Watch TV 1 hr 2 hr 3 hr 4 hr 5 hr Eat Fast Food 1 day 2 day 3 day 4 day 5 day

Non-Parametric Correlations Var. AVar. B Watch TV Never Daily Weekly Monthly Yearly Eat Fast Food Never Daily Weekends Holidays Leap Years What if Assumptions Not Met? Spearman's rho: Data are ordinal. Kendall's tau: Data are ordinal, but small sample, and many scores have the same ranking.

Comparing Correlations Issue: How do we know if one correlation is different from another? Example: Is the nightly-sleep / sick days correl. different from the TV hours /sick days correl?

Difference Between Correlations Link to calculator for two ind. samples correlations Diff. Between 2 Independent correlations Diff. Between 2 dependent = correlations tdifference = (r xy - r zy ) √ (n-3) (1 + r xz ) 2 (1-r 2 xy -r 2 xz - r 2 zy + 2r xy r xz r zy ) z = zr 1 - zr 2 1 n n 2 - 3

Note: Assumes independent samples