CPSY 501: Advanced Statistics 11 Sep 2009 Instructor: Dr. Sean Ho TA: Marci Wagner cpsy501.seanho.com No food/drink in the computer lab, please! Please.

Slides:



Advertisements
Similar presentations
CPSY 501: Class 2 Outline Please log-in and download the lec-2 data set from the class web-site. Statistical significance Effect size Combining effect.
Advertisements

Lecture 3: Chi-Sqaure, correlation and your dissertation proposal Non-parametric data: the Chi-Square test Statistical correlation and regression: parametric.
Lecture 11 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
1 Practicals, Methodology & Statistics II Laura McAvinue School of Psychology Trinity College Dublin.
Lecture 9: One Way ANOVA Between Subjects
Linear Regression and Correlation Analysis
Stat 217 – Week 10. Outline Exam 2 Lab 7 Questions on Chi-square, ANOVA, Regression  HW 7  Lab 8 Notes for Thursday’s lab Notes for final exam Notes.
Introduction to Probability and Statistics Linear Regression and Correlation.
Today Concepts underlying inferential statistics
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Correlation and Regression Analysis
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Inferential Statistics
Practical statistics for Neuroscience miniprojects Steven Kiddle Slides & data :
Difference Two Groups 1. Content Experimental Research Methods: Prospective Randomization, Manipulation Control Research designs Validity Construct Internal.
Selecting the Correct Statistical Test
Chapter 4 Hypothesis Testing, Power, and Control: A Review of the Basics.
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Copyright © 2008 by Pearson Education, Inc. Upper Saddle River, New Jersey All rights reserved. John W. Creswell Educational Research: Planning,
Simple Covariation Focus is still on ‘Understanding the Variability” With Group Difference approaches, issue has been: Can group membership (based on ‘levels.
N318b Winter 2002 Nursing Statistics Specific statistical tests: Correlation Lecture 10.
CPSY 501: Advanced Statistics Instructor: Dr. Sean Ho Teaching Assistant: Jessica Nee Reminder: We are not allowed to have food or drink near the computers,
Chapter 15 Correlation and Regression
Choosing and using statistics to test ecological hypotheses
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.
User Study Evaluation Human-Computer Interaction.
Experimental Research Methods in Language Learning Chapter 11 Correlational Analysis.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
1 rules of engagement no computer or no power → no lesson no SPSS → no lesson no homework done → no lesson GE 5 Tutorial 5.
Hypothesis testing Intermediate Food Security Analysis Training Rome, July 2010.
Intro: “BASIC” STATS CPSY 501 Advanced stats requires successful completion of a first course in psych stats (a grade of C+ or above) as a prerequisite.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Ch14: Linear Correlation and Regression 13 Mar 2012 Dr. Sean Ho busi275.seanho.com Please download: 09-Regression.xls 09-Regression.xls HW6 this week Projects.
C M Clarke-Hill1 Analysing Quantitative Data Forming the Hypothesis Inferential Methods - an overview Research Methods.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
Review Hints for Final. Descriptive Statistics: Describing a data set.
Mixed-Design ANOVA 5 Nov 2010 CPSY501 Dr. Sean Ho Trinity Western University Please download: treatment5.sav.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Experimental Design and Statistics. Scientific Method
Academic Research Academic Research Dr Kishor Bhanushali M
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
Lecture 10: Correlation and Regression Model.
The “Big 4” – Significance, Effect Size, Sample Size, and Power 18 Sep 2009 Dr. Sean Ho CPSY501 cpsy501.seanho.com.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Comparing Two Means Chapter 9. Experiments Simple experiments – One IV that’s categorical (two levels!) – One DV that’s interval/ratio/continuous – For.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Ch14: Correlation and Linear Regression 27 Oct 2011 BUSI275 Dr. Sean Ho HW6 due today Please download: 15-Trucks.xls 15-Trucks.xls.
1 Week 3 Association and correlation handout & additional course notes available at Trevor Thompson.
Power Analysis: Sample Size, Effect Size, Significance, and Power 16 Sep 2011 Dr. Sean Ho CPSY501 cpsy501.seanho.com Please download: SpiderRM.sav.
Chapter 13 Understanding research results: statistical inference.
Power Point Slides by Ronald J. Shope in collaboration with John W. Creswell Chapter 7 Analyzing and Interpreting Quantitative Data.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
BUSI 275: Business Statistics 8 Sep 2011 Instructor: Dr. Sean Ho TA: Anne Schober busi275.seanho.com No food/drink in the computer lab, please! Please.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Independent Samples ANOVA. Outline of Today’s Discussion 1.Independent Samples ANOVA: A Conceptual Introduction 2.The Equal Variance Assumption 3.Cumulative.
Mixed-Design ANOVA 13 Nov 2009 CPSY501 Dr. Sean Ho Trinity Western University Please download: treatment5.sav.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
PSY 325 AID Education Expert/psy325aid.com FOR MORE CLASSES VISIT
Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.
PS Research Methods I with Kimberly Maring Unit 9 – Experimental Research Chapter 6 of our text: Zechmeister, J. S., Zechmeister, E. B., & Shaughnessy,
Appendix I A Refresher on some Statistical Terms and Tests.
Some Terminology experiment vs. correlational study IV vs. DV descriptive vs. inferential statistics sample vs. population statistic vs. parameter H 0.
CPSY 501: Advanced Statistics
Dependent-Samples t-Test
Inferential statistics,
COMPARING VARIABLES OF ORDINAL OR DICHOTOMOUS SCALES: SPEARMAN RANK- ORDER, POINT-BISERIAL, AND BISERIAL CORRELATIONS.
Presentation transcript:

CPSY 501: Advanced Statistics 11 Sep 2009 Instructor: Dr. Sean Ho TA: Marci Wagner cpsy501.seanho.com No food/drink in the computer lab, please! Please pick-up: Syllabus HW 1 Project Handout Please fill out sign-in sheet

11 Sep 2009CPSY501: Intro2 Outline for today Welcome, devotional, introductions Administrative details: syllabus, schedule MyCourses, SPSS, textbook Stats review: Purpose, research questions Linear models Correlation, Spearman's ρ, χ 2, t-test Data Analysis Project, HW Assignment 1 Marci's SPSS tutorials

11 Sep 2009CPSY501: Intro3 Stats review: purpose What is the purpose of statistical analysis in counselling psychology research? → Research questions! Statistics allows us to (1) pose new questions and (2) answer them – decision-making tool Is an effect/relationship real? How strong? Possible limitations, assumptions: Danger of extreme reductionism Neutrality of observation, objectivity Looking at groups, not individuals

11 Sep 2009CPSY501: Intro4 Cycles in statistical analysis Formulate research question Data prep: input errors/typos, missing data, univariate outliers Explore variables: IV, DV, descriptives Model building: choose a model based on RQ Model testing: are assumptions met? If not, either clean data or change model May need to modify RQ! Run final model and present results

11 Sep 2009CPSY501: Intro5 Research question: example RQ: are men taller than women? Is this relationship real? How strong is it? What are the variables? IV/DV? Level of meas? IV: gender (dichot), DV: height (scale) Levels of measurement: categorical, ordinal, scale (interval, ratio) What type of test should we use? Independent samples: t-test Limitations/assumptions of this test?

11 Sep 2009CPSY501: Intro6 Model-building process Operationally define a phenomenon: variables Measure it (collect data) Build a model: verify data meet assumptions and input data into model Draw conclusions and/or predictions about the phenomenon in the “real world” population e.g., if child A holds 2 apples, B:6 apples, and C:1, how many apples is a child most likely to have? Individual vs. group

11 Sep 2009CPSY501: Intro7 Statistical model: example RQ: does self-esteem correlate with school performance? Measure: questionnaire and marks Choose model: correlation Assumptions! Measures, procedures, model Make conclusions: based on assumptions Objectivity, individual vs. group, Linearity is a big assumption!

11 Sep 2009CPSY501: Intro8 Linear modelling A linear model is a straight “line” that best fits the observed data Minimizes error (least-squares) of model Use analytic techniques to derive the equation of the linear model directly from the data, or Use optimization techniques to try to find the line that maximizes the goodness of fit between model and data: Test statistic = (variance due to model) / (due to error)

11 Sep 2009CPSY501: Intro9 Linear modelling: summary Statistics are used to build models of psychological phenomena out of observations gathered from specific samples of individuals The most common type of statistical model is linear – straight “line” (or plane, or hyperplane, …) that minimizes distance from model to data The adequacy of the model to explain the data can be calculated through test statistics If there is a poor fit, the model may need to be revised, or to consider additional confounding variables

11 Sep 2009CPSY501: Intro10 Linear modelling: limitations What if vars are not related in a linear way? Many common procedures (some ANOVA, some regression) depend strongly on linearity If linearity is violated, results are only very approximate Even non-parametric models are often approximations using group patterns Reifying models: “correct” the data to better fit the assumptions of the model! Examples of psychological phenomena vars that are related non-linearly?

11 Sep 2009CPSY501: Intro11 Linear Correlation A measure of the strength of the linear relationship between two variables Relies on measuring covariance between vars When one var deviates from mean, does the other var also deviate? Does it deviate in the same direction? Correlation is a value between -1 and +1 Close to +1: positive relationship Close to -1: negative relationship Close to 0: no relationship

11 Sep 2009CPSY501: Intro12 Measuring correl: Pearson's r The most common way to measure correlation is Pearson's product-moment correlation coefficient, named r: Requires parametric data Indep obs, scale level, normally distrib! Example: ExamAnxiety.sav Measured anxiety before exam, time spent reviewing before exam, and exam performance (% score)

11 Sep 2009CPSY501: Intro13 Pearson's correlation coeff Name of Correlation Statistic Each variable is perfectly correlated with itself! Significance Value (p)

Spearman's Rho ( ρ or r s ) Another way of calculating correlation Non-parametric: can be used when data violate parametricity assumptions No free lunch: loses information about data Spearman's works by first ranking the data, then applying Pearson's to those ranks Example (grades.sav): grade on a national math exam (GCSE) grade in a univ. stats course (STATS) coded by “letter” (A=1, B=2, C=3,...)

Spearman's Rho ( ρ or r s ): ex Sample Size The correlation is positive Name of Correlation Statistic

Chi-Square test ( χ 2 ) Evaluates whether there is a relationship between 2 categorical variables The Pearson chi-square statistic tests whether the 2 variables are independent If the significance is small enough (p< α, usually α =.05), we reject the null hypothesis that the two variables are independent (unrelated) i.e., we think that they are in some way related.

t-Tests: comparing two means Moving beyond correlational research… We often want to look at the effect of one variable on another by systematically changing some aspect of that variable That is, we want to manipulate one variable to observe its effect on another variable. t-tests are for comparing two means Two types of application of t-tests: Related/dependent measures Independent groups

Related/dependent t-tests A repeated measures experiment that has 2 conditions (levels of the IV) the same subjects participate in both conditions We expect that a person’s behaviour will be the same in both conditions external factors – age, gender, IQ, motivation,... – should be same in both conditions Experimental Manipulation: we do something different in Condition 1 than what we do in Condition 2 (so the only difference between conditions is the manipulation the experimenter made) e.g., Control vs. test

Independent samples t-tests We still have 2 conditions (levels of the IV), but different subjects participate in each condition. So, differences between the two group means can possibly reflect: The manipulation (i.e., systematic variation) Differences between characteristics of the people allotted to each group (i.e., unsystematic variation) Question: what is one way we can try to keep the ‘noise’ in an experiment to a minimum?

t-Tests t-tests work by identifying sources of systematic and unsystematic variation, and then comparing them. The comparison lets us see whether the experiment created considerably more variation than we would have got if we had just tested the participants w/o the experimental manipulation.

Example: dependent samples “Paired” samples t-test 12 ‘spider phobes’ exposed to a picture of a spider (picture), and on a separate occasion, a real live tarantula (real) Their anxiety was measured at each time (i.e., in each condition).

Paired samples t-test

Example: paired t-Tests Standard Deviation of the pairwise difference Standard error of the differences b/w subjects’ scores in each condition Degrees of Freedom (in a repeated measures design, it's N-1) SPSS uses df to calculate the exact probability that the value of the ‘t’ obtained could occur by chance The probability that ‘t’ occurred by chance is reflected here

Example: indep samples t-test Used in situations where there are 2 experimental conditions – and different participants are used in each condition Example: SpiderBG.sav 12 spider phobes exposed to a picture of a spider (picture); 12 different spider phobes exposed to a real-life tarantula Anxiety was measured in each condition

Summary Statistics for the 2 experimental conditions Parametric tests (e.g., t- tests) assume variances in the experimental conditions are ‘roughly’ equal. If Levene’s test is sig., the assumption of homogeneity of variance has been violated (N1 + N2) - 2 = 22 Significance (p-value): > α =.05, so there is no significant difference between the means of the 2 samples

Data Analysis Project Half of this course is your semester-long data analysis project: Find suitable existing data Propose a new statistical analysis of it Get approval by Research Ethics Board Go through “spiral” of statistical analysis Write it up in an APA-style manuscript Groups of up to 3 people Can also be done individually Let me know when you have your group

Project step 1: Finding data It must be existing data – you are not allowed to collect data for this course! (nor do you have time to) No simulated (made-up) data Minimum sample size: 50 Minimum of 3 variables (2 IV, 1 DV) Analysis: multiple regression or ANOVA Non-parametric alternatives w/permission Possible sources: your own data, faculty members, CPSY dept thesis data, publicly available / government data (WHO, NIH, etc.)

Dataset description: due 2Oct Written description of the dataset you will be using and the particular variables you consider Preliminary explorations of the data Descriptives, histograms, boxplots, etc. Upload annotated SPSS output *.spv APA manuscript style not needed, but please format it neatly in a document e.g., not a short or Facebook msg! Upload (1) write-up, and (2) SPSS output to myCourses

Project step 2: Proposal/meeting Written proposal of the particular analysis you plan to do on the dataset Old data, but new analysis State specific research questions Check sample size is sufficient (GPower3) Anticipate possible problems, plan Book an appointment with me (Neu 5) by 9Oct All team members there Send me your proposal >24hrs before Bring/ your dataset on USB stick

Project step 3: REB (due 16Oct) Approval by TWU Research Ethics Board is required before any new analysis may be done! Cursory exploration as in dataset description and proposal is okay You are not allowed to start your new analysis until you get REB approval Use the “Analysis of Existing Data” form You need written permission from the original owner of the data For CPSY theses, the faculty supervisor None needed for publicly available data

Project step 4: Manuscript The focus is to demonstrate your statistical knowledge, not to deal with the subject area in question It's okay if you don't find groundbreaking results for all of counselling psychology Methodology and statistics will be more detailed than a “real” research paper Full APA manuscript format is required! Include tables/figures Max length 15 pages + annotated SPSS output

HW Assignment 1: Stats review Four homework assignments over the semester will give you practice on the concepts in lecture HW assignment 1 (due 25Sep, in two weeks): Review of undergrad statistics Practice with SPSS Download from our website: “Assignments” Assignment: HW1-Review.doc SPSS Dataset: AttnDefDis-1.sav

Practice reading for next week For practice, try reading this journal article, focusing on their statistical methods: see how much you can understand Missirlian, et al., “Emotional Arousal, Client Perceptual Processing, and the Working Alliance in Experiential Psychotherapy for Depression”, Journal of Consulting and Clinical Psychology, Vol. 73, No. 5, pp. 861–871, Download from website, under today's lecture

For discussion next time: What research questions do the authors state that they are addressing? What analytical strategy was used, and how appropriate is it for addressing their questions? What were their main conclusions, and are these conclusions warranted from the actual results /statistics /analyses that were reported? What, if any, changes/additions need to be made to the methods to give a more complete picture of the phenomenon of interest (e.g., sampling, description of analysis process, effect sizes, dealing with multiple comparisons, etc.)?

SPSS tutorials by Marci Our TA, Marci, has graciously agreed to do tutorials on getting started with SPSS After class today, in Wong Centre  Just tag along with her Next Wed, see Marci for time