Repeated measures ANOVA in SPSS Cross tabulations Survival analysis.

Slides:



Advertisements
Similar presentations
A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Advertisements

CHI-SQUARE(X2) DISTRIBUTION
Introduction to Survival Analysis October 19, 2004 Brian F. Gage, MD, MSc with thanks to Bing Ho, MD, MPH Division of General Medical Sciences.
The analysis of cross-tabulations. Today Chi-square test Fisher’s exact test Odds and odds ratios McNemars test.
Statistics for Linguistics Students Michaelmas 2004 Week 6 Bettina Braun
Lecture 3 Survival analysis. Problem Do patients survive longer after treatment A than after treatment B? Possible solutions: –ANOVA on mean survival.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Ch. 28 Chi-square test Used when the data are frequencies (counts) or proportions for 2 or more groups. Example 1.
Final Review Session.
Lecture 2 The analysis of cross-tabulations. Cross-tabulations Tables of countable entities or frequencies Made to analyze the association, relationship,
Chapter 2 Simple Comparative Experiments
Chapter 26: Comparing Counts. To analyze categorical data, we construct two-way tables and examine the counts of percents of the explanatory and response.
Analysis of Variance & Multivariate Analysis of Variance
PSY 307 – Statistics for the Behavioral Sciences Chapter 19 – Chi-Square Test for Qualitative Data Chapter 21 – Deciding Which Test to Use.
5-3 Inference on the Means of Two Populations, Variances Unknown
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Survival analysis. First example of the day Small cell lungcanser Meadian survival time: 8-10 months 2-year survival is 10% New treatment showed median.
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
The Chi-Square Test Used when both outcome and exposure variables are binary (dichotomous) or even multichotomous Allows the researcher to calculate a.
AS 737 Categorical Data Analysis For Multivariate
Analysis of Categorical Data
CHP400: Community Health Program - lI Research Methodology. Data analysis Hypothesis testing Statistical Inference test t-test and 22 Test of Significance.
Statistics for clinical research An introductory course.
Amsterdam Rehabilitation Research Center | Reade Testing significance - categorical data Martin van der Esch, PhD.
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
HSRP 734: Advanced Statistical Methods July 10, 2008.
Lecture 3 Survival analysis.
Experimental Design STAT E-150 Statistical Methods.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 13: Nominal Variables: The Chi-Square and Binomial Distributions.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
SIMPLE TWO GROUP TESTS Prof Peter T Donnan Prof Peter T Donnan.
TAUCHI – Tampere Unit for Computer-Human Interaction ERIT 2015: Data analysis and interpretation (1 & 2) Hanna Venesvirta Tampere Unit for Computer-Human.
X Treatment population Control population 0 Examples: Drug vs. Placebo, Drugs vs. Surgery, New Tx vs. Standard Tx  Let X =  cholesterol level (mg/dL);
B AD 6243: Applied Univariate Statistics Repeated Measures ANOVA Professor Laku Chidambaram Price College of Business University of Oklahoma.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Correlation Patterns.
Repeated Measures  The term repeated measures refers to data sets with multiple measurements of a response variable on the same experimental unit or subject.
11 Comparison of Several Multivariate Means Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute.
Chapter 12 A Primer for Inferential Statistics What Does Statistically Significant Mean? It’s the probability that an observed difference or association.
Linear correlation and linear regression + summary of tests
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Contingency tables Brian Healy, PhD. Types of analysis-independent samples OutcomeExplanatoryAnalysis ContinuousDichotomous t-test, Wilcoxon test ContinuousCategorical.
The binomial applied: absolute and relative risks, chi-square.
Confidence intervals and hypothesis testing Petter Mostad
Statistical Inference for more than two groups Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Repeated-measures ANOVA When subjects are tested on two or more occasions “Repeated measures” or “Within subjects” design Repeated Measures ANOVA But,
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
Essential Statistics Chapter 161 Review Part III_A_Chi Z-procedure Vs t-procedure.
Chapter 11 Chi- Square Test for Homogeneity Target Goal: I can use a chi-square test to compare 3 or more proportions. I can use a chi-square test for.
1 ANALYSIS OF VARIANCE (ANOVA) Heibatollah Baghi, and Mastee Badii.
Non-parametric Tests e.g., Chi-Square. When to use various statistics n Parametric n Interval or ratio data n Name parametric tests we covered Tuesday.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
WINKS 7 Tutorial 3 Analyzing Summary Data (Using Student’s t-test) Permission granted for use for instruction and for personal use. ©
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 11 Analyzing the Association Between Categorical Variables Section 11.2 Testing Categorical.
Week 13a Making Inferences, Part III t and chi-square tests.
Review of Probability Concepts Prepared by Vera Tabakova, East Carolina University.
Analysis of Variance STAT E-150 Statistical Methods.
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
 Naïve Bayes  Data import – Delimited, Fixed, SAS, SPSS, OBDC  Variable creation & transformation  Recode variables  Factor variables  Missing.
Hypothesis Testing Review
Chapter 2 Simple Comparative Experiments
Statistical Inference for more than two groups
Comparing k Populations
Chapter 10 Analyzing the Association Between Categorical Variables
Analyzing the Association Between Categorical Variables
Inference for Two Way Tables
Where are we?.
Analysis of two-way tables
Presentation transcript:

Repeated measures ANOVA in SPSS Cross tabulations Survival analysis

Repeated measures ANOVA in SPSS 2

The data from Chen 2004 The number of errors are recorded for the same test repeated 4 times The degree of anxiety was different between the subjects. Stuff we would like to know: Is there a difference in the performance between the anxiety groups? Is there a reduction in errors over time – a learning effect? Does one of the groups learn faster than the other? 3

What to do? A mediocre suggestion and a really bad one Is there a difference in the performance between the anxiety groups? We could calculate the mean (or sum, time-to-max, min, max, etc.) for each subject and do a t-test between the anxiety groups Is there a reduction in errors over time – a learning effect? We could make 6 paired t-tests to test the difference between Does one of the groups learn faster than the other? ??? 4

Repeated measures ANOVA Time is a within subject factor (4 levels) Anxiety is a between subjects factor (2 levels) 5

Repeated measures ANOVA Plotting the data 6

Repeated measures ANOVA – between subjects effects 7

Repeated measures ANOVA – within subjects effects Is there a time effect? Univariate or Multivariate? Univariate assumes sphericity of the covariance matrix: 8 Residual SSCP Matrix Trial1Trial2Trail3Trial4 Trial14,5672,4171,3831,017 Trial22,4176,2835,1335,167 Trail31,3835,1336,4176,083 Trial41,0175,1676,0837,617 Based on Type III Sum of Squares

Repeated measures ANOVA – within subjects effects Univariate Mauchly's Test of Sphericity b Measure:MEASURE_1 Within Subjects Effect Mauchly's W Approx. Chi- SquaredfSig. dimension1 time,3379,4965,093 Tests the null hypothesis that the error covariance matrix of the orthonormalized transformed dependent variables is proportional to an identity matrix. b. Design: Intercept + Anxiety Within Subjects Design: time 9 Residual SSCP Matrix Trial1Trial2Trail3Trial4 Trial14,5672,4171,3831,017 Trial22,4176,2835,1335,167 Trail31,3835,1336,4176,083 Trial41,0175,1676,0837,617 Based on Type III Sum of Squares

Repeated measures ANOVA – within subjects effects Univariate 10 Greenhouse-Geisser is newer than Huynh-Feldt methods Lower-bound is the most konservative Alternatively use Multible ANOVA

Repeated measures ANOVA – within subjects effects Multivariate 11 The covariance matrices must be equal for all levels of the between groups Tested by Box’s test

Repeated measures ANOVA – within subjects effects Multivariate 12 If the 4 methods disagree use Wilk’s Lambda Multivariate and Univariate renders the same conclusion in this case Which method to choose?

Repeated measures ANOVA – within subjects effects Which method to choose? 13

Repeated measures ANOVA – within subjects effects Pairwise comparisons 14 Does all these comparisons make sense?

Repeated measures ANOVA – within subjects effects Contrasts? 15

Cross tabulations 16

Delivery and housing tenure Housing tenurePretermTerm Owner-occupier50849 Council tentant29229 Private tentant11164 Lives with parents666 Other336

Cross-tabulations Tables of countable entities or frequencies Made to analyze the association, relationship, or connection between two variables This association is difficult to describe statistically Null- Hypothesis: “There is no association between the two variables” can be tested Analysis of cross-tabulations with larges samples

Delivery and housing tenure Housing tenurePretermTermTotal Owner-occupier Council tentant Private tentant Lives with parents66672 Other33639 Total

Delivery and housing tenure Expected number without any association between delivery and housing tenure Housing tenurePreTermTotal Owner-occupier899 Council tenant258 Private tenant175 Lives with parents72 Other39 Total

Delivery and housing tenure If the null-hypothesis is true 899/1443 = 62.3% are house owners. 62.3% of the Pre-terms should be house owners: 99*899/1443 = 61.7 Housing tenurePreTermTotal Owner-occupier899 Council tenant258 Private tenant175 Lives with parents72 Other39 Total

Delivery and housing tenure If the null-hypothesis is true 899/1443 = 62.3% are house owners. 62.3% of the ‘Term’s should be house owners: 1344*899/1443 = Housing tenurePreTermTotal Owner-occupier Council tenant258 Private tenant175 Lives with parents72 Other39 Total

Delivery and housing tenure If the null-hypothesis is true 258/1443 = 17.9% are council tenant. 17.9% of the ‘preterm’s should be council tenant: 99*258/1443 = 17.7 Housing tenurePreTermTotal Owner-occupier Council tenant258 Private tenant175 Lives with parents72 Other39 Total

Delivery and housing tenure If the null-hypothesis is true In general Housing tenurePreTermTotal Owner-occupier Council tenant Private tenant Lives with parents Other Total row total * column total grand total

Delivery and housing tenure If the null-hypothesis is true Housing tenurePreTermTotal Owner-occupier50(61.7)849(837.3)899 Council tenant29(17.7)229(240.3)258 Private tenant11(12.0)164(163.0)175 Lives with parents6(4.9)66(67.1)72 Other3(2.7)36(36.3)39 Total

Delivery and housing tenure test for association If the numbers are large this will be chi-square distributed. The degree of freedom is (r-1)(c-1) = 4 From Table 13.3 there is a 1 - 5% probability that delivery and housing tenure is not associated

Chi Squared Table

Delivery and housing tenure If the null-hypothesis is true It is difficult to say anything about the nature of the association. Housing tenurePreTermTotal Owner-occupier50(61.7)849(837.3)899 Council tenant29(17.7)229(240.3)258 Private tenant11(12.0)164(163.0)175 Lives with parents6(4.9)66(67.1)72 Other3(2.7)36(36.3)39 Total

Chi-squared test for small samples Expected valued > 80% >5 All >1 StreptomycinControlTotal Improvement13 (8.4)5 (9.6)18 Deterioration2 (4.2)7 (4.8)9 Death0 (2.3)5 (2.7)5 Total151732

Chi-squared test for small samples Expected valued > 80% >5 All >1 StreptomycinControlTotal Improvement13 (8.4)5 (9.6)18 Deterioration and death 2 (6.6)12 (7.4)14 Total151732

Fisher’s exact test An example SDT A314 B SDT A404 B SDT A134 B SDT A224 B

Fisher’s exact test Survivors: a, b, c, d, e Deaths: f, g, h Table 1 can be made in 5 ways Table 2: 30 Table 3: 30 Table 4: 5 70 ways in total The properties of finding table 2 or a more extreme is: SDT A314 B SDT A404 B SDT A134 B SDT A224 B

Fisher’s exact test SDT A f 11 f 12 r1r1 B f 21 f 22 r2r2 c1c1 c2c2 n SDT A314 B SDT A f 11 f 12 r1r1 B f 21 f 22 r2r2 c1c1 c2c2 n SDT A404 B

Odds and odds ratios Odds, p is the probability of an event Log odds / logit

Odds The probability of coughs in kids with history of bronchitis. p = 26/273 = o = 26/247 = The probability of coughs in kids with history without bronchitis. p = 44/1046 = o = 44/1002 = BronchitisNo bronchitisTotal Cough26 (a)44 (b)70 No Cough247 (c)1002 (d)1249 Total

Odds ratio The odds ratio; the ratio of odds for experiencing coughs in kids with and kids without a history of bronchitis. BronchitisNo bronchitisTotal Cough26; (a)44; (b)70 No Cough247; 9.50 (c)1002; 22.8 (d)1249 Total

Is the odds ratio different form 1? BronchitisNo bronchitisTotal Cough26 (a)44 (b)70 No Cough247 (c)1002 (d)1249 Total We could take ln to the odds ratio. Is ln(or) different from zero? 95% confidence (assumuing normailty)

Confidence interval of the Odds ratio ln (or) ± 1.96*SE(ln(or)) = 0.37 to 1.38 Returning to the odds ratio itself: e to e = 1.45 to 3.97 The interval does not contain 1, indicating a statistically significant difference BronchitisNo bronchitisTotal Cough26 (a)44 (b)70 No Cough247 (c)1002 (d)1249 Total

McNemar’s test Colds at 14 Total YesNo Colds at 12 Yes No Total

Survival analysis 40

First example of the day

Problem Do patients survive longer after treatment 1 than after treatment 2? Possible solutions: ANOVA on mean survival time? ANOVA on median survival time?

Progressively censored observations Current life table Completed dataset Cohort life table Analysis “on the fly”

Actuarial / life table anelysis Treatment for lung cancer

Actuarial / life table anelysis A sub-set of 13 patients undergoing the same treatment

Kaplan-Meier Simple example with only 2 ”terminal-events”.

Confidence interval of the Kaplan-Meier method Fx after 32 months

Confidence interval of the Kaplan-Meier method Survival plot for all data on treatment 1 Are there differences between the treatments?

Comparing Two Survival Curves One could use the confidence intervals… But what if the confidence intervals are not overlapping only at some points? Logrank-stats

Comparing Two Survival Curves The logrank statistics Aka Mantel-logrank statistics Aka Cox-Mantel-logrank statistics

Comparing Two Survival Curves Five steps to the logrank statistics table 1.Divide the data into intervals (eg. 10 months) 2.Count the number of patients at risk in the groups and in total 3.Count the number of terminal events in the groups and in total 4.Calculate the expected numbers of terminal events e.g. (31-40) 44 in grp1 and 46 in grp2, 4 terminal events. expected terminal events 4x(44/90) and 4x(46/90) 5.Calculate the total

Comparing Two Survival Curves Smells like Chi-Square statistics

Comparing Two Survival Curves