Multivariate Statistical Methods

Slides:



Advertisements
Similar presentations
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
Advertisements

A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Topic 6: Introduction to Hypothesis Testing
T-Tests.
t-Tests Overview of t-Tests How a t-Test Works How a t-Test Works Single-Sample t Single-Sample t Independent Samples t Independent Samples t Paired.
T-Tests.
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 9-1 Introduction to Statistics Chapter 10 Estimation and Hypothesis.
Lecture 9: One Way ANOVA Between Subjects
Chapter 11 Multiple Regression.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Chapter 2 Simple Comparative Experiments
Chapter 11: Inference for Distributions
Data Analysis Statistics. Inferential statistics.
Analysis of Variance & Multivariate Analysis of Variance
5-3 Inference on the Means of Two Populations, Variances Unknown
Inferential Statistics: SPSS
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
5-1 Introduction 5-2 Inference on the Means of Two Populations, Variances Known Assumptions.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Comparing Three or More Means 13.
Profile Analysis. Definition Let X 1, X 2, …, X p denote p jointly distributed variables under study Let  1,  2, …,  p denote the means of these variables.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
Correlation.
Two Sample Tests Nutan S. Mishra Department of Mathematics and Statistics University of South Alabama.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 10-1 Chapter 2c Two-Sample Tests.
11 Comparison of Several Multivariate Means Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Chapter 22: Comparing Two Proportions. Yet Another Standard Deviation (YASD) Standard deviation of the sampling distribution The variance of the sum or.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 9 Inferences Based on Two Samples Confidence Intervals and Tests of Hypotheses.
Section 12.2: Tests for Homogeneity and Independence in a Two-Way Table.
Paired Samples Lecture 39 Section 11.3 Tue, Nov 15, 2005.
Inference for the mean vector. Univariate Inference Let x 1, x 2, …, x n denote a sample of n from the normal distribution with mean  and variance 
AP Statistics. Chap 13-1 Chapter 13 Estimation and Hypothesis Testing for Two Population Parameters.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Université d’Ottawa / University of Ottawa 1999 Bio 8100s Multivariate biostatistics L5.1 Two sample comparisons l Univariate 2-sample comparisons l The.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Introduction For inference on the difference between the means of two populations, we need samples from both populations. The basic assumptions.
Differences Among Group Means: Multifactorial Analysis of Variance
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Math 4030 – 10b Inferences Concerning Variances: Hypothesis Testing
Estimation & Hypothesis Testing for Two Population Parameters
Virtual COMSATS Inferential Statistics Lecture-26
CHAPTER 10 Comparing Two Populations or Groups
Comparing Three or More Means
Inference for the mean vector
Chapter 2 Simple Comparative Experiments
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Chapter 8: Inference for Proportions
CHAPTER 12 ANALYSIS OF VARIANCE
Chapter 10 Two-Sample Tests and One-Way ANOVA.
Elementary Statistics
CHAPTER 10 Comparing Two Populations or Groups
Multivariate Statistical Methods
Multivariate Statistical Methods
CHAPTER 10 Comparing Two Populations or Groups
Hypothesis Testing: The Difference Between Two Population Means
Chapter 24 Comparing Two Means.
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Chapter 9 Estimation: Additional Topics
Inference Concepts 1-Sample Z-Tests.
Introductory Statistics
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

Multivariate Statistical Methods Tests of Hypotheses of Means by Jen-pei Liu, PhD Division of Biometry, Department of Agronomy, National Taiwan University and Division of Biostatistics and Bioinformatics National Health Research Institutes 2019/2/27 Copyright by Jen-pei Liu, PhD

Tests of Hypotheses of Means Introduction Review of Univariate One-sample t Test Multivariate One-sample T2 Test Review of Univariate Two-sample t Test Multivariate Two-sample T2 Test Profile Analysis Summary 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Introduction Wechsler Adult Intelligence Scale scores of 101 elderly men and women (60-64, years old, Morrison, 2005) Item Mean Variance Covariance verbal 55.24 210.54 126.99 Performance 34.97 119.68 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Introduction The mean scores for the population between 18 and 59 are 60 and 50 for verbal and performance items Univariate question: Is the mean score of verbal item for elderly population different from that of the population between 18 and 59? Multivariate question: Is the Wechsler Adult Intelligence Scale scores for elderly population different from that of the population between 18 and 59? 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Introduction Survivors and non-survivors for Bumpus’s female sparrows Survivors Nonsurvivors Variables mean variance Mean Variance Total length 157.38 11.05 158.43 15.07 Alar extent 241.00 17.50 241.57 32.55 Length of bead and head 31.43 0.53 31.48 0.73 Length of humerus 18.50 0.18 18.45 0.43 Length of keel of sterum 20.81 0.58 20.84 1.32 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Introduction Univariate questions: Does difference exist in each of the five morphological measurements between the survivors and non-survivors? Multivariate questions: Is there any difference in morphology between the survivors and nonsurvivors? 2019/2/27 Copyright by Jen-pei Liu, PhD

Review of Univariate One-sample t test Hypothesis of mean:H0: v = o vs. Ha: v  o 2019/2/27 Copyright by Jen-pei Liu, PhD

Review of Univariate One-sample t test Wechsler Adult Intelligence Scale (WAIS) verbal scores 2019/2/27 Copyright by Jen-pei Liu, PhD

Review of Univariate One-sample t test Univaraite t2 follow a F distribution with 1 and n-1 d.f. t and t2 are unitless Example of verbal scores t2 = (-4.76)2/(210.54/101) = (-4.76)(210.54/101)-1(-4.76) = 10.8695 > F0.01,1,100 = 6.76 2019/2/27 Copyright by Jen-pei Liu, PhD

Review of Univariate One-sample test Properties It is the uniformly most powerful unbiased test for mean H0: v = o vs. Ha: v  o under the normal assumption when the variance is unknown in sense the probability of rejecting H0 is less than or equal to  if v = o and is at least  if v  o 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test Multivariate Hypothesis of One-sample Means 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test Union-intersection principle a’X ~ N1(a’, a’a) Univariate hypothesis H0(a): a’ = a’0 vs. Ha(a): a’  a’0 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test Test Statistic Hotelling T2 Statistic: a generalization of univariate t statistic by union-intersection principle 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test Confidence Region and Simultaneous CIs 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test Wechsler Adult Intelligence Scale (WAIS) scores 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test Wechsler Adult Intelligence Scale verbal scores 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate One-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Review of Univariate Two-sample t Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Review of Univariate Two-sample t Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Review of Univariate Two-sample t Test Example: The total length of female sparrows between survivors and nonsuvivors Statistics Survivors Nonsurvivors Sample size 21 28 Mean 157.38 158.43 Variance 11.05 15.07 2019/2/27 Copyright by Jen-pei Liu, PhD

Review of Univariate Two-sample t Test Example: The total length of female sparrows between survivors and nonsuvivors 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test Multivariate Hypothesis of Two-sample Means 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test The Paired T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test The Paired T2 Test Take the difference in the corresponding vectors of measurements within each object between the two conditions Obtain the sample mean difference vector and sample covariance matrix Apply one-sample T2 test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test The Paired T2 Test – Example Dataset: Measurement of uraic acid (X1) and total cholesterol level (X2) before and after the treatment for six patients Before After Difference Pat. No. X11 x21 X12 X22 d1 d2 12.5 220 6.5 190 -6.0 -30 14.2 260 7.8 250 -6.4 -10 10.8 180 6.0 190 -4.8 10 13.4 200 7.2 220 -6.2 20 11.9 280 6.8 240 -5.1 -40 12.0 170 5.9 180 -6.1 -10 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Two-sample T2 Test 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis Forty-nine elderly men were classified into “senile factor present” (SFP, n1=12) and “no senile factor” (NSF, n2=37). The Wechsler Adult Intelligence Scale (WAIS) was administered to all subjects. The WAIS consists of four domains: information, similarity, arithmetic and picture completion. The sample means are given below (Morrison, 2005): Group NSF SFP Domain n1 = 37 n2 = 12 information 12.57 8.75 similarity 9.57 5.33 arithmetic 11.49 8.50 picture completion 7.97 4.75 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis The two-sample Hotelling T2 statistic is 22.13 > F0.05, 4,44 = 5.18 The 95% simultaneous CI Domain CI Information ( 0.12, 7.52) Similarity ( 0.18, 8.30) Arithmetic (-0.77, 6.75) Picture Completion ( 0.56, 5.88) 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis Assumptions: The same battery of psychological tests Measurements of continuous random variables The responses are commensurable or in comparable units 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis Three questions (in terms of priority): Are the population mean profiles similar, in the sense that the line segments of adjacent tests are parallel? (Test for parallelism: response-by-group interaction) If the two population profiles are indeed parallel, are they also at the same level? (equal group effects) Again assuming parallelism, are the population means of the tests different? 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis Parallelism: (response-by-group interaction) The slopes of the population profile segments are the same under each condition 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis If the null hypothesis of parallelism is not rejected at the  significance level, we can test the hypothesis of the same level 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis If the null hypothesis of parallelism is not rejected at the  significance level, we can test the hypothesis of the equal response 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis Example: WAIS Dataset Hypothesis of parallelism 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis Example: WAIS Dataset Hypothesis of the same level 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis Example: WAIS Dataset Hypothesis of the equal mean responses 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Profile Analysis Example: WAIS Dataset Hypothesis of the equal mean responses 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Assumptions Multivariate normality Equal covariance matrices Independent samples 2019/2/27 Copyright by Jen-pei Liu, PhD

Homogeneity of Covariance Matrix Hypothesis of Equal Covariance Matrices of m p-dimensional multivariate normal Distributions Ho: 1 = …= m 2019/2/27 Copyright by Jen-pei Liu, PhD

Homogeneity of Covariance Matrix 2019/2/27 Copyright by Jen-pei Liu, PhD

Homogeneity of Covariance Matrix 2019/2/27 Copyright by Jen-pei Liu, PhD

Homogeneity of Covariance Matrix Reaction Times 32 male and 32 female normal subjects reacted to visual stimuli preceded by warning intervals of different lengths in 0.5 and 0.15 seconds 2019/2/27 Copyright by Jen-pei Liu, PhD

Homogeneity of Covariance Matrix 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Normality Calculate squared Mahalanobis distance (MD2) of each observed vector of p variables from the sample mean vector Order the MD2 from the largest to the largest, MD2(1) < MD2(2) <…< MD2(n) For each ordered MD2, compute the (i-0.5)/n percentile where i is the ith order observed vector 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Normality The 2 values for the percentiles are obtained from the 2 with p d.f., which can be computed by CINV function in SAS Plot MD2 vs. 2 (similar to Q-Q plot) The plot should be linear Deviation from linearity indicates non-normality 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Normality Several tests available, see Seber (1984). However, methods are ad-hoc and are not implemented in most of statistical software Transformations Counts – square-root Proportions – logit Skewed and positive - logarithm 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Outliers 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Outliers 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Outliers Hotelling T2 is invariant under any full- rank linear transformation. The joint distribution of {T2i,i=1,…,n} is independent of parameter  and  Liu and Weng (1991, SIM) proposed a test procedure to identify multiple multivariate outliers 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Outliers Let T2(1) ,…, T2(n) be the ordered statistics of T21 ,…, T2n and H0(1) ,…, H0(n) be the corresponding sub-hypothesis based on T2(i). Let {W21 ,…, W2n} be a vector of n Hotelling T2 statistics computed from a sample of size n from a p-dimensional normal with mean 0 and covariance matrix Ip 2019/2/27 Copyright by Jen-pei Liu, PhD

Multivariate Outliers 2019/2/27 Copyright by Jen-pei Liu, PhD

Copyright by Jen-pei Liu, PhD Summary Hotelling T2 statistics One sample Two independent samples (unpaired) Paired samples Confidence regions Simultaneous confidence interval Profile analysis 2019/2/27 Copyright by Jen-pei Liu, PhD