Marginal and Conditional distributions
Theorem: (Marginal distributions for the Multivariate Normal distribution) have p-variate Normal distribution with mean vector and Covariance matrix Then the marginal distribution of is q i -variate Normal distribution (q 1 = q, q 2 = p - q) with mean vector and Covariance matrix
Theorem: (Conditional distributions for the Multivariate Normal distribution) have p-variate Normal distribution with mean vector and Covariance matrix Then the conditional distribution of given is q i -variate Normal distribution with mean vector and Covariance matrix
is called the matrix of partial variances and covariances. is called the partial covariance (variance if i = j) between x i and x j given x 1, …, x q. is called the partial correlation between x i and x j given x 1, …, x q.
is called the matrix of regression coefficients for predicting x q+1, x q+2, …, x p from x 1, …, x q. Mean vector of x q+1, x q+2, …, x p given x 1, …, x q is:
Example: Suppose that Is 4-variate normal with
The marginal distribution of is bivariate normal with The marginal distribution of is trivariate normal with
Find the conditional distribution of given Now and
The matrix of regression coefficients for predicting x 3, x 4 from x 1, x 2.
Thus the conditional distribution of given is bivariate Normal with mean vector And partial covariance matrix
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS
The first step is to input the data. The data is usually contained in some type of file. 1.Text files 2.Excel files 3.Other types of files
After starting the SSPS program the following dialogue box appears:
If you select Opening an existing file and press OK the following dialogue box appears
Once you selected the file and its type
The following dialogue box appears:
If the variable names are in the file ask it to read the names. If you do not specify the Range the program will identify the Range: Once you “click OK”, two windows will appear
A window containing the output
The other containing the data:
To perform any statistical Analysis select the Analyze menu:
To compute correlations select Correlate then Bivariate To compute partial correlations select Correlate then Partial
for Bivariate correlation the following dialogue appears
the output for Bivariate correlation:
for partial correlation the following dialogue appears
- - - P A R T I A L C O R R E L A T I O N C O E F F I C I E N T S Controlling for.. AGE HT WT CHL ALB CA UA CHL ( 0) ( 178) ( 178) ( 178) P=. P=.082 P=.000 P=.002 ALB ( 178) ( 0) ( 178) ( 178) P=.082 P=. P=.000 P=.101 CA ( 178) ( 178) ( 0) ( 178) P=.000 P=.000 P=. P=.020 UA ( 178) ( 178) ( 178) ( 0) P=.002 P=.101 P=.020 P=. (Coefficient / (D.F.) / 2-tailed Significance) ". " is printed if a coefficient cannot be computed the output for partial correlation:
Compare these with the bivariate correlation:
CHL ALB CA UA CHL ALB CA UA Partial Correlations Bivariate Correlations
In the last example the bivariate and partial correlations were roughly in agreement. This is not necessarily the case in all stuations An Example: The following data was collected on the following three variables: 1.Age 2.Calcium Intake in diet (CAI) 3.Bone Mass density (BMI)
The data
Bivariate correlations
Partial correlations
Scatter plot CAI vs BMI (r = )
3D Plot Age, CAI and BMI
Transformations Theorem Let x 1, x 2,…, x n denote random variables with joint probability density function f(x 1, x 2,…, x n ) Let u 1 = h 1 (x 1, x 2,…, x n ). u 2 = h 2 (x 1, x 2,…, x n ). u n = h n (x 1, x 2,…, x n ). define an invertible transformation from the x’s to the u’s
Then the joint probability density function of u 1, u 2,…, u n is given by: where Jacobian of the transformation
Example Suppose that x 1, x 2 are independent with density functions f 1 (x 1 ) and f 2 (x 2 ) Find the distribution of u 1 = x 1 + x 2 u 2 = x 1 - x 2 Solving for x 1 and x 2 we get the inverse transformation
The Jacobian of the transformation
The joint density of x 1, x 2 is f(x 1, x 2 ) = f 1 (x 1 ) f 2 (x 2 ) Hence the joint density of u 1 and u 2 is:
Theorem Let x 1, x 2,…, x n denote random variables with joint probability density function f(x 1, x 2,…, x n ) Let u 1 = a 11 x 1 + a 12 x 2 +…+ a 1n x n + c 1 u 2 = a 21 x 1 + a 22 x 2 +…+ a 2n x n + c 2 u n = a n1 x 1 + a n2 x 2 +…+ a nn x n + c n define an invertible linear transformation from the x’s to the u’s
Then the joint probability density function of u 1, u 2,…, u n is given by: where
Theorem Suppose that The random vector, [x 1, x 2, … x p ] has a p-variate normal distribution with mean vector and covariance matrix then has a p-variate normal distribution with mean vector and covariance matrix
Theorem Suppose that The random vector, [x 1, x 2, … x p ] has a p-variate normal distribution with mean vector and covariance matrix then has a p-variate normal distribution with mean vector and covariance matrix
Proof then
since Also and hence QED
Theorem Suppose that The random vector, has a p-variate normal distribution with mean vector and covariance matrix with mean vector and covariance matrix then has a p-variate normal distribution Let A be a q p matrix of rank q ≤ p
proof then is invertible. and covariance matrix Let B be a (p - q) p matrix so that is p–variate normal with mean vector
Thus the marginal distribution of and covariance matrix is q–variate normal with mean vector