Anthony Greene1 Correlation The Association Between Variables
Anthony Greene2 When to Use t-test or ANOVA When the independent variable is Categorical
Anthony Greene3 When to Use Correlation & Regression When the independent variable is Ratio or Interval
Anthony Greene4 ScatterPlots
Anthony Greene5 ScatterPlots
Times and costs for five word- processing jobs
Anthony Greene7 Four data points
Anthony Greene8 Direct Relationship: Positive Slope
Anthony Greene9 Age and price data for a sample of 11 used cars
Anthony Greene10 Scatter diagram for the age and price data of used cars
Anthony Greene11 Inverse Relationship: Negative Slope
Anthony Greene12 Various degrees of linear correlation (Slide 1 of 3)
Anthony Greene13 Various degrees of linear correlation (Slide 3 of 3)
Anthony Greene14 Various degrees of linear correlation (Slide 2 of 3)
Anthony Greene15 Examples of positive and negative relationships
16 The Simple Idea If the corresponding x and y z-scores are always in agreement, r will be high. If they are sometimes in agreement r will be moderate If they are generally different, r will be near zero If they are in agreement, but in opposite directions, r will be negative
Anthony Greene17 The Basic Idea ZxZx ZyZy ZxZx ZyZy ZxZx ZyZy
Anthony Greene18 We define SS x, SS p and SS y by Notation Used in Regression and Correlation
Anthony Greene19 Obtaining the three sums of squares for the used car data using the computational formulas
Anthony Greene20 The linear correlation coefficient, r, of n data points is defined by or by the computational formula Linear Correlation Coefficient
Anthony Greene21 Linear Correlation Coefficient
Anthony Greene22 Coefficient of Determination The coefficient of determination, r 2, is the proportion of variation in the observed values of the response variable that is explained by the regression: The coefficient of the determination always lies between 0 and 1 and is a descriptive measure of of the utility of the regression equation for making predictions. Values of r 2 near 0 indicate that the regression equation is not useful for making predictions, whereas values near 1 indicate that the regression equation is extremely useful for making predictions.
Anthony Greene23 t-Distribution for a Correlation Test For samples of size n, the variable has the t-distribution with df = n – 2 if the null hypothesis ρ = 0 ρ or rho is pronounced “row”
Anthony Greene24 The t-test for correlation (Slide 1 of 3) With df = n-2 use table B.6
Anthony Greene25 The t-test for correlation (Slide 2 of 3)
Anthony Greene26 The t-test for correlation (Slide 3 of 3) Step 4 Compute the test statistic r. Table B.6 allows a direct lookup. Alternatively, r has a t distribution and Table B.2 will yield an identical conclusion Step 5 If the value of the test statistic falls in the rejection region, reject the null hypothesis. Step 6 State the conclusion in words
Anthony Greene27 Criterion for deciding whether or not to reject the null hypothesis
Anthony Greene28 Correlation Matrix abcd a1.00 b c d
Anthony Greene29 Computer printouts for correlations