Presentation is loading. Please wait.

Presentation is loading. Please wait.

Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association.

Similar presentations


Presentation on theme: "Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association."— Presentation transcript:

1 Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association

2 Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter Overview In many circumstances, the purpose of research is to investigate relationships between variables rather than to compare group differences. Introduce the methods and statistical analysis of research into relationships or correlations between variables. The type of data dictates the appropriate statistical approach, and the statistical approach affects how results are interpreted. Recognize and understand the relationships between variables. There are limitations to the conclusions that can be appropriately drawn from statistical results. Probability estimates can be, and often are, provided in association with point estimates of association. The interpretation of p-values requires some elaboration.

3 Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Purpose and Methods Knowledge of the association between variables can inform health care practice: –About the association between characteristics that individuals possess –About the association between external events and individual characteristics –The relationship between events and population values Several Considerations Related to the Research Methods Investigating Association Measurement error and data validity: The less precise a measurement is, the lower the association between that variable and any other variable, unless the direction and magnitude of error are similar for each. It is possible to have a strong relationship between two variables but the strength of the relationship is underestimated due to measurement imprecision.

4 Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Measurement Error and Data Validity If a measurement is reliable and fairly precise, a strong association may be found. The interpretation of the association may be misleading, however. The investigator and research consumer must consider the following when interpreting and applying the results of research into the association between variables : –Timing and a complete description and careful consideration of the sample tested. –The more homogeneous a sample, the more likely the strength of the association between variables will be underestimated. –Consider whether the sample studied truly represents the range of the population of interest. –Generalization of results into clinical practice is a judgment of the clinician, but the investigator must provide sufficient description.

5 Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Data Analysis and Regression Analysis Data Analysis When analyzing data to estimate the strength of an association between variables, first determine the type of data at hand. Regression Analysis The fundamental statistic for the analysis of interval and ratio data to estimate the strength of the association between variables. ANOVA is a special case of regression. Simple linear regression is the simplest form of regression.

6 Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Regression Simple linear regression, or Pearson Product Moment Correlation (PPMC), designated as a Pearson r or simply an r value, is used to measure the strength of the association between pairs of data that are interval or ordinal. Correlation coefficients can be quickly calculated with commercial software, but the formula is: r = N∑ XY – (∑X)(∑Y) / √ N∑X 2 – (∑X) 2 √N∑Y 2 – (∑Y) 2 –where N = the number of pairs of data, X = the values of one variable, and Y = the values of the second. It is possible to estimate the association between multiple variables through multiple regression. This is conceptually similar to adding variables or levels of variables and performing ANOVA rather than a t-test. The use of statistical packages allows for more data and more complex designs to be managed in a rapid fashion.

7 Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Regression (continued) When multiple regression is performed, the process yields an R 2 value. An R 2 value is the variance in the criterion variable (Y) explained by a predictor variable (X) or predictor variables. Multiple regression: R 2 values PPMC: r values The product of (r) 2 = R 2 or when PPMC is squared, the value is equal to what was obtained through multiple regression. The relationship between r and R 2 confirms that the PPMC is a special case of multiple regression. It is possible to explore the relationships between variables while controlling for the influence of other variables through the calculation of partial and semipartial correlations.

8 Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Interpreting R and R 2 A p-value is an indication that the differences observed when studying a sample are reflective of true population differences. The p-values found in association with r and R 2 relate to hypothesis testing and are of little value when considering the meaningfulness of an association between variables. If a p-value is not informative, then the research consumer is most concerned with the R 2 value or the portion of variance in Y explained by the predictor variable(s). The usefulness of the R 2 value is revealed when one considers the interpretation of r values.

9 Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Interpreting R and R 2 Investigators and statisticians have provided some guidelines for interpreting r values..00 to.25.00 to –.25Little, if any correlation.26 to.49–.26 to –.49 Low correlation.50 to.69–.50 to –.69Moderate correlation.70 to.89–.70 to –.89 High correlation.90 to 1.0–.90 to –1.0Very high correlation

10 Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Measures of Association Measures of association do not imply cause and effect. Interpreting the strength and importance of measures of association and teasing out cause and effect relationships requires knowledge of the subject matter and careful critical appraisal rather than reference to a published scale, which can potentially mislead. Although measures of association cannot confirm cause and effect, these analyses can build useful predictive models. The values of the predictor variables in fact predict the value of the criterion value. The prediction model does not imply cause and effect but is rather a means of predicting an outcome in one measure, based on the values of predictor measures.

11 Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Managing Ordinal and Nominal Data The analysis used most often for ordinal data is a Spearman rank order correlation. The result is designated as a Spearman r. The formula for a Spearman rank order correlation is as follows: Spearman r = 1 – ((6∑d 2 ) / n(n 2 – 1)) d = the difference in ranks between pairs of data, and n = the number of subjects. The analysis of the association between ordinal values is more complicated than those for ordinal, interval, or ratio data. The most common approach is a Cramers’ V correlation, which is based on Chi square (X 2 ) values. One application of an X 2 analysis is to test the hypothesis that two nominal variables are independent. If the null hypothesis that two variables are independent is rejected, a Cramer’s V can be calculated to measure the strength of the relationship.

12 Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Association vs. Cause Cramer’s V analysis can identify a relationship, but it may have no suggestion of cause and effect. In some cases, two variables may appear highly associated due to correlation with a third variable. It requires more than measures of association to establish cause and effect. In some cases, experimental designs can test hypotheses of cause and effect derived from studies of association. In other cases, cause and effect is established through evidence. It is important to note that because Spearman r and Cramer’s V are nonparametric statistics, the square of these values is meaningless. The interpretation of these statistics requires knowledge of the literature on the question being addressed and the nature of the data in making informed decisions regarding studies that address the association between variables.

13 Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter Summary and Key Points Knowledge of the association between variables can inform health care practice. The less precise a measurement, the lower the association between that variable and any other variable. Validity of data may or may not affect the strength of the relationship between two measurements. Generalization of results into clinical practice is a judgment of the clinician and requires thoughtful decisions on the part of the investigator. Regression analysis is the fundamental statistic for the analysis of interval and ratio data to estimate the strength of the association between variables. P-values relate to statistical differences but offer little to understanding clinical meaningfulness. Because Spearman r and Cramer’s V are nonparametric statistics, the square of these values is meaningless.


Download ppt "Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association."

Similar presentations


Ads by Google