Measures of Association: Correlation Analysis Assistant Prof. Özgür Tosun.

Slides:



Advertisements
Similar presentations
CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
Advertisements

Chapter 16: Correlation.
Describing Relationships Using Correlation and Regression
Correlation Chapter 9.
Lecture 3: Chi-Sqaure, correlation and your dissertation proposal Non-parametric data: the Chi-Square test Statistical correlation and regression: parametric.
Cal State Northridge  320 Andrew Ainsworth PhD Correlation.
Chapter Seven The Correlation Coefficient. Copyright © Houghton Mifflin Company. All rights reserved.Chapter More Statistical Notation Correlational.
Describing Relationships: Scatterplots and Correlation
Chapter 7 Correlational Research Gay, Mills, and Airasian
Summary of Quantitative Analysis Neuman and Robson Ch. 11
8/10/2015Slide 1 The relationship between two quantitative variables is pictured with a scatterplot. The dependent variable is plotted on the vertical.
Regression and Correlation
Correlation vs. Causation What is the difference?.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
February  Study & Abstract StudyAbstract  Graphic presentation of data. Graphic presentation of data.  Statistical Analyses Statistical Analyses.
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
SHOWTIME! STATISTICAL TOOLS IN EVALUATION CORRELATION TECHNIQUE SIMPLE PREDICTION TESTS OF DIFFERENCE.
Correlation By Dr.Muthupandi,. Correlation Correlation is a statistical technique which can show whether and how strongly pairs of variables are related.
Bivariate Relationships Analyzing two variables at a time, usually the Independent & Dependent Variables Like one variable at a time, this can be done.
Exploratory Data Analysis: Two Variables
Is there a relationship between the lengths of body parts ?
Irkutsk State Medical University Department of Faculty Therapy Correlations Khamaeva A. A. Irkutsk, 2009.
1 Measuring Association The contents in this chapter are from Chapter 19 of the textbook. The crimjust.sav data will be used. cjsrate: RATE JOB DONE: CJ.
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
Statistics in Applied Science and Technology Chapter 13, Correlation and Regression Part I, Correlation (Measure of Association)
LECTURE 5 Correlation.
Experimental Research Methods in Language Learning Chapter 11 Correlational Analysis.
Hypothesis of Association: Correlation
1 Further Maths Chapter 4 Displaying and describing relationships between two variables.
The Correlational Research Strategy
Examining Relationships in Quantitative Research
By: Amani Albraikan.  Pearson r  Spearman rho  Linearity  Range restrictions  Outliers  Beware of spurious correlations….take care in interpretation.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 16 Data Analysis: Testing for Associations.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Correlation and Linear Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Correlation. Correlation Analysis Correlations tell us to the degree that two variables are similar or associated with each other. It is a measure of.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 12 Testing for Relationships Tests of linear relationships –Correlation 2 continuous.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
The basic task of most research = Bivariate Analysis
The Correlational Research Strategy Chapter 12. Correlational Research The goal of correlational research is to describe the relationship between variables.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Correlation They go together like salt and pepper… like oil and vinegar… like bread and butter… etc.
Chapter 16: Correlation. So far… We’ve focused on hypothesis testing Is the relationship we observe between x and y in our sample true generally (i.e.
PART 2 SPSS (the Statistical Package for the Social Sciences)
Chapter 15: Correlation. Correlations: Measuring and Describing Relationships A correlation is a statistical method used to measure and describe the relationship.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Nonparametric Statistics
Statistics Correlation and regression. 2 Introduction Some methods involve one variable is Treatment A as effective in relieving arthritic pain as Treatment.
Paul Fryers Deputy Director, EMPHO Technical Advisor, APHO Introduction to Correlation and Regression Contributors Shelley Bradley, EMPHO Mark Dancox,
GOAL: I CAN USE TECHNOLOGY TO COMPUTE AND INTERPRET THE CORRELATION COEFFICIENT OF A LINEAR FIT. (S-ID.8) Data Analysis Correlation Coefficient.
Interpretation of Common Statistical Tests Mary Burke, PhD, RN, CNE.
Dr.Rehab F.M. Gwada. Measures of Central Tendency the average or a typical, middle observed value of a variable in a data set. There are three commonly.
Principles of Biostatistics Chapter 17 Correlation 宇传华 网上免费统计资源(八)
Determining and Interpreting Associations between Variables Cross-Tabs Chi-Square Correlation.
Theme 5. Association 1. Introduction. 2. Bivariate tables and graphs.
T-test calculations.
Correlation analysis is undertaken to define the strength an direction of a linear relationship between two variables Two measurements are use to assess.
Theme 7 Correlation.
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Correlation and the Pearson r
15.1 The Role of Statistics in the Research Process
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Correlation & Regression
COMPARING VARIABLES OF ORDINAL OR DICHOTOMOUS SCALES: SPEARMAN RANK- ORDER, POINT-BISERIAL, AND BISERIAL CORRELATIONS.
Presentation transcript:

Measures of Association: Correlation Analysis Assistant Prof. Özgür Tosun

But First: Numbers for Prostate Cancer America In the US, many men choose to be screened for prostate-specific antigens (PSA) which can be an indicator of the disease. England In the UK, it's more common for men to get checked only after they start experiencing problems.

Question is: In the UK, man are diagnosed with prostate cancer later, and are less likely to survive for five years before dying Does that mean more men die of prostate cancer in the UK compared to US?

Additional Info Many men have "non-progressive" prostate cancer that will never kill them While screened American men in this situation are marked as having "survived" cancer, unscreened British men aren’t Five-year survival rates of prostate cancer are much higher in the US than in the UK (99% rather than 81%)

Harding Center's diagrams shows that the risk of death is the same whether men are screened for prostate cancer or not

What numbers tell? The numbers of deaths from prostate cancer every year per 100,000 men are almost the same (23 in the US, 24 in the UK) Likewise in 1999, there were reports about Britain's survival rate for colon cancer (at the time 35%) being half that of the US (60%), experts again ignored the fact that that the mortality rate was about the same

Former New York mayor Rudy Giuliani declared in 2007 that someone's chance of surviving prostate cancer in the US was twice that of someone using the "socialized medicine" of Britain's National Health Service, he was wrong.

Doctors understand the numbers??? Research shows just how confused doctors often are about survival and mortality rates In a survey of 412 doctors in the US it was found that 75% of physicians mistakenly believed that higher survival rates meant more lives were saved

BACK TO THEORY

A Scatterplot Showing the Existence of a Relationship Between the Two Variables

Correlation Coefficient A correlation coefficient is the descriptive statistic that summarizes and describes the important characteristics of a relationship in mathematical terms, a correlation coefficient provides a measure of the strength and direction of the relationship between two variables

Drawing Conclusions The term correlation is synonymous with relationship (association) However, the fact there is a relationship between two variables does not mean that changes in one variable cause the changes in the other variable

Drawing Conclusions For example, there is a relation between the number of alarms in a fire and the extent of the damage. However, the fire alarms themselves did not cause the damage, rather the fire did. Therefore, although a relationship may exist, other factors also may affect the variables under study. Icecream consumption versus drawning in the sea

Types of Relationships

Linear Relationships In a linear relationship, as the X scores increase, the Y scores tend to change in only one direction In a positive linear relationship, as the scores on the X variable increase, the scores on the Y variable also tend to increase In a negative linear relationship, as the scores on the X variable increase, the scores on the Y variable tend to decrease

A Scatterplot of a Positive Linear Relationship

A Scatterplot of a Negative Linear Relationship

Data and Scatter Plot Reflecting No Relationship

Nonlinear Relationships In a nonlinear, or curvilinear, relationship, as the X scores change, the Y scores do not tend to only increase or only decrease: At some point, the Y scores change their direction of change.

A Scatterplot of a Nonlinear Relationship

Strength of the Relationship

Correlation Coefficients The Pearson and Spearman correlation coefficients, which are denoted by r, provide a number that indicates both the strength and the direction of the relationship between the two values Correlation coefficients may range between -1 and +1. The closer to 1 (-1 or +1) the coefficient is, the stronger the relationship; the closer to 0 the coefficient is, the weaker the relationship.

r When r equals − 1, it indicates a perfect negative or inverse relationship when r equals 0, it indicates no relationship when r equals +1, it indicates a perfect positive relationship.

Strength The strength of a relationship is the extent to which one value of Y is consistently paired with one and only one value of X The absolute value of the correlation coefficient indicates the strength of the relationship The sign of the correlation coefficient indicates the direction of a linear relationship (either positive or negative)

r Strenght 0.90 to 1.00Perfect Correlation 0.70 to 0.89High Correlation 0.50 to 0.69Moderate Correlation 0.30 to 0.49Low Correlation 0.00 to 0.29No or Weak Correlation However, in order to mention about a statistically important correlation, p value must be evaluated. If p value of a correlation is lower than the α, then this correlation is statistically significant (no matter the value of r) STRENGHT OF THE RELATIONSHIP

Methods for Measures of Association  Methods to define the strenght and direction of the association (Correlation analysis)  Methods to define the functional structure of the association (Regression analysis)

Correlation Coefficients Pearson Correlation Spearman Correlation Phi Coefficient Contingency Coefficient Quantitative Quantitative/Qualitative Qualitative 2x2 Qualitative 2x3 etc Qualitative (paired) nxn Kappa (Chance corrected agreement)

Pearson Correlation (r) For two continuous variables, this analysis provides information about the strenght and the direction of the linear association -1<= r <=+1

Pearson Correlation (r) It is the parametric correlation analysis Distributions for both variables must be normal Number of subjects must be adequate (usually at least 10)

Strenght increases 0 +1 Strenght decreases

Height (cm)Age (Month) Duration of exercise Performance

Perfect Positive Correlation r=0.97 p<0.001

r=-0.96 p<0.001 Perfect Negative Correlation

Low Correlation

No Correlation r=0.027 p=0.887

Correlation Matrix

Spearman Rank Correlation (r s ) When one or both of the continuous variables are non-normal distributed When one or both variables are ordinal Nonparametric correlation analysis Alternative for Pearson Correlation

Example

In this specific example: Income is continous BMI is ordinal

Another Example with Ordinal Variables

Measures of Association for Crosstables

Phi ( φ) Coefficient  For 2 x 2 crosstabs designs  For dichotomous variables such as (good-bad), (male- female), (diseased- nondiseased)…  Similar to Pearson (r)  Phi coefficient is significant if the Chi Square is significant

Contingency Coefficient This is a nonparametric technique that can be used to measure the relationship between two nominal-level variables. The variables need not be dichotomous but may have two or more categories. Ranges from 0 to <1 depending on the number of rows and columns with 1 indicating a high relationship and 0 indicating no relationship

Kappa ( κ) Chance corrected agreement  In clinical trials and medical research, we often have a situation where two different measures/assessments are performed on the same sample, same patient, same image,… the agreement needs to be calculated as a summary statistics  is a statistical measure of inter-rater agreement or inter-annotator agreement for qualitative (categorical) items

 is a measure of agreement between two sources, which is measured on a binary scale (i.e. condition present/absent).  κ statistic can take values between 0 and 1. Poor agreement: κ < 0.20 Fair agreement: κ = 0.20 to 0.39 Moderate agreement: κ = 0.40 to 0.59 Good agreement: κ = 0.60 to 0.79 Very good agreement: κ =0.80 to 1.00 Kappa ( κ) Chance corrected agreement

Another Example for Inter-Rater Agreement Between Two Doctors