Data analysis is largely a search for patterns – that is, for meaningful relations among various items observed - K. Godfrey.

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Correlation and regression Dr. Ghada Abo-Zaid
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
SIMPLE LINEAR REGRESSION
Linear Regression and Correlation Analysis
Chapter 13 Introduction to Linear Regression and Correlation Analysis
REGRESSION AND CORRELATION
SIMPLE LINEAR REGRESSION
Correlation and Regression Analysis
Correlation and Regression 1. Bivariate data When measurements on two characteristics are to be studied simultaneously because of their interdependence,
Correlation & Regression Math 137 Fresno State Burger.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Lecture 5 Correlation and Regression
Regression and Correlation Methods Judy Zhong Ph.D.
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Correlation Scatter Plots Correlation Coefficients Significance Test.
Correlation By Dr.Muthupandi,. Correlation Correlation is a statistical technique which can show whether and how strongly pairs of variables are related.
Correlation and regression 1: Correlation Coefficient
Simple Linear Regression
Data Collection & Processing Hand Grip Strength P textbook.
Simple Covariation Focus is still on ‘Understanding the Variability” With Group Difference approaches, issue has been: Can group membership (based on ‘levels.
Covariance and correlation
Correlation.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Regression and Correlation. Bivariate Analysis Can we say if there is a relationship between the number of hours spent in Facebook and the number of friends.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Statistics in Applied Science and Technology Chapter 13, Correlation and Regression Part I, Correlation (Measure of Association)
Correlation and Regression SCATTER DIAGRAM The simplest method to assess relationship between two quantitative variables is to draw a scatter diagram.
Introduction to Linear Regression
1 G Lect 8b G Lecture 8b Correlation: quantifying linear association between random variables Example: Okazaki’s inferences from a survey.
Design and Data Analysis in Psychology I Salvador Chacón Moscoso Susana Sanduvete Chaves School of Psychology Dpt. Experimental Psychology 1.
Correlation Correlation is used to measure strength of the relationship between two variables.
Examining Relationships in Quantitative Research
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Topic 10 - Linear Regression Least squares principle - pages 301 – – 309 Hypothesis tests/confidence intervals/prediction intervals for regression.
Regression MBA/510 Week 5. Objectives Describe the use of correlation in making business decisions Apply linear regression and correlation analysis. Interpret.
CORRELATION: Correlation analysis Correlation analysis is used to measure the strength of association (linear relationship) between two quantitative variables.
Chapter 16 Data Analysis: Testing for Associations.
Describing Relationships Using Correlations. 2 More Statistical Notation Correlational analysis requires scores from two variables. X stands for the scores.
CORRELATION. Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson’s coefficient of correlation.
Lecture 10: Correlation and Regression Model.
Examining Relationships in Quantitative Research
Correlation. Correlation Analysis Correlations tell us to the degree that two variables are similar or associated with each other. It is a measure of.
Simple linear regression Tron Anders Moger
Chapter 12: Correlation and Linear Regression 1.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Lecture 29 Dr. MUMTAZ AHMED MTH 161: Introduction To Statistics.
SIMPLE LINEAR REGRESSION AND CORRELLATION
…. a linear regression coefficient indicates the impact of each independent variable on the outcome in the context of (or “adjusting for”) all other variables.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
CORRELATION ANALYSIS.
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
Determining and Interpreting Associations between Variables Cross-Tabs Chi-Square Correlation.
Correlation & Linear Regression Using a TI-Nspire.
Correlation & Simple Linear Regression Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU 1.
Chapter 12: Correlation and Linear Regression 1.
Simple Linear Correlation
Correlation & Regression
Correlation – Regression
Chapter 5 STATISTICS (PART 4).
Basic Statistics Overview
Understanding Standards Event Higher Statistics Award
NURS 790: Methods for Research and Evidence Based Practice
CORRELATION ANALYSIS.
بسم الله الرحمن الرحيم. Correlation & Regression Dr. Moataza Mahmoud Abdel Wahab Lecturer of Biostatistics High Institute of Public Health University.
Correlation & Regression
Presentation transcript:

Data analysis is largely a search for patterns – that is, for meaningful relations among various items observed - K. Godfrey

Objectives of the study The starting point of any investigation must be to define clearly its objectives, since these will determine the appropriate study design and the type of data needed. Objectives might be categorized into one of the three main types as below. Estimation of certain features of a population for e.g., what percentage of children has received a full course of vaccination by their birthday? Investigation of the relationship or association between a factor of interest and a particular outcome For eg. (i) Is age related to blood pressure during adulthood? (ii) are respiratory infections more common among children whose parents smoke? Evaluation of a drug or therapy or of an intervention aimed at reducing the incidence (or severity) of disease fro e.g., Does the administration of BCG reduce the risk of tuberculosis?

Many biomedical studies are designed to explore relationship between two variables and specifically to determine whether these variables and specifically to determine whether these variables are independent or dependent. Examples: 1. Obesity and Blood pressure: Are obesity and Blood Pressure independent or Do overweight men tend to have high blood pressure?

Purposes of studying relationship between two continuous variables: To assess whether two variables are associated. i.e. if the values of one variable tend to be higher (or lower) for higher values of other variable To enable value of one variable to be predicted from any known value of other variable The method of assessing the association between two continuous variables is known as Correlation and of prediction of one continuous variable from another is by Linear Regression

What is the difference between association and correlation? Customarily association and correlation are roughly means the same, but in statistical sense Association is the term used for assessing relationship between categorical variable. Correlation is the term used for assessing relationship between continuous variables.

Correlation analysis describes linear association between two continuous variables that vary over their respective range of values (i.e. when two variables are considered to be related when a change in one is likely to be accompanied by a change in other). Simple way to depict correlation is by means of graphical representation called “scatter plot”.

More linear and diagonal pattern stronger the relationship BPBP BW o o oo o o o o o

Example Positive correlation: Gestation age (GA) against birth weight of baby (BW) Negative correlation: in relation to prognosis of myocardial infarction (MI) relationship between left ventricular ejection fraction and QRS score traced from ECG Also the correlation can be quantified by means of various indices

r = 0 r = 0.7 r = 1 r = Perfectly + ve Perfectly - ve

Different indices of Correlation Coefficient Pearson’s product moment correlation coefficient (r) - used to assess two normally distributed continuous variables Spearman’s rank correlation coefficient (p) (rho) – used to assess two continuous variables and at least one of which is not normally distributed Kendall’s Rank Correlation Coefficient (t) (tan)– used to assess agreement between two continuous variable which, are measured on the same scale. Intra class correlation coefficient (ICC) – used to assess agreement between two continuous variable which, are measured on the same scale.

How correlation coefficient is reported or interpreted? Correlation Coefficient always ranges between -1 to +1 that measures the strength and direction of relationship. Correlation of 1 refers to 100% positive relation and -1 refers to negative correlation. Correlation value of zero means no linear relation.

Calculation of Pearson’s product moment correlation coefficient ® r = Cov(X,Y) = 1 SD x * SD Y n∑ (X-X) (y-y) √1/n ∑ (x-x) 2 *√1/n∑(y-y) 2 Where Cov (X, Y) – Covariance of X and Y SDx, SDy – Standard deviation of x and y respectively.

Example: Body weight (BW) and systolic blood pressure (BP) in 10 newborn babies. Find is there any association. BW BP

Scatter plot BPBP BW o o oo o o o o o

Calculation of Correlation Coefficient Sr. No 7 (1) BW8 (2) BP (3) X-mean ((1)-2.69)) (4) Y-Mean ((2)) – 76.9) (3)* (4) X-Mean * (Y-Mean) (3) 2 (X – Mean) 2 (4) 2 (Y-Mean) Tot Mean Var ∑x= ∑y= SD

r = Cov (X, Y)/SD x * SD Y = /.684*16.92 = 0.92

95% Confidence Interval of Correlation Coefficient (r) Z = 1 1n (1+r) SE (Z) = 1 2 (1-r) n-3 Z1 = Z-1.96 * SE (Z), Z2 = Z * SE (Z) 95% CI for (r) = e 2Z1 – 1 /e 2Z1 +1 to e 2Z 2 - 1/e 2Z 2 + 1

Misuses of Correlation Coefficient Concluding no correlation from zero correlation while in fact a strong linear relationship may exist. Unwarranted conclusion from spurious correlation Concluding cause and effect relationship while just be an indirect relationship Concluding an agreement between pairs of measurement while they may not have same values at all points