Correlation and Regression. O UTLINE Introduction  10-1 Scatter plots.  10-2 Correlation.  10-3 Correlation Coefficient.  10-4 Regression.

Slides:



Advertisements
Similar presentations
Correlation and regression Dr. Ghada Abo-Zaid
Advertisements

Correlation & Regression Chapter 10. Outline Section 10-1Introduction Section 10-2Scatter Plots Section 10-3Correlation Section 10-4Regression Section.
Correlation and Regression
Correlation and Regression
© The McGraw-Hill Companies, Inc., 2000 CorrelationandRegression Further Mathematics - CORE.
Elementary Statistics Larson Farber 9 Correlation and Regression.
Correlation and Regression
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
SIMPLE LINEAR REGRESSION
Linear Regression and Correlation Analysis
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Chapter 9: Correlation and Regression
SIMPLE LINEAR REGRESSION
Correlation & Regression Math 137 Fresno State Burger.
Correlation and Linear Regression
STATISTICS ELEMENTARY C.M. Pascual
SIMPLE LINEAR REGRESSION
Correlation and Regression
Chapter Correlation and Regression 1 of 84 9 © 2012 Pearson Education, Inc. All rights reserved.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
© The McGraw-Hill Companies, Inc., 2000 Business and Finance College Principles of Statistics Lecture 10 aaed EL Rabai week
Unit 3 Section : Correlation  Correlation – statistical method used to determine whether a relationship between variables exists.  The correlation.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
Regression Section 10.2 Bluman, Chapter 101. Example 10-1: Car Rental Companies Construct a scatter plot for the data shown for car rental companies in.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
Correlation and Regression
Elementary Statistics Correlation and Regression.
Correlation Analysis. A measure of association between two or more numerical variables. For examples height & weight relationship price and demand relationship.
Correlation & Regression
Unit 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 11 Correlation and Simple Linear Regression Statistics for Business (Econ) 1.
Multiple Correlation and Regression
Correlation and Regression. fourth lecture We will learn in this lecture: Correlation and Regression 1- Linear Correlation Coefficient of Pearson 2- Simple.
Chapter 7 Calculation of Pearson Coefficient of Correlation, r and testing its significance.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
CORRELATION ANALYSIS.
Free Powerpoint Templates ROHANA BINTI ABDUL HAMID INSTITUT E FOR ENGINEERING MATHEMATICS (IMK) UNIVERSITI MALAYSIA PERLIS.
Unit 3 Section : Regression  Regression – statistical method used to describe the nature of the relationship between variables.  Positive.
Go to Table of Content Correlation Go to Table of Content Mr.V.K Malhotra, the marketing manager of SP pickles pvt ltd was wondering about the reasons.
Video Conference 1 AS 2013/2012 Chapters 10 – Correlation and Regression 15 December am – 11 am Puan Hasmawati Binti Hassan
Unit 3 Sections 10-1 & What we will be able to do throughout this chapter…  Determine relationships between two or more variables  Determine strengths.
CHAPTER 10 & 13 Correlation and Regression Instructor: Alaa saud Note: This PowerPoint is only a summary and your main source should be the book.
CHAPTER 10 & 13 Correlation and Regression. Regression.
Chapter Correlation and Regression 1 of 84 9 © 2012 Pearson Education, Inc. All rights reserved.
Correlation and Regression Note: This PowerPoint is only a summary and your main source should be the book.
Correlation and Regression Lecturer : FATEN AL-HUSSAIN Note: This PowerPoint is only a summary and your main source should be the book.
CHAPTER 10 & 13 Correlation and Regression
Correlation & Regression
CHAPTER 10 & 13 Correlation and Regression
CHAPTER 10 & 13 Correlation and Regression
10.2 Regression If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is.
Correlation and Regression
Chapter 5 STATISTICS (PART 4).
SIMPLE LINEAR REGRESSION MODEL
Correlation and Regression
Correlation and Regression
CHAPTER 10 Correlation and Regression (Objectives)
Correlation and Regression
M248: Analyzing data Block D.
Lecture Notes The Relation between Two Variables Q Q
CORRELATION ANALYSIS.
SIMPLE LINEAR REGRESSION
بسم الله الرحمن الرحيم. Correlation & Regression Dr. Moataza Mahmoud Abdel Wahab Lecturer of Biostatistics High Institute of Public Health University.
Correlation & Regression
CORRELATION & REGRESSION compiled by Dr Kunal Pathak
Presentation transcript:

Correlation and Regression

O UTLINE Introduction  10-1 Scatter plots.  10-2 Correlation.  10-3 Correlation Coefficient.  10-4 Regression.

Correlation and Regression numerical or quantitative Correlation and Regression are inferential statistics involves determining whether a relationship between two or more numerical or quantitative variables exists. Examples: Is the number of hours a student studies is related to the student’s score on a particular exam? Is caffeine related to heart damage? Is there a relationship between a person’s age and his or her blood pressure?

I NTRODUCTION  Correlation  Correlation is a statistical method used to determine whether a relationship between variables exists.  Regression  Regression is a statistical method used to describe the nature of the relationship between variables (positive or negative, linear or nonlinear).

1. Are two or more variables related? 2. If so, what is the strength of the relationship? 3. What type of relationship exists? 4. What kind of predictions can be made from the relationship? S TATISTICAL Q UESTIONS

There are two types of relationships simple multiple there are two types of variables: independent variable  independent variable (explanatory or predictor variable)  dependent variable  dependent variable (outcome or response variable). there are two or more independent variables that are used to predict one dependent variable.

Simple relationship (independent + dependent) Positive relationship Negative relationship both variables increase or decrease at the same time. Example: a person’s height and perfect weight. one variable increases, the other variable decreases and vice versa. Example: the strength of people over 60 years of age.

Example : Is there a relationship between a person’s age and his or her blood pressure? The type of relationship: independent variable(s): The independent variable(s): The dependent variable:

Example: Is there a relationship between a students final score in math and factors such as the number of hours a student studies, the number of absences, and the IQ score. The type of relationship: independent variable(s): The independent variable(s): The dependent variable:

Scatter Plots

scatter plot  A scatter plot is a graph of the ordered pairs (x, y) of numbers consisting of the independent variable x and the dependent variable y. + linear relationship - linear relationship No linear relationship

Construct a scatter plot for the data obtained in a study on the number of absences and the final grades of seven randomly selected students from a statistics class. Example 10-2: StudentNumber of absencesFinal grade A682 B286 C1543 D974 E1258 F590 G878

From the following scatter plot, the relationship can be described as: a) Positive b) Strong positive c) Strong negative d) No relationship

Correlation

 The correlation coefficient computed from the sample data measures the strength and direction of a linear relationship between two variables.  The range of the correlation coefficient is from  1 to  1. symbol for the sample r symbol for the population 

 If there is a strong positive linear relationship between the variables, the value of r will be close to  1.  If there is a strong negative linear relationship between the variables, the value of r will be close to  1.

MeaningCorrelation Coefficient value Complete Positive Linear Relationship+1 Strong Positive Linear Relationship من 0.70 إلى 0.99 Moderate Positive Linear Relationship من 0.50 إلى 0.69 Weak Positive Linear Relationship من 0.01 إلى 0.49 No Linear Relationship0 Weak Negative Linear Relationship من إلى Moderate Negative Linear Relationship من إلى Strong Negative Linear Relationship من إلى Complete Negative Linear Relationship

Which of the following values does not represent a correlation coefficient: a) r= 1.5 b) r= 0.90 c) r= 0 d) r= -1

If the correlation coefficient between 2-variable equal (-0.096), this means the relationship between 2-variable is: a) Weak positive b) Weak negative c) Strong positive d) Strong negative

Pearson Ch (10) Spearman Rank Ch (13) -Denoted by ( r ) -Only Used when Two variables are quantitative. -Denoted by ( r s ) -Used when Two variables are Quantitative or Qualitative. Correlation Coefficient

Pearson linear correlation coefficient is where n is the number of data pairs (sample size).

Example 10-5: Compute the correlation coefficient for the data in Example 10–2. Student Number of absences x Final grade y xyx2x2 y2y2 A682 B286 C1543 D974 E1258 F590 G878

Compute the correlation coefficient for the data in Example 10–1. Example 10-4: compa ny Cars x Income y xyx2x2 y2y2 A B C D E F Σx = 153.8Σy = 18.7 Σxy = Σx 2 = Σy 2 = 80.67

Solution :

Rank Correlation Coefficient

Spearman rank correlation coefficient,  Other types of correlation coefficients. Is called the Spearman rank correlation coefficient, can be used when the data are ranked. Where d = difference in ranks. n = number of data pairs.

EX:EX: To study the correlation estimates of the students in the subject of statistics and mathematics in their estimates we chose 5 students and their estimates were as follows: BDCAFStat (X) AFBCDMath (Y)

Example 13-7 P(698): Two students were asked to rate eight different textbooks for a specific course on an ascending scale from 0 to 20 points. Compute the correlation coefficient for the data: Textbook.Student 1 Student 2 Rank(X 1 )Rank(X 2 )d=X 1 – X 2 d² ABCDEFGHABCDEFGH Total

Example 13-7 P(698): Textbook.Student 1 Student 2 Rank(X 1 )Rank(X 2 )d=X 1 – X 2 d² ABCDEFGHABCDEFGH Total30

If the different between the ranks of two variables are (-1, 0, 0, -1, 4, -2), Find the value of the Spearman correlation coefficient? a) r s = 1.5 b) r s = 0.90 c) r s = 0.371

Regression

regression line  If the value of the correlation coefficient is significant, the next step is to determine the equation of the regression line which is the data’s line of best fit.

  Best fit means that the sum of the squares of the vertical distance from each point to the line is at a minimum.

Example 10-9: Find the equation of the regression line for the data in Example 10–4, and graph the line on the scatter plot. Σx = 153.8, Σy = 18.7, Σxy = , Σx 2 = , Σy 2 = 80.67, n = 6

If age (x), number of sick days (y) And the equation of the regression line is: y`= x Answer questions: 1- Predict the number of sick days if age is 40 years. = For each increase of age, the number of sick days: a) Decreases by on average. b) Increases by on average The correlation coefficient (relationship) between two variables: a) Positive b) Negative c) d) What is the slope of the regression line? a) 12.5 b) c)

Predictions Predictions are made in all areas and daily. Examples include weather forecasting, stock market analyses, sales predictions, crop predictions, gasoline price predictions, and sports predictions. Some predictions are more accurate than others, due to the strength of the relationship. That is, the stronger the relationship is between variables, the more accurate the prediction is.

 Find two points to sketch the graph of the regression line. Use any x values between 10 and 60. For example, let x equal 15 and 40. Substitute in the equation and find the corresponding y value. Plot (15,1.986) and (40,4.636), and sketch the resulting line.

For Example: If Absences (x) and Final Grade (y) :b= , which means: for each increase of 1 absences (x), the value of (y) changes unit (the final grade decrease scores) on average.

*R EMARK : o The sign of the correlation coefficient and the sign of the slope of the regression line will always be the same. r (positive) ↔ b (positive) r (negative) ↔ b (negative) Car Rental Companies: r=0.982,b=0.106 Absences and Final Grade: r= , b= The regression line will always pass through the poin t (, ӯ ).

*R EMARK : The magnitude of the change in one variable when the other variable changes exactly 1 unit is called a marginal change. The value of slope b of the regression line equation represent the marginal change. For Example: Car Rental Companies: b= 0.106, which means for each increase of 10,000 cars, the value of y changes unit (the annual income increase $106 million) on average.

Example 10-11: Use the equation of the regression line to predict the income of a car rental agency that has 200,000 automobiles. x = 20 corresponds to 200,000 automobiles. Hence, when a rental agency has 200,000 automobiles, its revenue will be approximately $2.516 billion.

If (x) is the temperature degree, and (y) is the number of emergency calls, Choose the right equation of the regression line. When the temperature increases by one degree, then the emergency calls decreases by on average. a) y`= x b) y`= x c) y`= x d) y`= 0.99 – 0.025x

When we study the relationship between the number of hours a person exercises and his weight, the correlation coefficient may be: a) b) 0.88 c) 0.90 d) -0.01

The correlation coefficient and the slope of the regression are the same in: a) Only sign. b) Only value c) Sign and value If the both variables (x,y) are quantitative data, then the appropriate correlation coefficient is: a) Spearman b) Rank c) Pearson

To determine the relationship between variables exists is to use the ….. a) Correlation b) Scatter plot c) Regression d) a and c The range of correlation coefficient (r) is: a) -1 ≤ r ≤ 1 b) -1 < r <1 c) -1 < r ≤ 1 d) -1 ≤ r < 1

If the value of correlation coefficient (r) will be close to  1, then the relationship between variables is: a) Positive linear b) Strong positive linear c) Weak positive linear d) Complete positive linear

*** Remark *** The sign of the slope of the regression line indicates the direction of the relationship, Positive slope  Positive relationship Negative slope  Negative relationship