Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.

Slides:



Advertisements
Similar presentations
Chapter 10 Correlation and Regression
Advertisements

Sections 10-1 and 10-2 Review and Preview and Correlation.
Correlation & Regression Chapter 10. Outline Section 10-1Introduction Section 10-2Scatter Plots Section 10-3Correlation Section 10-4Regression Section.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Correlation and Regression
1 Objective Investigate how two variables (x and y) are related (i.e. correlated). That is, how much they depend on each other. Section 10.2 Correlation.
Chapter 12 Simple Regression
SIMPLE LINEAR REGRESSION
Correlation A correlation exists between two variables when one of them is related to the other in some way. A scatterplot is a graph in which the paired.
Regression Chapter 10 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
10-2 Correlation A correlation exists between two variables when the values of one are somehow associated with the values of the other in some way. A.
SIMPLE LINEAR REGRESSION
Correlation and Regression Analysis
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
STATISTICS ELEMENTARY C.M. Pascual
Descriptive Methods in Regression and Correlation
Copyright © 2011 Pearson Education, Inc. Multiple Regression Chapter 23.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Chapter 10 Correlation and Regression
SIMPLE LINEAR REGRESSION
Correlation and Regression
Introduction to Linear Regression and Correlation Analysis
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
Relationship of two variables
Chapter 13: Inference in Regression
Correlation.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Sections 9-1 and 9-2 Overview Correlation. PAIRED DATA Is there a relationship? If so, what is the equation? Use that equation for prediction. In this.
September In Chapter 14: 14.1 Data 14.2 Scatterplots 14.3 Correlation 14.4 Regression.
Is there a relationship between the lengths of body parts ?
1 Chapter 9. Section 9-1 and 9-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-1 Review and Preview.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Probabilistic and Statistical Techniques 1 Lecture 24 Eng. Ismail Zakaria El Daour 2010.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Chapter 10 Correlation and Regression
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
Statistics Class 7 2/11/2013. It’s all relative. Create a box and whisker diagram for the following data(hint: you need to find the 5 number summary):
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Slide Slide 1 Warm Up Page 536; #16 and #18 For each number, answer the question in the book but also: 1)Prove whether or not there is a linear correlation.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 13 Multiple Regression
Chapter 10 Correlation and Regression Lecture 1 Sections: 10.1 – 10.2.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 MVS 250: V. Katch S TATISTICS Chapter 5 Correlation/Regression.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Chapter Correlation and Regression 1 of 84 9 © 2012 Pearson Education, Inc. All rights reserved.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-1 Overview Overview 10-2 Correlation 10-3 Regression-3 Regression.
1 Objective Given two linearly correlated variables (x and y), find the linear function (equation) that best describes the trend. Section 10.3 Regression.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Lecture Slides Elementary Statistics Twelfth Edition
Review and Preview and Correlation
Is there a relationship between the lengths of body parts?
CHS 221 Biostatistics Dr. wajed Hatamleh
Chapter 5 STATISTICS (PART 4).
Elementary Statistics
Lecture Slides Elementary Statistics Twelfth Edition
Lecture Slides Elementary Statistics Thirteenth Edition
Correlation and Regression
Chapter 10 Correlation and Regression
Lecture Slides Elementary Statistics Eleventh Edition
Lecture Slides Elementary Statistics Tenth Edition
Correlation and Regression Lecture 1 Sections: 10.1 – 10.2
Lecture Slides Elementary Statistics Eleventh Edition
Chapter 9 Correlation and Regression
Presentation transcript:

Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple Regression 10-6 Modeling

Slide Slide 2 Section 10-1 Overview

Slide Slide 3 Overview This chapter introduces important methods for making inferences about a correlation (or relationship) between two variables, and describing such a relationship with an equation that can be used for predicting the value of one variable given the value of the other variable. We consider sample data that come in pairs.

Slide Slide 4 Section 10-2 Correlation

Slide Slide 5 Context of data: Important! 1)What is the key issue if this data is test grades of subjects before and after formal instruction? 2)What is the key issue if these data are reasoning tests for sample of men (x) and a separate sample of women (y)? 3)What is the key issue if each pair represents a math reasoning score (x) and a starting salary (y) in thousands of dollars? XY

Slide Slide 6 Activity Partner up! 1)Measure the length of your thumb; measure your partner’s thumb. 2)Text the following messages to you partner and record the time it takes for them to text the message to you (must be exactly the same): Ms. OLoughlin is the best statistics teacher ever! 3)Record your results on the board 4)Plot your data; which one is x and which one is y? 5)Do you see a relationship between the two variables? Describe it as best you can. Length of thumb Speed of text

Slide Slide 7 Key Concept This section introduces the linear correlation coefficient r, which is a numerical measure of the strength of the relationship between two variables representing quantitative data. Because technology can be used to find the value of r, it is important to focus on the concepts in this section, without becoming overly involved with tedious arithmetic calculations. Use r to conclude whether or not there is a relationship between two variables.

Slide Slide 8 Definitions A correlation exists between two variables when one of them is related to the other in some way. Part 1: Basic Concepts of Correlation The linear correlation coefficient r measures the strength of the linear relationship between paired x- and y- quantitative values in a sample.

Slide Slide 9 Exploring the Data We can often see a relationship between two variables by constructing a scatterplot. Figure 10-2 following shows scatterplots with different characteristics. You should study the “overall” pattern of plotted points! Beware of outliers!

Slide Slide 10 Scatterplots of Paired Data Figure 10-2

Slide Slide 11 Scatterplots of Paired Data Figure 10-2

Slide Slide 12 Requirements 1. The sample of paired ( x, y ) data is a random sample of independent quantitative data. 2. Visual examination of the scatterplot must confirm that the points approximate a straight-line pattern. 3. The outliers must be removed if they are known to be errors. The effects of any other outliers should be considered by calculating r with and without the outliers included.

Slide Slide 13 Notation for the Linear Correlation Coefficient n represents the number of pairs of data present.  denotes the addition of the items indicated.  x denotes the sum of all x- values.  x 2 indicates that each x - value should be squared and then those squares added. (  x ) 2 indicates that the x- values should be added and the total then squared.  xy indicates that each x -value should be first multiplied by its corresponding y- value. After obtaining all such products, find their sum. r represents linear correlation coefficient for a sample.  represents linear correlation coefficient for a population.

Slide Slide 14 Formula 10-1 n  xy – (  x)(  y) n(  x 2 ) – (  x) 2 n(  y 2 ) – (  y) 2 r =r = The linear correlation coefficient r measures the strength of a linear relationship between the paired values in a sample. Calculators can compute r Formula

Slide Slide 15 Interpreting r Using Table A-6: If the absolute value of the computed value of r exceeds the value in Table A-6, conclude that there is a linear correlation. Otherwise, there is not sufficient evidence to support the conclusion of a linear correlation. Using Software: If the computed P-value is less than or equal to the significance level, conclude that there is a linear correlation. Otherwise, there is not sufficient evidence to support the conclusion of a linear correlation.

Slide Slide 16  Round to three decimal places so that it can be compared to critical values in Table A-6.  Use calculator or computer if possible. Rounding the Linear Correlation Coefficient r

Slide Slide Data x y You! Calculate r… Using the simple random sample of data below, find the value of r. 1) Use Table A-6 to tell if there is a linear correlation at.01 and.05. 2) Use your calculator to see if there is a linear correlation at.01 and.05.

Slide Slide 18 Activity Partner up! 1)Measure the length of your thumb; measure your partner’s thumb. 2)Text the following messages to you partner and record the time it takes for them to text the message to you (must be exactly the same): Ms. OLoughlin is the best statistics teacher ever! 3)Record your results on the board 4)Plot your data; which one is x and which one is y? 5)Do you see a relationship between the two variables? Describe it as best you can. 6) Use Table A-6 to tell if there is a linear correlation at.01 and.05. 7) Use your calculator to see if there is a linear correlation at.01 and.05 (LinRegTTest). Length of thumb Speed of text

Slide Slide 19 Properties of the Linear Correlation Coefficient r 1. –1  r  1 2. The value of r does not change if all values of either variable are converted to a different scale. 3. The value of r is not affected by the choice of x and y. Interchange all x- and y- values and the value of r will not change. 4. r measures strength of a linear relationship.

Slide Slide 20 Interpreting r : Explained Variation The value of r 2 is the proportion of the variation in y that is explained by the linear relationship between x and y.

Slide Slide 21 Using the duration/interval data in Table 10-1, we have found that the value of the linear correlation coefficient r = What proportion of the variation of the interval after eruption times can be explained by the variation in the duration times? With r = 0.926, we get r 2 = We conclude that (or about 86%) of the variation in interval after eruption times can be explained by the linear relationship between the duration times and the interval after eruption times. This implies that 14% of the variation in interval after eruption times cannot be explained by the duration times. Example: Old Faithful

Slide Slide 22 Common Errors Involving Correlation 1. Causation: It is wrong to conclude that correlation implies causality. 2. Averages: Averages suppress individual variation and may inflate the correlation coefficient. 3. Linearity: There may be some relationship between x and y even when there is no linear correlation.

Slide Slide 23 Always graph your data! 1)Find r, r-squared 2)Comment on the form, direction, and strength. Should we use r for this data? X Y

Slide Slide 24 Extra Example Many of us have heard that a tip should be 15% of the bill. The accompanying table lists some sample data collected from some students. 1)Find r, r-squared. Interpret r-squared. 2)Comment on the form, direction, and strength. 3) Is there sufficient evidence to conclude that there is a relationship between the amount of the bill and the amount of the tip? (Use Table A-6 and LinRegTTest) Bill ($) Tip ($)