Data Analysis Module: Correlation and Regression

Slides:



Advertisements
Similar presentations
Lesson 10: Linear Regression and Correlation
Advertisements

Forecasting Using the Simple Linear Regression Model and Correlation
Correlation and Linear Regression.
Scatter Diagrams and Linear Correlation
Correlation and Regression Analysis
Linear Regression and Correlation
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Correlation 1. Correlation - degree to which variables are associated or covary. (Changes in the value of one tends to be associated with changes in the.
Multiple Regression Research Methods and Statistics.
Relationships Among Variables
Correlational Research Strategy. Recall 5 basic Research Strategies Experimental Nonexperimental Quasi-experimental Correlational Descriptive.
Correlation & Regression
MATH 1107 Elementary Statistics Lecture 6 Scatterplots, Association and Correlation.
Correlation and Regression
Relationship of two variables
Correlation and Linear Regression
Anthony Greene1 Correlation The Association Between Variables.
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
Lecture 6 Correlation and Regression STAT 3120 Statistical Methods I.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 12-1 Correlation and Regression.
UNDERSTANDING RESEARCH RESULTS: DESCRIPTION AND CORRELATION © 2012 The McGraw-Hill Companies, Inc.
Correlation and Linear Regression INCM Correlation  Correlation coefficients assess strength of linear relationship between two quantitative variables.
Statistical Methods Statistical Methods Descriptive Inferential
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 4 Summary Scatter diagrams of data pairs (x, y) are useful in helping us determine visually if there is any relation between x and y values and,
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Chapter 13 Linear Regression and Correlation. Our Objectives  Draw a scatter diagram.  Understand and interpret the terms dependent and independent.
Correlation and Linear Regression
Thursday, May 12, 2016 Report at 11:30 to Prairieview
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
Statistics 200 Lecture #6 Thursday, September 8, 2016
Correlation and Linear Regression
Simple Bivariate Regression
Regression Analysis AGEC 784.
Inference for Least Squares Lines
Correlation & Regression
EXPLORATORY DATA ANALYSIS and DESCRIPTIVE STATISTICS
STAT 4030 – Programming in R STATISTICS MODULE: Basic Data Analysis
REGRESSION (R2).
Correlation, Bivariate Regression, and Multiple Regression
A little VOCAB.
SCATTERPLOTS, ASSOCIATION AND RELATIONSHIPS
Correlation and Regression
STAT 4030 – Programming in R STATISTICS MODULE: Multiple Regression
The scatterplot shows the advertised prices (in thousands of dollars) plotted against ages (in years) for a random sample of Plymouth Voyagers on several.
Correlation and Regression Basics
STAT 4030 – Programming in R STATISTICS MODULE: Confidence Intervals
Correlation and Regression Basics
Understanding Research Results: Description and Correlation
HMI 7530– Programming in R STATISTICS MODULE: Multiple Regression
2. Find the equation of line of regression
HMI 7530– Programming in R STATISTICS MODULE: Confidence Intervals
Keller: Stats for Mgmt & Econ, 7th Ed
Lecture Notes The Relation between Two Variables Q Q
The Weather Turbulence
Introduction to Probability and Statistics Thirteenth Edition
STEM Fair Graphs.
Inferential Statistics and Probability a Holistic Approach
Correlation and Regression
Section 1.4 Curve Fitting with Linear Models
Correlation and Regression
An Introduction to Correlational Research
CORRELATION AND MULTIPLE REGRESSION ANALYSIS
Warsaw Summer School 2017, OSU Study Abroad Program
Linear Regression and Correlation
Presentation transcript:

Data Analysis Module: Correlation and Regression R Programming Data Analysis Module: Correlation and Regression

Data Analysis Module Basic Descriptive Statistics and Confidence Intervals Basic Visualizations Histograms Pie Charts Bar Charts Scatterplots Ttests/Bivariate testing One Sample Paired Independent Two Sample ANOVA Chi Square and Odds Regression Basics

Data Analysis Module: Correlation and Regression Dependent Variable Independent (predictor) Variable Statistical Test Comments Quantitative Categorical T-TEST (one, two or paired sample) Determines if categorical variable (factor) affects dependent variable; typically used for experimental or planned change studies Correlation/Regression Analysis Test establishes a regression model; used to explain, predict or control dependent variable Chi-Square Tests if variables are statistically independent (i.e. are they related or not?)

Data Analysis Module: Correlation and Regression Correlation coefficients assess strength of linear relationship between two quantitative variables. The correlation measure ranges from -1 to +1. A negative correlation means that X and Y are inversely related. A positive correlation means that X and Y are directly related. zero correlation means that X and Y are not linearly related. A correlation of +1 indicates X and Y are directly related and that all the points fall on the same straight line. A correlation of -1 indicates X and Y are inversely related and that all the points fall on the same straight line Plot Scatter Diagram of Each Predictor variable and Dependent Variable Look of Departures from Linearity Look for extreme data points (Outliers) Examine Partial Correlation Can’t determine causality, but isolate confounding variables

Data Analysis Module: Correlation and Regression For example, lets take two variables and evaluate their correlation…open the stats98 dataset in Excel… What would you expect the correlation of the Verbal SAT scores and the Math SAT scores to be? Why? What would you expect the correlation of the Math SAT scores and the percent taking the test to be? Why?

Data Analysis Module: Correlation and Regression What would you expect the correlation of the Verbal SAT scores and the Math SAT scores to be? Why?

Data Analysis Module: Correlation and Regression What would you expect the correlation of the Math SAT scores and the Percent of HS students that took the test? Why?

Data Analysis Module: Correlation and Regression

Data Analysis Module: Correlation and Regression From the previous slide, the “regression line” has been imposed onto the relationship between Price and Age of car. The equation of this line takes the general form of y=mx+b, where: Y is the dependent variable (Price) M is the slope of the line X is the independent variable (Age) B is the Y-intercept. When we discussion regression models, we transform this equation to be: Y = bo + b1x1 + …bnxn Where bo is the y-intercept and b1 is the slope of the line. The “slope” is also the effect of a one unit change of x on y.

Data Analysis Module: Correlation and Regression From the previous slide, the model equation is presented in the form of the equation of a line: y=-924x + 12320. From this, we would say: For every 1 year of a car’s age, there is a $924 decrease in theprice of the car. Every car “starts” at $12,320. If a car is 2 years old, the expected price is $10,472. That R2 value of .8937 is interpreted as “89.37% of the change in price of the cars can be explained by this linear model, where age is the only predictor”.