Introduction: Correlation and Regression The General Linear Model is a phrase used to indicate a class of statistical models which include simple linear.

Slides:



Advertisements
Similar presentations
Modeling of Data. Basic Bayes theorem Bayes theorem relates the conditional probabilities of two events A, and B: A might be a hypothesis and B might.
Advertisements

6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Introduction: The General Linear Model b b The General Linear Model is a phrase used to indicate a class of statistical models which include simple linear.
Chapter 10 Regression. Defining Regression Simple linear regression features one independent variable and one dependent variable, as in correlation the.
Sociology 601 Class 17: October 28, 2009 Review (linear regression) –new terms and concepts –assumptions –reading regression computer outputs Correlation.
Bivariate Regression Analysis
The Simple Linear Regression Model: Specification and Estimation
9. SIMPLE LINEAR REGESSION AND CORRELATION
WFM 5201: Data Management and Statistical Analysis © Dr. Akm Saiful IslamDr. Akm Saiful Islam WFM 5201: Data Management and Statistical Analysis Akm Saiful.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
The Simple Regression Model
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Introduction to Probability and Statistics Linear Regression and Correlation.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
1 Relationships We have examined how to measure relationships between two categorical variables (chi-square) one categorical variable and one measurement.
Correlation and Regression Analysis
Leon-Guerrero and Frankfort-Nachmias,
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Relationships Among Variables
Ordinary Least Squares
Linear Regression Analysis
Chapter 8: Bivariate Regression and Correlation
Regression and Correlation Methods Judy Zhong Ph.D.
Introduction to Linear Regression and Correlation Analysis
MAT 254 – Probability and Statistics Sections 1,2 & Spring.
EC339: Lecture 6 Chapter 5: Interpreting OLS Regression.
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
Chapter 6 & 7 Linear Regression & Correlation
Managerial Economics Demand Estimation. Scatter Diagram Regression Analysis.
Chapter 12 Examining Relationships in Quantitative Research Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
Introduction to Linear Regression
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
Statistical Methods Statistical Methods Descriptive Inferential
Examining Relationships in Quantitative Research
Go to Table of Content Single Variable Regression Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
Roger B. Hammer Assistant Professor Department of Sociology Oregon State University Conducting Social Research Ordinary Least Squares Regression.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
PS 225 Lecture 20 Linear Regression Equation and Prediction.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 13-1 Introduction to Regression Analysis Regression analysis is used.
CORRELATION. Correlation key concepts: Types of correlation Methods of studying correlation a) Scatter diagram b) Karl pearson’s coefficient of correlation.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Basic Statistics Linear Regression. X Y Simple Linear Regression.
V. Rouillard  Introduction to measurement and statistical analysis CURVE FITTING In graphical form, drawing a line (curve) of best fit through.
Linear Regression Linear Regression. Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Purpose Understand Linear Regression. Use R functions.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
Lecture 10 Introduction to Linear Regression and Correlation Analysis.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 7: Regression.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Copyright © Cengage Learning. All rights reserved. 8 9 Correlation and Regression.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Correlation and Regression Basic Concepts. An Example We can hypothesize that the value of a house increases as its size increases. Said differently,
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
Chapter 13 Simple Linear Regression
Regression Analysis AGEC 784.
Lecture #26 Thursday, November 17, 2016 Textbook: 14.1 and 14.3
Regression Analysis Module 3.
Linear Regression and Correlation Analysis
Linear Regression Prof. Andy Field.
Correlation and Regression
Simple Linear Regression
Regression Part II.
Presentation transcript:

Introduction: Correlation and Regression The General Linear Model is a phrase used to indicate a class of statistical models which include simple linear regression analysis. Regression is the predominant statistical tool used in the social sciences due to its simplicity and versatility. Also called Linear Regression Analysis. We will examine regression first, and then see how correlation is one portion of regression analysis.

Simple Linear Regression: The Basic Mathematical Model Regression is based on the concept of the simple proportional relationship - also known as the straight line. We can express this idea mathematically! Theoretical aside: All theoretical statements of relationship imply a mathematical theoretical structure. Just because it isn’t explicitly stated doesn’t mean that the math isn’t implicit in the language itself!

Simple Linear Relationships Alternate Mathematical Notation for the straight line - don’t ask why! 10th Grade Geometry Statistics Literature Econometrics Literature

Alternate Mathematical Notation for the Line These are all equivalent. We simply have to live with this inconsistency. We won’t use the geometric tradition, and so you just need to remember that B 0 and a are both the same thing.

Linear Regression: the Linguistic Interpretation In general terms, the linear model states that the dependent variable is directly proportional to the value of the independent variable. Thus if we state that some variable Y increases in direct proportion to some increase in X, we are stating a specific mathematical model of behavior - the linear model.

Linear Regression: A Graphic Interpretation

The linear model is represented by a simple picture

The Mathematical Interpretation of the Regression Parameters a = the intercept the point where the line crosses the Y-axis. (the value of the dependent variable when all of the independent variables = 0) b = the slope the increase in the dependent variable per unit change in the independent variable (also known as the 'rise over the run')

The Error Term Such models do not predict behavior perfectly. So we must add a component to adjust or compensate for the errors in prediction. Having fully described the linear model, there are now several courses to spend on the error term.

The 'Goal' of Ordinary Least Squares Ordinary Least Squares (OLS) is a method of finding the linear model which minimizes the sum of the squared errors. Such a model provides the best explanation/prediction of the data.

Why Least Squared error? Why not simply minimum error? It is similar to the problem with the average deviation The error’s about the line sum to 0.0! Minimum absolute deviation (error) models now exist, but they are mathematically cumbersome. Try algebra with | Absolute Value | signs! We square the error to get rid of the negative signs, and take the square root to get back to the “root mean squared error.” Which we don’t use very much Some feel that big errors should be more influential than small errors.

Other models are possible... Best parabola...? (i.e. nonlinear or curvilinear relationships) Best maximum likelihood model... ? Best expert system...? Complex Systems…? Chaos models Catastrophe models others

The Notion of Linear Change The linear aspect means that the same amount of increase unemployment will have the same effect on crime at both low and high unemployment. A nonlinear change would mean that as unemployment increased, its impact upon the crime rate might increase at higher unemployment levels.

Minimizing the Sum of Squared Errors Who put the Least in OLS In mathematical jargon we seek to minimize the Unexplained Sum of Squares (USS), where:

Decomposition of the error in LS

T-Tests Since we wish to make probability statements about our model, we must do tests of inference. Fortunately,

Measures of Goodness of fit The Correlation coefficient r-squared The F test

The correlation coefficient A measure of how close the residuals are to the regression line It ranges between -1.0 and +1.0 It is closely related to the slope.

Goodness of fit The correlation coefficient A measure of how close the residuals are to the regression line It ranges between -1.0 and +1.0 r 2 (r-square) The r-square (or R-square, or r 2 ) is also called the coefficient of determination Ranges between 0.0 and 1.0 Expresses the % of Y explained by X

Tests of Inference t-tests for coefficients F-test for entire model Since we are interested in how well the model performs at reducing error, we need to develop a means of assessing that error reduction. Since the mean of the dependent variable represents a good benchmark for comparing predictions, we calculate the improvement in the prediction of Yi relative to the mean of Y (the best guess of Y with no other information).