Summary of introduced statistical terms and concepts mean Variance & standard deviation covariance & correlation Describes/measures average conditions.

Slides:



Advertisements
Similar presentations
Multiple Regression Analysis
Advertisements

Kin 304 Regression Linear Regression Least Sum of Squares
Chapter 12 Simple Linear Regression
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Regression Analysis Module 3. Regression Regression is the attempt to explain the variation in a dependent variable using the variation in independent.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
Simple Linear Regression. G. Baker, Department of Statistics University of South Carolina; Slide 2 Relationship Between Two Quantitative Variables If.
Simple Linear Regression and Correlation
Objectives (BPS chapter 24)
Simple Linear Regression
A Short Introduction to Curve Fitting and Regression by Brad Morantz
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
REGRESSION What is Regression? What is the Regression Equation? What is the Least-Squares Solution? How is Regression Based on Correlation? What are the.
Chapter 13 Introduction to Linear Regression and Correlation Analysis
SIMPLE LINEAR REGRESSION
Chapter Topics Types of Regression Models
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 11 Notes Class notes for ISE 201 San Jose State University.
Pertemua 19 Regresi Linier
FIN357 Li1 The Simple Regression Model y =  0 +  1 x + u.
REGRESSION Predict future scores on Y based on measured scores on X Predictions are based on a correlation from a sample where both X and Y were measured.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
1 Chapter 17: Introduction to Regression. 2 Introduction to Linear Regression The Pearson correlation measures the degree to which a set of data points.
Introduction to Regression Analysis, Chapter 13,
Lecture 5 Correlation and Regression
Lecture 3-2 Summarizing Relationships among variables ©
Introduction to Linear Regression and Correlation Analysis
STATISTICS: BASICS Aswath Damodaran 1. 2 The role of statistics Aswath Damodaran 2  When you are given lots of data, and especially when that data is.
1 FORECASTING Regression Analysis Aslı Sencer Graduate Program in Business Information Systems.
Chapter 6 & 7 Linear Regression & Correlation
CHAPTER 15 Simple Linear Regression and Correlation
Managerial Economics Demand Estimation. Scatter Diagram Regression Analysis.
Business Research Methods William G. Zikmund Chapter 23 Bivariate Analysis: Measures of Associations.
Production Planning and Control. A correlation is a relationship between two variables. The data can be represented by the ordered pairs (x, y) where.
Regression. Population Covariance and Correlation.
1 Chapter 12 Simple Linear Regression. 2 Chapter Outline  Simple Linear Regression Model  Least Squares Method  Coefficient of Determination  Model.
Chapter 14 Inference for Regression AP Statistics 14.1 – Inference about the Model 14.2 – Predictions and Conditions.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Simple Linear Regression. The term linear regression implies that  Y|x is linearly related to x by the population regression equation  Y|x =  +  x.
Objective: Understanding and using linear regression Answer the following questions: (c) If one house is larger in size than another, do you think it affects.
Political Science 30: Political Inquiry. Linear Regression II: Making Sense of Regression Results Interpreting SPSS regression output Coefficients for.
STA291 Statistical Methods Lecture LINEar Association o r measures “closeness” of data to the “best” line. What line is that? And best in what terms.
Environmental Modeling Basic Testing Methods - Statistics III.
LECTURE 9 Tuesday, 24 FEBRUARY STA291 Fall Administrative 4.2 Measures of Variation (Empirical Rule) 4.4 Measures of Linear Relationship Suggested.
AP STATISTICS LESSON 14 – 1 ( DAY 1 ) INFERENCE ABOUT THE MODEL.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Basic Statistics Linear Regression. X Y Simple Linear Regression.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
1 Simple Linear Regression and Correlation Least Squares Method The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES.
Regression Analysis Deterministic model No chance of an error in calculating y for a given x Probabilistic model chance of an error First order linear.
Lecture 10 Introduction to Linear Regression and Correlation Analysis.
Chapter 14 Introduction to Regression Analysis. Objectives Regression Analysis Uses of Regression Analysis Method of Least Squares Difference between.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Describing Bivariate Relationships. Bivariate Relationships When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response.
Bivariate Regression. Bivariate Regression analyzes the relationship between two variables. Bivariate Regression analyzes the relationship between two.
The simple linear regression model and parameter estimation
Chapter 4: Basic Estimation Techniques
Regression and Correlation of Data Summary
Chapter 4 Basic Estimation Techniques
Lecture #26 Thursday, November 17, 2016 Textbook: 14.1 and 14.3
Econ 3790: Business and Economics Statistics
…Don’t be afraid of others, because they are bigger than you
Slope of the regression line:
No notecard for this quiz!!
Correlation and Regression
Simple Linear Regression
Chapter 14 Inference for Regression
Ch 4.1 & 4.2 Two dimensions concept
Introduction to Regression
Presentation transcript:

Summary of introduced statistical terms and concepts mean Variance & standard deviation covariance & correlation Describes/measures average conditions or the center of the sample points Describes/measures the spread of the sample points; deviations from the center of the sample points Describes/measures co-dependence of variations in samples of two random variables Slides 6, 17 updated

Summary of introduced statistical terms and concepts mean Variance & standard deviation covariance & correlation Calculated mean values: unbound, any real number (check your values: it must be within the minimum and maximum of the sample data) Variance: values are > or = 0 Standard deviation: > or = 0 (check your values: standard deviation should be less than the minimum- maximum sample range |max(x)-min(x)|) Covariance: any real number Correlation: between -1 and +1 (check your values: correlation should never exceed the range from -1 to 1]

Linear function: y= bx +a The value of y depends on the value of x Δy = b*Δx Δx Note: I corrected the notation of the equation, please check your notes b is the slope, a is the constant the intercept value. R-script class14.R was updated ( :30pm)

Linear function y= bx +a The value of y depends on the value of x Δy = b*Δx Δx b= Δy/Δx =2 a=-1

Linear function y= bx +a The value of y depends on the value of x a= Δy/Δx =2 y= 0x + a= a value of y does not depend on x

Note: updated slide to define y with random error

?

How to estimate the best fitting line?

Mathematically we formulate this as a minimization problem: Minimize the distance of the data points from the linear regression line.

How to estimate the best fitting line? Sum of Squared Errors (SSE) The deviations from the deterministic model line are interpreted as random errors (following a Gaussian distribution)

How to estimate the best fitting line? Sum of Squared Errors (SSE) Intercept Slope

How to estimate the best fitting line? Sum of Squared Errors (SSE) Note: Many textbooks and statisticians would prefer to distinguish the estimated values from the actual true (but unknown) parameter values using a different symbol. Or they use Greek letters for the true values, and Latin letters for the estimates. ^ ^ ^ Sample mean of x and y

How to estimate the best fitting line? Sum of Squared Errors (SSE) 1 n 1 n COV(x,y) VAR(x) b=

How to estimate the best fitting line? Sum of Squared Errors (SSE) 1 n 1 n Slope of the regression line: Correlation coefficient * standard deviation (y) / standard deviation (x)

How to estimate the best fitting line? Linear relationship with errors: y= bx +a + ε The value of y depends on the value of x plus a random error Estimated Regression line

Class exercises: download script class14.R: source(“class14.R”) (1) change the linear parameters to have steeper, or more flat slopes. (2) change the slope to negative (from top left to bottom right) (3) observe how the correlation coefficient changes (4) change the error variance and observe how it affects the correlation and fitting of the line (5) watch in case, where does the line intersect with the y-axis (6) change the intercept parameter (intersection with the y-axis). What is the effect on the correlation? (7) find a way to change the sample size of the scatter points and repeat your (1)-(6) (8) set the slope parameter closer to 0 and eventually to 0 change the variance of the errors. What happens to correlation?

5/30/2016 Note: In R-scripts the variable names have a slightly different notation: As you can see in class14.R we use ‘yobs’ for the variable y containing the random error ‘e’. The estimator for the slope ‘bfit’ is calculated by using for the correlation coefficient ‘cor(x,yobs)’ and the standard deviation ‘sd(yobs)’ and ‘sd(x)’ The equation on slide 14 includes the error