Chapter 2 Simple Linear Regression – What is it, Why do we do it?

Slides:



Advertisements
Similar presentations
Chapter 12: Basic option theory
Advertisements

 Population multiple regression model  Data for multiple regression  Multiple linear regression model  Confidence intervals and significance tests.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Correlation and regression Dr. Ghada Abo-Zaid
Chapter 4 Mathematical Expectation.
Key Stone Problem… Key Stone Problem… next Set 15 © 2007 Herbert I. Gross.
Simple Linear Regression
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Chapter 4 Multiple Regression.
The Asset Market, Money, and Prices
The Basics of Regression continued
Return, Risk, and the Security Market Line
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 8.1 Chapter 8 Continuous Probability Distributions.
1 Simple Linear Regression Chapter Introduction In this chapter we examine the relationship among interval variables via a mathematical equation.
Lecture 19 Simple linear regression (Review, 18.5, 18.8)
Simple Linear Regression Analysis
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Linear Regression Modeling with Data. The BIG Question Did you prepare for today? If you did, mark yes and estimate the amount of time you spent preparing.
4.4 Linear Inequalities in Two Variables
Lecture 3-2 Summarizing Relationships among variables ©
Introduction to Linear Regression and Correlation Analysis
Chapter 13: Inference in Regression
Chapter Writing Functions.
Algebra 1 Chapter 4 Section 3.
Chapter 1. Mathematical Model  A mathematical model is a graphical, verbal, numerical, or symbolic representation of a problem situation.
Inferences for Regression
Chapter 6 & 7 Linear Regression & Correlation
Lecture 3-3 Summarizing r relationships among variables © 1.
Exponential and Logarithm
CHAPTER 14 MULTIPLE REGRESSION
Definition of a Function A function is a set of ordered pairs in which no two ordered pairs of the form (x, y) have the same x-value with different y-values.
Chapter 1. Mathematical Model  A mathematical model is a graphical, verbal, numerical, or symbolic representation of a problem situation.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
Slide 1 Lesson 76 Graphing with Rates Chapter 14 Lesson 76 RR.7Understand that multiplication by rates and ratios can be used to transform an input into.
The Mystery of Calculating The Breakeven Point. What in the world is it? w It is the point at which a company does not make any money. w It is the calculation.
Ch4 Describing Relationships Between Variables. Section 4.1: Fitting a Line by Least Squares Often we want to fit a straight line to data. For example.
Copyright © 2010 Pearson Education, Inc. All rights reserved Sec
Correlation & Regression
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Simple Linear Regression. Deterministic Relationship If the value of y (dependent) is completely determined by the value of x (Independent variable) (Like.
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Lecture 7: What is Regression Analysis? BUEC 333 Summer 2009 Simon Woodcock.
Regression MBA/510 Week 5. Objectives Describe the use of correlation in making business decisions Apply linear regression and correlation analysis. Interpret.
Chapter 13 Multiple Regression
Copyright © 2009 Cengage Learning 18.1 Chapter 20 Model Building.
Direct and Inverse Variation. Direct Variation Two functions are said to vary directly if as the magnitude of one increases, the magnitude of the other.
Chapter 1 Introduction n Introduction: Problem Solving and Decision Making n Quantitative Analysis and Decision Making n Quantitative Analysis n Model.
Chapter 1 Linear Functions and Mathematical Modeling Section 1.3.
Chapter 8 Continuous Probability Distributions Sir Naseer Shahzada.
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
© Buddy Freeman, 2015 Let X and Y be two normally distributed random variables satisfying the equality of variance assumption both ways. For clarity let.
Chapter 12: Correlation and Linear Regression 1.
Basic Statistics Linear Regression. X Y Simple Linear Regression.
Algebra Problems… Solutions Algebra Problems… Solutions © 2007 Herbert I. Gross Set 17 part 2 By Herbert I. Gross and Richard A. Medeiros next.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
Multiplication of Common Fractions © Math As A Second Language All Rights Reserved next #6 Taking the Fear out of Math 1 3 ×1 3 Applying.
INTRODUCTORY MATHEMATICAL ANALYSIS For Business, Economics, and the Life and Social Sciences  2011 Pearson Education, Inc. Chapter 1 Applications and.
MTH108 Business Math I Lecture 8. Chapter 4 Mathematical Functions.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Chapter 15 Random Variables. Introduction Insurance companies make bets. They bet that you are going to live a long life. You bet that you are going to.
Multivariate Statistics Introduction W. M. van der Veld University of Amsterdam.
Regression Chapter 5 January 24 – Part II.
Objective: Solve systems of linear equations. Interpret the solution to a system of linear equations. Problem 1 A friend of yours interviewed for two different.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
The simple linear regression model and parameter estimation
Simple Linear Regression
Simple Linear Regression - Introduction
BASIC REGRESSION CONCEPTS
Multiple Linear Regression
Presentation transcript:

Chapter 2 Simple Linear Regression – What is it, Why do we do it?

Remember Statistics is an applied branch of mathematics. When you apply mathematics to describe the world we live in, we call this mathematical modeling. Up to this point in your mathematical studies you have been learning about the language of algebra, which involves learning the about the objects – like functions – that make up this language, its characteristics, syntax, and grammar. The big concept in algebra is the understanding of relationships between variables. The simplest mathematical relationship being linear relationships. After a while we learn about a special type of relationship called a function, symbolized by f(x) = y.

The types of relationships/functions we learn about in algebra are called deterministic. What is deterministic? This means that we have a perfect relationship between the variables in question. If we know a particular value of one variable, then knowing what the relationship is – meaning knowing the equation- gives us complete knowledge of what is the corresponding value of the other variable. Here is a simple example. You work at a job that pays you $10.00 per hour. Let h equal the number of hours you work, d equal the amount of dollars you earn and f(h) = d equal the symbolic representation of the relationship between h and d. So, if on your first week on the job you work 20 hours your gross pay will be $ without question, f(20) = $ The relationship is exact!

If the following week, you work 20 hours again your pay will be $ again. This is different from a probabilistic model. In a probabilistic model there is a relationship between two variables but that relationship is not perfect. For the value of a particular variable we have an expectation of a value for the other variable in the relationship, but we can not expect to get that exact value. Here is a simple example. You are a vendor at an outdoor market. Let h represent the number of hours you work at the outdoor market, and let d equal the net amount of money you earn from selling your products. If you work for 8 hours you will earn some amount of money, but you can not predict from day to day what that amount will be. You have an expectation of what you will earn, otherwise you would not be involved in this endeavor.

But suppose that you worked for 8 hours, for many days at your stall in the outdoor market. Eventually you would have gathered enough data to create a distribution of sales and would begin to see a pattern emerge. Soon you would recognize that this pattern repeats itself, except that you can not predict exactly how much money you will earn in any one week. But depending on the time of the season, or month you can expect a certain return for your work. Lets say that sales always seem to be higher at the end of the month. So there is a relationship between the two variables, except that it is not deterministic- exact - it is probabilistic.

So here is where we begin talking about the topic of chapter 2, simple linear regression. The attempt here is to understand what relationship exists between the two variables under a probabilistic model. Before we start let us cover a couple of preliminary concepts. As was mentioned, we are dealing with a probabilistic model, which means our functions will also be probabilistic. More on what this means in a moment. An important fact is that we will be dealing with linear equations/functions only to make our task easier.

Now, what do we mean by a probabilistic linear function? Let x be our independent variable, which we will call, most of the time, the explanatory variable. In algebra the letter y represent the dependent variable, which we now call the response variable (more on the name change later). Using algebra symbols, f is the name of the linear function, and f(x) = y. But since we are dealing with a probabilistic model, that is x and y are not perfectly related it seems that we have the following situation: f(3) = ?. What exactly is the output of a probabilistic equation that defines the relationship between two variables?

Alright, here it is. Let the variable h, the explanatory variable be the number of hours you work at the market selling your product. Let us say you work 8 hours every Saturday. The variable d represents the gross amount of dollars earned for working 8 hours on Saturday. The function f represents the mathematical relationship between the two; so f(8) equals the expected (mean) gross amount of dollars earned. In other words f(8) = the mean for the given situation. Thus, f(8) represents the mean of some distribution.

We will make some assumptions about our distribution; f(8) is the mean dollar value of the sales. Secondly, we will assume a normal distribution.

We realize that there is also then a value associated for any value of h, number of hours worked. Thus f(4) = has some value, and f(5) = has some value, and f(3.5) = has some value and so on. The output is always the mean number of sales in dollars associated with that number of hours worked, and we will assume that the distributions are all normal. Also all the normal distributions have the same standard deviation.

Ok, this is understandable so far. But what exactly is simple linear regression? What is chapter 2 about?

We want to find the equation f(x) = y or in this example f(h) = d. What is this equation that explains the relationship between the variable x and the variable y, or in our example h and d, which represents the mean of a normal distribution. We will only be considering situation when f(x) is a linear equation, linear relationship. So one task is to determine if our relationship is linear, how good is the linear relationship, and if it will it provide any good information We will assume that for each value of x, f(x) is the mean of a normal distribution.