Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03

Slides:



Advertisements
Similar presentations
AP Statistics.  Least Squares regression is a way of finding a line that summarizes the relationship between two variables.
Advertisements

Chapter 6: Exploring Data: Relationships Lesson Plan
CHAPTER 3 Describing Relationships
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
^ y = a + bx Stats Chapter 5 - Least Squares Regression
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
SWBAT: Calculate and interpret the equation of the least-squares regression line Do Now: If data set A of (x, y) data has correlation r = 0.65, and a second.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
CHAPTER 3 Describing Relationships
Chapter 4.2 Notes LSRL.
LEAST – SQUARES REGRESSION
Statistics 101 Chapter 3 Section 3.
CHAPTER 3 Describing Relationships
Linear Regression Special Topics.
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Least-Squares Regression
Chapter 6: Exploring Data: Relationships Lesson Plan
Chapter 5 LSRL.
Chapter 3.2 LSRL.
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
Daniela Stan Raicu School of CTI, DePaul University
Chapter 6: Exploring Data: Relationships Lesson Plan
AP Stats: 3.3 Least-Squares Regression Line
Least Squares Regression Line LSRL Chapter 7-continued
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
^ y = a + bx Stats Chapter 5 - Least Squares Regression
CHAPTER 3 Describing Relationships
Unit 4 Vocabulary.
Least-Squares Regression
Chapter 3 Describing Relationships Section 3.2
Least-Squares Regression
Daniela Stan Raicu School of CTI, DePaul University
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 5 LSRL.
Least-Squares Regression
Basic Practice of Statistics - 3rd Edition Regression
Chapter 3: Describing Relationships
Least-Squares Regression
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Warmup A study was done comparing the number of registered automatic weapons (in thousands) along with the murder rate (in murders per 100,000) for 8.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
3.2 – Least Squares Regression
CHAPTER 3 Describing Relationships
4.2 Cautions about Correlation and Regression
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Warm-up: Pg 197 #79-80 Get ready for homework questions
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Least Squares Regression Chapter 3.2
9/27/ A Least-Squares Regression.
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Presentation transcript:

Data Analysis and Statistical Software I (323-21-403) Quarter: Autumn 02/03 Daniela Stan, PhD Course homepage: http://facweb.cs.depaul.edu/Dstan/csc323 Office hours: (No appointment needed) M, 3:00pm - 3:45pm at LOOP, CST 471 W, 3:00pm - 3:45pm at LOOP, CST 471 11/10/2018 Daniela Stan - CSC323

Outline Chapter 2: Looking at Data – Relationships between two or more variables Least - Squares Regression Cautions about Regression and Correlation 11/10/2018 Daniela Stan - CSC323

Regression A regression line is a straight line that describes how a response variable y changes as an explanatory variable x changes. 11/10/2018 Daniela Stan - CSC323

Regression y = a + b*x b = slope ~ rate of change a = intercept (x=0) Fitting a line to data means drawing a line that comes as close as possible to the points: y = a + b*x b = slope ~ rate of change a = intercept (x=0) Height= a + b*age 11/10/2018 Daniela Stan - CSC323

Prediction Use of Regression: to predict the value of y for any value of x by substituting this x into the equation of the regression line. Extrapolation is the use of regression line for prediction outside the range values of the explanatory variable x that you used to obtain the line. Such predictions are often not accurate. Example: Problem 2.35 11/10/2018 Daniela Stan - CSC323

The regression line makes the prediction errors as small as possible. Least - Squares Regression How do we fit a line to a scatterplot? The regression line makes the prediction errors as small as possible. 11/10/2018 Daniela Stan - CSC323

Least - Squares Regression (cont.) The least - squares regression line of y on x is the line that makes the sum of the squares of the vertical distances of the data points from the line as small as possible. How to calculate the least – squares regression line? = predicted value Where: r = correlation, Sx,Sy = standard deviations = means 11/10/2018 Daniela Stan - CSC323

Correlation and Regression The correlation r is the slope of the least-squares regression line when we measure both x and y in standardized units. The square of the correlation, r2 11/10/2018 Daniela Stan - CSC323

Cautions about regression and correlation Residuals = the difference between the observed and predicted values of y. A residual plot is a scatterplot of the regression residuals against the explanatory variable; it helps us to asses the fit of the regression line. Lurking variable is a variable that is not among the explanatory or response variables in a study and yet may influence the interpretation of the relationships among those variables. 11/10/2018 Daniela Stan - CSC323

Data Mining Exploring really large data bases in the hope of finding useful patterns is called data mining. Domain Understanding Data Selection Cleaning & Preprocessing Knowledge Evaluation & Interpretation Discovering patterns The entire process is iterative and interactive. 11/10/2018 Daniela Stan - CSC323