Data Handling & Analysis BD7054 Scatter Plots Andrew Jackson

Slides:



Advertisements
Similar presentations
Correlation and regression
Advertisements

Chapter 12 Simple Linear Regression
Forecasting Using the Simple Linear Regression Model and Correlation
Chapter 27 Inferences for Regression This is just for one sample We want to talk about the relation between waist size and %body fat for the complete population.
Copyright © 2010 Pearson Education, Inc. Slide
Inference for Regression
Chapter 10 Regression. Defining Regression Simple linear regression features one independent variable and one dependent variable, as in correlation the.
Objectives (BPS chapter 24)
Correlation and Linear Regression
Chapter 15 (Ch. 13 in 2nd Can.) Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression.
Inference for Regression 1Section 13.3, Page 284.
5-7: Scatter Plots & Lines of Best Fit. What is a scatter plot?  A graph in which two sets of data are plotted as ordered pairs  When looking at the.
Data Handling & Analysis ZO4030 Linear Regression Andrew Jackson
Lecture 4: Correlation and Regression Laura McAvinue School of Psychology Trinity College Dublin.
Quantitative Business Analysis for Decision Making Simple Linear Regression.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Correlation and Regression Analysis
1 1 Slide Simple Linear Regression Chapter 14 BA 303 – Spring 2011.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS & Updated by SPIROS VELIANITIS.
Lecture 5 Correlation and Regression
Correlation & Regression
Linear Regression.
Inference for regression - Simple linear regression
Simple Linear Regression Models
CORRELATION & REGRESSION
12b. Regression Analysis, Part 2 CSCI N207 Data Analysis Using Spreadsheet Lingma Acheson Department of Computer and Information Science,
Researchers, such as anthropologists, are often interested in how two measurements are related. The statistical study of the relationship between variables.
Trend lines and Lines of best fit.  Statisticians gather data to determine correlations (relationships) between events.  Scatter plots will often show.
Inferences for Regression
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Correlation and Regression PS397 Testing and Measurement January 16, 2007 Thanh-Thanh Tieu.
Examining Relationships in Quantitative Research
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Regression Regression relationship = trend + scatter
Multiple Regression BPS chapter 28 © 2006 W.H. Freeman and Company.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Section 2.6 – Draw Scatter Plots and Best Fitting Lines A scatterplot is a graph of a set of data pairs (x, y). If y tends to increase as x increases,
Simple linear regression Tron Anders Moger
Essential Statistics Chapter 51 Least Squares Regression Line u Regression line equation: y = a + bx ^ –x is the value of the explanatory variable –“y-hat”
2.5 Using Linear Models A scatter plot is a graph that relates two sets of data by plotting the data as ordered pairs. You can use a scatter plot to determine.
1 Data Analysis Linear Regression Data Analysis Linear Regression Ernesto A. Diaz Department of Mathematics Redwood High School.
ANOVA, Regression and Multiple Regression March
Inference for Regression
AP Statistics Section 15 A. The Regression Model When a scatterplot shows a linear relationship between a quantitative explanatory variable x and a quantitative.
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
BUSINESS MATHEMATICS & STATISTICS. Module 6 Correlation ( Lecture 28-29) Line Fitting ( Lectures 30-31) Time Series and Exponential Smoothing ( Lectures.
Department of Mathematics
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Regression.
AP Statistics Chapter 14 Section 1.
Inference for Regression
Scatter Plots Line of Best Fit
(Residuals and
CHAPTER 29: Multiple Regression*
No notecard for this quiz!!
Regression.
Regression.
Scatter Plots Line of Best Fit
Regression.
Relationship between two continuous variables: correlations and linear regression both continuous. Correlation – larger values of one variable correspond.
Regression Chapter 8.
Regression.
Chapter 14 Inference for Regression
Ch 4.1 & 4.2 Two dimensions concept
Inferences for Regression
Algebra Review The equation of a straight line y = mx + b
Inference for Regression
Correlation and Simple Linear Regression
Presentation transcript:

Data Handling & Analysis BD7054 Scatter Plots Andrew Jackson

Scatter plot data How are two measures related? Are they correlated? Does one cause an effect in the other? What is the relationship?

Develop a hypothesis What is the hypothesis about these data? What is the null hypothesis?

Covariance and Correlation Both the x and y data vary in some way Question is do they co-vary? – Are large x values associated with large y values? (positive covariance) – Or large x with small y (negative covariance) Calculate a statistic called the “correlation coefficient” (r) which takes values -1 >= r <= +1 Test r against a statistical distribution

Lets ask a different question Instead of... Is there a relationship between x and y? I want to know... What is the relationship between x and y? – Fitting a mathematical line to the data will tell us what the relationship likely is

The equation of a line Mathematicians use – Y = mX + c Statisticians use – y = b 1 x + b 0 – b 1 is the slope of the line – b 0 is the intercept (the value of y when x=0) To calculate the coefficients use – b 1 = (y 2 -y 1 )/(x 2 -x 1 ) – b 0 = y-b 1 x

To calculate the coefficients b 1 = (y 2 -y 1 )/(x 2 -x 1 ) b 0 = y-b 1 x NB b 0 can often be estimated visually from the graph

Different slopes Y = b 1 X + b 0 b 1 > 0 b 1 = 0 b 1 < 0

Different intercepts Y = b 1 X + b 0 Parallel lines

Sample data

Return to Interaction Strengths The predicted y value The observed y value

Residuals Informative as it tells us which data are larger than predicted, and which are lower Should ideally be normally distributed around the line – Test this with visual plots like histograms or q-q plots Should be evenly spread around the line with no obvious trend

Regression model assumptions Inherently assume a straight line relationship The residuals, or errors are assumed to be normally distributed – Need to test this – And make sure they are evenly spread above and below the line along its length

Computer Session