Notes Bivariate Data Chapters 7 - 9. Bivariate Data Explores relationships between two quantitative variables.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

AP Statistics Section 3.2 B Residuals
Residuals.
Chapter 6: Exploring Data: Relationships Lesson Plan
Scatter Diagrams and Linear Correlation
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
The Practice of Statistics
Looking at Data-Relationships 2.1 –Scatter plots.
CHAPTER 3 Describing Relationships
Ch 2 and 9.1 Relationships Between 2 Variables
C HAPTER 2 S CATTER PLOTS, C ORRELATION, L INEAR R EGRESSION, I NFERENCES FOR R EGRESSION By: Tasha Carr, Lyndsay Gentile, Darya Rosikhina, Stacey Zarko.
Haroon Alam, Mitchell Sanders, Chuck McAllister- Ashley, and Arjun Patel.
Regression, Residuals, and Coefficient of Determination Section 3.2.
Chapter 5 Regression. Chapter 51 u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We.
Descriptive Methods in Regression and Correlation
Relationship of two variables
Correlation with a Non - Linear Emphasis Day 2.  Correlation measures the strength of the linear association between 2 quantitative variables.  Before.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
Chapter 6: Exploring Data: Relationships Chi-Kwong Li Displaying Relationships: Scatterplots Regression Lines Correlation Least-Squares Regression Interpreting.
1 Chapter 3: Examining Relationships 3.1Scatterplots 3.2Correlation 3.3Least-Squares Regression.
Chapter 6: Exploring Data: Relationships Lesson Plan Displaying Relationships: Scatterplots Making Predictions: Regression Line Correlation Least-Squares.
Chapter 6: Exploring Data: Relationships Lesson Plan Displaying Relationships: Scatterplots Making Predictions: Regression Lines Correlation Least-Squares.
Ch 3 – Examining Relationships YMS – 3.1
Chapter 3 concepts/objectives Define and describe density curves Measure position using percentiles Measure position using z-scores Describe Normal distributions.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
AP Statistics Chapter 8 & 9 Day 3
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Verbal SAT vs Math SAT V: mean=596.3 st.dev=99.5 M: mean=612.2 st.dev=96.1 r = Write the equation of the LSRL Interpret the slope of this line Interpret.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.2 Least-Squares.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 5 Regression. u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We can then predict.
Chapter 3-Examining Relationships Scatterplots and Correlation Least-squares Regression.
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Business Statistics for Managerial Decision Making
Residuals Recall that the vertical distances from the points to the least-squares regression line are as small as possible.  Because those vertical distances.
^ y = a + bx Stats Chapter 5 - Least Squares Regression
LEAST-SQUARES REGRESSION 3.2 Least Squares Regression Line and Residuals.
CHAPTER 3 Describing Relationships
Notes Chapter 7 Bivariate Data. Relationships between two (or more) variables. The response variable measures an outcome of a study. The explanatory variable.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
Response Variable: measures the outcome of a study (aka Dependent Variable) Explanatory Variable: helps explain or influences the change in the response.
Lecture 4 Chapter 3. Bivariate Associations. Objectives (PSLS Chapter 3) Relationships: Scatterplots and correlation  Bivariate data  Scatterplots (2.
Chapter 3: Describing Relationships
CHAPTER 5: Regression ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Part II Exploring Relationships Between Variables.
1. Analyzing patterns in scatterplots 2. Correlation and linearity 3. Least-squares regression line 4. Residual plots, outliers, and influential points.
CHAPTER 3 Describing Relationships
SCATTERPLOTS, ASSOCIATION AND RELATIONSHIPS
Chapter 6: Exploring Data: Relationships Lesson Plan
Cautions about Correlation and Regression
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
Regression and Residual Plots
Chapter 6: Exploring Data: Relationships Lesson Plan
Chapter 7 Part 1 Scatterplots, Association, and Correlation
AP Stats: 3.3 Least-Squares Regression Line
^ y = a + bx Stats Chapter 5 - Least Squares Regression
Unit 4 Vocabulary.
Least Squares Regression
Review of Chapter 3 Examining Relationships
Least-Squares Regression
CHAPTER 3 Describing Relationships
Chapter 3 Vocabulary Linear Regression.
Chapters Important Concepts and Terms
Honors Statistics Review Chapters 7 & 8
Review of Chapter 3 Examining Relationships
Presentation transcript:

Notes Bivariate Data Chapters 7 - 9

Bivariate Data Explores relationships between two quantitative variables.

The explanatory variable attempts to explain the observed outcomes. (In algebra this is your independent variable – “x”)

The response variable measures an outcome of a study. (In algebra this is your dependent variable – “y”)

 When we gather data, we usually have in mind which variables are which.  Beware! – this explanatory/response relationship suggests a cause and effect relationship that may not exist in all data sets. Use common sense!!

Displaying the Variables  We always graph our data right?  You use a scatterplot to graph the relationship between 2 quantitative variables. Each point represents an individual.

 Remember that not all relationships are linear!!! We will talk about non-linear in the next unit.

Interpret a Scatterplot  Here is what we look for:  1) direction (positive, negative) D  2) form ( linear, or not linear) S  3) strength ( correlation, r) S  4) deviations from the pattern (outliers) U SUDS!!

Remember on outlier is an individual observation that falls outside the overall pattern of the graph.  There is no outlier test for bivariate data. It’s a judgment call

 Categorical variables can be added to scatterplots by changing the symbols in the plot. (See P. 199 for examples)  Visual inspection is often not a good judge of how strong a linear relationship is. Changing the plotting scales or the amount of white space around a cloud of points can be deceptive. So….

Facts about Correlation:  1) positive r – positive association (positive slope) negative r – negative association (negative slope)  2) r must fall between –1 and 1 inclusive.  3) r values close to –1 or 1 indicate that the points lie close to a straight line.  4) r values close to 0 indicate a weak linear relationship.  5) r values of –1 or 1 indicate a perfect linear relationship.  6) correlation only measures the strength in linear relationships (not curves).  7) correlation can be strongly affected by extreme values (outliers).

Least-Squares Regression Line  The least-squares regression line (LSRL) is a mathematical model for the data.  This line is also known as the line of best fit or the regression line.

Formal definition…  The least-squares regression line of y on x is the line that makes the sum of the squares of the vertical distances of the data points from the line as small as possible.

The form…

Some new formulas…

Why do we do regression?  The purpose of regression is to determine a model that we can use for making predictions.

Communication is always the goal!!!  When we write the equation for a LSRL we do not use x & y, we use the variable names themselves…  For example:  Predicted score = (hours studied)

Another measure of strength…  The coefficient of determination, r 2, is the fraction of the variation in the value of y that is explained by the linear model.  When we explain r 2 then we say… ___% of the variability in ___(y) can be explained by this linear model.

Deviations for single points  A residual is the vertical difference between an actual point and the LSRL at one specific value of x. That is, Residual = observed y – predicted y or Residual = y –  The mean of the residuals is always zero.

A new plot…  A residual plot plots the residuals on the vertical axis against the explanatory variables on the horizontal axis.  Such a plot magnifies the residuals and makes patterns easier to see.

Why do I need a residual plot?  Remember that all data is not linear in shape!!! The residual plot clearly shows if linear is appropriate.  A residual plot show good linear fit when the points are randomly scattered about y = 0 with no obvious patterns.

To create a residual plot on the calculator:  1)You must have done a linear regression with the data you wish to use.  2) From the Stat-Plot, Plot # menu choose scatterplot and leave the x list with the x values.  3) Change the y-list to “RESID” chosen from the list menu.  4) Zoom – 9

 In scatterplots we can have points that are outliers or influential points or both.  An observation can be an outlier in the x direction, the y direction, or in both directions.  An observation is influential if removing it or adding it) would markedly change the position of the regression line.

 Extrapolation is the use of a regression model for prediction outside the domain of values of the explanatory variable x.  Such predictions cannot be trusted.

Association vs. Causation  A strong association between two variables is NOT enough to draw conclusions about cause & effect.

Association vs Causation  Strong association between two variables x and y can reflect:  A) Causation – Change in x causes change in y  B) Common response – Both x and y are Responding to some other unobserved factor  C) Confounding – the effect on y of the explanatory variable x is hopelessly mixed up with the effects on y of other variables.

 A Lurking Variable is a variable that has an important effect on the relationship among the variables in a study but is not included among the variables being studied.  Lurking variables can suggest a relationship when there isn’t one or can hide a relationship that exists.

Association vs Causation  Cause and Effect can only be determined from a well designed experiment.

 Data with no apparent linear relationship can also be examined in two ways to see if a relationship still exists:  1) Check to see if breaking the data down into subsets or groups makes a difference.  2) If the data is curved in some way and not linear, a relationship still exists. We will explore that in the next chapter.