Bivariate EDA. Quantitative Bivariate EDASlide #2 Bivariate EDA –Graphically –Numerically –Model Describe the relationship between pairs of variables.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

Correlation and Linear Regression
Scatter Diagrams and Linear Correlation
AP Statistics Chapters 3 & 4 Measuring Relationships Between 2 Variables.
CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
CHAPTER 4: Scatterplots and Correlation
+ Scatterplots and Correlation Displaying Relationships: ScatterplotsThe most useful graph for displaying the relationship between two quantitative variables.
Describing the Relation Between Two Variables
Scatterplots By Wendy Knight. Review of Scatterplots  Scatterplots – Show the relationship between 2 quantitative variables measured on the same individual.
CHAPTER 3.1 AP STAT By Chris Raiola Emily Passalaqua Lauren Kelly.
Describing Relationships: Scatterplots and Correlation
Linear Regression Analysis
Correlation & Regression
Lecture 3: Bivariate Data & Linear Regression 1.Introduction 2.Bivariate Data 3.Linear Analysis of Data a)Freehand Linear Fit b)Least Squares Fit c)Interpolation/Extrapolation.
Linear Regression and Correlation
Descriptive Methods in Regression and Correlation
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Relationship of two variables
Warm-up with Scatterplot Notes
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
A P STATISTICS LESSON 3 – 2 CORRELATION.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Warm-Up A trucking company determines that its fleet of trucks averages a mean of 12.4 miles per gallon with a standard deviation of 1.2 miles per gallon.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Lesson Scatterplots and Correlation. Knowledge Objectives Explain the difference between an explanatory variable and a response variable Explain.
Chapter 10 Correlation and Regression
Example 1: page 161 #5 Example 2: page 160 #1 Explanatory Variable - Response Variable - independent variable dependent variable.
BIOL 582 Lecture Set 11 Bivariate Data Correlation Regression.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
The Practice of Statistics
Chapter 4: Describing the relation between two variables Univariate data: Only one variable is measured per a subject. Example: height. Bivariate data:
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Chapter 7 Scatterplots, Association, and Correlation.
Describing Relationships: Scatterplots and Correlation.
Correlation MEASURING ASSOCIATION Establishing a degree of association between two or more variables gets at the central objective of the scientific enterprise.
4.2 Correlation The Correlation Coefficient r Properties of r 1.
Correlation.
Lesson Scatter Diagrams and Correlation. Objectives Draw and interpret scatter diagrams Understand the properties of the linear correlation coefficient.
1 Association  Variables –Response – an outcome variable whose values exhibit variability. –Explanatory – a variable that we use to try to explain the.
We would expect the ENTER score to depend on the average number of hours of study per week. So we take the average hours of study as the independent.
Scatter Diagram of Bivariate Measurement Data. Bivariate Measurement Data Example of Bivariate Measurement:
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Relationships Scatterplots and Correlation.  Explanatory and response variables  Displaying relationships: scatterplots  Interpreting scatterplots.
Chapter 16: Correlation. So far… We’ve focused on hypothesis testing Is the relationship we observe between x and y in our sample true generally (i.e.
What Do You See?. A scatterplot is a graphic tool used to display the relationship between two quantitative variables. How to Read a Scatterplot A scatterplot.
UNIT 4 Bivariate Data Scatter Plots and Regression.
Quantitative Bivariate EDASlide #1 Bivariate EDA Describe the relationship between pairs of variables Four characteristics to describe Association (Direction)
Notes Chapter 7 Bivariate Data. Relationships between two (or more) variables. The response variable measures an outcome of a study. The explanatory variable.
Chapter 14 STA 200 Summer I Scatter Plots A scatter plot is a graph that shows the relationship between two quantitative variables measured on the.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
CORRELATION ANALYSIS.
Scatter Plots. Definitions  Scatter Plot: A graph in which the values of two variables are plotted along two axes, the pattern of the resulting points.
Lecture 8 Sections Objectives: Bivariate and Multivariate Data and Distributions − Scatter Plots − Form, Direction, Strength − Correlation − Properties.
Chapter 4 Scatterplots – Descriptions. Scatterplots Graphical display of two quantitative variables We plot the explanatory (independent) variable on.
Chapter 9 Scatter Plots and Data Analysis LESSON 1 SCATTER PLOTS AND ASSOCIATION.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. C ORRELATION Section 4.1.
CCSS.Math.Content.8.SP.A.1 Construct and interpret scatter plots for bivariate measurement data to investigate patterns of association between two quantities.
Part II Exploring Relationships Between Variables.
Topic 13: Quantitative-Quantitative Association Part 1:
Two Quantitative Variables
CHS 221 Biostatistics Dr. wajed Hatamleh
Examining Relationships
Examining Relationships
Correlation & Trend Lines
Statistics 101 CORRELATION Section 3.2.
Presentation transcript:

Bivariate EDA

Quantitative Bivariate EDASlide #2 Bivariate EDA –Graphically –Numerically –Model Describe the relationship between pairs of variables

1.What is the name of this plot? 2.What type of variable is latitude? 3.Which variable is considered the response variable? 4.What is the approximate percentage females at a latitude of 55? at 45? at 35? Quantitative Bivariate EDASlide #3 Figure 1. Plot of the percent female kingfishers observed at different latitudes during the Christmas Bird Count, 1992.

Quantitative Bivariate EDASlide #4 Variables & Axes Response (dependent) variable –variability is being explained or values predicted –y-axis Explanatory (independent, predictor) variable –used to explain variability or to make predictions –x-axis

Quantitative Bivariate EDASlide #5 Bivariate EDA -- Description Association/Direction – what words are used? Positive Negative None What four things are described in a bivariate EDA for quantitative data?

Quantitative Bivariate EDASlide #6 What Type of Association? X Y Negative

Quantitative Bivariate EDASlide #7 What Type of Association? X Y Positive

Quantitative Bivariate EDASlide #8 What Type of Association? X Y None

Quantitative Bivariate EDASlide #9 Items to Describe in a Bivariate EDA Association/Direction Form – what two forms will we consider? Linear Non-linear

Quantitative Bivariate EDASlide #10 Items to Describe in a Bivariate EDA Association/Direction Form Outliers X Y

Quantitative Bivariate EDASlide #11 Items to Describe in a Bivariate EDA Association/Direction Form Outliers Strength -- how closely the points cluster to the form

Strength? Quantitative Bivariate EDASlide # X Y

Which is More Strong? Quantitative Bivariate EDASlide #13

Quantitative Bivariate EDASlide #14 Correlation Coefficient 1. Standardize both X and Y 2. Product paired standardized values 3. Sum products 4. Divide by n-1 1n  s yy y i          s xx x i          r       n 1i      *

Quantitative Bivariate EDASlide #15 A measure of association/direction Correlation Coefficient

Quantitative Bivariate EDASlide #16 r for positive Association? X Y Standardize both X and Y

Quantitative Bivariate EDASlide #17 r for positive Association? X Y Product paired standardized values

Quantitative Bivariate EDASlide #18 r for positive Association? X Y Sum products Positive

Quantitative Bivariate EDASlide #19 r for positive Association? X Y Divide by n Positive

Quantitative Bivariate EDASlide #20 r for Positive Association? X Y Thus, r is Positive

Quantitative Bivariate EDASlide #21 r for Negative Association? X Y Thus, r is Negative

Quantitative Bivariate EDASlide #22 A measure of association and strength Correlation Coefficient +1 0 WeakestStrongest

Quantitative Bivariate EDASlide #23 A measure of association and strength of a linear relationship with no outliers Moral … PLOT YOUR DATA!! Correlation Coefficient r = 0.817

Quantitative Bivariate EDASlide #24 Correlation Review Variables must be quantitative Form must be linear without outliers -- i.e., PLOT -1 < r < 1 No distinction between which variable is on x and which is on y (though, response variable should always be y) r does not depend on units of x and y Correlation is not causation We won’t compute r - must interpret and identify strength

Perform a bivariate EDA from Figure 1. Quantitative Bivariate EDASlide #25 Figure 1. Plot of the percent female kingfishers observed at different latitudes during the Christmas Bird Count, r =

Perform a bivariate EDA from Figure 2. Quantitative Bivariate EDASlide #26 Figure 2. Plot of the maximum temperature versus herbage yield for grassland headfires in west Texas. r=0.798

Perform a bivariate EDA from Figure 3. Quantitative Bivariate EDASlide #27 Figure 3. Plot of the number of pupae per gallery and the density of attacks for the beetle Ips cembrae r=-0.612

Quantitative Bivariate EDASlide #28 Examine handout – plot() – cor() Quantitative Bivariate EDA in R