Chapter 3 Describing Bivariate Data General Objectives: Sometimes the data that are collected consist of observations for two variables on the same experimental.

Slides:



Advertisements
Similar presentations
Copyright ©2005 Brooks/Cole A division of Thomson Learning, Inc. Introduction to Probability and Statistics Twelfth Edition Robert J. Beaver Barbara M.
Advertisements

Chapter 12 Simple Linear Regression
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Chapter 4 The Relation between Two Variables
Business Statistics - QBM117 Scatter diagrams and measures of association.
Describing the Relation Between Two Variables
PSY 307 – Statistics for the Behavioral Sciences
Statistics for the Behavioral Sciences (5th ed.) Gravetter & Wallnau
Correlation & Regression Math 137 Fresno State Burger.
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Lecture 3: Bivariate Data & Linear Regression 1.Introduction 2.Bivariate Data 3.Linear Analysis of Data a)Freehand Linear Fit b)Least Squares Fit c)Interpolation/Extrapolation.
Linear Regression and Correlation
Descriptive Methods in Regression and Correlation
Linear Regression.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved Section 10-3 Regression.
Relationship of two variables
Correlation By Dr.Muthupandi,. Correlation Correlation is a statistical technique which can show whether and how strongly pairs of variables are related.
Correlation and regression 1: Correlation Coefficient
Copyright ©2011 Nelson Education Limited Describing Bivariate Data CHAPTER 3.
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Prior Knowledge Linear and non linear relationships x and y coordinates Linear graphs are straight line graphs Non-linear graphs do not have a straight.
Probabilistic and Statistical Techniques 1 Lecture 24 Eng. Ismail Zakaria El Daour 2010.
Bivariate Data When two variables are measured on a single experimental unit, the resulting data are called bivariate data. You can describe each variable.
Chapter 1 Review MDM 4U Mr. Lieff. 1.1 Displaying Data Visually Types of data Quantitative Discrete – only whole numbers are possible Continuous – decimals/fractions.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 1 Section 13-6 Regression and Correlation.
© The McGraw-Hill Companies, Inc., Chapter 11 Correlation and Regression.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
Correlation is a statistical technique that describes the degree of relationship between two variables when you have bivariate data. A bivariate distribution.
1.6 Linear Regression & the Correlation Coefficient.
Section 5.2: Linear Regression: Fitting a Line to Bivariate Data.
Chapter 20 Linear Regression. What if… We believe that an important relation between two measures exists? For example, we ask 5 people about their salary.
Chapter 10 Correlation and Regression
BIOL 582 Lecture Set 11 Bivariate Data Correlation Regression.
 Graph of a set of data points  Used to evaluate the correlation between two variables.
Summarizing Bivariate Data
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Regression Regression relationship = trend + scatter
1.1 example these are prices for Internet service packages find the mean, median and mode determine what type of data this is create a suitable frequency.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
April 1 st, Bellringer-April 1 st, 2015 Video Link Worksheet Link
Chapter 9: Correlation and Regression Analysis. Correlation Correlation is a numerical way to measure the strength and direction of a linear association.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
1 Data Analysis Linear Regression Data Analysis Linear Regression Ernesto A. Diaz Department of Mathematics Redwood High School.
Chapter 9 Scatter Plots and Data Analysis LESSON 1 SCATTER PLOTS AND ASSOCIATION.
REGRESSION MODELS OF BEST FIT Assess the fit of a function model for bivariate (2 variables) data by plotting and analyzing residuals.
Scatter Plots. Standard: 8.SP.1 I can construct and interpret scatterplots.
Chapter 1-2 Review MDM 4U Mr. Lieff. Ch1 Learning Goals Classify data as Quantitative (and continous or discrete) or Qualitatitive Identify the population,
Unit 3 – Association: Contingency, Correlation, and Regression Lesson 3-2 Quantitative Associations.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
CCSS.Math.Content.8.SP.A.1 Construct and interpret scatter plots for bivariate measurement data to investigate patterns of association between two quantities.
Part II Exploring Relationships Between Variables.
Department of Mathematics
Scatterplots Chapter 6.1 Notes.
CHAPTER 7 LINEAR RELATIONSHIPS
Two Quantitative Variables
CHS 221 Biostatistics Dr. wajed Hatamleh
Chapter 5 STATISTICS (PART 4).
SIMPLE LINEAR REGRESSION MODEL
CHAPTER 10 Correlation and Regression (Objectives)
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
2. Find the equation of line of regression
Introduction to Probability and Statistics Thirteenth Edition
STEM Fair Graphs.
Correlation and Regression
Correlation.
7.1 Draw Scatter Plots & Best-Fitting Lines
Presentation transcript:

Chapter 3 Describing Bivariate Data General Objectives: Sometimes the data that are collected consist of observations for two variables on the same experimental unit. Special techniques that can be used in describing these variables will help you identify possible relationships between them. © 1998 Brooks/Cole Publishing/ITP

Specific Topics 1. Bivariate data 2. Side-by-side pie charts, comparative line charts 3. Side-by-side bar charts, stacked bar charts 4. Scatterplots for two quantitative variables 5. Covariance and the correlation coefficient 6. The best-fitting line © 1998 Brooks/Cole Publishing/ITP

3.1 and 3.2 Bivariate Data and Graphs for Qualitative Variables n When two variables are measured on a single experimental unit, the resulting data are called bivariate data. n You may want to explore the relationship between the two variables. n If you have one qualitative and one quantitative variable, you can use two side-by-side charts or a bar chart in which the bars for the two populations are placed side by side. n Another option is to use a stacked bar chart, where the bars for each category are stacked on top of each other. n See Figures 3.1, 3.2, and 3.3 for examples of techniques. © 1998 Brooks/Cole Publishing/ITP

Figure 3.1 Comparative bar charts

© 1998 Brooks/Cole Publishing/ITP Figure 3.2 Comparative pie charts

© 1998 Brooks/Cole Publishing/ITP Figure 3.3 Stacked bar chart

3.3 Scatterplots for Two Quantitative Variables n Describing patterns shown in a scatterplot: - What type of pattern do you see? Upward or downward? Curved? None or random? - How strong is the pattern? All points follow it? Only weakly visible? - Are there any unusual observations? Outliers? Clusters? Explanation for groupings? n See Figures 3.4 and 3.5 for examples of the use of a scatterplot. © 1998 Brooks/Cole Publishing/ITP

Figure 3.4

© 1998 Brooks/Cole Publishing/ITP Figure 3.5

3.4 Numerical Measures for Quantitative Bivariate Data n Sometimes two variables exhibit a linear relationship, e.g., the relationship between height and weight in humans. © 1998 Brooks/Cole Publishing/ITP

n A simple measure of the relationship between two variables x and y is the correlation coefficient, r : where s x and s y are the standard deviations for the variables x and y. n The new quantity s xy is called the covariance between x and y and is defined as: n A computing formula for the covariance: where  x i y i is the sum of the products x i y i for each of the n pairs of measurements. © 1998 Brooks/Cole Publishing/ITP

n If the points in the x vs y plot tend to run from lower left to upper right, then s xy and r will be positive. n If the points tend to run from upper left to lower right, then s xy and r will be negative. n If the points are scattered high and low and left and right, then s xy and r will be close to zero. n The value of y typically depends on the value of x ; y is called the dependent variable and x is called the independent variable. Sometimes it is possible to describe the relationship relating x to y using a straight line given by the equation y  ax  b. © 1998 Brooks/Cole Publishing/ITP

n The best-fitting line relating y to x, called the regression or least-squares line, is found by minimizing the sum of the squared differences between the data points and the line itself. n The formulas for computing b and a are n When r is positive, b is positive; when r is negative, b is negative; when r is zero, b is zero. © 1998 Brooks/Cole Publishing/ITP

Example An electrical engineer is interested in calibrating a volt meter to be used on a production line. For the test, 10 different flow rates are fixed and the corresponding meter readings observed. Measured Voltage Actual Voltage © 1998 Brooks/Cole Publishing/ITP

Compute the correlation coefficient between the observed and actual voltages. Solution © 1998 Brooks/Cole Publishing/ITP