Body Parts – Lab Write-up A hiker was walking in the woods and found a 54 cm fibula bone buried in the ground Forensic investigators were called to the.

Slides:



Advertisements
Similar presentations
MATH 2400 Chapter 5 Notes. Regression Line Uses data to create a linear equation in the form y = ax + b where “a” is the slope of the line (unit rate.
Advertisements

Unit 4: Linear Relations Minds On 1.Determine which variable is dependent and which is independent. 2.Graph the data. 3.Label and title the graph. 4.Is.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Chapter 3 Bivariate Data
Chapter 8 Linear Regression © 2010 Pearson Education 1.
CHAPTER 8: LINEAR REGRESSION
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Between Quantitative Variables Chapter 5.
Relationships Between Quantitative Variables
Describing the Relation Between Two Variables
Optical illusion ? Correlation ( r or R or  ) -- One-number summary of the strength of a relationship -- How to recognize -- How to compute Regressions.
Linear Models. Functions n function - a relationship describing how a dependent variable changes with respect to an independent variable n dependent variable.
Business Statistics - QBM117 Least squares regression.
Two Variable Analysis Joshua, Alvin, Nicholas, Abigail & Kendall.
Least Squares Regression Line (LSRL)
Making Science Graphs and Interpreting Data
1 Chapter 10 Correlation and Regression We deal with two variables, x and y. Main goal: Investigate how x and y are related, or correlated; how much they.
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
Relationship of two variables
Linear Regression and Correlation
 Once data is collected and organized, we need to analyze the strength of the relationship and formalize it with an equation  By understanding the strength.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Bivariate Data Pick up a formula sheet, Notes for Bivariate Data – Day 1, and a calculator.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Scatter Diagrams Joan Ridgway. When do we use scatter diagrams? When we look at data, we may want to investigate the relationship between two variables.
New Seats – Block 1. New Seats – Block 2 Warm-up with Scatterplot Notes 1) 2) 3) 4) 5)
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
CHAPTER 8 LEAST SQUARES LINE. IMPORTANT TERMS – WRITE THESE DOWN WE WILL WORK ON THE DEFINITIONS AS WE GO! Model Linear Model Predicted value Residuals.
1 Chapter 10 Correlation and Regression 10.2 Correlation 10.3 Regression.
1 What you will learn today 1. New vocabulary 2. How to determine if data points are related 3. How to develop a linear regression equation 4. How to graph.
Chapter 10 Correlation and Regression
Do Now 12/3/09 Take out HW from last night. -Text p. 328, #3-6, 8-12 evens, 16 & 17 (4 graphs) Copy HW in planner. - Text p. 338, #4-14 evens, 18 & 20.
Basic Concepts of Correlation. Definition A correlation exists between two variables when the values of one are somehow associated with the values of.
Bivariate Data and Scatter Plots Bivariate Data: The values of two different variables that are obtained from the same population element. While the variables.
Graph Some basic instructions
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Aim: Review for Exam Tomorrow. Independent VS. Dependent Variable Response Variables (DV) measures an outcome of a study Explanatory Variables (IV) explains.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Correlation and Regression: The Need to Knows Correlation is a statistical technique: tells you if scores on variable X are related to scores on variable.
MARE 250 Dr. Jason Turner Linear Regression. Linear regression investigates and models the linear relationship between a response (Y) and predictor(s)
Requirements of a good GRAPH. GRAPH  Title (usually “dependent” vs. “independent”)  Go Big (cover at least ½ the page in both directions) This increases.
 Find the Least Squares Regression Line and interpret its slope, y-intercept, and the coefficients of correlation and determination  Justify the regression.
SECTIONS B Chapter 2. Objectives To draw scatter plots. To find and use prediction equations. Using a graphing calculator to graph lines of regression.
Unit 3 Review. Data: – A variable is an attribute that can be measured – One Variable Data: measures only 1 attribute for each data point. – Two Variable.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-2 Correlation 10-3 Regression.
Wednesday Today you need: Whiteboard, Marker, Eraser Calculator 1 page handout.
Chemistry Lab Report & Graphing Basics. Formatting Your Paper  Typed  Double spaced  12-pt Times New Roman font  Each section should be titled  Name.
ContentDetail  Two variable statistics involves discovering if two variables are related or linked to each other in some way. e.g. - Does IQ determine.
Lecture 10 Introduction to Linear Regression and Correlation Analysis.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
Lesson 6-7 Scatter Plots and Lines of Best Fit. Scatter Plots A scatter plot is a graph that relates two different sets of data by plotting the data as.
M11.E Draw, find and/or write an equation for a line of best fit for a scatter plot.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Lecture Slides Elementary Statistics Twelfth Edition
Unit 4 LSRL.
Chapter 5 LSRL.
LSRL Least Squares Regression Line
Chapter 4 Correlation.
Scatterplots A way of displaying numeric data
Scatter Plots - Line of Best Fit
Graphing Review.
Line of best fit.
Line of Best Fit.
Chapter 5 LSRL.
Does age have a strong positive correlation with height? Explain.
Does age have a strong positive correlation with height? Explain.
Lesson 2.2 Linear Regression.
Honors Statistics Review Chapters 7 & 8
Presentation transcript:

Body Parts – Lab Write-up A hiker was walking in the woods and found a 54 cm fibula bone buried in the ground Forensic investigators were called to the scene and also found an old size 12 (12-inch) men’s shoe It is known that there were 12 missing persons in the area ranging from 5’3” to 6’8” Six of these were men and six were women Using this information and the data from your class, what, if anything, can the investigators conclude?

Making Charts What data would you plot? If you would like to plot this data what kind of chart would you make?

Chart of Radius Length vs. Height

Radius Length vs. Height – Best Fit Line

Radius Length vs. Height (Chart) Note the following key characteristics of the chart: 1.The chart has a title 2.The x-axis and y-axis are labeled 3.The labels have units (cm in this case) 4.The chart uses space well (not a lot of empty space (compare with the chart on the next page)

What’s Wrong with this Chart? (3 things)

Radius Length vs. Height (“Best-Fit” Line) Using Excel, I created a “best-fit” line for the data An estimated “best-fit” line is drawn so that approximately half of the data points are above it and half are below it (drawing a real best-fit line requires statistics, but Excel does this for you) This “best-fit” line is called the regression line Any line has a slope and an intercept: y = mx + b o m = slope = ∆y/∆x (“rise/run”) o b = intercept = y (when x = 0)

Radius Length vs. Height (Equation) Equation for this line: y = 2.4x +114 How could we find this equation from a paper graph (do exercise on board) If a person’s radius is 27cm, how tall would we predict him or her to be? Is this reasonable based on our chart? What if we know a person is 184 cm, how long would we expect his radius to be? CAUTION: You cannot use the equation to extrapolate values too far out of the range of points you have.

Radius Length vs. Height (Correlation) Correlation refers to how strongly two variables are related to each other Visually, the closer the data points are to a “best-fit” line the better the correlation The statistical term that measures how strongly two variables are correlated is R 2 (“R-squared”), which is expressed as a number between 0 and 1 The closer R 2 is to 1, the better the correlation

Radius Length vs. Height (R-Squared) Degree of correlation: o R 2 =1.0  Perfect correlation o 0.8<R 2 <1.0  Very strong correlation o 0.6<R 2 <0.8  Strong correlation o 0.4<R 2 <0.6  Moderate correlation o 0.2<R 2 <0.4  Weak correlation o 0.0<R 2 <0.2  Very weak correlation o R 2 =0.0  No correlation The R 2 for radius length vs. height is 0.6 The more data points, the better the R 2 – why?

Perfect Correlation (R 2 = 1)

No Correlation (R 2 = 0)

Perfect Correlation What do you think the equation is for the line on the previous slide? y = mx + b o m = ? o b = ? If a person is 185 cm, how many inches is he? If a person is 67 in, how many centimeters is she?

Some terminalogy Remember: y = mx + b When we say “x predicts y” or “y depends on x” o Independent variable: x o Dependent variable: y In the previous case, we can say “radius length predicts height,” in which case: o Independent variable: radius length o Dependent variable: height Some examples of dependent and independent variables are…

Outliers Outliers are points that are numerically distant from the rest of the data They are usually due to measurement error (either human error or instrument error) If you get data that look like outliers as you are doing the lab, the best thing to do is to take the measurement again o DO NOT erase the outliers from your original data o In your lab write-up explain that you are going to disregard outliers and proceed with the data analysis (see next page):

Chart with Outliers NY Yankees

Chart (Adjusted for Outliers)

MLB Payroll Question If a team spends $75 million on payroll, how many wins on average can that team expect to achieve? Why do you think the R-squared is so low for payroll vs. wins?