Download presentation
Presentation is loading. Please wait.
Published byDaniella Carter Modified over 9 years ago
1
Math 2: Unit 6 Day 1 How do we use scatter plots, correlation, and linear regression?
2
Scatter Plots A scatter plot is a graph of a set of data values (x, y) that shows the relationship between 2 quantitative variables. Ex: http://www.shodor.org/interactivate/activities/ScatterPlot/
3
Correlation Data has a positive correlation if y increases as x increases and has a negative correlation if y decreases as x increases.
4
Tell if the following show positive, negative, or no correlation: The amount of hours you study and your test scores. The speed you drive, and the amount of time it takes to get to your destination. The color of your eyes and your height. Positive – the more (↑) you study, the better (↑) your test score will be. Negative – the faster you drive (↑), the less time it takes. No correlation!
5
You decide: What type of relationship might you expect? The weight of a sirloin steak and the selling price. The number of problems assigned for homework and the amount of time spent doing homework. Athletic ability and musical ability. The number of days you are absent, and your grade in the class. The number of dogs in 30 California cities and the number of cats in 30 Texas cities. positive No correlation negative THINK: more ↑ absences means a worse ↓ grade.
6
Correlation Coefficient A correlation coefficient, denoted by r, is a number from -1 to 1 that measures how well a line fits a set of data pairs (x, y). If r is near 1, then the points lie close to a line with a positive slope. If r is near -1, then the points lie close to a line with a negative slope. If r is near 0, then the points do not lie close to any line. *See handout
7
Correlation Ex: Decide whether the data have a positive correlation, a negative correlation, or approximately no correlation. Then, tell whether the correlation coefficient is closest to -1, -0.5, 0, 0.5, or 1. 1.2. Positive correlation; 1No correlation; 0
8
Correlation Ex: Decide whether the data have a positive correlation, a negative correlation, or approximately no correlation. Then, tell whether the correlation coefficient is closest to -1, -0.5, 0, 0.5, or 1. 3.4. Negative correlation; -1Positive correlation; 0.5
9
An outlier is a value that is outside the clustered majority of points on a graph. Ex: outlier
10
Association Positive slope indicates a positive association and a negative slope indicates a negative association. negative positive
11
To clarify… Statistically, correlation and association are not synonymous – they do not mean the same thing. Association describes the nature of the relationship between 2 variables, whereas correlation measures the direction and strength of the linear relationship between 2 variables. i.e. Correlation gives a numeric value and association does not. Correlation does not imply causation! *An action or occurrence can cause another (such as smoking causes lung cancer), or it can correlate with another (such as smoking is correlated with alcoholism). If one action causes another, then they are most certainly correlated.
12
Your turn. The table shows the number of absences and grades for 16 students. Absences 3609425839024611 Grade 85649041838878608250989085719387 1.Make a scatter plot for these data. 2.What type of relationship seems to exist between absences and grades? Strong negative correlation.
13
Before moving on, we need to review what the different types of graphs look like. Linear: Quadratic:
14
Cubic: Exponential: Absolute Value:
15
Ex: Which type of function could the data in the scatter plot below best be modeled by: quadratic, linear, logarithmic, or exponential? 3.4. linear quadratic
16
Ex: Draw a scatterplot of the following data to determine which model would best describe the data: linear, exponential, absolute value, or quadratic. 5. Year19981999200020012002 Carbon Dioxide Emissions (million metric tons) 1083.31107.11170.41163.31193.9 linear
17
Ex: Draw a scatterplot of the following data to determine which model would best describe the data: linear, exponential, absolute value, or quadratic. 6. Year, t1234567 Scrap parts, p (in thousands) 9.99.79.59.49.810.210.4 quadratic
18
Line of Best Fit The line of best fit is the line that lies as close as possible to all the data points. Linear regression is a method for finding the equation of the regression line,.
19
Ex 7: The ordered pairs (x, y) give the height y in feet of a young tree x years after 2000. Approximate the best fitting line for the data. (0,5.1), (1,6.4), (2,7.7), (3,9), (4,10.3), (5,11.6), (6,12.9) Use the points (0, 5.1) and (1, 6.4) to find the slope. Estimate the y- intercept from the graph.
20
Ex 8: The table below gives the number of people y who attended each of the first seven football games x of the season. Approximate the best-fitting line for the data. x1234567 y722763772826815857897
21
Ex 9: The table gives the average class score y on each unit test for the first 6 units of Math II. Approximate the best fitting line for the data. x123456 y848386888790 y = 1.3x + 81.9
22
Ex 10: Find the equation of the line of best fit.
23
Ex 11: Approximate the linear regression line for the data.
24
HOMEWORK Unit 6 Day 1 Handout
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.