Trail Mix Investigation

Slides:



Advertisements
Similar presentations
AP Statistics Section 3.2 C Coefficient of Determination
Advertisements

Regression, Residuals, and Coefficient of Determination Section 3.2.
Linear Regression and Correlation
Relationship of two variables
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
Least-Squares Regression Section 3.3. Why Create a Model? There are two reasons to create a mathematical model for a set of bivariate data. To predict.
Bivariate Distributions Overview. I. Exploring Data Describing patterns and departures from patterns (20%-30%) Exploring analysis of data makes use of.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Regression Regression relationship = trend + scatter
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Examining Bivariate Data Unit 3 – Statistics. Some Vocabulary Response aka Dependent Variable –Measures an outcome of a study Explanatory aka Independent.
Creating a Residual Plot and Investigating the Correlation Coefficient.
Scatter Diagram of Bivariate Measurement Data. Bivariate Measurement Data Example of Bivariate Measurement:
Residuals Recall that the vertical distances from the points to the least-squares regression line are as small as possible.  Because those vertical distances.
STATISTICS 12.0 Correlation and Linear Regression “Correlation and Linear Regression -”Causal Forecasting Method.
Section 1.6 Fitting Linear Functions to Data. Consider the set of points {(3,1), (4,3), (6,6), (8,12)} Plot these points on a graph –This is called a.
Simple Linear Regression The Coefficients of Correlation and Determination Two Quantitative Variables x variable – independent variable or explanatory.
Simple Linear Regression In many scientific investigations, one is interested to find how something is related with something else. For example the distance.
Warm-up Get a sheet of computer paper/construction paper from the front of the room, and create your very own paper airplane. Try to create planes with.
Linear Regression Essentials Line Basics y = mx + b vs. Definitions
Sections 3.3 & 3.4 Quiz tomorrow.
Regression Analysis.
Describing Relationships
Correlation & Regression
Sections Review.
Scatter plots Recall: Coordinate plane- A two-dimensional surface on which points are plotted and located by their x and y coordinates (x,y). This point.
distance prediction observed y value predicted value zero
Regression and Correlation
Warm-up: This table shows a person’s reported income and years of education for 10 participants. The correlation is .79. State the meaning of this correlation.
Unit 4 Lesson 4 (5.4) Summarizing Bivariate Data
Chapter 5 Lesson 5.3 Summarizing Bivariate Data
The following data represents the amount of Profit (in thousands of $) made by a trucking company dependent on gas prices. Gas $
Objectives Fit scatter plot data using linear models with and without technology. Use linear models to make predictions.
SCATTER PLOTS & LINES OF BEST FIT
Make Predictions from Scatter Plots
Energy in Food - Design Practical
Scatterplots A way of displaying numeric data
CHAPTER fourteen Correlation and Regression Analysis
Suppose the maximum number of hours of study among students in your sample is 6. If you used the equation to predict the test score of a student who studied.
1) A residual: a) is the amount of variation explained by the LSRL of y on x b) is how much an observed y-value differs from a predicted y-value c) predicts.
Chapter 8 Part 2 Linear Regression
Describing Bivariate Relationships
r = The correlation between price and calories is 0. 79
Simple Linear Regression
Ice Cream Sales vs Temperature
EQ: How well does the line fit the data?
Chapter 3: Describing Relationships
^ y = a + bx Stats Chapter 5 - Least Squares Regression
The Weather Turbulence
GET OUT p.161 HW!.
Examining Relationships
Residuals and Residual Plots
Introduction to Regression
Review of Chapter 3 Examining Relationships
Warm-up: This table shows a person’s reported income and years of education for 10 participants. The correlation is .79. State the meaning of this correlation.
Do now Take out worksheet from yesterday and check your answers with your groups Please grab a chromebook for your groups.
11C Line of Best Fit By Eye, 11D Linear Regression
CHAPTER 3 Describing Relationships
Examining Relationships
Section 3.2: Least Squares Regressions
Algebra Review The equation of a straight line y = mx + b
Chapter 3 Vocabulary Linear Regression.
9/27/ A Least-Squares Regression.
Relations P.O.D. #37 March
Lesson – Teacher Notes Standard:
Chapter 3: Describing Relationships
Review of Chapter 3 Examining Relationships
Presentation transcript:

Trail Mix Investigation

Technology You will need access to Desmos for this lesson. Go to www.desmos.com/calculator

Ask students what they think makes trail mix good. Bad. Expensive Ask students what they think makes trail mix good? Bad? Expensive? Cheap? Healthy? Unhealthy? Focus the discussion around healthy/unhealthy and frame the guiding questions.

Questions in Focus Which nutritional components of trail mix are the best predictor of the calorie content? Does the amount of fat in the trail mix reliably predict the calorie content? Does the amount of carbs? How could we figure this out? Why would someone care about this relationship? Discussion time … then explain that we chose to focus on fat vs. calories

Narrowing Our Investigation We will focus on this question today: Can the amount of fat in a trail mix reliably predict the calorie content?

Displaying Bivariate Data Can we begin to answer our question by looking at the relationship between the grams of fat and calories in a ... dot plot? scatter plot? frequency table? What is most appropriate for this investigation and why?

Displaying Bivariate Quantitative Data Use technology to construct a scatter plot for the fat and calories from the nutrition labels from your trail mix. Consider and discuss the following questions: How will you set up and scale your axes? What do you notice about the data? What is the “center” of this data? How do we measure the variability of this data?

Scatter Plot Here is a table and graph of the fat and calorie content of several different trail mixes. [Link to Handout]

Focus Question: Can the amount of fat in a trail mix reliably predict the calorie content? How does the scatter plot help us to answer this question? What else do we need to do to answer this question? Guiding Questions: Are fat and calories associated? How can we describe the association between fat and calories? Can we estimate the linear fit (with a line of best fit?)? How would we model this without guessing?

Line of Best Fit Estimate the line of best fit for the Trail Mix data.

Testing Your Line of Best Fit Variability in bivariate quantitative data can be quantified by calculating the residuals and creating a residual plot. Residuals: the vertical distances from the actual y-value of each data point to the corresponding y-value generated by a linear model; actual - predicted Residual Plot: a scatter plot created by plotting the x-values on the horizontal axis, and the corresponding residuals on the vertical axis

Variability On your scatter plot or in a spreadsheet, find the residuals for your line of fit. Create a residual plot by hand. Pay attention to patterns if any emerge … this may indicate that a linear model is not appropriate for this data set. Things to note: Patterns in residual plots indicate something Squaring and summing the residuals may lead to a conceptual understanding of the LSRL

Variability Go back to your scatter plot on which you drew the residuals. Sketch and shade in squares generated from the residuals. Compare your squares to some of your classmates. Who has the smallest total area of squares? What do you think that means?

Line of Best Fit → LSRL The Least Squares Regression Line (LSRL) will give us the true line of best fit. It does this by minimizing the sum of the squares created by the residuals to generate the equation for the best line of fit. You can play with this idea here: https://www.desmos.com/calculator/lywhybetzt

Line of Best Fit → LSRL Use desmos.com to find the LSRL of the trail mixes’ fat vs. calories data by typing in y1 ~ mx1+b Link: https://www.desmos.com/calculator/jx4arhqoy8

How Good Is Our Model? Correlation Coefficient, r The correlation coefficient, r, is a measure of how much or how little data is scattered around the LSRL. If you have already plotted the residuals and decided that a linear model is a good fit, the correlation coefficient, r, is a measure of the strength of a linear association, and is a value from +1 (positive, perfectly linear association) to -1 (negative, perfectly linear association).

How Good Is Our Model? r-Squared: The correlation coefficient squared (r2) gives us a percentage that helps us understand the variability in the data. r2 % of the variability in the dependent variable can be explained by a linear relationship with the independent variable. Lower r-squared % → higher chance that other variables are affecting the dependent variable (in this case, perhaps carbs, protein, etc. are affecting the calorie content in the trail mixes).

Answering Our Guiding Question Can the amount of fat in the trail mix reliably predict the calorie content? Why or why not?