AP STATS: Warm-Up Do Math SAT scores help to predict Verbal SAT scores. Make a scatter plot. Find the least squares regression and r and r-squared. Also.

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships Lindsey Van Cleave AP Statistics September 24, 2006.
Advertisements

AP Statistics Section 3.2 C Coefficient of Determination
The Role of r2 in Regression Target Goal: I can use r2 to explain the variation of y that is explained by the LSRL. D4: 3.2b Hw: pg 191 – 43, 46, 48,
Chapter 3 Examining Relationships
Regression Analysis Simple Regression. y = mx + b y = a + bx.
Chapter 4 The Relation between Two Variables
Chapter 3 Bivariate Data
The Practice of Statistics
Chapter 3: Examining Relationships
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Ch 2 and 9.1 Relationships Between 2 Variables
C HAPTER 3: E XAMINING R ELATIONSHIPS. S ECTION 3.3: L EAST -S QUARES R EGRESSION Correlation measures the strength and direction of the linear relationship.
Simple Linear Regression 1. 2 I want to start this section with a story. Imagine we take everyone in the class and line them up from shortest to tallest.
Correlation Scatter Plots Correlation Coefficients Significance Test.
Warm-up with 3.3 Notes on Correlation The fat and calorie contents of 5oz of 3 kinds of pizza are represented by the data points (9, 305), (11, 309)
EC339: Lecture 6 Chapter 5: Interpreting OLS Regression.
Ch 3 – Examining Relationships YMS – 3.1
AP STATISTICS LESSON 3 – 3 LEAST – SQUARES REGRESSION.
Chapter 3 Section 3.1 Examining Relationships. Continue to ask the preliminary questions familiar from Chapter 1 and 2 What individuals do the data describe?
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 4 Section 2 – Slide 1 of 20 Chapter 4 Section 2 Least-Squares Regression.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Section 3.2C. The regression line can be found using the calculator Put the data in L1 and L2. Press Stat – Calc - #8 (or 4) - enter To get the correlation.
The correlation coefficient, r, tells us about strength (scatter) and direction of the linear relationship between two quantitative variables. In addition,
STA291 Statistical Methods Lecture LINEar Association o r measures “closeness” of data to the “best” line. What line is that? And best in what terms.
CHAPTER 5 Regression BPS - 5TH ED.CHAPTER 5 1. PREDICTION VIA REGRESSION LINE NUMBER OF NEW BIRDS AND PERCENT RETURNING BPS - 5TH ED.CHAPTER 5 2.
Chapter 5 Regression. u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We can then predict.
Chapter 4 – Correlation and Regression before: examined relationship among 1 variable (test grades, metabolism, trip time to work, etc.) now: will examine.
The correlation coefficient, r, tells us about strength (scatter) and direction of the linear relationship between two quantitative variables. In addition,
POD 09/19/ B #5P a)Describe the relationship between speed and pulse as shown in the scatterplot to the right. b)The correlation coefficient, r,
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.3 Predicting the Outcome.
Residuals Recall that the vertical distances from the points to the least-squares regression line are as small as possible.  Because those vertical distances.
CHAPTER 3 Describing Relationships
Lecturer’s desk Physics- atmospheric Sciences (PAS) - Room 201 s c r e e n Row A Row B Row C Row D Row E Row F Row G Row H Row A
Chapter 8 Linear Regression. Fat Versus Protein: An Example 30 items on the Burger King menu:
The correlation coefficient, r, tells us about strength (scatter) and direction of the linear relationship between two quantitative variables. In addition,
Scatterplots and Correlation Section 3.1 Part 2 of 2 Reference Text: The Practice of Statistics, Fourth Edition. Starnes, Yates, Moore.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
AP STATISTICS LESSON 3 – 3 (DAY 2) The role of r 2 in regression.
Warm-up Get a sheet of computer paper/construction paper from the front of the room, and create your very own paper airplane. Try to create planes with.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Warm-up: This table shows a person’s reported income and years of education for 10 participants. The correlation is .79. State the meaning of this correlation.
(Residuals and
Coefficient of Determination
EQ: How well does the line fit the data?
AP STATISTICS LESSON 3 – 3 (DAY 2)
Least-Squares Regression
Chapter 3: Describing Relationships
Scatterplots and Correlation
Chapter 3: Describing Relationships
Warm-up: This table shows a person’s reported income and years of education for 10 participants. The correlation is .79. State the meaning of this correlation.
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Least-Squares Regression
Chapter 3: Describing Relationships
Warmup A study was done comparing the number of registered automatic weapons (in thousands) along with the murder rate (in murders per 100,000) for 8.
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3 Vocabulary Linear Regression.
Chapter 3: Describing Relationships
AP Stats Agenda Text book swap 2nd edition to 3rd Frappy – YAY
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Presentation transcript:

AP STATS: Warm-Up Do Math SAT scores help to predict Verbal SAT scores. Make a scatter plot. Find the least squares regression and r and r-squared. Also graph the residuals. Use L3 (Y1(L1)) and L4 (L2-L3) to do so. Plot L1 vs. L4. What is a residual? What is the residual for the SAT math score of 680?

Agenda: Today: R-Squared Wednesday: Quiz on 3.2 and intro to 3.3 Thursday: 3.3 Lurking Variables/Review Friday: No Class (half day) Monday: Quiz on 3.3 – and Review for the Chapter 3 Test Tuesday (11/5): Chapter 3 Test Wednesday: DROP Thursday (11/7): Project 2 is Due Friday (11/8): No Class (Parent-Teacher Conferences)

Idea of the Day: Regression towards the mean. Named after Sir Francis Galton. He found that the kids of taller-than-average parents tend to be taller on average, but not as tall their parents. Hmmm… I will post a chapter from a book called “Thinking Fast and Slow” by Daniel Kahneman about this. It’s a really interesting idea. This explains why athletes having a spectacular year tend to do poorly the next year and why sick people tend to get better (regardless of the type of treatment).

Some Notes on the Project: -Be formal and scientific in your writing (for the most part). i.e. don’t say “I have no clue.” Try to avoid being colloquial. -Summarize your descriptive stats (mean, median, middle 50%) in the results (if it is relevant). Actually restate the numbers here that are relevant and meaningful. -Print and edit. It’s the only way to catch blunders. -Be explicit in your method. Who did you ask, what was your question, how did you conduct the study? -No need to comment on which graph looks best (i.e. a histogram versus a boxplot). By choosing the graph, the reader can assume that you chose wisely. -Choose graphs wisely. What are you trying to show? What type of display shows this result best? -Make subheadings for each part of your paper (intro, method, results, conclusion). -Be careful in drawing conclusions. Just because you see a difference doesn’t mean that it’s conclusive evidence (we need more formal ways to analyze data before we can make those claims sometimes).

The role of r2 in regression r2 – also known as the coefficient of determination. -It is true that r2 is the square of r, but there is more to the story. The big idea of r2: How much better is the least squares line at predicting responses (y) than if we just used y-bar (the mean) as our prediction of every point. **r measures the strength of a linear relationship and r2 tells you how much better the linear model is at predicting y-values than simply using y-bar (the mean of y).

Saying it in words. Say that the r2 for a car’s age versus the value of the car is 45%. This means that 45% of the variation in a car’s value is explained by the least squares regression line relating car age to car value. SIMPLY PUT: it is the percentage of the response variable variation that is explained by a linear model.

Formula for r2 SST = SSE = Total sum of Squares Sum of Squared Errors (i.e. the sum of the residuals squared) The fraction of the variation in the values of y that is explained by the least squared regression on the other variable.

Facts about Least Squares Regressions The distinction between explanatory and response variable is essential in regression. You will get a different regression line if you reverse the variables. Recall that the least square regression line always passes through (x-bar, y-bar)

You Try! 3.44) A study of class attendance and grades among first year students at a state university showed that in general, students who attended a higher proportion of their classes earned higher grades. Class attendance explained 16% of the variation in grade index among the students. What is the numerical value of the correlation between between percent of classes attended and grade index?

Classwork/Homework 23 Read the section 3.2 review Complete 3.43, 3.53, 3.55. Optional: 3.58 (a bit tricky).