AP Statistics. Issues Interpreting Correlation and Regression  Limitations for r, r 2, and LSRL :  Can only be used to describe linear relationships.

Slides:



Advertisements
Similar presentations
Section 4.2. Correlation and Regression Describe only linear relationship. Strongly influenced by extremes in data. Always plot data first. Extrapolation.
Advertisements

The Question of Causation YMS3e 4.3:Establishing Causation AP Statistics Mr. Molesky.
Chapter 4 Review: More About Relationship Between Two Variables
Chapter 4: More on Two- Variable Data.  Correlation and Regression Describe only linear relationships Are not resistant  One influential observation.
Aim: How do we establish causation?
Review for the chapter 6 test 6. 1 Scatter plots & Correlation 6
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Between Quantitative Variables Chapter 5.
Chapter 2: Looking at Data - Relationships /true-fact-the-lack-of-pirates-is-causing-global-warming/
Relationships Between Quantitative Variables
More on Two-Variable Data. Chapter Objectives Identify settings in which a transformation might be necessary in order to achieve linearity. Use transformations.
Chapter 5 Regression. Chapter 51 u Objective: To quantify the linear relationship between an explanatory variable (x) and response variable (y). u We.
Chapter 5 Regression. Chapter outline The least-squares regression line Facts about least-squares regression Residuals Influential observations Cautions.
The Question of Causation
HW#9: read Chapter 2.6 pages On page 159 #2.122, page 160#2.124,
1 10. Causality and Correlation ECON 251 Research Methods.
Describing relationships …
C HAPTER 4: M ORE ON T WO V ARIABLE D ATA Sec. 4.2 – Cautions about Correlation and Regression.
2.4: Cautions about Regression and Correlation. Cautions: Regression & Correlation Correlation measures only linear association. Extrapolation often produces.
The Practice of Statistics Third Edition Chapter 4: More about Relationships between Two Variables Copyright © 2008 by W. H. Freeman & Company Daniel S.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Examining Relationships YMS3e Chapter 3 3.3: Correlation and Regression Extras Mr. Molesky Regression Facts.
2 Types of Studies Observational Study Experimental Study.
Warm Up I. Pilots or Senators: Which team has played better? vs. East vs. West vs. East vs. West WINSLOSSES PCT. WINS LOSSES PCT. WINSLOSSES PCT. WINS.
Chapter 15 Describing Relationships: Regression, Prediction, and Causation Chapter 151.
4.3: Establishing Causation Both correlation and regression are very useful in describing the relationship between two variables; however, they are first.
Notes Bivariate Data Chapters Bivariate Data Explores relationships between two quantitative variables.
Topic 10 - Linear Regression Least squares principle - pages 301 – – 309 Hypothesis tests/confidence intervals/prediction intervals for regression.
Chapter 4 More on Two-Variable Data “Each of us is a statistical impossibility around which hover a million other lives that were never destined to be.
Chapter 4 More on Two-Variable Data YMS 4.1 Transforming Relationships.
AP STATISTICS LESSON 4 – 2 ( DAY 1 ) Cautions About Correlation and Regression.
Chapter 4 – Correlation and Regression before: examined relationship among 1 variable (test grades, metabolism, trip time to work, etc.) now: will examine.
Lesson Correlation and Regression Wisdom. Knowledge Objectives Recall the three limitations on the use of correlation and regression. Explain what.
Business Statistics for Managerial Decision Making
Chapter 4 Day Six Establishing Causation. Beware the post-hoc fallacy “Post hoc, ergo propter hoc.” To avoid falling for the post-hoc fallacy, assuming.
1. Plot the data. What kind of growth does it exhibit? (plot by hand but you may use calculators to confirm answers.) 2. Use logs to transform the data.
UNIT 4 Bivariate Data Scatter Plots and Regression.
Cautions About Correlation and Regression Section 4.2.
The coefficient of determination, r 2, is The fraction of the variation in the value of y that is explained by the regression line and the explanatory.
THE SAT ESSAY: IS LONGER BETTER?  In March of 2005, Dr. Perelmen from MIT reported, “It appeared to me that regardless of what a student wrote, the longer.
Scatterplots and Correlation Section 3.1 Part 2 of 2 Reference Text: The Practice of Statistics, Fourth Edition. Starnes, Yates, Moore.
Chapters 8 Linear Regression. Correlation and Regression Correlation = linear relationship between two variables. Summarize relationship with line. Called.
Section Causation AP Statistics ww.toddfadoir.com/apstats.
Two-Variable Data Analysis
The Question of Causation 4.2:Establishing Causation AP Statistics.
Chapter 3 Unusual points and cautions in regression.
Statistics 200 Lecture #6 Thursday, September 8, 2016
2.7 The Question of Causation
Cautions About Correlation and Regression Section 4.2
Cautions About Correlation and Regression
Chapter 4 Correlation.
Correlation A Lecture for the Intro Stat Course
Section 4.3 Types of Association
Chapter 2 Looking at Data— Relationships
Examining Relationships
EQ: How well does the line fit the data?
Chapter 8 Linear Regression
residual = observed y – predicted y residual = y - ŷ
Cautions about Correlation and Regression
The SAT essay: Is Longer Better?
The Question of Causation
Examining Relationships
Scatterplots and Correlation
Least-Squares Regression
Chapter 5 Correlation.
EQ: What gets in the way of a good model?
Examining Relationships Chapter 7
4.2 Cautions about Correlation and Regression
3.3 Cautions Correlation and Regression Wisdom Correlation and regression describe ONLY LINEAR relationships Extrapolations (using data to.
Section 6.2 Establishing Causation
Chapter 4: More on Two-Variable Data
Presentation transcript:

AP Statistics

Issues Interpreting Correlation and Regression  Limitations for r, r 2, and LSRL :  Can only be used to describe linear relationships.  r, r 2, and LSRL can be influenced by extremes.  Extrapolation: Predicting outside the domain of x. Such predictions can’t be trusted.  Lurking Variables: Sometimes a relationship between x and y can be influenced by other variables without our knowledge.  Using Averaged Data (combining info from many individuals): These correlations are usually too high as the usual variation from individuals is lost.

Why Associations Happen  Causation: Changes in x cause changes in y.  Example: A drop in temperature causes increased power consumption.

Why Associations Happen  Common Response: Both x and y respond to changes in some observed variable or variables.

There is an almost perfect linear relationship between x and y. (r= ) x = # Methodist Ministers in New England y = # of Barrels of Rum Imported to Boston CORRELATION DOES NOT IMPLY CAUSATION! x = # Methodist Ministers in New England y = # of Barrels of Rum Imported to Boston CORRELATION DOES NOT IMPLY CAUSATION! Example of Common Response

 Methodist Ministers vs Barrels of Rum Imported to Boston.  r =  What is the variable driving the growth in the number of ministers and the number of barrels of rum being imported?

Why Associations Happen  Confounding: The effect on y of the explanatory variable x is hopelessly mixed up with the effects on y by other variables.

Example of Confounding  We think that the amount of calories consumed each day is related a person’s body mass index.  Body mass index may be based on genetics, too.

Association doesn’t imply causation!  An association between x and y, even if strong, is not good evidence that changes in x cause changes in y.  The best way to get evidence that x causes y is to do an experiment.  Experiments control lurking variables, so that there isn’t any other explanation for y changing as x changes.

Confounding vs. Lurking Variables  There are any number of variables in the environment during a study or an experiment. These are lurking variables.  Once any one of these variables add unwanted variation to our results, they become confounding variables.

Examples 1) There is a high positive correlation: nations with many TV sets have higher life expectancies. Could we lengthen the life of people in Rwanda by shipping them TVs? 2) People who use artificial sweeteners in place of sugar tend to be heavier than people who use sugar. Does artificial sweetener use cause weight gain? 3) Women who work in the production of computer chips have abnormally high numbers of miscarriages. The union claimed chemicals cause the miscarriages. Another explanation may be the fact these workers spend a lot of time on their feet.

4) People with two cars tend to live longer than people who own only one car. Owning three cars is even better, and so on. What might explain the association? 5) Children who watch many hours of TV get lower grades on average than those who watch less TV. Why does this fact not show that watching TV causes low grades?

6) Data show that married men (and men who are divorced or widowed) earn more than men who have never been married. If you want to make more money, should you get married? 7) High school students who take the SAT, enroll in an SAT coaching course, and take the SAT again raise their mathematics score from an average of 521 to 561. Can this increase be attributed entirely to taking the course?

Homework  Textbook: 4.33, 4.37, 4.38, 4.42, 4.44, 4.76