Download presentation
Presentation is loading. Please wait.
Published byKevin Oliver Modified over 9 years ago
1
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition
2
Preliminary results Questions on the Chapter 4 Moodle Quiz? What just happened in Spreadsheet Assignment 4? How does this connect with SA5?
3
Most of research is… (finding variance) Explaining variance (prediction, correlation) Explaining what is causing the variance (causation)
4
Article
5
What’s wrong?
9
F3LIVARR 1 = Lives by him/herself 2 = Lives in parent/guardian’s home 3 = Not in parents’ home; lives w/ spouse 4 = Not in parents’ home; lives w/ partner 5 = Not in parents’ home; lives w/ children 6 = Not in parents’ home; lives w/ sibling 7 = Not in parents’ home; lives w/ roommate/friend 8 = Other living arrangement
10
What’s wrong?
11
Let’s examine our data Which variables have the lowest means? The highest means? Which variables have the lowest standard deviation? The highest standard deviation? Which pairs of variables have the strongest correlations? (positive or negative) The weakest correlations? Which pairs of variables provide an interesting question to ask? What are the limitations of our data collection?
12
Starter Question We hear about U.S. being a “violent” place to live, but how does it compare to the rest of the developed world in terms of serial killings?serial killings
13
Let’s find and interpret the regression line for your spreadsheets
14
Regression line
15
Influential Observations An observation is influential for a statistical calculation if removing it would markedly change the result of the calculation. Results are questionable if they depend strongly on a few influential observations.
16
Figure 5.5, The Basic Practice of Statistics, © 2015 W. H. Freeman Chapter 5, #6: From a graph in Tania Singer et al., “Empathy for pain involves the affective but not sensory components of pain,” Science, 303 (2004), pp. 1157-1162.
17
Outliers and influential points Empathy score and brain activity From all of the data r 2 = 51.5% r 2 = 33.1% After removing observation 16
18
Multiple Regression Let’s take a shot at predicting your future salary (with some important caveats!) By putting other variables into the model, we increase our overall predictive power (R 2 ) and we can “control” for variables to get a better sense of the unique relationship between two variables.
19
Least-squares regression
20
Evidence of causation A properly conducted experiment may establish causation. Other considerations when we cannot do an experiment: The association is strong and consistent. Control for lurking variables. Higher doses are associated with stronger responses. Alleged cause precedes the effect in time. Alleged cause is plausible (reasonable explanation).
21
Cautions about correlation and regression Correlation and regression lines describe only linear relationships. Correlation and least-squares regression lines are not resistant. Beware ecological correlation, or correlation based on averages rather than individuals. Beware of extrapolation—predicting outside of the range of x. Beware of lurking variables—these have an important effect on the relationship among the variables in a study, but are not included in the study. Correlation does not imply causation!
22
Least Squares Regression Line Why is the trendline through a scatterplot called a “least squares regression line”?
23
Regression line Example: Predict the gain in fat (in kg) based on the change in Non-Exercise Activity (NEA change, in calories). If the NEA change is 400 calories, what is the expected fat gain? A regression line is a straight line that describes how a response variable y changes as an explanatory variable x changes. This regression line describes the overall pattern of the relationship
24
How can we explain differences in accuracy?
25
Basketball Regression
26
The least-squares regression line LEAST-SQUARES REGRESSION LINE The least-squares regression line of y on x is the line that makes the sum of the squares of the vertical distances of the data points from the line as small as possible.least-squares regression line
27
Entry Slip Question What’s it called when we predict a y-value for an x-value that is far outside of our range? extrapolation (Example: Trying to predict salary from age. We studied people between ages 25 and 65, but now attempt to predict the salary of a 100-year old woman using our same regression line.)
28
The least-squares regression line
29
Prediction via regression line Suppose we know someone has an increase of 400 calories of NEA. What would we predict for fat gain? This is the predicted response for someone with an of 400 calories of NEA
30
What calculations should you know? Definitely know these Mean, median Z-scores (and conversions for standard normal) Interpret and use the linear regression line No need to memorize How to calculate standard deviation or variance How to calculate correlation from data How to calculate the linear regression line
31
The $1,300 homework finding Remember that our regression found an average difference in salary of $5,000 between students who rarely completed homework and those who nearly always did. Based on some (questionable) calculations, this could be interpreted as an additional $1,300 per night. What should we be careful of?
32
Even very strong correlations may not correspond to a real causal relationship (changes in x actually causing changes in y). Correlation may be explained by a lurking variable Correlation does not imply causation Social Relationships and Health House, J., Landis, K., and Umberson, D. “Social Relationships and Health,” Science, Vol. 241 (1988), pp 540-545. Does lack of social relationships cause people to become ill? (There was a strong correlation.) Or, are unhealthy people less likely to establish and maintain social relationships? (reversed relationship) Or, is there some other factor that predisposes people both to have lower social activity and become ill? Social Relationships and Health House, J., Landis, K., and Umberson, D. “Social Relationships and Health,” Science, Vol. 241 (1988), pp 540-545. Does lack of social relationships cause people to become ill? (There was a strong correlation.) Or, are unhealthy people less likely to establish and maintain social relationships? (reversed relationship) Or, is there some other factor that predisposes people both to have lower social activity and become ill?
33
Caution: beware of extrapolation Sarah’s height was plotted against her age. Can you predict her height at age 42 months? Can you predict her height at age 30 years (360 months)?
34
Caution: b eware of extrapolation
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.