Class 1: Sept. 9 About instructor: Dylan Small, Assistant Professor, Department of Statistics. How I got interested in statistics?

Slides:



Advertisements
Similar presentations
Chapter 3 Examining Relationships
Advertisements

Chapter 6: Exploring Data: Relationships Lesson Plan
Chapter 4 Scatterplots and Correlation. Rating Cereal: 0 to = unhealthy 100 = very nutritious.
CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
+ Scatterplots and Correlation Displaying Relationships: ScatterplotsThe most useful graph for displaying the relationship between two quantitative variables.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
AP STATISTICS LESSON 3 – 1 EXAMINING RELATIONSHIPS SCATTER PLOTS.
Examining Relationships Prob. And Stat. CH.2.1 Scatterplots.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Warm-Up A trucking company determines that its fleet of trucks averages a mean of 12.4 miles per gallon with a standard deviation of 1.2 miles per gallon.
Chapter 6: Exploring Data: Relationships Lesson Plan Displaying Relationships: Scatterplots Making Predictions: Regression Line Correlation Least-Squares.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
CHAPTER 4: Scatterplots and Correlation ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
Relationships If we are doing a study which involves more than one variable, how can we tell if there is a relationship between two (or more) of the.
Chapter 7 Scatterplots, Association, and Correlation.
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Unit 3: Describing Relationships
Chapter 3 Examining Relationships. Introduction We have looked at only one-variable statistics: Quantitative & Categorical data We have looked at only.
Business Statistics for Managerial Decision Making
4.1 Scatterplots  Explanatory and Response Variables  Scatterplots  Interpreting Scatterplots  Categorical Variables in Scatterplots 1.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 3: Describing Relationships Section 3.1 Scatterplots and Correlation.
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 7 LINEAR RELATIONSHIPS
Variables Dependent variable: measures an outcome of a study
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Variables Dependent variable: measures an outcome of a study
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Unit 4 Vocabulary.
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3 Scatterplots and Correlation.
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
September 25, 2013 Chapter 3: Describing Relationships Section 3.1
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
AP Stats Agenda Text book swap 2nd edition to 3rd Frappy – YAY
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Presentation transcript:

Class 1: Sept. 9 About instructor: Dylan Small, Assistant Professor, Department of Statistics. How I got interested in statistics?

My Current research Statistical methods for comparing treatments/policies when a perfectly controlled randomized experiment cannot be done using the method of “instrumental variables.” Applications to: –Treatment of depression among the elderly in primary care practices –Food policy in developing countries Statistical methods for panel studies, studies that survey same people repeatedly over time. – Prediction of child morbidity/mortality in Pakistan using previous height and weight measurements.

Course Objectives To learn how to use two important statistical tools to analyze data: Regression and Analysis of Variance To get hands on experience analyzing data and computing with data (using JMP) To gain experience in interpreting the results of a statistical analysis and communicating the results to others

Course requirements Responsible for both material covered in the lecture and reading associated with the lecture. Weekly homework, typically handed out on Thursday, due following Thursday at beginning of class. Late homework will be given at most half credit. Project: Analysis of data set of interest to you using regression. Work in groups of 2-3 people. Final report, class presentation. More details in October. Midterm: Tuesday, October 21, 3:00 pm-4:20pm Final: Tuesday, December 21, 8:30am-10:30am

Grading Grades will be based on –20% Homework –30% Project –20% Midterm –30% Final

Web site/Textbooks Web site: stat.wharton.upenn.edu/~dsmall/stat112-f04http://www- stat.wharton.upenn.edu/~dsmall/stat112-f04 Can be reached by going to stat.wharton.upenn.edu, clicking on courses and clicking on Stat stat.wharton.upenn.edu Textbooks: –Moore and McCabe, Introduction to the Practice of Statistics, 4 th edition (Required). We will be covering Chapter 2, part of Chapter 3 and Chapters –JMP version 5 with handbook. Highly recommended. If you do not own it, you need to sign up for a Wharton account and use it in the Wharton labs. –JMP manual for Introduction to the Practice of Statistics. Recommended.

Instructor Accessibility address: My Office hours (office: 464 Huntsman Hall): –Tuesdays and Thursdays after class, 4:30-5:30. –By appointment. I will be happy to meet with you if you send me an to arrange a time. I encourage you to come see me at least once during the semester to chat about your background, interests, concerns about the class and future plans. TA: Lie Wang, office hours TBA Stat Lab: Monday-Thursday, 9-3; Friday, 11-5

Class 1 Reading: Introduction to Chapter 2, Chapter 2.1 Topic: Relationships between variables measured on same unit. Unit could be an individual, a state, a company, a year, etc. Data set: Penn Alcohol data set. Penn Alcohol dataset (pennalcohol.JMP under datasets on website). Survey given to 123 Penn undergraduates. Alcohol use: Number of days per month on which person drinks.

Association Two variables measured on the same unit are associated if some values of one variable tend to occur more often with some values of the second variable than with other values of that variable. Two variables are positively associated when above average values of one tend to accompany above average values of the other and below- average values also tend to occur together. Two variables are negatively associated when above-average values of one accompany below- average values of the other, and vice versa.

Strength of association Strength of the association: Measure of how strong is the positive or negative association. Statistical associations are overall tendencies, not ironclad rules. If there is a strong association between two variables, then knowing one helps a lot in predicting the other. But when there is a weak association, information about one variable does not help much in guessing the other.

Association does not have to be linear or unidirectional Relationship between gas mileage per gallon and speed at which a car is driven:

Response and Explanatory Variable Response variable (Y) measures outcome of study. Explanatory variable (X) explains or causes change in the response variable. Y=gas mileage per gallon, X=speed at which car is driven. Response and explanatory variables in alcohol study?

Scatterplots A scatterplot shows the relationship between two quantitative variables measured on the same units. The values of one variable appear on the horizontal axis, and the values of the other variable appear on the vertical axis. Each unit in the data appears as the point in the plot fixed by the values of both variables for that unit. Always plot the explanatory variable, if there is one, on the horizontal axis (the x axis of the scatterplot).

Scatterplots in JMP Click Analyze, Fit Y by X. Left click the response variable (so that it is highlighted) and then left click the Y, response button (so that it appears in the Y, response box). Similarly left click the explanatory variable and then left click the X, factor button. Click OK.

Examining a scatterplot Look for the overall pattern of the data and for striking deviations from that pattern. The overall pattern of a scatterplot can be described by the form, direction and strength of the relationship. An important kind of deviation is an outlier in terms of the direction of the scatterplot, a point that falls outside the overall pattern of the relationship.

Brain size and body size in 96 mammals (mammalstudy.JMP)

Labeling points in JMP To label a point in a scatterplot in JMP, put cursor in column that you want to use to name the point (species in the mammal study), then click Cols and then click Label. Then put cursor on the row you want to label, then click Rows and then click Label.

Association is not causation An association between what we call the response variable and what we call the explanatory variable does not prove that changes in the explanatory variable cause changes in the response variable. The relationship between two variables can be strongly influenced by other variables that are lurking in the background (lurking variables)

Key Points from Lecture Association: Definition. Scatterplots: –How to examine them. –How to make them in JMP Association is not causation. Next class: 2.2 (correlation), begin 2.3 (least squares regression)