Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03

Slides:



Advertisements
Similar presentations
CHAPTER 4: Scatterplots and Correlation. Chapter 4 Concepts 2  Explanatory and Response Variables  Displaying Relationships: Scatterplots  Interpreting.
Advertisements

CHAPTER 4: Scatterplots and Correlation
Examining Relationships Prob. And Stat. CH.2.1 Scatterplots.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 3 Describing Relationships 3.1 Scatterplots.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
The Practice of Statistics
Chapter 4 - Scatterplots and Correlation Dealing with several variables within a group vs. the same variable for different groups. Response Variable:
Chapter 4 Scatterplots and Correlation. Chapter outline Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots.
Chapter 3 Examining Relationships. Introduction We have looked at only one-variable statistics: Quantitative & Categorical data We have looked at only.
Notes Chapter 7 Bivariate Data. Relationships between two (or more) variables. The response variable measures an outcome of a study. The explanatory variable.
Scatterplots Chapter 6.1 Notes.
CHAPTER 3 Describing Relationships
Statistics 101 Chapter 3 Section 3.
Basic Practice of Statistics - 3rd Edition
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 6: Exploring Data: Relationships Lesson Plan
Chapter 3: Describing Relationships
Daniela Stan Raicu School of CTI, DePaul University
Basic Practice of Statistics - 3rd Edition
Data Analysis and Statistical Software I ( ) Quarter: Autumn 02/03
Chapter 6: Exploring Data: Relationships Lesson Plan
Basic Practice of Statistics - 5th Edition
Chapter 3: Describing Relationships
Daniela Stan Raicu School of CTI, DePaul University
CHAPTER 4: Scatterplots and Correlation
Chapter 2 Looking at Data— Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Examining Relationships
Warmup In a study to determine whether surgery or chemotherapy results in higher survival rates for a certain type of cancer, whether or not the patient.
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 4 - Scatterplots and Correlation
Basic Practice of Statistics - 3rd Edition
Chapter 3 Scatterplots and Correlation.
3.1: Scatterplots & Correlation
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
September 25, 2013 Chapter 3: Describing Relationships Section 3.1
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
AP Stats Agenda Text book swap 2nd edition to 3rd Frappy – YAY
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3: Describing Relationships
Chapter 3 Examining Relationships
Bivariate Data Response Variable: measures the outcome of a study (aka Dependent Variable) Explanatory Variable: helps explain or influences the change.
Chapter 3: Describing Relationships
Basic Practice of Statistics - 3rd Edition
CHAPTER 3 Describing Relationships
Chapter 3: Describing Relationships
3.2 Correlation Pg
Presentation transcript:

Data Analysis and Statistical Software I (323-21-403) Quarter: Autumn 02/03 Daniela Stan, PhD Course homepage: http://facweb.cs.depaul.edu/Dstan/csc323 Office hours: (No appointment needed) M, 3:00pm - 3:45pm at LOOP, CST 471 W, 3:00pm - 3:45pm at LOOP, CST 471 9/18/2018 Daniela Stan - CSC323

Outline Chapter 2: Looking at Data – Relationships between two or more variables Scatterplots Correlation 9/18/2018 Daniela Stan - CSC323

Association Between Variables Two variables measured on the same individuals are associated if some values of one variable tend to occur more often with some values of the second variable than with other values of that variable. A response variable measures an outcome of a study. An explanatory variable explains or causes changes in the response variable. Explanatory variable ~ independent variable Response variable ~ dependent variable 9/18/2018 Daniela Stan - CSC323

Association Between Variables (cont.) Example 1: A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study is: A. Yield of crop B. Amount of yearly rainfall C. The experimenter D. Either bushels or inches of water Example 2: A researcher is interested in determining if one could predict the score on a statistics exam from the amount of time spent studying for the exam. In this study, the explanatory variable is: A. The researcher B. The amount of time spent studying for the exam C. The score on the exam D. The fact that this is a statistics exam 9/18/2018 Daniela Stan - CSC323

Scatterplots A scatterplot displays the relationship between two quantitative variables. The values of one variable appear on the horizontal axis and the values of the other variable appear on the vertical axis. Always plot the explanatory variable, if there is one, on the horizontal axis (the x axis) of a scatterplot. Each individual in the data appears as the point in the plot fixed by the values of both variables for that individual. 9/18/2018 Daniela Stan - CSC323

Scatterplots (cont.) Example: State mean SAT mathematics score plotted against the percent of high school seniors in each state who took the SAT exam. Cluster 2: - Higher scores Linear negative relationship Cluster 1: - Low scores 9/18/2018 Daniela Stan - CSC323

Interpreting Scatterplots The overall pattern of a scatterplot can be described by the: - form - direction - strength of the relationship. Form: - linear or non-linear (curved relationships or clusters) Direction: - positive or negative association Strength: how closely the points follow a clear form: - strong or weak 9/18/2018 Daniela Stan - CSC323

Adding categorical variables To add a categorical variable to a scatterplot, use a different plot color or symbol for each category. Example: The states are grouped in four regions: Region Midwest Northeast South West Categorical Variable 9/18/2018 Daniela Stan - CSC323

Examples on Scatterplots Problem 2.10 9/18/2018 Daniela Stan - CSC323

Correlation X Y x1 y1 x2 y2 … xn yn The correlation r measures the direction and strength of the linear relationship between two quantitative variables. Suppose we have the following data: X Y x1 y1 x2 y2 … xn yn Where sx, sy are the standard deviations for the two variables X and Y 9/18/2018 Daniela Stan - CSC323

Properties of the Correlation (cont.) Correlation ignores distinction between explanatory and response variables Correlation requires that both variables be quantitative Correlation is not affected by changes in the unit of measurement of either variable Positive correlation indicates positive association between the variables and negative correlation indicates negative association 9/18/2018 Daniela Stan - CSC323

Properties of the Correlation Correlation always satisfies the condition: -1 ≤ r ≤1 and indicates a strong association by how close it is to -1 or 1: Values of correlation r close to 1 indicates that the points lie close to a straight line with a positive slope. Values of correlation r close to -1 indicates that the points lie close to a straight line with a negative slope. Values of correlation r close to 0 indicates that there is no relationships between the points; correlation measures the strength of only linear relationships. 9/18/2018 Daniela Stan - CSC323

What correlation numbers correspond to the points from the 6 scatterplots from the right? r = 0 r = 0.5 r = 0.9 r = -0.99 r = -0.7 r = -0.3 9/18/2018 Daniela Stan - CSC323

More on Correlation Problem 2.22/ page 132 Correlation is not resistant measure, so outliers can greatly change the value of r. 9/18/2018 Daniela Stan - CSC323

Homework and Programming Assignment #2 Reading Assignments Chapter 1 Chapter 2 (Sections 2.1 and 2.2) Homework and Programming Assignment #2 To be posted today 9/18/2018 Daniela Stan - CSC323