Stat 350 Lab Session GSI: Yizao Wang Section 016 Mon 2pm30-4pm MH 444-D Section 043 Wed 2pm30-4pm MH 444-B.

Slides:



Advertisements
Similar presentations
Sta220 - Statistics Mr. Smith Room 310 Class #14.
Advertisements

Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.2 Graphical Summaries.
Shape Ogives and Timeplots
Histogram Most common graph of the distribution of one quantitative variable.
Stat 350 Lab Session GSI: Yizao Wang Section 016 Mon 2pm30-4pm MH 444-D Section 043 Wed 2pm30-4pm MH 444-B.
Qwizdom Review Questions Week 2. Example Exam Question, p.18 A new method of measuring phosphorus levels in soil is under consideration. A sample of 11.
Business Statistics - QBM117 Statistical inference for regression.
Experimental Statistics I.  We use data to answer research questions  What evidence does data provide?  How do I make sense of these numbers without.
CHAPTER 1: Picturing Distributions with Graphs
Normal Distribution Recall how we describe a distribution of quantitative (continuous) data: –plot the data (stemplot or histogram) –look for the overall.
Linear Regression Inference
 Multiple choice questions…grab handout!. Data Analysis: Displaying Quantitative Data.
STATS 250 Lab 2 Julie Ghekas September 15, 2014.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 PROBABILITIES FOR CONTINUOUS RANDOM VARIABLES THE NORMAL DISTRIBUTION CHAPTER 8_B.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 2 Modeling Distributions of Data 2.2 Density.
1 Chapter 3 Looking at Data: Distributions Introduction 3.1 Displaying Distributions with Graphs Chapter Three Looking At Data: Distributions.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
Ch. 2 – Modeling Distributions of Data Sec. 2.2 – Assessing Normality.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 2: Modeling Distributions of Data
Ch 2 The Normal Distribution 2.1 Density Curves and the Normal Distribution 2.2 Standard Normal Calculations.
Chapter 18 Sampling Distribution Models *For Means.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
Bell Ringer You will need a new bell ringer sheet – write your answers in the Monday box. 3. Airport administrators take a sample of airline baggage and.
Density Curves & Normal Distributions Textbook Section 2.2.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
Assessing Normality Are my data normally distributed?
Chapter 1.1 Displaying Distributions with graphs.
Probability plots.
Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Chapter 1: Exploring Data
Chapter 2: Describing Location in a Distribution
Good Afternoon! Agenda: Knight’s Charge-please wait for direction
Warm Up.
CHAPTER 2 Modeling Distributions of Data
Linear transformations
CHAPTER 1: Picturing Distributions with Graphs
Warm-up We are going to collect some data and determine if it is “normal” Roll each pair of dice 10 times and record the SUM of the two digits in your.
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Warmup Normal Distributions.
CHAPTER 2 Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
Do Now In BIG CLEAR numbers, please write your height in inches on the index card.
CHAPTER 1: Picturing Distributions with Graphs
CHAPTER 2 Modeling Distributions of Data
Chapter 1: Exploring Data
CHAPTER 2 Modeling Distributions of Data
Chapter 1: Exploring Data
CHAPTER 2 Modeling Distributions of Data
Chapter 1: Exploring Data
CHAPTER 2 Modeling Distributions of Data
Advanced Algebra Unit 1 Vocabulary
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 2 Modeling Distributions of Data
Statistics is... a collection of techniques for planning experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting,
Chapter 1: Exploring Data
CHAPTER 2 Modeling Distributions of Data
Chapter 1: Exploring Data
Presentation transcript:

Stat 350 Lab Session GSI: Yizao Wang Section 016 Mon 2pm30-4pm MH 444-D Section 043 Wed 2pm30-4pm MH 444-B

Outline Two main types of studies Time plots Module 3 Activity 1 QQ plots Module 3 Activity 2 Quizdom Question

Two Main Types of Studies Observational studies: simple observation Experiments: design, measuring the effect of manipulation on output of interest, comparison

Time Plots & QQ Plots In statistical inference, first we need to check some assumptions for the data before inference procedures. Ex1: We need random sample. Ex2: We need the population from which the sample is taken to be normally distributed.

Time plots: check randomness of quantitative data collected over time (identically distributed) QQ plots: check normality of population Time Plots & QQ Plots

Time/Sequence Plots

Time plots Horizontal axis: time (sequence number) Vertical axis: response or measured variable Distinguish time plots from histograms

Purpose: check the identically distributed aspect of a random sample How: looking for evidence of stability Stability = no patterns If the data is not stable, histograms and numerical summaries may not be meaningful. Time/Sequence Plots

Overall patterns: Trend: a persistent, long-term rise or fall Seasonal variation: a pattern that repeats itself at regular intervals of time A persistent, long-term increase (or decrease) in the variation of the observations. Time/Sequence Plots

Increasing trend

Time/Sequence Plots Seasonal variation

Time/Sequence Plots Increasing trend, seasonal variation and increase in the variation

Module 3 Activity 1 Background 1: death rate (number of deaths per 100 million miles driven) from 1960 to 2004 p14 or deathrate.sav Task: What do you see? Would it make sense to make a histogram of the death rates?

Module 3 Activity 1 Background 1: death rate (number of deaths per 100 million miles driven) from 1960 to 2004 p14 or deathrate.sav Task: What do you see? Would it make sense to make a histogram of the death rates? From the sequence plot we see that overall the death rate appears to be decreasing over time. It wouldn't make sense to make a histogram of the death rates because the time series is not stationary because of the decreasing trend.

Module 3 Activity 1 Background 2: observed passenger flow (number of passengers flying a given flight) of a certain airline company over many years airline.sav Task: What do you see? Is there any pattern?

Module 3 Activity 1 Background 2: observed passenger flow (number of passengers flying a given flight) of a certain airline company over many years airline.sav Task: What do you see? Is there any pattern? We see a seasonal variation here. We also see that both the overall mean and the overall variation appear to be increasing over time. One possible reason for this seasonal variation is that much more people go on vacation during the summer time than in any other season of a year.

Module 3 Activity 1 Interpretations-words should not be too strong. Not: The mean is increasing. Better: There is evidence of an increasing mean, based on the data. Not: The variance decreased. Better: The variability in the response appears to be decreasing over time. Time plots do not tell the shape of a distribution.

QQ Plots You need to know how to check normality using QQ plot You don’t need to know how to produce a QQ plot by yourself. (A plot of the percentiles of a standard normal distribution against the corresponding percentiles of the observed data.)

QQ Plots

Purpose: QQ plot is a graphic tool to check normality. (Why/when we need normality?) How to check: look at how the points fall… Approximately Normal: approximately along a straight line with a positive slope Not Normal: deviations from the line

QQ Plots

Module 3 Activity 2 Task 1: Use the iq.sav dataset and examine the distribution of iq by creating a histogram and QQ plot. Describe the shape of the IQ distribution.

Module 3 Activity 2 Task 1: Use the iq.sav dataset and examine the distribution of iq by creating a histogram and QQ plot. Describe the shape of the IQ distribution. From the histogram, we see that the distribution of IQ is unimodal, centered around It appears to be very slightly skewed left, but is basically symmetric. With the Q-Q plot, we do see points falling roughly along a straight line with a positive slope, which would indicate that the bell-curve normal distribution is a reasonable model.

Module 3 Activity 2 Task 2: Use the employee data.sav dataset and create the histogram for the variable salary (current salary). Then create a QQ plot for salary. Comments?

Module 3 Activity 2 Task 2: Use the employee data.sav dataset and create the histogram for the variable salary (current salary). Then create a QQ plot for salary. Comments?

Module 3 Activity 2 The histogram reveals that the distribution for current salary is definitely not symmetric. It is strongly skewed right. However, it is unimodal, with a peak at around The mean will be much higher than the median due to the right skewness. The QQ plot clearly shows that the distribution for current salary cannot be considered normal, since it does not follow a straight line.

Review of lab 2 Experiments and observational studies Time/sequence plots Randomness and stability QQ plots Normality of population Any questions?

Before we finish… Qwizdom questions