Understanding and Comparing Distributions

Slides:



Advertisements
Similar presentations
Copyright © 2010 Pearson Education, Inc. Slide
Advertisements

F IND THE S TANDARD DEVIATION FOR THE FOLLOWING D ATA OF GPA: 4, 3, 2, 3.5,
Understanding and Comparing Distributions
Understanding and Comparing Distributions 30 min.
CHAPTER 4 Displaying and Summarizing Quantitative Data Slice up the entire span of values in piles called bins (or classes) Then count the number of values.
Slide 3- 1 Copyright © 2010 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Business Statistics First Edition.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 4- 1.
Chapter 4: Displaying Quantitative Data
Chapter 5: Understanding and Comparing Distributions
AP Statistics Section 2.1 B
Understanding and Comparing Distributions
Understanding and Comparing Distributions
Copyright © 2010, 2007, 2004 Pearson Education, Inc.
Copyright © 2010 Pearson Education, Inc. Chapter 6 The Standard Deviation as a Ruler and the Normal Model.
Copyright © 2009 Pearson Education, Inc. Chapter 5 Understanding and Comparing Distributions.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Chapter 6 Displaying and Describing Quantitative Data © 2010 Pearson Education 1.
Slide 4-1 Copyright © 2004 Pearson Education, Inc. Dealing With a Lot of Numbers… Summarizing the data will help us when we look at large sets of quantitative.
Chapter 4 Displaying Quantitative Data. Quantitative variables Quantitative variables- record measurements or amounts of something. Must have units or.
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
MMSI – SATURDAY SESSION with Mr. Flynn. Describing patterns and departures from patterns (20%–30% of exam) Exploratory analysis of data makes use of graphical.
Chapter 5 Understanding and Comparing Distributions Math2200.
Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Understanding and Comparing Distributions.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
Understanding and Comparing Distributions
Displaying Quantitative Data AP STATS NHS Mr. Unruh.
Chapter 5 The Standard Deviation as a Ruler and the Normal Model.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
Understanding & Comparing Distributions Chapter 5.
Copyright © 2009 Pearson Education, Inc. Slide 4- 1 Practice – Ch4 #26: A meteorologist preparing a talk about global warming compiled a list of weekly.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 6- 1.
AP Statistics. Chapter 1 Think – Where are you going, and why? Show – Calculate and display. Tell – What have you learned? Without this step, you’re never.
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 4- 1.
Section 2.1 Visualizing Distributions: Shape, Center, and Spread.
Displaying Quantitative Data
Chapter 1: Exploring Data
Understanding and Comparing Distributions
Chapter 4 Understanding and Comparing Distributions.
Chapter 5 : Describing Distributions Numerically I
Describing Distributions Numerically
Sections 2.3 and 2.4.
Understanding and Comparing Distributions
Unit 4 Statistical Analysis Data Representations
Understanding and Comparing Distributions
Understanding and Comparing Distributions
Objective: Given a data set, compute measures of center and spread.
Describing Distributions Numerically
AP Statistics CH. 4 Displaying Quantitative Data
Displaying Quantitative Data
Bell Ringer Create a stem-and-leaf display using the Super Bowl data from yesterday’s example
Do-Now-Day 2 Section 2.2 Find the mean, median, mode, and IQR from the following set of data values: 60, 64, 69, 73, 76, 122 Mean- Median- Mode- InterQuartile.
The histograms represent the distribution of five different data sets, each containing 28 integers from 1 through 7. The horizontal and vertical scales.
Understanding and Comparing Distributions
Honors Statistics Chapter 4 Part 4
Histograms: Earthquake Magnitudes
Understanding and Comparing Distributions
Re-expressing Data:Get it Straight!
Probability & Statistics Describing Quantitative Data
Displaying and Summarizing Quantitative Data
Displaying and Summarizing Quantitative Data
Homework Check.
Shape of Distributions
Displaying and Summarizing Quantitative Data
Elementary Statistics: Looking at the Big Picture
Understanding and Comparing Distributions
Describing Distributions Numerically
Honors Statistics Review Chapters 4 - 5
Understanding and Comparing Distributions
Displaying and Summarizing Quantitative Data
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Presentation transcript:

Understanding and Comparing Distributions Chapter 4 Understanding and Comparing Distributions

The Big Picture We can answer interesting questions about variables when we compare distributions for different groups. Here is a histogram of the Average Wind Speed for each day in 1989. Where is the mean?

The Big Picture (cont.) Someone “CUSS”! Uni-modal and skewed right. This value may be an outlier. The median is 1.9 mph; the IQR is 1.78 mph.

Wind Speed: Making Boxplots (cont.) Histogram & boxplot for daily wind speeds: What do we see?

Someone compare these distributions (CUSS comparatively)! Comparing Groups Sometimes it is more interesting to compare groups (here: spring/summer & fall/winter). With histograms, we can note the shapes, centers, and spreads of the two distributions. Someone compare these distributions (CUSS comparatively)!

Comparing Groups In the Spring/Summer, the wind speeds are strongly skewed to the right. The Fall/Winter wind speeds are more uniform, with one outlier. Are some months windier than others? Are wind speeds equally variable from month to month? Do some months show more variation?

Comparing Groups (cont.) Boxplots offer a nice balance of information and simplicity, displaying overall summary information. We can plot them “side-by-side” for groups or categories that we wish to compare (in this case, by month). What do we see?

Comparing Groups (cont.) Why do we now see outliers? Because they are relative to the month. Example: A windy day in July wouldn’t stand out in November.

Comparing Groups (cont.) When do wind speeds tend to decrease? In the summer. When are winds the strongest and most variable? From November to April.

What About Outliers? Why do you think there could have been an outlier in November? Maybe the outlier in November was a rare tornado? Some research could only tell.

What About Outliers? If there are clear outliers and you want to report the mean and standard deviation, report them with the outliers present and with the outliers removed. The differences may be quite revealing.

Timeplots For some data sets, we are interested in how the data behave over time. In these cases, it may be helpful to construct time plots of the data. Here we are looking at calendar ‘year days’, starting on day 1. What do we see?

Timeplots: Order, Please! There are more calm periods in the summer, starting around day 150. Winds are more variable and stronger during the earlier and later parts of the year.

Comparing Groups Worksheet Classwork: Comparing Groups Worksheet

Re-expressing Skewed Data When data are strongly skewed it can be hard to summarize them using center and spread, and hard to decide whether extreme values are outliers or just part of a stretched out tail. How can we say anything useful about such data? What do we see?

OMG! There’s actually a use for logarithms!! Re-expressing Data One way to make a skewed distribution more symmetric is to re-express (transform) the data by applying some simple function (example: logarithm). Note the change in skewness from the raw data (previous slide) to the transformed data (right). OMG! There’s actually a use for logarithms!!

Here are both… Re-expression rescales the data; maintaining the relative magnitudes of the individual data points and giving us a better “look” at the distribution.

*Re-expressing Skewed Data to Improve Symmetry (cont.) We can now see that a typical log compensation is between about 6.5 and 7. What does this mean? This corresponds to $3,162,277.66 through $10,000,000. Do you remember how to do logs?

What Can Go Wrong? Avoid inconsistent scales, either within the display or when comparing two displays. Label clearly so the reader knows what the plot displays.

Classwork/Homework: Chapter 4 Review for the Quiz: Understanding and Comparing Distributions Chapter 4 Quiz Tomorrow