1 Chapter 4: Describing Distributions 4.1Graphs: good and bad 4.2Displaying distributions with graphs 4.3Describing distributions with numbers.

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

CHAPTER 1 Exploring Data
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Chapter 1 Exploring Data
Let’s Review for… AP Statistics!!! Chapter 1 Review Frank Cerros Xinlei Du Claire Dubois Ryan Hoshi.
Chapter 1 – Exploring Data YMS Displaying Distributions with Graphs xii-7.
1.1 Displaying Distributions with Graphs
Chapter 1: Exploring Data AP Stats, Questionnaire “Please take a few minutes to answer the following questions. I am collecting data for my.
AP Stats Chapter 1 Review. Q1: The midpoint of the data MeanMedianMode.
Warm-up The number of deaths among persons aged 15 to 24 years in the United States in 1997 due to the seven leading causes of death for this age group.
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
Math 145 September 11, Recap  Individuals – are the objects described by a set of data. Individuals may be people, but they may also be animals.
Review BPS chapter 1 Picturing Distributions with Graphs What is Statistics ? Individuals and variables Two types of data: categorical and quantitative.
Organizing Data AP Stats Chapter 1. Organizing Data Categorical Categorical Dotplot (also used for quantitative) Dotplot (also used for quantitative)
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Plan for Today: Chapter 11: Displaying Distributions with Graphs Chapter 12: Describing Distributions with Numbers.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
Chapter 1: Exploring Data, cont. 1.2 Describing Distributions with Numbers Measuring Center: The Mean Most common measure of center Arithmetic average,
CHAPTER 1 Exploring Data
UNIT ONE REVIEW Exploring Data.
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
CHAPTER 2: Describing Distributions with Numbers
1st Semester Final Review Day 1: Exploratory Data Analysis
Statistical Reasoning
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
DAY 3 Sections 1.2 and 1.3.
Please take out Sec HW It is worth 20 points (2 pts
Describing Distributions with Numbers
POPULATION VS. SAMPLE Population: a collection of ALL outcomes, responses, measurements or counts that are of interest. Sample: a subset of a population.
CHAPTER 1 Exploring Data
Organizing Data AP Stats Chapter 1.
Means & Medians.
Describing Quantitative Data with Numbers
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Honors Statistics Review Chapters 4 - 5
CHAPTER 2: Describing Distributions with Numbers
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Compare and contrast histograms to bar graphs
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Presentation transcript:

1 Chapter 4: Describing Distributions 4.1Graphs: good and bad 4.2Displaying distributions with graphs 4.3Describing distributions with numbers

2 Dow Jones Industrial Average

3 Pie Graph

4 Definitions Types of variables Categorical E.g., gender, type of degree Quantitative E.g., time, mass, force, dollars The distribution of a variable tells us what values it takes and how often it takes these values.

5 Bar graph showing a distribution

6 Exercises, pp

7 Bar graph for 4.1

8 Pie Chart for 4.1

9 Misleading Pictogram (p. 209) Worker Salary $2000/mo Manager Salary $4000/mo

10 Dow Jones Industrial Average: This is a line graph (p. 210)

11 Misleading Graphs?

12 Making good graphs (p. 213) Graphs must have labels, legends, and titles. Make the data stand out. Pay attention to what the eye sees. 3-D is really not necessary!

13 Exercises, pp through 4.8

14 Homework Problems, pp , to be done in Excel: 4.11, Excel file by class time on Monday Section 4.2 Reading, pp

Displaying Distributions with Graphs

16 Displaying distributions graphically The distribution of a variable tells us what values it takes and how often it takes these values. Ways to display distributions for quantitative variables: dotplots histograms stemplots See example on pp

17 Figure 4.15: A histogram

18 Figure 4.16: A stemplot

19 Histograms Most common graph of the distribution of a quantitative variable. How to make a histogram: Example 4.9, p. 224 Range: 5.7 to 17.6 Shoot for 6-15 classes (bars) Read paragraph on p. 226

20 Example 4.9, pp

21 Practice Problem: 4.18, p. 226

22 Exercise 4.18 Histogram By hand Using calculator Stemplot By hand

23 Interpreting the graphical displays Concentrate on the main features. Overall pattern (p. 230) Shape, center, spread Outliers Individual observations outside the overall pattern of the graph

24 Example 4.10, p. 230

25 Shape Symmetric or skewed (p. 231)? Is it unimodal (one hump) or bimodal (two humps)?

26 Homework Reading: pp

27 Stemplots Usually reserved for smaller data sets. Advantage: Actual (or rounded) data are provided. Possible drawback: Many people are not used to this type of plot, so the presenter/writer has to describe it.

28 How to make a stemplot, p. 236

29 More problems Exercises: 4.24 and 4.25, p , p. 233

30 Practice Exercises 4.30, p. 239 and 4.32, p , p. 238

31 Wrapping up Section 4.2 … 4.28, p , p

Describing Distributions with Numbers Until now, we’ve been satisfied with using words to describe the center and spread of distributions. Now, we will use numbers to describe these characteristics of a distribution. The 5-number summary: Center: Median (p. 248) Spread: Find the Quartiles, Q 1 and Q 3. (p. 250) Spread: Min and Max

33 Boxplots We can use this information to construct a boxplot:

34 Practice 4.46, p. 254 Enter data in the Stat Edit menu in your calculator, and order them.

35 Boxplot vs. Modified Boxplot The modified boxplot shows outliers … they are marked with a *. The lines extending from the quartiles go to the last number which is not an outlier. If there are no outliers, the modified boxplot and the regular boxplot are identical. Below are a boxplot (on the left) and modified boxplot (on the right) for Problem 4.39, p. 245.

36 Side-by-side boxplots (p. 252)

37 Practice Exercises: 4.50, p , p. 256

38 Testing for Outliers Find the Inter-Quartile Range: IQR=Q 3 -Q 1 Multiply: 1.5*IQR Outliers on low side: Q *IQR Outliers on high side: Q *IQR Are there any numbers outside of these values? If so, they are outliers, and are marked on boxplots with an asterisk. The tail is drawn to the highest (or lowest) value which is not an outlier.

39 Measures of Center and Spread Median and IQR Mean and Standard Deviation Mean is the arithmetic average Standard deviation measures the average distance of the observations from their mean. Variance is simply the squared standard deviation. All of these statistics can be calculated by hand, but we use technology to do these today … We use 1-sample stats on our calculators, or a stats program.

40 Properties of standard deviation (p. 259) Use s as a measure of spread when you use the mean. If s=0, there is no spread. The larger the value for s, the larger the spread of the distribution.

41 Practice Problem 4.52, p. 263 Mike: 59,69,71,52,65,55,72,50,75,67,51,69,68,62,69

42 Practice Problem 4.55, p. 263

43 Example 4.21, p. 265

44 Choosing a summary The book has a section on which summary to use (mean and std. dev., or median with the quartiles). I like to report all of them. However, when writing about a distribution, or comparing distributions, we should think about which summary works best. See p Skewed, outliers … median and quartiles Symmetrical, no (or few) outliers … mean and std. dev. Mean and standard deviation are most common. One reason is that they allow for more sophisticated calculations to be used in higher statistics.

45 More Practice … p. 271: 4.57, 4.58, 4.60