Statistics: Concepts and Controversies

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

Describing Distributions With Numbers
CHAPTER 1 Exploring Data
Measures of Variation Sample range Sample variance Sample standard deviation Sample interquartile range.
Describing Quantitative Data with Numbers Part 2
Looking at data: distributions - Describing distributions with numbers IPS chapter 1.2 © 2006 W.H. Freeman and Company.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Basic Practice of Statistics - 3rd Edition
CHAPTER 2: Describing Distributions with Numbers
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Describing distributions with numbers
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Chapter 12: Describing Distributions with Numbers We create graphs to give us a picture of the data. We also need numbers to summarize the center and spread.
CHAPTER 2: Describing Distributions with Numbers ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Slide 3-2 Copyright © 2008 Pearson Education, Inc. Chapter 3 Descriptive Measures.
1 Stat 1510 Statistical Thinking & Concepts Describing Distributions with Numbers.
1.3: Describing Quantitative Data with Numbers
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 12, Part 2 STA 291 Summer I Mean and Standard Deviation The five-number summary is not the most common way to describe a distribution numerically.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Describing distributions with numbers
Chapter 5: Exploring Data: Distributions Lesson Plan Exploring Data Displaying Distributions: Histograms Interpreting Histograms Displaying Distributions:
Chapter 3 Looking at Data: Distributions Chapter Three
Describing Quantitative Data with Numbers Section 1.3.
Essential Statistics Chapter 21 Describing Distributions with Numbers.
Numerical descriptors BPS chapter 2 © 2006 W.H. Freeman and Company.
Review BPS chapter 1 Picturing Distributions with Graphs What is Statistics ? Individuals and variables Two types of data: categorical and quantitative.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
Chapter 2 Describing Distributions with Numbers. Numerical Summaries u Center of the data –mean –median u Variation –range –quartiles (interquartile range)
Numerical descriptors BPS chapter 2 © 2006 W.H. Freeman and Company.
Chapter 5 Describing Distributions Numerically.
SWBAT: Measure standard deviation, identify outliers, and construct a boxplot using the five-number summary Do Now: Explain how this graph could be misleading.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
Chapter 1: Exploring Data, cont. 1.2 Describing Distributions with Numbers Measuring Center: The Mean Most common measure of center Arithmetic average,
BPS - 5th Ed.Chapter 21 Describing Distributions with Numbers.
IPS Chapter 1 © 2012 W.H. Freeman and Company  1.1: Displaying distributions with graphs  1.2: Describing distributions with numbers  1.3: Density Curves.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers.
CHAPTER 2: Describing Distributions with Numbers
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Warmup What is the shape of the distribution? Will the mean be smaller or larger than the median (don’t calculate) What is the median? Calculate the.
Quartile Measures DCOVA
CHAPTER 1 Exploring Data
Describing Quantitative Data with Numbers
Basic Practice of Statistics - 3rd Edition
Exploratory Data Analysis
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 2: Describing Distributions with Numbers
Essential Statistics Describing Distributions with Numbers
Basic Practice of Statistics - 3rd Edition
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Basic Practice of Statistics - 3rd Edition
Describing Distributions with Numbers
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Presentation transcript:

Statistics: Concepts and Controversies Chapter 12 Describing Distributions with Numbers Chapter 12 Chapter 12

Statistics: Concepts and Controversies Chapter 12 Chapter 12

Statistics: Concepts and Controversies Median Example 1 data: 2 4 6 Median (M) = 4 Example 2 data: 2 4 6 8 Median = 5 (avg. of 4 and 6) Example 3 data: 6 2 4 Median  2 (order the values: 2 4 6 , so Median = 4) Chapter 12 Chapter 12

Example: Finding the median Statistics: Concepts and Controversies Example: Finding the median Consider the following set of data: 16 19 24 25 25 33 33 34 34 37 37 40 42 45 45 46 46 49 73 There are 19 observations, hence the median is the one in the middle, the 10th one. Hence the median of these data is 37. Chapter 12 Chapter 12

Example: Finding the median Statistics: Concepts and Controversies Example: Finding the median Now consider the following data where there is an even number of observations: 3 9 9 22 29 32 32 33 39 39 42 49 52 58 65 70 The ones in the middle are 33 and 39, and hence the median is the average of these values, 36. Chapter 12 Chapter 12

Statistics: Concepts and Controversies Quartiles Three numbers that divide the ordered data into four equal-sized groups. Q1 has 25% of the data below it. Q2 has 50% of the data below it. (Median) Q3 has 75% of the data below it. Chapter 12 Chapter 12

Statistics: Concepts and Controversies Chapter 12 Chapter 12

Example: Finding the quartiles Statistics: Concepts and Controversies Example: Finding the quartiles Consider the following set of data: 16 19 24 25 25 33 33 34 34 37 37 40 42 45 45 46 46 49 73 We already know that the median is 37. To find the first quartile we consider the first half of this data, namely, 16 19 24 25 25 33 33 34 34 There are 9 observations here, so the median is the 5th one, namely, 25. This is the first quartile. To find the third quartile we consider the second half of the original data, that is, 37 40 42 45 45 46 46 49 73 There are again 9 observations and the median is the one in the middle which is 45, this is the third quartile. Chapter 12 Chapter 12

Example: Finding the quartiles Statistics: Concepts and Controversies Example: Finding the quartiles Next, consider the following set of data: 3 9 9 22 29 32 32 33 39 39 42 49 52 58 65 70 We already know that the median is 36. To find the first quartile we consider the first half of this data, namely, 3 9 9 22 29 32 32 33 There are 8 observations here, so the median is the average of the 4th and 5th one, namely, 25.5. This is the first quartile. To find the third quartile we consider the second half of the original data, that is, 39 39 42 49 52 58 65 70 There are again 8 observations and the median is again the average of 4th and 5th observations, that is 50.5, this is the third quartile. Chapter 12 Chapter 12

Statistics: Concepts and Controversies Chapter 12 Chapter 12

Statistics: Concepts and Controversies Chapter 12 Chapter 12

Example: Five-number summary and boxplot Statistics: Concepts and Controversies Example: Five-number summary and boxplot Here are Roger Maris’ home run counts for his 12 years in the Major Leagues, arranged in order: 5 8 9 13 14 16 23 26 28 33 39 61 Since there are an even number of observations, the median is the average of two middle values, that is, (16+23)/2 = 19.5 The first quartile is the median of 5 8 9 13 14 16 that is (9+13)/2 = 11 The third quartile is the median of the second half, 23 26 28 33 39 61 that is (28+33)/2 = 30.5 The minimum and maximum values are obviously 5 and 61, respectively. Hence a five-number summary of these data is 5 11 19.5 30.5 61 Chapter 12 Chapter 12

Example: Five-number summary and boxplot Statistics: Concepts and Controversies Example: Five-number summary and boxplot How does the boxplot of this distribution compare with those of the data corresponding to Barry Bonds and Mark McGwire given in the following figure: Chapter 12 Chapter 12

Example: Education and Income Statistics: Concepts and Controversies Example: Education and Income Interpret the following figure which contains the boxplots of income among adults with different levels of education. Chapter 12 Chapter 12

Statistics: Concepts and Controversies Chapter 12 Chapter 12

Statistics: Concepts and Controversies Average or Mean Variance Chapter 12 Chapter 12

Comparing the Mean & Median Statistics: Concepts and Controversies Comparing the Mean & Median The mean and median of data from a symmetric distribution should be close together. The actual (true) mean and median of a symmetric distribution are exactly the same. In a skewed distribution, the mean is farther out in the long tail than is the median [the mean is “pulled” in the direction of the possible outlier(s)]. Chapter 12 Chapter 12

Example: Computing the mean and standard deviation Statistics: Concepts and Controversies Example: Computing the mean and standard deviation Consider the following data 16 25 24 19 33 25 34 46 37 33 42 40 37 34 49 73 46 45 45 The mean is Chapter 12 Chapter 12

Example: Computing the mean and standard deviation Statistics: Concepts and Controversies Example: Computing the mean and standard deviation The previous figure shows Barry Bonds’s home run counts, with their mean and distance of one observation from the mean indicated. The idea behind the standard deviation is to average these 19 distances. To find the standard deviation we can use the following table Chapter 12 Chapter 12

Example: Computing the mean and standard deviation Statistics: Concepts and Controversies Example: Computing the mean and standard deviation The average of these distances (the variance) is: Notice that we “average” by dividing by one less than the number of observations. Finally, the standard deviation is computed by taking the square root of the variation: Chapter 12 Chapter 12

Statistics: Concepts and Controversies Chapter 12 Chapter 12

Statistics: Concepts and Controversies Chapter 12 Chapter 12

Statistics: Concepts and Controversies Key Concepts Numerical Summaries Center (mean, median) Spread (variance, standard deviation, quartiles) Five-number summary & Boxplots Choosing mean versus median Choosing standard deviation versus five-number summary Chapter 12 Chapter 12

Statistics: Concepts and Controversies Exercise 12.7 College tuitions. The following figure is a stemplot of the tuition charged by 121 colleges in Illinois. The stems are thousands of dollars and the leaves are hundreds of dollars. Find the five-number summary of Illinois college tuitions. Would the mean tuition be clearly smaller than the median, about the same, or larger? Chapter 12 Chapter 12

Figure 11. 7 Stemplot of the Illinois tuition and fee data Figure 11.7 Stemplot of the Illinois tuition and fee data. (Data from the Web site www.collegeillinois.com/en/collegefunding/costs.htm. This figure was created using the Minitab software package.)

Statistics: Concepts and Controversies Exercise 12.11 The richest 1%. The distribution of individual incomes in the U.S. is strongly skewed to the right. In 2004, the mean and median incomes of the top 1% of Americans were $315,000 and $1,259,000. Which of these numbers is the mean? Chapter 12 Chapter 12

Statistics: Concepts and Controversies Exercise 12.31 Raising Pay. A school system employs teachers at salaries between $30,000 and $60,000. The teachers’ union and the school board are negotiating the form of next year’s increase in the salary schedule. Suppose that every teacher is given a flat $1,000 raise. How much will the mean salary increase? The median salary? Will a flat $1,000 raise increase the spread as measured by the distance between the quartiles? Will a flat $1,000 raise increase the spread as measured by the standard deviation of the salaries? Chapter 12 Chapter 12