Presentation is loading. Please wait.

Presentation is loading. Please wait.

Chapter Two: Summarizing and Graphing Data 2.2: Frequency Distributions 2.3: ** Histograms **

Similar presentations


Presentation on theme: "Chapter Two: Summarizing and Graphing Data 2.2: Frequency Distributions 2.3: ** Histograms **"— Presentation transcript:

1 Chapter Two: Summarizing and Graphing Data 2.2: Frequency Distributions 2.3: ** Histograms **

2 Summarizing Data Human beings cannot interpret large amounts of raw data. Here are State Unemployment Rates (July 2012) from BLS: 8.39.379.89.6 7.76.46.16.64.4 8.37.599.18.4 7.38.95.89.67.2 10.78.29.136 8.35.37.2 5 8.56.36.44.95.9 6.88.348.78.5 5.67.6127.97.4 8.87.65.410.87.3 2

3 Summarizing Data It is crucial to organize, summarize, and display data in a way that… – …accurately reflects the overall characteristics of the data. – …does not overstate or underemphasize patterns or trends in the data. – …is easy for human beings to interpret. – …is useful for later statistical analysis. 3

4 Summarizing Data We will consider the following general features: Center: A “typical” or “average” value that represents the “middle” or the data. Variation: A measure of how data values change or vary for different individuals. Distribution: The overall pattern or “shape” of the data. (symmetric, skewed, “bell curve,” etc.) Outliers: Individual values that are “unusual” compared to the majority of the data set. 4

5 Frequency Distributions Instead of displaying a list of data values for all individual, we can summarize as follows: – Group the values into several categories (or classes) such that each individual belongs to exactly one category. – For each category, give the number of individuals with values in that category. This number is called the frequency of the category. Example: Rather than listing each student’s Gender, we can summarize as follows: Female: ____ Male: ____ 5

6 Example: State Unemployment For quantitative data (must be numerical), we often group nearby values together. Here is the July 2012 state unemployment data: Unemployment RateFrequency 2.0% - 3.9%1 4.0% - 5.9%9 6.0% - 7.9%19 8.0% - 9.9%19 10.0% - 11.9%2 12.0% -13.9%1 6

7 Relative Frequency Table Alternatively, we can express the frequency for each category as a percentage of the number of values in the data set: Unemployment RateRel. Frequency 2.0% - 3.9%2.0% 4.0% - 5.9%17.7% 6.0% - 7.9%37.3% 8.0% - 9.9%37.3% 10.0% - 11.9%3.9% 12.0% -13.9%2.0% 7

8 Cumulative Frequencies Less common is the cumulative frequency (or percent), where we count the number/percent of individual less than a certain value: Unemployment RateCumulative Frequency Cumulative Percent Less than 4.0%12.0% Less than 6.0%1019.6% Less than 8.0%2956.9% Less than 10.0%4894.1% Less than 12.0%5098.0% Less than 14.0%51100.0% 8

9 Section 2.3 Histograms

10 ** Histograms ** A histogram is a graphical representation of a frequency table. Here is the state unemployment data from earlier: Number of states Percent Unemployed 10

11 ** Histograms ** Here is the same data, using smaller (more narrow) classes: Number of states Percent Unemployed 11

12 Making Histograms The histograms in today’s slides were generated using the JMP software package. The numbers above each bar are there for your convenience (these do not appear in the textbook). You should not worry about making histograms (or even frequency tables) by hand. Software will do this for you! You should focus on how to read and interpret a histogram. This is a crucial skill! 12

13 13 Example: Exam 1 Scores Exam Score Count The histogram above shows the scores on Exam 1 from a previous semester of this course. JMP includes the left endpoint in each interval, but not the right endpoint. Classes are 10-19, 20-29, etc. What does this tell you about scores on Exam 1? 13

14 Interpreting Histograms Some questions about the Exam 1 scores: How many students scored 80 or better? How many students scored less than 60? How many students scored in the 60-79 range? Does the histogram show any “unusual” scores? How many students scored 75 or better? 14

15 Normal Distributions In many cases, we have a histogram with that has the following features: – Approximate “bell” shape. – Strong (but rarely perfect) left/right symmetry – A single “peak” in the middle, short “tails” on the left and right sides. The State Unemployment data had these features. The Exam 1 data did not. 15

16 Example: Approximately Normal State unemployment data, with the approximating “bell” in red: Number of states Percent Unemployed 16

17 Normal Distributions “Normal” refers to a very specific type of “bell-shaped” distribution. ** Normal distributions play a key role in inference methods later in the course ** We will give a few more specifics next time, when we discuss the ideas of center and variation of a distribution. 17


Download ppt "Chapter Two: Summarizing and Graphing Data 2.2: Frequency Distributions 2.3: ** Histograms **"

Similar presentations


Ads by Google