Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Organizing and Summarizing Data 2.

Slides:



Advertisements
Similar presentations
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Advertisements

Chapter Two Organizing and Summarizing Data
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 2 Exploring Data with Graphs and Numerical Summaries Section 2.2 Graphical Summaries.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Slide 1 Spring, 2005 by Dr. Lianfen Qian Lecture 2 Describing and Visualizing Data 2-1 Overview 2-2 Frequency Distributions 2-3 Visualizing Data.
B a c kn e x t h o m e Frequency Distributions frequency distribution A frequency distribution is a table used to organize data. The left column (called.
Chapter Organizing and Summarizing Data © 2010 Pearson Prentice Hall. All rights reserved 3 2.
Chapter 2 Summarizing and Graphing Data
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Chapter 2 Presenting Data in Tables and Charts
© 2010 Pearson Prentice Hall. All rights reserved Organizing and Summarizing Data Graphically.
CHAPTER 2 ORGANIZING AND GRAPHING DATA. Opening Example.
Chapter 2 Graphs, Charts, and Tables – Describing Your Data
Organizing Information Pictorially Using Charts and Graphs
Statistics-MAT 150 Chapter 2 Descriptive Statistics
Sexual Activity and the Lifespan of Male Fruitflies
Chapter 2: Organizing Data STP 226: Elements of Statistics Jenifer Boshes Arizona State University.
Ka-fu Wong © 2003 Chap 2-1 Dr. Ka-fu Wong ECON1003 Analysis of Economic Data.
Frequency Distributions and Graphs
Welcome to Data Analysis and Interpretation
STATISTICAL GRAPHS.
 FREQUENCY DISTRIBUTION TABLES  FREQUENCY DISTRIBUTION GRAPHS.
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
Chapter Organizing and Summarizing Data © 2010 Pearson Prentice Hall. All rights reserved 3 2.
Chapter 2 Summarizing and Graphing Data
DATA FROM A SAMPLE OF 25 STUDENTS ABBAB0 00BABB BB0A0 A000AB ABA0BA.
Slide 2-2 Copyright © 2008 Pearson Education, Inc. Chapter 2 Organizing Data.
2.1 Organizing Qualitative Data
Chapter Organizing and Summarizing Data © 2010 Pearson Prentice Hall. All rights reserved 3 2.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Organizing and Summarizing Data 2.
Slide 2-2 Copyright © 2012, 2008, 2005 Pearson Education, Inc. Chapter 2 Organizing Data.
Chapter Two Organizing and Summarizing Data 2.2 Organizing Quantitative Data I.
Organizing Data Section 2.1.
Chapter 2 Describing Data.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Bias in Sampling 1.5.
ORGANIZING AND GRAPHING DATA
2.2 Organizing Quantitative Data. Data O Consider the following data O We would like to compute the frequencies and the relative frequencies.
Chapter Two Organizing and Summarizing Data 2.3 Organizing Quantitative Data II.
Chapter Organizing and Summarizing Data © 2010 Pearson Prentice Hall. All rights reserved 3 2.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Organizing and Summarizing Data 2.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 3 Graphical Methods for Describing Data.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 2-2 Frequency Distributions.
When data is collected from a survey or designed experiment, they must be organized into a manageable form. Data that is not organized is referred to as.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 2-1 Chapter 2 Presenting Data in Tables and Charts Statistics For Managers 4 th.
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
Lesson Additional Displays of Quantitative Data.
2.2 ORGANIZING QUANTITATIVE DATA OBJECTIVE: GRAPH QUANTITATIVE DATA Chapter 2.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Two Organizing Data.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Organizing and Summarizing Data 2.
Copyright 2011 by W. H. Freeman and Company. All rights reserved.1 Introductory Statistics: A Problem-Solving Approach by Stephen Kokoska Chapter 2 Tables.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 2 Section 2 – Slide 1 of 37 Chapter 2 Section 2 Organizing Quantitative Data.
More Graphs — But What Type Are These?.  Divide the range of data into equal widths.  Every number can only be placed in one class (bar).  Using.
Descriptive Statistics
Descriptive Statistics: Tabular and Graphical Methods
Organizing Quantitative Data: The Popular Displays
Organizing and Summarizing Data
STATISTICS INFORMED DECISIONS USING DATA
Organizing and Summarizing Data
3 2 Chapter Organizing and Summarizing Data
STATISTICS INFORMED DECISIONS USING DATA
Chapter Two Organizing and Summarizing Data
3 2 Chapter Organizing and Summarizing Data
Organizing and Summarizing Data
Frequency Distributions
Organizing and Summarizing Data
Organizing, Displaying and Interpreting Data
Presentation transcript:

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Organizing and Summarizing Data 2

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Organizing Qualitative Data 2.1

2-3 When data is collected from a survey or experiment, they must be organized into a manageable form. Data that is not organized is referred to as raw data. Ways to Organize Data Tables (“Frequency Distributions”) Graphs (“Bar Graphs”, “Pie Charts”) Numerical Summaries (Chapter 3)

2-4 A frequency distribution is a Table that lists each category of data (Class) and the number of occurrences (frequency) in each Class of data.

2-5 EXAMPLE Organizing Qualitative Data into a Frequency Distribution The data on the next slide represent the color of M&M candies in a bag of plain M&Ms. Construct a frequency distribution of the color of plain M&Ms.

2-6 Frequency Distribution Class (Color) TallyFrequency Brown||||| ||||| ||12 Yellow||||| 10 Red||||| ||||9 Orange||||| |6 Blue|||3 Green|||||5

2-7 The relative frequency is the proportion (ratio) of observations within a class and is found using the formula: A relative frequency distribution lists each class of data and its relative frequency.

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Organizing Qualitative Data 2.1

2-9 ClassTallyFrequencyRelative Frequency Brown||||| ||||| ||1212/45 ≈ Yellow||||| Red||||| ||||90.2 Orange||||| | Blue||| Green||||| ~ 1.0

2-10 A bar graph is constructed by labeling each class of data on the horizontal axis and the frequency/relative frequency of that class on the vertical axis. Rectangles of equal width are drawn for each class. The height of each rectangle represents the class’s frequency/relative frequency.

2-11

2-12

2-13 “Pareto” Chart

2-14 EXAMPLEComparing Two Data Sets The following data represent the marital status of U.S. residents (millions) in 1990 and Draw a “side-by-side” relative frequency bar graph of the data. Marital Status Never married Married Widowed Divorced

2-15 Relative Frequency Marital Status Marital Status in 1990 vs

2-16 EXAMPLE Constructing a “Pie Chart” The following data represent the marital status of U.S. residents (M) in Draw a pie chart of the data. Marital StatusFrequency Never married55.3 Married127.7 Widowed13.9 Divorced M

2-17 EXAMPLEConstructing a Pie Chart

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Organizing Quantitative Data: The Popular Displays 2.2

2-19 The first step in summarizing quantitative data is to determine whether the data are discrete or continuous. If the data are discrete and there are relatively few different values of the variable, the classes of data will be the individual observations. If the data are discrete, but there are many different values of the variables, or if the data are continuous, the classes of data must be created using intervals of numbers (like Ages of Students).

2-20 The following data represent the number of available cars in a household based on a random sample of 50 households. Construct a frequency and relative frequency distribution. EXAMPLE Constructing Frequency /Relative Frequency Distribution from Discrete Data Data based on results reported by the United States Bureau of the Census.

2-21 # of Cars TallyFrequencyRelative Frequency 0||||44/50 = ||||| ||||| |||1313/50 = ||||| ||||| ||||| ||||| || ||||| || ||| |10.02

2-22 A Histogram (like a Bar Chart) is constructed by drawing rectangles for each class of data. The height of each rectangle is the frequency/relative frequency of the class. The width of each rectangle is the same and the rectangles touch each other.

2-23

2-24

2-25 When a data set consists of a large number of different discrete data values or when a data set consists of continuous data, we must create classes by using intervals of numbers.

2-26 The following data represents the number of persons aged who are currently work-disabled. The lower class limit (LCL) of a class is the smallest value within the class while the upper class limit (UCL) of a class is the largest value within the class. The class width (CW) is the difference between consecutive LCLs. The class width of the data given above is 35 – 25 = 10.

2-27 EXAMPLEOrganizing Continuous Data into a Frequency/Relative Frequency Distribution The following data represent the time (sec) between eruptions for a random sample of 45 eruptions at Yellowstone’s “ Old Faithful” geyser in Wyoming. Construct a frequency/relative frequency data distribution. Source: Ladonna Hansen, Park Curator

2-28 The smallest data value is 672 and the largest data value is 738, so the Range of the data is = 66 sec. We will create 7 classes and set the lower class limit of the first class to be 670. (66/7 = 9.4 so round up to 10 to get Class Width)

2-29 Time between Eruptions (seconds) TallyFreqRel Freq 670 – 679||22/45 = ||||| || ||||| |||| ||||| |||| ||||| ||||| | ||||| || ~ 1.0

2-30 Guidelines for Determining Class Width Decide on the number of classes. Generally, there should be between 5 and 20 classes. The smaller the data set, the fewer classes you should have. CW = Range / # Classes (rounded up to nearest Integer)

2-31 Constructing a Freq/Rel Freq Histogram for Continuous Data Using class width of 10

2-32

2-33 A stem-and-leaf plot uses digits to the left of the rightmost digit to form the stem. Each rightmost digit forms a leaf. For example, a data value of 147 would have 14 as the stem and 7 as the leaf. Displayed as: 14 | 7 So: 147, 147, and 149  14 | 779

2-34 EXAMPLEConstructing a Stem-and-Leaf Plot An individual is considered to be unemployed if they do not have a job, but are actively seeking employment. The following data represent the unemployment rate in each of the 50 States plus the District of Columbia in June, 2008.

2-35 StateUnemployment Rate % StateUnemployment Rate % StateUnemployment Rate % Alabama4.7Kentucky6.3North Dakota3.2 Alaska6.8Louisiana3.8Ohio6.6 Arizona4.8Maine5.3Oklahoma3.9 Arkansas5.0Maryland4.0Oregon5.5 California6.9Mass5.2Penn5.2 Colorado5.1Michigan8.5Rhode Island7.5 Conn5.4Minnesota5.3South Carolina6.2 Delaware4.2Mississippi6.9South Dakota2.8 Dist Col6.4Missouri5.7Tenn6.5 Florida5.5Montana4.1Texas4.4 Georgia5.7Nebraska3.3Utah3.2 Hawaii3.8Nevada6.4Vermont4.7 Idaho3.8New Hamp4.0Virginia4.0 Illinois6.8New Jersey5.3Washington5.5 Indiana5.8New Mexico3.9W. Virginia5.3 Iowa4.0New York5.3Wisconsin4.6 Kansas4.3North Carolina6.0Wyoming3.2

2-36 We let the stem represent the integer portion of the number and the leaf will be the decimal portion. For example, the stem of Alabama (4.7%) will be 4 and the leaf will be 7 or: 4|7

2-38 A split stem-and-leaf plot: This stem represents 3.0 – 3.4 This stem represents 3.5 – 3.9

2-39 Once a frequency distribution or histogram of continuous data is created, the raw data is no longer available, so some accuracy (true mean, etc.) may also be lost. However, the raw data can be retrieved from an accompanying stem-and-leaf plot. of Stem-and-Leaf Diagrams over Histograms Advantage of Stem-and-Leaf Diagrams over Histograms

2-40 DOT PLOT A dot plot is drawn by sorting each data point (observation) horizontally, in ascending (increasing) order, and then placing a dot above the observation each time the data point is observed. This is similar to a Stem & Leaf plot in that it displays the pattern of all the actual data points.

2-41

2-42 Shapes of Distributions Uniform distribution: the frequency of each value of the variable is constant. Bell-shaped (Normal) distribution: the highest frequency occurs in the middle. Frequencies tail off symmetrically to the left and right. Skewed right : the tail to the right of the peak is longer than the tail to the left. Skewed left : the tail to the left of the peak is longer than the tail to the right.

2-43

2-44 EXAMPLEIdentifying the Shape of the Distribution Identify the shape of the following histogram which represents the time between eruptions at Old Faithful.

Chap 245

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Additional Displays of Quantitative Data 2.3

2-47 A class midpoint (MP) is the sum of (LCL + UCL) / 2. A frequency polygon is a graph that uses points, connected by lines (connect the dots) to display the freq/rel freq for the classes. Plot a point above each class MP at a height equal to the class frequency, Next, draw straight lines between the points. Two additional lines are drawn connecting each end MP with the horizontal axis.

2-48 Time between Eruptions (seconds) Class Midpoint FrequencyRelative Frequency 670 – – – – – – –

2-49 Time (seconds) Frequency Polygon

2-50 Ogive A cumulative frequency distribution displays the aggregate freq/rel freq of the classses displayed so far. For discrete data, it displays the total number of observations less than/equal to that class. For continuous data, it displays the total number of observations less than/equal to the UCL of that class. The graph of this distribution is known as an “ogive” (read as “oh-jive”)

2-51 Time between Eruptions (seconds) Frequency Relative Frequency Cumulative Frequency Cumulative Relative Frequency 670 – – – – – – – ~ =n1=100%

2-52 Time (seconds) Frequency Ogive

2-53 Time (seconds) Relative Frequency Ogive

2-54 If the variable changes over time, the data are referred to as “time series data”. A “time-series plot” is obtained by plotting the time on the horizontal axis and the corresponding value of the variable on the vertical axis. Line segments are then drawn connecting the points.

2-55 YearClosing Value , , , , The data to the right shows the closing prices (31 Dec) of the Dow Jones Industrial Average (DJIA = stock market) for the years: 1990 – 2007.

2-56 Year Closing Value Time-Series Plot Year-End DJIA (1990 – 2007)

Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Section Graphical Misrepresentations of Data 2.4

2-58 Statistics “The only science that enables different experts to draw different conclusions using the same numbers. ” – Evan Esar

2-59 EXAMPLEMisrepresentation of Data The data in the following table represent the life expectancies (years) of residents of the United States. Year, xLife Expectancy, y Source: National Center for Health Statistics

2-60 EXAMPLEMisrepresentation of Data (a) (b)

2-61 EXAMPLEMisrepresentation of Data The National Survey of Student Engagement is a 2007 survey that asked US college freshman students how many hours they spend preparing for class each week.

2-62 EXAMPLEMisrepresentation of Data HoursRelative Frequency 00 1 – – – – – – –

2-63 (a)

2-64 (b)

65Chap 2 Key Concepts Always remember, that graphs and plots can distort the way you perceive what you are seeing. This happens because of the way the visual-cortex (your brain) receives and re-transmits inputs from the sensor (eye). Always remember, that graphs and plots can distort the way you perceive what you are seeing. This happens because of the way the visual-cortex (your brain) receives and re-transmits inputs from the sensor (eye). People (salesmen) who purposely want this distortion to occur know how to use this feature of brain processing to leave a false impression in your mind. People (salesmen) who purposely want this distortion to occur know how to use this feature of brain processing to leave a false impression in your mind.

66Chap 2 Here Endeth Chapter 2 (Applause !) Next, comes Midterm Exam #1 (boo, hiss…), so here are a few pointers…

Chap 267