Graphical Displays of Information 3.1 – Tools for Analyzing Data Learning Goal: Identify the shape of a histogram MSIP / Home Learning: p. 146 #1, 2, 4,

Slides:



Advertisements
Similar presentations
Chapter Four Making and Describing Graphs of Quantitative Variables
Advertisements

Unit 16: Statistics Sections 16AB Central Tendency/Measures of Spread.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
CHAPTER 4 Displaying and Summarizing Quantitative Data Slice up the entire span of values in piles called bins (or classes) Then count the number of values.
Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.
Frequency Distribution and Variation Prepared by E.G. Gascon.
Chapter 3 Describing Data Using Numerical Measures
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Unit 3 – One Variable Statistics
PSY 307 – Statistics for the Behavioral Sciences
Measures of Central Tendency Section 2.3 Statistics Mrs. Spitz Fall 2008.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Numerical Measures of Central Tendency. Central Tendency Measures of central tendency are used to display the idea of centralness for a data set. Most.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
Objective To understand measures of central tendency and use them to analyze data.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
MGQ 201 WEEK 4 VICTORIA LOJACONO. Help Me Solve This Tool.
 Multiple choice questions…grab handout!. Data Analysis: Displaying Quantitative Data.
Descriptive Statistics
Methods for Describing Sets of Data
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Statistical Tools in Evaluation Part I. Statistical Tools in Evaluation What are statistics? –Organization and analysis of numerical data –Methods used.
Measures of Spread Chapter 3.3 – Tools for Analyzing Data I can: calculate and interpret measures of spread MSIP/Home Learning: p. 168 #2b, 3b, 4, 6, 7,
Copyright © Cengage Learning. All rights reserved. 2 Descriptive Analysis and Presentation of Single-Variable Data.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Graphical Displays of Information Chapter 3.1 – Tools for Analyzing Data Mathematics of Data Management (Nelson) MDM 4U.
Warm up The following graphs show foot sizes of gongshowhockey.com users. What shape are the distributions? Calculate the mean, median and mode for one.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Measure of Central Tendency Measures of central tendency – used to organize and summarize data so that you can understand a set of data. There are three.
6, 5, 3, 6, 8 01/10/11 Mean, Median, Mode Warm Up
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
MDM4U Chapter 3 Review Normal Distribution Mr. Lieff.
Measures of Central Tendency Chapter 3.2 – Tools for Analyzing Data Mathematics of Data Management (Nelson) MDM 4U.
Univariate Descriptive Statistics Chapter 2. Lecture Overview Tabular and Graphical Techniques Distributions Measures of Central Tendency Measures of.
Measures of Spread Chapter 3.3 – Tools for Analyzing Data Mathematics of Data Management (Nelson) MDM 4U.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Chapter 4: Quantitative Data Part 1: Displaying Quant Data (Week 2, Wednesday) Part 2: Summarizing Quant Data (Week 2, Friday)
Chapter 3 Review MDM 4U Mr. Lieff. 3.1 Graphical Displays be able to effectively use a histogram recognize and be able to interpret the various types.
1 Review Sections 2.1, 2.2, 1.3, 1.4, 1.5, 1.6 in text.
Chapter 3 Review MDM 4U Mr. Lieff. 3.1 Graphical Displays be able to effectively use a histogram name and be able to interpret the various types of distributions.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Symbol Description It would be a good idea now to start looking at the symbols which will be part of your study of statistics.  The uppercase Greek letter.
Graphical Displays of Information
2.3 Measures of Central Tendency Coach Bridges NOTES.
Welcome to MDM4U (Mathematics of Data Management, University Preparation)
Statistics and Data Analysis
NOTES #9 CREATING DOT PLOTS & READING FREQUENCY TABLES.
Descriptive Statistics Unit 6. Variable Any characteristic (data) recorded for the subjects of a study ex. blood pressure, nesting orientation, phytoplankton.
Graphical Displays of Information Chapter 3.1 – Tools for Analyzing Data Mathematics of Data Management (Nelson) MDM 4U.
Data Description Chapter 3. The Focus of Chapter 3  Chapter 2 showed you how to organize and present data.  Chapter 3 will show you how to summarize.
Chapter 3 – Data Description Section 3.1 – Measures of Central Tendency.
Chapter 3 Review MDM 4U Mr. Lieff. 3.1 Graphical Displays be able to effectively use a histogram name and be able to interpret the various types of distributions.
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
MDM4U Chapter 3/5 Review Normal Distribution Mr. Lieff.
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
Minds on! Two students are being considered for a bursary. Sal’s marks are Val’s marks are Which student would you award the bursary.
Descriptive Statistics: Measures of Central Tendency Donnelly, 2 nd edition Chapter 3.
Interpreting Categorical and Quantitative Data. Center, Shape, Spread, and unusual occurrences When describing graphs of data, we use central tendencies.
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
3.3 Measures of Spread Chapter 3 - Tools for Analyzing Data Learning goal: calculate and interpret measures of spread Due now: p. 159 #4, 5, 6, 8,
3.1 Graphical Displays Name and be able to analyze the various types of distributions Symmetric: Uniform, U-shaped, Mound-shaped Asymmetric: Left/Right-skewed.
Chapter 5 : Describing Distributions Numerically I
Warm-Up 1..
Chapter 2: Methods for Describing Data Sets
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
An Introduction to Statistics
Describing Data Coordinate Algebra.
Analyze Data: IQR and Outliers
Presentation transcript:

Graphical Displays of Information 3.1 – Tools for Analyzing Data Learning Goal: Identify the shape of a histogram MSIP / Home Learning: p. 146 #1, 2, 4, 9, 11 (data in Excel file on wiki),13

Histograms Show how data is spread out Best choice for:  continuous data  discrete data with a large spread Data is divided into 5-6 intervals Bin width = width of each interval (same) Different bin widths can produce different shaped distributions

Histogram Example These histograms represent the same data One shows much less of the structure of the data Too many bins (bin width too small) is also a problem

Histogram Applet – Old Faithful m.html

Bin Width Calculation Bin width = (range) ÷ (number of intervals)  where range = (max) – (min)  Number of intervals is usually 5-6 Bins should not overlap  Incorrect: 0-10, 10-20, 20-30, 30-40, etc.  Correct: Discrete: 0 - 9, , , , etc. Concinuous: , , , , etc.

Shapes of Distributions Symmetric  Mound Shaped  U-Shaped  Uniform Unsymmetrical  Left-Skewed  Right-Skewed

Mound-shaped distribution Middle interval(s) have the greatest frequency / tallest bars Bars get shorter as you move away E.g. roll 2 dice, height, memory

U-shaped distribution  Lowest frequency in the centre, higher towards the outside  E.g. height of a combined grade 1 and 6 class

Uniform distribution  All bars are approximately the same height  e.g. roll a die 50 times

Symmetric distribution  A distribution that is the same on either side of the centre  U-Shaped, Uniform and Mound-shaped Distributions are symmetric

Skewed distribution (left or right) Highest frequencies at one end Left-skewed has higher bars on the right and drops off to the left  E.g. the years on a handful of quarters (left)  E.g. the years of cars on a classic car lot (right)

MSIP / Home Learning Define in your notes:  Frequency distribution (p )  Cumulative frequency (p. 148)  Relative frequency (p. 148) Complete p. 146 #1, 2, 4, 9, 11 (data in Excel file on wiki),13

Warm up - Class marks What shape is this distribution? Which of the following can you tell from the graph: mean? median? modal interval? Left-skewed Modal interval: 72 – 80 Median: (70 actual) Mean: 66

Minds On! Mr. Lieff recorded the following 20 quiz marks: Find the average mark 2 different ways.

Measures of Central Tendency (Mean, Median, Mode) Chapter 3.2 – Tools for Analyzing Data Learning Goal: Calculate the mean, median and mode for weighted / grouped data Due now: p. 146 #1, 2, 4, 9, 11 (data in Excel file on wiki),13 MSIP / Home Learning: p. 159 #4, 5, 6, 8, 10-13

Sigma Notation the sigma notation is used to compactly express a mathematical series ex: … + 15 this can be expressed: the variable k is called the index of summation. the number 1 is the lower limit and the number 15 is the upper limit we would say: “the sum of k for k = 1 to k = 15”

Example 1: write in expanded form: This is the sum of the term 2n+1 as n takes on the values from 4 to 7. = (2×4 + 1) + (2×5 + 1) + (2×6 + 1) + (2×7 + 1) = = 48 NOTE: any letter can be used for the index of summation, though a, n, i, j, k & x are the most common

Example 2: write the following in sigma notation

The Mean (Average) Found by dividing the sum of all the data points by the number of data points Affected greatly by outliers Deviation (3.5)  the distance of a data point from the mean  calculated by subtracting the mean from the value  i.e.

The Weighted Mean where x i represent the data points, w i represents the weight or the frequency “The sum of the products of each item and its weight divided by the sum of the weights” see examples on page 153 and 154 example: 7 students have a mark of 70 and 10 students have a mark of 80 mean = (70×7 + 80×10) ÷ (7+10) = 75.9

Means with grouped data for data that is already grouped into class intervals (assuming you do not have the original data), you must use the midpoint of each class to estimate the weighted mean see the example on page 154-5

Median the midpoint of the data calculated by placing all the values in order if there is an odd number of values, the median is the middle number  median = 6 if there are an even number of values, the median is the mean of the middle two numbers  median = 7 not affected greatly by outliers

Mode The number that occurs most often There may be no mode, one mode, two modes (bimodal), etc. Which distributions from yesterday have one mode? Mound-shaped, Left/Right-Skewed Two modes? U-Shaped, Mound-shaped (could), Uniform (could) Modes are appropriate for discrete data or non-numerical data  Shoe size, Number of siblings  Eye colour, Favourite subject

Distributions and Central Tendancy the relationship between the three measures changes depending on the spread of the data symmetric (mound shaped)  mean = median = mode right skewed  mean > median > mode left skewed  mean < median < mode

What Method is Most Appropriate? Outliers are data points that are quite different from the other points Outliers affect the mean the greatest The median is least affected by outliers Skewed data is best represented by the median If symmetric either median or mean If not numeric or if the frequency is the most critical measure, use the mode

Example 3 a) Find the mean, median and mode mean = [(1x2) + (2x8) + (3x14) + (4x3)] / 27 = 2.7 median = 3 (27 data points, so #14 falls in bin 3) mode = 3 b) which way is it skewed? Left Survey responses 1234 Frequency 28143

Example 4 Find the mean, median and mode mean = [(145.5 × 3) + (155.5 × 7) + (165.5 × 4)] ÷ 14 = median = or mode = or MSIP / Home Learning: p. 159 #4, 5, 6, 8, Height No. of Students 374

MSIP / Home Learning p. 159 #4, 5, 6, 8, 10-13

References Wikipedia (2004). Online Encyclopedia. Retrieved September 1, 2004 from