Working with one variable data. Spread Joaquin’s Tests Taran’s Tests: 76, 45, 83, 68, 64 67, 70, 70, 62, 62 What can you infer, justify and conclude about.

Slides:



Advertisements
Similar presentations
Statistical Reasoning for everyday life
Advertisements

Unit 16: Statistics Sections 16AB Central Tendency/Measures of Spread.
Measures of Central Tendency and Variation 11-5
Unit 1.1 Investigating Data 1. Frequency and Histograms CCSS: S.ID.1 Represent data with plots on the real number line (dot plots, histograms, and box.
Measures of Dispersion
Chapter 3 Describing Data Using Numerical Measures
DESCRIBING DATA: 2. Numerical summaries of data using measures of central tendency and dispersion.
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Slides by JOHN LOUCKS St. Edward’s University.
Measures of Dispersion
Unit 4 – Probability and Statistics
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Grouped Data Calculation
Unit 3 Section 3-4.
Measures of Central Tendency
Chapter 2 Describing Data with Numerical Measurements
CONFIDENTIAL 1 Grade 8 Algebra1 Data Distributions.
Chapter 6.
Recap All about measures of location Mean Median Mode
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
Chapter 3 - Part B Descriptive Statistics: Numerical Methods
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
What is a box and whisker plot? A box and whisker plot is a visual representation of how data is spread out and how much variation there is. It doesn’t.
Warm Up Solve for x 2) 2x + 80 The product of a number
WHAT IS AN INTEGER? Integers can be thought of as discrete, equally spaced points on an infinitely long number line. (Nonnegative integers (purple) and.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Descriptive Statistics Measures of Variation. Essentials: Measures of Variation (Variation – a must for statistical analysis.) Know the types of measures.
Review Measures of central tendency
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Table of Contents 1. Standard Deviation
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
By: Amani Albraikan 1. 2  Synonym for variability  Often called “spread” or “scatter”  Indicator of consistency among a data set  Indicates how close.
Describing distributions with numbers
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
Lecture 3 Describing Data Using Numerical Measures.
Objectives The student will be able to: find the variance of a data set. find the standard deviation of a data set.
STATISTICS “CALCULATING DESCRIPTIVE STATISTICS –Measures of Dispersion” 4.0 Measures of Dispersion.
Measures of Dispersion How far the data is spread out.
What is the MEAN? How do we find it? The mean is the numerical average of the data set. The mean is found by adding all the values in the set, then.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Bell Ringers Calculate the mean median and mode for the following sets of data. 1.15, 16, 19, 6, 16, 17, 19 Mean: Median: Mode: 2. 68, 74, 20, 45, 96,
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
Practice Page 65 –2.1 Positive Skew Note Slides online.
© 2010 Pearson Education, Inc. All rights reserved Data Analysis/Statistics: An Introduction Chapter 10.
Summary Statistics: Measures of Location and Dispersion.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
LIS 570 Summarising and presenting data - Univariate analysis.
Vocabulary to know: *statistics *data *outlier *mean *median *mode * range.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
Statistics and Data Analysis
Warm Up Find the median of the following data set. Car accidents on Main and First street during the past 7 years
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Statistics Review  Mode: the number that occurs most frequently in the data set (could have more than 1)  Median : the value when the data set is listed.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
Notes 13.2 Measures of Center & Spread
Statistics Unit Test Review
Shoe Sizes.
Measures of central tendency
Percentiles and Box-and- Whisker Plots
11.2 box and whisker plots.
Measures of central tendency
Presentation transcript:

Working with one variable data

Spread Joaquin’s Tests Taran’s Tests: 76, 45, 83, 68, 64 67, 70, 70, 62, 62 What can you infer, justify and conclude about the Joaquin’s and Taran’s tests scores? (Hint: Calculate the mean, median and mode for each. What do they tell you?) J.’s mean = T.’s mean = med = med = mode = none mode =

Spread Mean, median and mode are all good ways to find the centre of your data. This information is most useful when the sets of data being compared are similar. It is also important to find out how much your data is spread out. This gives a lot more insight to data sets that vary from each other.

Consider the following two data sets with identical mean and median values. Why is this information misleading? ( Mean = 5, Median = 5) Set A) 1, 2, 2, 3, 3, 4, 4, 4, 5, 5, 5, 5, 6, 6, 6, 7, 7, 8, 8, 9 Set B) 3, 3, 3, 4, 4, 4, 5, 5, 5, 6, 6, 6, 7, 7, 7 This information is misleading because one graph is bell-shaped and the other is uniform, but the calculations make them appear to be similar when really A and B are spread out quite differently.

Measures of Spread In analysing data, it is often important to know whether it is spread out, or whether it is clustered around the mean. Measures of spread are used to quantify the spread of the data. The measures of spread, or dispersion are: Range Quartiles Variance Standard deviation

Range The simplest measure of dispersion. Calculated by finding the difference between the greatest and the least values of the data. Useful since it is the easiest to understand. Affected by extreme data. The range of values 1, 2, 4, 6, 9, 11, 15, 25 is 25 – 1 = 24

Quartiles and Interquartile Ranges Quartiles divide a set of ordered data into four groups with equal numbers of values. Lowest Datum First Quartile Q 1 Median Q 2 Third Quartile Q 3 Highest Datum The three “dividing points” are the first quartile (Q 1 ), median, (sometimes called the second quartile, Q 2 ), and the third quartile (Q 3 )

Quartiles and Interquartile Ranges Lowest DatumQ1Q1 Median Q 2 Q3Q3 Highest Datum  The interquartile range is Q 1 – Q 3, which is the range of the middle of the data.  The semi-interquartile range is one half of the interquartile range.  Both these ranges indicate how closely the data are clustered around the median.

Box and Whisker Plot Illustrates the Quartiles The Box shows the interquartile range The whiskers represent the lowest and highest values A modified box and whisker plot shows outliers outside of the whiskers See Page 141 for illustrations

Standard Deviation A deviation is the difference between an individual value in a set of data and the mean for the data. Standard Deviation averages the square of the distance that each piece of data is from the mean. The smaller the standard deviation, the more compact the data set.

Standard Deviation – Population σ = Standard Deviation - Population ∑ = Sum μ = Mean N = Number of data in population

Standard Deviation – Sample s = Standard Deviation - Sample ∑ = Sum = Mean n = Number of data in sample

Variance The variance can be found by calculating the average squared difference ( or deviation ) of each value from the mean. PopulationSample Or square the standard deviation.

Standard Deviation – Group Data If you are working with grouped data, you can estimate the standard deviation using the following formula PopulationSample f i = the frequency for a given interval m i = the midpoint of the interval

Find the Measures of Spread Rachelle works part-time at a gas station. Her gross earnings for the past eight weeks are shown. $55$68$83$59$68$95$75$65 Calculate the range, variance, standard deviation, interquartile, and semi-interquartile ranges for her weekly earnings.

Find the Measures of Dispersion Range: The range of Rachelle’s earnings is $

Find the Measures of Dispersion Variance: Gross Earnings Total The variance of Rachelle’s earnings is $

Find the Measures of Dispersion Standard Deviation: The standard deviation of Rachelle’s earnings is $

Find the Measures of Spread Interquartile range: First, put the data into numerical order Interquartile range = Q 3 - Q 1 = 79 – 62 = 17

Find the Measures of Spread Semi-Interquartile range: Semi-Interquartile range = 17/2 = 8.5 Therefore the interquartile range is 17 and semi-interquartile range is 8.5.

Standard Deviation Group Data - Example The following table represents the number of hours per day of watching TV in a sample of 500 people. Number of hours Frequency

Interval Midpoint (m i ) Frequency f i ( )2 = x = x 6.76 = x 0.36 = x 1.96 = x = x = x = = 3.05 THEREFORE THE STANDARD DEVIATION IS APPROXIMATLY 3.05

Z-Scores The number of standard deviations away from the mean a data point is –Thus if our standard deviation is 8 then how many 8’s is a data point (13) away from the average or centre –It is found by dividing the deviation by the standard deviation If your values are below the mean their z score will be negative. Similarly if your value is above the mean your z score will be positive

Percentiles Similar to quartiles Percentiles divide the data into 100 intervals that have equal number of values. k percent of the data are less than or equal to k th percentile P k Which means that you are finding what percent of the data is below your specific value in question Often used for Standardized Tests

Homework Pg 148 #1-6, 14 I LOVE HOMEWORK