Lesson 1 - 2 Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website.

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

Click the mouse button or press the Space Bar to display the answers.
Descriptive Measures MARE 250 Dr. Jason Turner.
Class Session #2 Numerically Summarizing Data
Numerically Summarizing Data
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
1 Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Range Standard Deviation Interquartile Range (IQR)
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Ways to Describe Data Sets
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Describing Data: Numerical
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Describing distributions with numbers
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Chapter 1 Exploring Data
Objectives 1.2 Describing distributions with numbers
1.1 Displaying Distributions with Graphs
Have out your calculator and your notes! The four C’s: Clear, Concise, Complete, Context.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Describing distributions with numbers
The Practice of Statistics Third Edition Chapter 1: Exploring Data 1.2 Describing Distributions with Numbers Copyright © 2008 by W. H. Freeman & Company.
Lecture 3 Describing Data Using Numerical Measures.
Lesson Describing Distributions with Numbers adapted from Mr. Molesky’s Statmonkey website.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
INVESTIGATION 1.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Exploring Data 1.2 Describing Distributions with Numbers YMS3e AP Stats at LSHS Mr. Molesky 1.2 Describing Distributions with Numbers YMS3e AP Stats at.
Chapter 3 Looking at Data: Distributions Chapter Three
Review BPS chapter 1 Picturing Distributions with Graphs What is Statistics ? Individuals and variables Two types of data: categorical and quantitative.
Organizing Data AP Stats Chapter 1. Organizing Data Categorical Categorical Dotplot (also used for quantitative) Dotplot (also used for quantitative)
Chapter 5 Describing Distributions Numerically.
LIS 570 Summarising and presenting data - Univariate analysis.
Chapter 1: Exploring Data, cont. 1.2 Describing Distributions with Numbers Measuring Center: The Mean Most common measure of center Arithmetic average,
Descriptive Statistics(Summary and Variability measures)
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
AP Statistics 5 Number Summary and Boxplots. Measures of Center and Distributions For a symmetrical distribution, the mean, median and the mode are the.
Descriptive Statistics ( )
UNIT ONE REVIEW Exploring Data.
Chapter 1: Exploring Data
Numerical descriptions of distributions
Chapter 3 Describing Data Using Numerical Measures
CHAPTER 2: Describing Distributions with Numbers
1st Semester Final Review Day 1: Exploratory Data Analysis
Averages and Variation
Description of Data (Summary and Variability measures)
Chapter 3 Describing Data Using Numerical Measures
CHAPTER 1 Exploring Data
Numerical Descriptive Measures
CHAPTER 1 Exploring Data
Descriptive Statistics
DAY 3 Sections 1.2 and 1.3.
Please take out Sec HW It is worth 20 points (2 pts
1.2 Describing Distributions with Numbers
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Organizing Data AP Stats Chapter 1.
Measure of Central Tendency
Exploratory Data Analysis
Chapter 1: Exploring Data
Honors Statistics Review Chapters 4 - 5
Measures of Center and Spread
Describing Distributions with Numbers
Presentation transcript:

Lesson Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website

Knowledge Objectives What is meant by a resistant measure? Two reasons why we use squared deviations rather just average deviations from the mean What is meant by degrees of freedom”

Construction Objectives Identify situations in which the mean is the most appropriate measure of center and situations in which the median is the most appropriate measure Given a data set: –Find the quartiles –Find the five-number summary –Compute the mean and median as measures of center –Compute the interquartile range (IQR) –Use the 1.5  IQR rule to identify outliers –Compute the standard deviation and variance as measures of spread

Construction Objectives cont Identify situations in which the standard deviation is the most appropriate measure of spread and situations in which the interquartile range is the most appropriate measure Explain the effect of a linear transformation of a data set on the mean, median, and standard deviation of the set Use numerical and graphical techniques to compare two or more data sets

Vocabulary Mean – the average value Median – the middle value (in an ordered list) Resistant measure – a measure (statistic or parameter) that is not sensitive to the influence of extreme observations Mode – the most frequent data value Range – difference between the largest and smallest observations P th percentile – p percent of the observations(in an ordered list) fall below at or below this number Quartile – multiples of 25 th percentile (Q1 – 25 th ; Q2 –50 th or median; Q3 – 75 th ) Five number summary – the minimum, Q1, Median, Q3, maximum

Vocabulary cont Boxplot – graphs the five number summary and any outliers Interquartile range (IQR) – where IQR = Q3 – Q1 Outlier – a data value that lies outside the interval [Q1 – 1.5  IQR, Q  IQR] Variance – the average of the squares of the deviations from the mean Standard Deviation – the square toot of the variance Degrees of freedom – the number of independent pieces of information that are included in your measurement Linear transformation – changes the data in the form of x new = a + bx

Measures of Center Numerical descriptions of distributions begin with a measure of its “center” If you could summarize the data with one number, what would it be? Mean: The “average” value of a dataset Median: The “middle” value of an ordered dataset Arrange observations in order min to max Locate the middle observation, average if needed.

Mean vs Median The mean and the median are the most common measures of center If a distribution is perfectly symmetric, the mean and the median are the same The mean is not resistant to outliers The mode, the data value that occurs the most often, is a common measure of center for categorical data You must decide which number is the most appropriate description of the center... Mean Median Applet meanmedian.html Use the mean on symmetric data and the median on skewed data or data with outliers

Distributions Parameters Skewed Left: (tail to the left) Mean substantially smaller than median (tail pulls mean toward it) Mean < Median < Mode Mode Median Mean

Distributions Parameters Symmetric: Mean roughly equal to median Mean ≈ Median ≈ Mode Mode Median Mean

Distributions Parameters Skewed Right: (tail to the right) Mean substantially greater than median (tail pulls mean toward it) Mean > Median > Mode Mode Median Mean

Central Measures Comparisons Measure of Central Tendency ComputationInterpretationWhen to use Mean μ = (∑x i ) / N x‾ = (∑x i ) / n Center of gravity Data are quantitative and frequency distribution is roughly symmetric Median Arrange data in ascending order and divide the data set into half Divides into bottom 50% and top 50% Data are quantitative and frequency distribution is skewed Mode Tally data to determine most frequent observation Most frequent observation Data are categorical or the most frequent observation is the desired measure of central tendency

Example 1 Which of the following measures of central tendency resistant? 1.Mean 2.Median 3.Mode Not resistant Resistant

Example 2 Given the following set of data: 70, 56, 48, 48, 53, 52, 66, 48, 36, 49, 28, 35, 58, 62, 45, 60, 38, 73, 45, 51, 56, 51, 46, 39, 56, 32, 44, 60, 51, 44, 63, 50, 46, 69, 53, 70, 33, 54, 55, 52 What is the mean? What is the median? What is the mode? What is the shape of the distribution? , 51, 56 Symmetric (tri-modal)

Example 3 Given the following types of data and sample sizes, list the measure of central tendency you would use and explain why? Sample of 50 Sample of 200 Hair color Height Weight Parent’s Income Number of Siblings Age Does sample size affect your decision? mode mean median mean Not in this case, but the larger the sample size, might allow use to use the mean vs the median

Sample Data Consider the following test scores for a small class: Plot the data and describe the SOCS: What number best describes the “center”? What number best describes the “spread’? Shape? Outliers? Center? Spread?

Day 1 Summary and Homework Summary –Three characteristics must be used to describe distributions (from histograms or similar charts) Shape (uniform, symmetric, bi-modal, etc) Center (mean, median, mode measures) Spread (variance – next lesson) –Median is resistant to outliers; mean is not! –Use Mean for symmetric data –Use Median for skewed data (or data with outliers) –Use Mode for categorical data Homework –pg 74 – 75: problems 27-31