STA 291 - Lecture 131 STA 291 Lecture 13, Chap. 6 Describing Quantitative Data – Measures of Central Location – Measures of Variability (spread)

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

HS 67 - Intro Health Statistics Describing Distributions with Numbers
Descriptive Measures MARE 250 Dr. Jason Turner.
LECTURE 7 THURSDAY, 11 FEBRUARY STA291 Fall 2008.
Looking at data: distributions - Describing distributions with numbers IPS chapter 1.2 © 2006 W.H. Freeman and Company.
Chapter 3 Describing Data Using Numerical Measures
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
MEASURES OF SPREAD – VARIABILITY- DIVERSITY- VARIATION-DISPERSION
1 Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Range Standard Deviation Interquartile Range (IQR)
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
VARIABILITY. PREVIEW PREVIEW Figure 4.1 the statistical mode for defining abnormal behavior. The distribution of behavior scores for the entire population.
Unit 4 – Probability and Statistics
Chapter 5 – 1 Chapter 5: Measures of Variability The Importance of Measuring Variability The Range IQR (Inter-Quartile Range) Variance Standard Deviation.
Vocabulary for Box and Whisker Plots. Box and Whisker Plot: A diagram that summarizes data using the median, the upper and lowers quartiles, and the extreme.
Describing Data Using Numerical Measures
Describing distributions with numbers
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
Objectives 1.2 Describing distributions with numbers
STA Lecture 111 STA 291 Lecture 11 Describing Quantitative Data – Measures of Central Location Examples of mean and median –Review of Chapter 5.
Methods for Describing Sets of Data
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Section 3.1 Measures of Center. Mean (Average) Sample Mean.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Applied Quantitative Analysis and Practices LECTURE#08 By Dr. Osman Sadiq Paracha.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
The Practice of Statistics Third Edition Chapter 1: Exploring Data 1.2 Describing Distributions with Numbers Copyright © 2008 by W. H. Freeman & Company.
Lecture 3 Describing Data Using Numerical Measures.
Applied Quantitative Analysis and Practices LECTURE#09 By Dr. Osman Sadiq Paracha.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Measures of Dispersion How far the data is spread out.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Agenda Descriptive Statistics Measures of Spread - Variability.
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
 The mean is typically what is meant by the word “average.” The mean is perhaps the most common measure of central tendency.  The sample mean is written.
Summary Statistics and Mean Absolute Deviation MM1D3a. Compare summary statistics (mean, median, quartiles, and interquartile range) from one sample data.
Created by: Tonya Jagoe. Measures of Central Tendency mean median mode.
Describing Data Descriptive Statistics: Central Tendency and Variation.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
LIS 570 Summarising and presenting data - Univariate analysis.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Created by: Tonya Jagoe. Measures of Central Tendency & Spread Input the data for these test scores into your calculator to find.
7 th Grade Math Vocabulary Word, Definition, Model Emery Unit 4.
Methods for Describing Sets of Data
Notes 13.2 Measures of Center & Spread
Summary Statistics 9/23/2018 Summary Statistics
Chapter 3 Describing Data Using Numerical Measures
Lecture 2 Chapter 3. Displaying and Summarizing Quantitative Data
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Describing Distributions Numerically
Quartile Measures DCOVA
Measuring Variation – The Five-Number Summary
Data Analysis and Statistical Software I Quarter: Spring 2003
Basic Practice of Statistics - 3rd Edition
Statistics and Data (Algebraic)
Basic Practice of Statistics - 3rd Edition
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
Basic Practice of Statistics - 3rd Edition
Presentation transcript:

STA Lecture 131 STA 291 Lecture 13, Chap. 6 Describing Quantitative Data – Measures of Central Location – Measures of Variability (spread)

STA Lecture 132 Summarizing Data Numerically Center of the data –Mean (average) –Median –Mode (…will not cover) Spread of the data –Variance, Standard deviation –Inter-quartile range –Range

STA Lecture 133 Mathematical Notation: Sample Mean Sample size n Observations x 1, x 2,…, x n Sample Mean “x-bar” --- a statistic

STA Lecture 134 Mathematical Notation: Population Mean for a finite population of size N Population size (finite) N Observations x 1, x 2,…, x N Population Mean “mu ” --- a Parameter

STA Lecture 135 Percentiles The pth percentile is a number such that p% of the observations take values below it, and (100-p)% take values above it 50 th percentile = median 25 th percentile = lower quartile 75 th percentile = upper quartile

STA Lecture 136 Quartiles 25 th percentile = lower quartile = Q1 75 th percentile = upper quartile = Q3 Interquartile range = Q3 - Q1 (a measurement of variability in the data)

STA Lecture 137 SAT Math scores Nationally (min = 210 max = 800 ) Q1 = 440 Median = Q2 = 520 Q3 = 610 ( -- you are better than 75% of all test takers) Mean = 518 (SD = 115 what is that?)

STA Lecture 138

9 Five-Number Summary Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software SAS output (Murder Rate Data) Quantile Estimate 100% Max % Q % Median % Q % Min 1.60

STA Lecture 1310 Five-Number Summary Maximum, Upper Quartile, Median, Lower Quartile, Minimum Example: The five-number summary for a data set is min=4, Q1=256, median=530, Q3=1105, max=320,000. What does this suggest about the shape of the distribution?

Box plot A box plot is a graphic representation of the five number summary --- provided the max is within 1.5 IQR of Q3 (min is within 1.5 IQR of Q1) STA Lecture 1311

Otherwise the max (min) is suspected as an outlier and treated differently. STA Lecture 1312

STA Lecture 1313

Box plot is most useful when compare several populations STA Lecture 1314

STA Lecture 1315 Measures of Variation Mean and Median only describe the central location, but not the spread of the data Two distributions may have the same mean, but different variability Statistics that describe variability are called measures of spread/variation

STA Lecture 1316 Measures of Variation Range: = max - min Difference between maximum and minimum value Variance: Standard Deviation: Inter-quartile Range: = Q3 – Q1 Difference between upper and lower quartile of the data

STA Lecture 1317 Deviations: Example Sample Data: 1, 7, 4, 3, 10 Mean (x-bar): ( )/5 =25/5=5 dataDeviation Dev. square 1(1 - 5)= (3 - 5)= (4 - 5) = (7 - 5) = (10 - 5) = 5 25 Sum=25Sum = 0 sum = 50

STA Lecture 1318 Sample Variance The variance of n observations is the sum of the squared deviations, divided by n-1.

STA Lecture 1319 Variance: Example ObservationMeanDeviationSquared Deviation Sum of the Squared Deviations50 n-15-1=4 Sum of the Squared Deviations / (n-1)50/4=12.5

So, sample variance of the data is 12.5 Sample standard deviation is 3.53 STA Lecture 1320

Variance/standard deviation is also more susceptible to extreme valued observations. We are using x-bar and variance/standard deviation mostly in the rest of this course. STA Lecture 1321

Population variance/standard deviation Notation for Population variance/standard deviation (usually obtain only after a census) Sigma-square / sigma STA Lecture 1322

standardization Describe a value in a sample by “how much standard deviation above/below the average” The value 6 is one standard deviation above mean -- the value 6 corresponds to a z-score of 1 May be negative (for below average) STA Lecture 1323

STA Lecture 1324 Attendance Survey Question On a 4”x6” index card –write down your name and section number –Question: Independent or not? –Gender of first child and second child from same couple.