Summary Statistics: Measures of Location and Dispersion.

Slides:



Advertisements
Similar presentations
C. D. Toliver AP Statistics
Advertisements

Chapter 2 Exploring Data with Graphs and Numerical Summaries
Descriptive Measures MARE 250 Dr. Jason Turner.
Grade 10 Mathematics Data handling.
Measures of Dispersion
Descriptive Statistics
Measures of Dispersion or Measures of Variability
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Slides by JOHN LOUCKS St. Edward’s University.
Central Tendency and Variability Chapter 4. Central Tendency >Mean: arithmetic average Add up all scores, divide by number of scores >Median: middle score.
Chapter 2 Describing Data with Numerical Measurements
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Methods for Describing Sets of Data
Modified by ARQ, from © 2002 Prentice-Hall.Chap 3-1 Numerical Descriptive Measures Chapter %20ppts/c3.ppt.
Review Measures of central tendency
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Percentiles and Box – and – Whisker Plots Measures of central tendency show us the spread of data. Mean and standard deviation are useful with every day.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Measures of Center.
Lecture 3 Describing Data Using Numerical Measures.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Numerical Statistics Given a set of data (numbers and a context) we are interested in how to describe the entire set without listing all the elements.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Chapter 2 Means to an End: Computing and Understanding Averages Part II  igma Freud & Descriptive Statistics.
Quantitative data. mean median mode range  average add all of the numbers and divide by the number of numbers you have  the middle number when the numbers.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
1 Measures of Center. 2 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely.
Summary Statistics and Mean Absolute Deviation MM1D3a. Compare summary statistics (mean, median, quartiles, and interquartile range) from one sample data.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data Descriptive Statistics: Central Tendency and Variation.
1 Descriptive Statistics Descriptive Statistics Ernesto Diaz Faculty – Mathematics Redwood High School.
Unit 3: Averages and Variations Week 6 Ms. Sanchez.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Using Measures of Position (rather than value) to Describe Spread? 1.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
LIS 570 Summarising and presenting data - Univariate analysis.
Unit 3: Averages and Variations Part 3 Statistics Mr. Evans.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Descriptive Statistics(Summary and Variability measures)
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Descriptive Statistics ( )
Chapter 3 Describing Data Using Numerical Measures
Chapter 5 : Describing Distributions Numerically I
Averages and Variation
Description of Data (Summary and Variability measures)
Chapter 3 Describing Data Using Numerical Measures
Numerical Descriptive Measures
Descriptive Statistics
Box and Whisker Plots Algebra 2.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Numerical Descriptive Measures
11.2 box and whisker plots.
Numerical Descriptive Measures
Numerical Descriptive Statistics
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Numerical Descriptive Measures
Presentation transcript:

Summary Statistics: Measures of Location and Dispersion

The sum of values,, can be denoted as.

 Select 4 students and ask “how many brothers and sisters do you have?” Data: 2, 3, 1, 3 Or we can write

 Solve the following:

 Measure of Central Tendency - Description of Average (Typical Value)  Sample Mean:

 number of siblings – Data: 2, 3, 1, 3  Suppose we had selected a 5 th person for our sample which had 10 siblings. New Data: 2, 3, 1, 3, 10  The sample mean is sensitive to extreme values and does not have to be a possible data value.

 rank data from smallest to largest  if n is odd, median is the middle score  if n is even, median is the mean of two middle scores

 number of siblings – Data: 2, 3, 1, 3  New Data: 2, 3, 1, 3, 10  Sample median is not sensitive to extreme scores  Half the data will fall above the sample median and half below the sample median

 The median is a better measure of central tendency if extreme scores exist.  If extreme scores are unlikely, the mean varies less from sample to sample than the median and is a better measure.

 If the distribution is right skewed  If the distribution is symmetric  If the distribution is left skewed

 sample mode: most frequent score  Example: number of siblings – Data: 2,3,1,3 Mode = 3  New Data: 2,3,1,3,10 Mode = 3  Mode does not always exist/can be more than one  Also, it is unstable  Should be used with qualitative data

 Example: number of siblings – Data: 2,3,1,3  Midrange =  New Data: 2,3,1,3,10  Midrange =  Midrange is totally dependent on extreme scores.

 Percentiles – gives the percentage below an observation  Quartiles – divide the data into four equally sized parts , First Quartile: 25 th percentile , Second Quartile ( ), 50 th percentile , Third Quartile, 75 th percentile

 Order the data from smallest to largest  Find. This is  is the median of the lower half of the data; that is, it is the median of the data falling below (not including )  is the median of the upper half of the data; (same as above)

 Interquartile range (IQR) = Q3 – Q1  Range of the middle 50% of the data  5 number summary – The low score, Q1, Q2, Q3, and the high score

StudentsFaculty

StudentsFaculty Low = 0Low = 10 Q1 = 1Q1 = 15 Q2 = 5Q2 = 25 Q3 = 7Q3 = 31 High = 10High = 73

 The box goes from Q1 to Q3 and represents IQR  The line through the box is Q2 ( )  Extreme values are identified by *’s  Lines, called whiskers, run from Q1 to the lowest value and from Q3 to the highest value (If the low or high are extreme then the whisker goes to the next value)

Distribution #1Distribution # Distribution #1Distribution #2 = 35 = 35 mode = 35 midrange =35midrange = 35

Example: Years of experience of faculty Data: 1, 30, 22, 10, 5  Range is sensitive to extreme scores (Based entirely on the high and low)  Range is easy to compute

 Large values of suggest large variability  It is difficult to interpret since it is in square units  Keep in mind it can never be negative

Example: Years of experience of faculty Data: 1, 30, 22, 10, 5 sample standard deviation – measures the average distance data points are from Standard deviation is in the same units as the data

Z-score – Gives the number of standard deviations an observation is above or below the mean Example: Test scores = 79, s = 9 If your score is 88%, what is your z-score? If your score is 63%, what is your z-score?

 Approximately 68% of the data fall within 1 standard deviation of the mean  Approximately 95% of the data fall within 2 standard deviations of the mean  Approximately 99.7% of the data fall within 3 standard deviations of the mean

Example: Suppose that the amount of liquid in “12 oz.” Pepsi cans is a mound shaped distribution with oz. and s = 0.1 oz.