Asian School of Business PG Programme in Management (2005-06) Course: Quantitative Methods in Management I Instructor: Chandan Mukherjee Session 2: Summarising.

Slides:



Advertisements
Similar presentations
Statistical Reasoning for everyday life
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
Lesson Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website.
Measures of Dispersion
Describing Quantitative Data with Numbers Part 2
Descriptive Statistics
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Intro to Descriptive Statistics
1 Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Range Standard Deviation Interquartile Range (IQR)
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Unit 4 – Probability and Statistics
5 Number Summary Box Plots. The five-number summary is the collection of The smallest value The first quartile (Q 1 or P 25 ) The median (M or Q 2 or.
Vocabulary for Box and Whisker Plots. Box and Whisker Plot: A diagram that summarizes data using the median, the upper and lowers quartiles, and the extreme.
Box and Whisker Plots A diagram that summarizes data by dividing it into four parts. It compares two sets of data.
Objectives 1.2 Describing distributions with numbers
Exploration of Mean & Median Go to the website of “Introduction to the Practice of Statistics”website Click on the link to “Statistical Applets” Select.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Modified by ARQ, from © 2002 Prentice-Hall.Chap 3-1 Numerical Descriptive Measures Chapter %20ppts/c3.ppt.
Applied Quantitative Analysis and Practices LECTURE#08 By Dr. Osman Sadiq Paracha.
Review Measures of central tendency
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Percentiles and Box – and – Whisker Plots Measures of central tendency show us the spread of data. Mean and standard deviation are useful with every day.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Chapter 2 Describing Data.
Section 3.3 Measures of Relative Position HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems,
Numerical Statistics Given a set of data (numbers and a context) we are interested in how to describe the entire set without listing all the elements.
INVESTIGATION 1.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
Box and Whisker Plots Measures of Central Tendency.
Numerical Measures of Variability
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
Essential Statistics Chapter 21 Describing Distributions with Numbers.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 4 – Slide 1 of 23 Chapter 3 Section 4 Measures of Position.
Chapter 3 Averages and Variation Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Summary Statistics, Center, Spread, Range, Mean, and Median Ms. Daniels Integrated Math 1.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Descriptive Statistics(Summary and Variability measures)
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
Warm Up What is the mean, median, mode and outlier of the following data: 16, 19, 21, 18, 18, 54, 20, 22, 23, 17.
Descriptive Statistics ( )
Notes 13.2 Measures of Center & Spread
2.5: Numerical Measures of Variability (Spread)
CHAPTER 2: Describing Distributions with Numbers
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Averages and Variation
Description of Data (Summary and Variability measures)
Chapter 3 Describing Data Using Numerical Measures
10-3 Data Distributions Warm Up Lesson Presentation Lesson Quiz
Numerical Descriptive Measures
Box and Whisker Plots Algebra 2.
Lecture 2 Chapter 3. Displaying and Summarizing Quantitative Data
Describing Distributions Numerically
Quartile Measures DCOVA
The absolute value of each deviation.
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
Essential Statistics Describing Distributions with Numbers
Basic Practice of Statistics - 3rd Edition
Numerical Descriptive Measures
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
The Five-Number Summary
Basic Practice of Statistics - 3rd Edition
Presentation transcript:

Asian School of Business PG Programme in Management ( ) Course: Quantitative Methods in Management I Instructor: Chandan Mukherjee Session 2: Summarising a Distribution

Modern/EDA Terminology Classical Terminology Cluster, Level, CentreCentral Tendency, Location Scatter, SpreadDispersion ShapeSkewness TailsKurtosis

Numerical Summaries (Descriptive Statistics) FeatureMean based summaryOrder based summary LevelArithmatic MeanMedian SpreadStandard DeviationMidspread The order based summaries are resistant to extreme values i.e. that are not unduly influenced by a small part of the data. That is why they are called Resistant Summaries.

Numerical Summaries (Descriptive Statistics) Both the mean based and the order based summaries of the spread of a distribution (Standard Deviation and Midspread) are scale dependent. That is why we need to neutralise the scale effect by dividing by their respective summaries of the Center. Coefficient of Variation = Standard Deviation / Mean Relative Midspread = Midspread / Median

Definitions Variance = Average Squared Distance from the Mean = Serial No. Data (X) X–Mean(X–Mean) Total Average = Variance where

DEFINITION (contd.) Standard Deviation (SD ) = Square root of Variance Co-efficient Variation = SD/Mean = (or, 5.17%)

Median = The value that divides the ordered data values into two equal halves To compute Median: Sort the data in ascending order Divide the number of observations (data values) by 2 If the result is an integer, say 9, then median is the average of the 9th and 10th observations If the result is not an integer, say 9.3, then round it up to the next integer above i.e. 10 in this case. The 10th observation is the median DEFINITION (contd.)

Average of 26 th and 27 th observations = Example: Finding the Median

DEFINITION (contd.) Quartiles = The three values that divide the ordered observations into four equal parts 25% of the observations lie below the First (Lower) Quartile 50% of the observations lie below the Second (Middle) Quartile 75% of the observations lie below the Third (Upper) Quartile The Third or the Middle Quartile is obviously the Median

DEFINITION (contd.) To compute the Lower and the Upper Quartile: Sort the data in ascending order Divide the total number of observations by 4 If the result is an integer, say 12, then take the average of the 12 th and the 13 th observations from the lowest observation (downward) as the Lower Quartile. Similarly, take the 12th and the 13th observations from the highest observation (upward) as the Upper Quartile If the result is not an integer, say, 12.7, then round it up to the next integer above, i.e. 13 in this case. The Lower Quartile is the 13 th observations from the lowest, and the Upper quartile is the 13 th observation from the highest

DEFINITION (contd.) Example: Finding the Lower & Upper Quatiles Lower Quartile ( )/2 = Upper quartile ( )/2 =

DEFINITION (contd.) Midspread = Upper Quartile – Lower Quartile The range that holds the middle 50% of the observations Relative Midspread = Midspread / Median = ( – 40.04)/61.80 = 1.41

Five Number Summary Five numbers can comprehensively summarise the features of a distribution without being unduly affected by a small part of the data Minimum (MN) Lower Quartile (LQ) Median (MD) Upper Quartile (UQ) Maximum (MX)

Lower Tail Upper Tail Five Number Summary is Comprehensive: The Grand Summary of a Distribution

Indentifying the Extreme Values: Who Are The Outliers? Here is a thumb rule (based on theory): Step = 1.5 times Midspread Lower Fence = Lower Quartile – Step Upper Fence = Upper Quartile + Step All observations below the Lower Fence are Negative Outliers All observations above the Upper Fence are Positive Outliers

Cotton Blended Yearn Companies: Five Number Summary & Fences MN 0.07 LQ MD UQ MX Midspread Step Lower Fence Upper Fence258.61

Box & Whisker Plot MD UQ LQ MX MN Outliers! Upper Fence Lower Fence

Comparing Two Distributions of Gross Fixed Asset: Fabrics and Yarn Companies

Summary Of The Points