Why statisticians were created Measure of dispersion FETP India.

Slides:



Advertisements
Similar presentations
Measures of Dispersion
Advertisements

Descriptive Statistics
Measures of Dispersion or Measures of Variability
Calculating & Reporting Healthcare Statistics
DESCRIBING DATA: 2. Numerical summaries of data using measures of central tendency and dispersion.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Intro to Descriptive Statistics
Biostatistics Unit 2 Descriptive Biostatistics 1.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
2.3. Measures of Dispersion (Variation):
Variability Ibrahim Altubasi, PT, PhD The University of Jordan.
 Deviation is a measure of difference for interval and ratio variables between the observed value and the mean.  The sign of deviation (positive or.
Central Tendency and Variability Chapter 4. Central Tendency >Mean: arithmetic average Add up all scores, divide by number of scores >Median: middle score.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
Describing Data: Numerical
Describing distributions with numbers
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 2 NUMERICAL DATA REPRESENTATION.
Measurement Tools for Science Observation Hypothesis generation Hypothesis testing.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
May 06th, Chapter - 7 INFORMATION PRESENTATION 7.1 Statistical analysis 7.2 Presentation of data 7.3 Averages 7.4 Index numbers 7.5 Dispersion from.
Summary statistics Using a single value to summarize some characteristic of a dataset. For example, the arithmetic mean (or average) is a summary statistic.
Where are we? Measure of central tendency FETP India.
Applied Quantitative Analysis and Practices LECTURE#08 By Dr. Osman Sadiq Paracha.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Chapter 2 Describing Data.
Lecture 3 Describing Data Using Numerical Measures.
Skewness & Kurtosis: Reference
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Measures of Dispersion
Copyright © 2014 by Nelson Education Limited. 3-1 Chapter 3 Measures of Central Tendency and Dispersion.
INVESTIGATION 1.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Variability Pick up little assignments from Wed. class.
1 Descriptive statistics: Measures of dispersion Mary Christopoulou Practical Psychology 1 Lecture 3.
Numerical Measures of Variability
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
Introduction to Statistics Santosh Kumar Director (iCISA)
Chapter 3, Part A Descriptive Statistics: Numerical Measures n Measures of Location n Measures of Variability.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 5. Measuring Dispersion or Spread in a Distribution of Scores.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
CHAPTER 2: Basic Summary Statistics
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Descriptive Statistics(Summary and Variability measures)
MR. MARK ANTHONY GARCIA, M.S. MATHEMATICS DEPARTMENT DE LA SALLE UNIVERSITY.
Dr Hidayathulla Shaikh. At the end of the lecture students should be able to  Enumerate various measures of central tendency  Enumerate various measures.
Introduction Dispersion 1 Central Tendency alone does not explain the observations fully as it does reveal the degree of spread or variability of individual.
Descriptive Statistics
Measures of dispersion
Mathematical Presentation of Data Measures of Dispersion
Measures of Central Tendency
Descriptive Statistics
Description of Data (Summary and Variability measures)
Summary descriptive statistics: means and standard deviations:
Numerical Descriptive Measures
Descriptive Statistics
Variability.
Summary descriptive statistics: means and standard deviations:
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
CHAPTER 2: Basic Summary Statistics
Presentation transcript:

Why statisticians were created Measure of dispersion FETP India

Competency to be gained from this lecture Calculate a measure of variation that is adapted to the sample studied

Key issues Range Inter-quartile variation Standard deviation

Measures of spread, dispersion or variability The measure of central tendency provides important information about the distribution However, it does not provide information concerning the relative position of other data points in the sample Measure of spread, dispersion or variability address are needed Range

Why one needs to measure variability Students Marks obtained BiologyPhysicsChemistry Mean200 VariationNilSlightSubstantial Range02200 Range

Every concept comes from a failure of the previous concept Mean is distorted by outliers Median takes care of the outliers Range

The range: A simple measure of dispersion Take the difference between the lowest value and the highest value Limitation:  The range says nothing about the values between extreme values  The range is not stable: As the sample size increases, the range can change dramatically  Statistics cannot be used to look at the range Range

Example of a range Take a sample of 10 heights:  70, 95, 100, 103, 105, 107, 110, 112, 115 and 140 cms Lowest (Minimum) value  70cm Highest (Maximum) value  140cm Range  140 – 70 = 70cm Range

Three different distributions with the same range (35 Kgs) XXXXXXXXX XXXXXXXX XX X Even Uneven Clumped XXXXXXX Range

The range increases with the sample size ValuesRange Initial set (5 values) New set (3 more values) New set (3 more values) New set (3 more values) Two ranges based on different sample sizes are not comparable Range

Percentiles and quartiles Percentiles  Those values in a series of observations, arranged in ascending order of magnitude, which divide the distribution into two equal parts  The median is the 50th percentile Quartiles  The values which divide a series of observations, arranged in ascending order, into 4 equal parts  The median is the 2nd quartile Inter-quartile range

First 25%2nd 25%3rd 25%4th 25% Q1 Q2 (Median) Q3 Sorting the data in increasing order Median  Middle value (if n is odd)  Average of the two middle values (if n is even)  A measure of the “centre” of the data Quartiles divide the set of ordered values into 4 equal parts

The inter-quartile range The central portion of the distribution Calculated as the difference between the third quartile and the first quartile Includes about one-half of the observations Leaves out one quarter of the observations Limitations:  Only takes into account two values  Not a mathematical concept upon which theories can be developed Inter-quartile range

The inter-quartile range: Example Values  29, 31, 24, 29, 30, 25 Arrange  24, 25, 29, 29, 30, 31 Q1  Value of (n+1)/4=1.75  = Q3  Value of (n+1)*3/4=5.2  Q3 = = 30.2 Inter-quartile range = Q3 – Q1 = 30.2 – Inter-quartile range

Graphic representation of the inter-quartile range Inter-quartile range

The mean deviation from the mean Calculate the mean of all values Calculate the difference between each value and the mean Calculate the average difference between each value and the mean Limitations:  The average between negative and positive deviations may generate a value of 0 while there is substantial variation Standard deviation

The mean deviation from the mean: Example Data Mean = 280/7 = 40 Mean deviation from mean ……… Sum = 0 Standard deviation

Absolute mean deviation from the mean Calculate the mean of all values Calculate the difference between each value and the mean and take the absolute value Calculate the average difference between each value and the mean Limitations:  Absolute value is not good from a mathematical point of view Standard deviation

Absolute mean deviation from the mean: Example Standard deviation Data Mean = 280/7 = 40 Mean deviation from mean ……… Absolute values Mean deviation from mean = 120/7 = 17.1

Calculating the variance (1/2) 1.Calculate the mean as a measure of central location (MEAN) 2.Calculate the difference between each observation and the mean (DEVIATION) 3.Square the differences (SQUARED DEVIATION) Negative and positive deviations will not cancel each other out Values further from the mean have a bigger impact Standard deviation

Calculating the variance (2/2) 4.Sum up these squared deviations (SUM OF THE SQUARED DEVIATIONS) 5.Divide this SUM OF THE SQUARED DEVIATIONS by the total number of observations minus 1 (n-1) to give the VARIANCE Why divide by n - 1 ?  Adjustment for the fact that the mean is just an estimate of the true population mean  Tends to make the variance larger Standard deviation

The standard deviation Take the square root of the variance Limitations:  Sensitive to outliers Standard deviation

Example PatientNo of X rays Deviation from mean Absolute deviation Square deviation Square of observations A1010-9= = = 100 B88-9= = 18 2 = 64 C66-9= = 96 2 = 36 D = = = 144 E99-9 = = 09 2 = 81 Total Mean = 45/9 = 9 x-raysMean deviation = 8/5 = 1.6 x-rays Variance = (20/(5-1)) = 20/4 = 5 x-raysStandard deviation =  5 = 2.2

Properties of the standard deviation Unaffected if same constant is added to (or subtracted from) every observation If each value is multiplied (or divided) by a constant, the standard deviation is also multiplied (or divided) by the same constant Standard deviation

Need of a measure of variation that is independent from the measurement unit The standard deviation is expressed in the same unit as the mean:  e.g., 3 cm for height, 1.4 kg for weight Sometimes, it is useful to express variability as a percentage of the mean  e.g., in the case of laboratory tests, the experimental variation is ± 5% of the mean Standard deviation

The coefficient of variation Calculate the standard deviation Divide by the mean  The standard deviation becomes “unit free” Coefficient of variation (%) =  [S.D / Mean] x 100 (Pure number) Standard deviation

Uses of the coefficient of variation Compare the variability in two variables studied which are measured in different units  Height (cm) and weight (kg) Compare the variability in two groups with widely different mean values  Incomes of persons in different socio- economic groups Standard deviation

A summary of measures of dispersion MeasureAdvantagesDisadvantages Range Obvious Easy to calculate Uses only 2 observations Increases with the sample size Can be distorted by outliers Inter-quartile range Not affected by extreme values Uses only 2 observations Not amenable for further statistical treatment Standard deviation Uses every value Suitable for further analysis Highly influenced by extreme values

Choosing a measure of central tendency and a measure of dispersion Type of distribution Measure of central tendency Measure of dispersion Normal Mean Standard deviation Skewed Median Inter-quartile range Exponential or logarithmic Geometric mean Consult with the statistician

Key messages Report the range but be aware of its limitations Report the inter-quartile deviation when you use the median Report the standard deviation when you use a mean