Measures of Dispesion Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.

Slides:



Advertisements
Similar presentations
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Advertisements

SUMMARIZING DATA: Measures of variation Measure of Dispersion (variation) is the measure of extent of deviation of individual value from the central value.
The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set.
Agricultural and Biological Statistics
Measures of Dispersion
Measures of Dispersion
Presentation on Statistics for Research Lecture 7.
DESCRIBING DATA: 2. Numerical summaries of data using measures of central tendency and dispersion.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Measures of Central Tendency
WFM 5201: Data Management and Statistical Analysis
Intro to Descriptive Statistics
Biostatistics Unit 2 Descriptive Biostatistics 1.
Learning Objectives for Section 11.3 Measures of Dispersion
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Describing Data: Numerical
Section 3.2 Measures of Variation Range Standard Deviation Variance.
Chapter-2 Statistical description of quantitative variable.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
Chapter 3 Averages and Variations
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Worked examples and exercises are in the text STROUD PROGRAMME 27 STATISTICS.
13-1 Copyright  2010 McGraw-Hill Australia Pty Ltd PowerPoint slides to accompany Croucher, Introductory Mathematics and Statistics, 5e Chapter 13 Measures.
Describing Data: Numerical Measures. GOALS 1.Calculate the arithmetic mean, weighted mean, median, mode, and geometric mean. 2.Explain the characteristics,
INVESTIGATION 1.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Measures of Central Tendency. These measures indicate a value, which all the observations tend to have, or a value where all the observations can be assumed.
Presentation on Statistics for Research Lecture 7.
CHAPTER 3  Descriptive Statistics Measures of Central Tendency 1.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
Chapter 3, Part A Descriptive Statistics: Numerical Measures n Measures of Location n Measures of Variability.
Chapter 3: Averages and Variation Section 2: Measures of Dispersion.
Summary Statistics: Measures of Location and Dispersion.
Measures of Variation Range Standard Deviation Variance.
Chapter 11 Data Descriptions and Probability Distributions Section 3 Measures of Dispersion.
Applied Quantitative Analysis and Practices LECTURE#07 By Dr. Osman Sadiq Paracha.
CHAPTER 2: Basic Summary Statistics
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
The Third lecture We will examine in this lecture: Mean Weighted Mean Median Mode Fractiles (Quartiles-Deciles-Percentiles) Measures of Central Tendency.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Chapter ( 2 ) Strategies for understanding the meanings of Data : Learning outcomes Understand how data can be appropriately organized and displayed Understand.
Measures of Central Tendency. Definition Measures of Central Tendency (Mean, Median, Mode)
Bio-Statistic KUEU 3146 & KBEB 3153 Bio-Statistic Data grouping and presentations Part II: Summarizing Data.
Day 2. GOALS 1.Calculate the arithmetic mean, weighted mean, median, mode, and geometric mean. 2.Explain the characteristics, uses, advantages, and disadvantages.
Summarizing Data with Numerical Values Introduction: to summarize a set of numerical data we used three types of groups can be used to give an idea about.
Central Tendency Quartiles and Percentiles (الربيعيات والمئينات)
Exploratory Data Analysis
Methods of mathematical presentation (Summery Statistics)
Measures of Dispersion
MATHEMATICS The Measure of Data Location
Descriptive Statistics
Measures of variation (dispersion) [مقاييس التشتت]
Dr.Fatima Alkhaledy M.B.Ch.B;F.I.C.M.S/C.M
Introductory Mathematics & Statistics
Dispersion.
Describing, Exploring and Comparing Data
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Numerical Measures: Centrality and Variability
NUMERICAL DESCRIPTIVE MEASURES
Characteristics of the Mean
MEASURES OF CENTRAL TENDENCY
Describing Data: Numerical Measures
Numerical Descriptive Measures
CHAPTER 2: Basic Summary Statistics
Presentation transcript:

Measures of Dispesion Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016

Measures of Dispersion After the central tendency of a frequency distribution is determined. The next step is to determine how spread out (Dispersed) the numbers are from each others or from the average. This can be done by calculating measures of dispersion. They are also called measures of spread or variation. Standard Measures of Dispersion Range Variance Standard Deviation Coefficient of variation

1. Range It is the simplest measure of dispersion Definition: It is the difference between the highest and lowest values. Advantages: It is quick and easy to calculate. Disadvantages: - It does not use directly the majority of the observations. - It is very sensitive to extreme values. Calculation: A)Ungrouped Data: Largest observation- Smallest observation B)Grouped Data: Upper limit of last interval – Lowest limit of first interval Example (1): The following data represent the height of 10 persons: 20 – 60 – 53 – 80 – 89 – 56 – 42 – 46 – 88 – 95 cm -Find the range Answer: Largest observation = 95, Smallest observation = 20 The range= = 75 cm

1. Range Example (2): The following data represent the age of 50 persons: -Compute the range Answer: Upper limit of last interval = 65 Lower limit of first interval= 15 The range= = 50 years Age (Years)No. of persons (Frequency) Total50

2. Variance (S 2 ) It is the average of the squared deviation of observations- from the mean. Formula based on definition: ∑(x i -x) 2 S 2 = n-1 Computation Formula: ∑x 2 - (∑x) 2 S 2 = n n-1 Where: ∑x 2 = Sum of squared observations (∑x) 2 = Square of sum observations N = No. of observations It is the average of the squared deviation of observations- from the mean. Formula based on definition: ∑(x i -x) 2 S 2 = n-1 Computation Formula: ∑x 2 - (∑x) 2 S 2 = n n-1 Where: ∑x 2 = Sum of squared observations (∑x) 2 = Square of sum observations N = No. of observations

3. Standard Deviation (S) Definition: It is a measure of the spread of data around their mean It is the positive square root of the variance  The value of standard deviation and variance are always positive. Advantages: -It is the preferred measure of dispersion. - It uses all of the measurements in the set. Disadvantages: - It is influenced by a few (or even only one) extreme values. - It is not calculated in open table. Calculation: A)Ungrouped Data Steps for calculation: 1- Determine the sum of observations (∑x i ) 2- Find (∑x i ) 2 3- Find the sure of each observation x i 2 4- Find the sum of the squared observation (∑x i 2 ) ∑x i 2 - (∑x i ) 2 S = n √ n-1

3. Standard Deviation (S) Example: The following data represent the weight of 5 persons. 14 – 15 – 16 – 17 – 18 kg - Find the standard deviation Answer: (∑x i )= = 80 (∑x i ) 2 = (80) 2 = 6400 (x i 2 ) = = S = 5 = 1.6 kg √ 5-1

3. Standard Deviation (S) Calculation: B) Grouped Data Steps for calculation: 1-Determine the mid-point of each interval x j 2- Find the product f j x j for each interval, then find the sum of these product∑ f j x j 3- Find the product f j x j 2 by multiplying the mid point x j by the corresponding product f j x j 4- Find the sum of the squared observation ∑f j x j 2 ∑f j x j 2 - (∑f j x j ) 2 S = ∑ f j √ ∑ f j -1

3. Standard Deviation (S) Example(1): The following table represents the weight of 20 persons. - Compute the standard deviation Weight (kg)Frequency Total20

3. Standard Deviation (S) Answer(1): (720) 2 S = 20 = 10.5 kg √ 20-1 Weight (kg)fjfj XjXj F j X j F j X j Total ∑ f j= = ∑f j x j 28000=∑f j x j 2

3. Standard Deviation (S) Example(2): The following table represents the number of attacks of diarrhea among 30 children. - Compute the standard deviation Attacks of diarrheaFrequency Total30

3. Standard Deviation (S) Answer(2): (178.5) 2 S = 30 = 2.98 = 3 attacks √ 30-1 Attacks of diarrheafjfj XjXj F j X j F j X j Total ∑ f j= =∑ f j x j =∑f j x j 2

4. Semi-inter-quartile-range(S.I.Q.R) Definition: It equal to half distance between third and first quartiles.. Advantages: -It can be calculated for closed or open –ended tables. - It is the preferable measure of variation when we want to avoid outlying observations. Disadvantages: - It does not use directly the majority of the observations. Calculation: A)Ungrouped Data Steps for calculation: 1- Arrange Data 2- Find Q1 (first quartile) and Q3 (third quartile) 3- Rank Q1= n+1/4, Rank Q3= 3( n+1)/4 4- Compute S.I.Q.R given by: Q3 - Q1 S.I.Q.R= 2

4. Semi-inter-quartile-range(S.I.Q.R) Example : From the following set of observations compute semi-inter-quartile range 15 – 34 – 7 – 12 – – 1 – 42 – 56 – 28 – 13 – 24 – 35 Answer : Arrange Data 1 -7 – 9 – 12 – 13 – 15 – 18 – 24 – 28 – 34 – 35 – 42 – 56 Q1 rank = (13+1)/ 4 = 14/ 4 = 3.5 Q1= 9+12= 10.5 Q3 rank= 3(13+1)/4= 42/4= 10.5 Q3= 34+35= S.I.Q.R= = 12 2

4. Semi-inter-quartile-range(S.I.Q.R) Calculation: B) Grouped Data Steps for calculation: 1- compute the general rank of Q1= n/4 and general rank of Q3 = 3n/4 2- Determine the Q1 by comparing the general rank of Q1 with CF values from above downwards and the first interval met with whose CF is equal to or greater than the general rank of the Q1 is the Q1 interval. 3- Determine the Q3 by same way of Q1 but for Q3. 4- Determine the special rank of Q1 and Q3 given by: special rank = general rank – CF of pre Q interval 5- Determine the value of Q1 and Q3 given by: Q i = LL of Q i interval + special rank X width of Q i interval frequency of Q i interval 6- S.I.Q.R = Q3 - Q1 2

4. Semi-inter-quartile-range(S.I.Q.R) B) Grouped Data Example: The following table represents data about age of 40 persons: - Compute the semi-inter-quartile range? Age (Years)Frequency Total40

4. Semi-inter-quartile-range(S.I.Q.R) B) Grouped Data Answer: Find the cumulative frequency Age (Years)FrequencyCF Total40

4. Semi-inter-quartile-range(S.I.Q.R) B) Grouped Data Answer: Q1: General Rank= n/4= 40/4= 10 Q1 Interval= Special rank= 10- 8= 2 Q1= 35+ 2X 10 = 37.5 years 8 Q3: General Rank= 3n/4= 120/4= 30 Q1 Interval= Special rank= = 2 Q1= 55+ 2X 10 = 57 years 10 S.I.Q.R= = 9.75 years 2

5. Coefficient of Variation(CV) Characteristics: It is a measure of relative variation. It is independent of the units used. Therefore can be used for comparing sets of measurements of different units.  Suppose we want to compare dispersion of two sets of data, use the standard deviation for comparison, it may lead to fallacious results. It may be that two variables involved are measured in different units for example, we may wish to know. For example, we may wish to know, for a certain population, whether serum cholesterol levels (measured in milligram per 100 ml) are more variable than body weight (measured in pounds). The coefficient variation is a measure that is independent of the units of measurement and it is the solution for such comparison. It is expressed as percentage.

5. Coefficient of Variation(CV) Calculation Ungrouped Data or Closed- ended tablesOpen or Closed-ended tables CV= S X 100 X C V = SIQR X 100 Median The standard deviation is expressed as percentage of mean. The semi-inter-quartile range is expressed as a percentage of the median.

5. Coefficient of Variation(CV) Example(1): The following are the age of 5 children: 5 – 3 – 4 – 7 – 6 - Find the coefficient of variation Answer (1): X = =25 5 (∑ X i ) 2 = (25) 2 = 625 (∑ X i 2 )= = S= 5 = 1.58 years √ CV= (1.58/5) X100 = 31.6%

5. Coefficient of Variation(CV) Example(2): The following table represents the data about age of 80 person: - Find the coefficient of variation. Age (Years)Frequency Total80

5. Coefficient of Variation(CV) Answer(2): Age (Years)FrequencyCF Total80 Q1Q2 = MedianQ3 General rank= n/4= 80/4= 20 n/2= 80/2=40=3n/4= 3(80)/4= 60 Interval Special rank= 20-15=5=40-35=5=60-35=25 Value25+ 5X10 20 =27.5 years 35+ 5X10 25 =37 years X10 25 = 45 years

5. Coefficient of Variation(CV) Answer(2): S. I. Q. R= 45 – 27.5 = CV= 8.75 X 100 = 23.6% 37