Measures of Central Tendency & Variability Dhon G. Dungca, M.Eng’g.

Slides:



Advertisements
Similar presentations
Lecture 2 Part a: Numerical Measures
Advertisements

Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
SUMMARIZING DATA: Measures of variation Measure of Dispersion (variation) is the measure of extent of deviation of individual value from the central value.
Measures of Dispersion
Statistics for the Social Sciences
Measures of Central Tendency
WFM 5201: Data Management and Statistical Analysis
Intro to Descriptive Statistics
Biostatistics Unit 2 Descriptive Biostatistics 1.
Introductory Mathematics & Statistics
WFM 5201: Data Management and Statistical Analysis
Central Tendency and Variability
Describing Data: Numerical Measures
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Descriptive Statistics
Chapter 3 Descriptive Measures
Measurement Tools for Science Observation Hypothesis generation Hypothesis testing.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Unit 3 Sections 3-1 & 3-2. What we will be able to do throughout this chapter…  Use statistical methods to summarize data  The most familiar method.
Chapter Eleven A Primer for Descriptive Statistics.
Lecture 3 A Brief Review of Some Important Statistical Concepts.
Worked examples and exercises are in the text STROUD (Prog. 28 in 7 th Ed) PROGRAMME 27 STATISTICS.
BUS250 Seminar 4. Mean: the arithmetic average of a set of data or sum of the values divided by the number of values. Median: the middle value of a data.
Measures of Central Tendency and Dispersion Preferred measures of central location & dispersion DispersionCentral locationType of Distribution SDMeanNormal.
Worked examples and exercises are in the text STROUD PROGRAMME 27 STATISTICS.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Describing Data: Numerical Measures. GOALS 1.Calculate the arithmetic mean, weighted mean, median, mode, and geometric mean. 2.Explain the characteristics,
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Measures of Central Tendency. These measures indicate a value, which all the observations tend to have, or a value where all the observations can be assumed.
Chapter 9 Statistics.
Sociology 5811: Lecture 3: Measures of Central Tendency and Dispersion Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Barnett/Ziegler/Byleen Finite Mathematics 11e1 Chapter 11 Review Important Terms, Symbols, Concepts Sect Graphing Data Bar graphs, broken-line graphs,
FARAH ADIBAH ADNAN ENGINEERING MATHEMATICS INSTITUTE (IMK) C HAPTER 1 B ASIC S TATISTICS.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Describing Data: Numerical Measures Chapter 3 McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.
LIS 570 Summarising and presenting data - Univariate analysis.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Worked examples and exercises are in the text STROUD PROGRAMME 27 STATISTICS.
By Tatre Jantarakolica1 Fundamental Statistics and Economics for Evaluating Survey Data of Price Indices.
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
STROUD Worked examples and exercises are in the text Programme 28: Data handling and statistics DATA HANDLING AND STATISTICS PROGRAMME 28.
Data Description Chapter 3. The Focus of Chapter 3  Chapter 2 showed you how to organize and present data.  Chapter 3 will show you how to summarize.
Statistics Josée L. Jarry, Ph.D., C.Psych. Introduction to Psychology Department of Psychology University of Toronto June 9, 2003.
Day 2. GOALS 1.Calculate the arithmetic mean, weighted mean, median, mode, and geometric mean. 2.Explain the characteristics, uses, advantages, and disadvantages.
Measures of Variation. Range, Variance, & Standard Deviation.
CHAPTER 3 – Numerical Techniques for Describing Data 3.1 Measures of Central Tendency 3.2 Measures of Variability.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Numerical Measures Chapter 3.
Measures of Dispersion Measures of Variability
Chapter 2 The Mean, Variance, Standard Deviation, and Z Scores.
Central Tendency Quartiles and Percentiles (الربيعيات والمئينات)
PRESENTATION OF DATA.
Descriptive Statistics
Describing, Exploring and Comparing Data
Chapter 5 STATISTICS (PART 1).
PROGRAMME 27 STATISTICS.
Central Tendency and Variability
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Descriptive Statistics
Lecture 5,6: Measures in Statistics
Characteristics of the Mean
MEASURES OF CENTRAL TENDENCY
Numerical Descriptive Measures
Descriptive Statistics
Numerical Descriptive Measures
Measures of Dispersion
Check Homework, Ask ?s.
Numerical Descriptive Measures
Presentation transcript:

Measures of Central Tendency & Variability Dhon G. Dungca, M.Eng’g.

MEASURES OF CENTRAL TENDENCY Is the value around which the whole set of data tend to cluster. It is represented by a single number which summarizes and describes the whole set Most commonly used measures of central tendency are: –The arithmetic mean –The median –The mode

FOR UNGROUPED DATA Ungrouped Data –Refer to data not organized into frequency distribution. ARITHMETIC MEAN –May be defined as an arithmetic average. It is the sum of the observed values divided by the number of observations. –This can either be a population mean (denoted by  ) or a sample mean (denoted by x)

ARITHMETIC MEAN Population Mean: n  x i  = i=1 N where:N = total # of items in a population x i = the ith observed value Sample Mean: n  x i x = i=1 n where:n = total # of items in a population x i = the ith observed value

WEIGHTED MEAN Takes into consideration the proper weights assigned to the observed values according to their relative importance. n  W i x i x = i=1  W i where:W i = weight of each item x i = value of each item

EXAMPLE 1 What is the mean age of a group of 8 children whose ages are: 7, 8, 8.5, 9, 10, 10.5, 11, 12, and 13? Answer: x = x = 88 / 8 x = 11 years old

EXAMPLE 2 A market vendor sold 3 dozens of eggs at P per dozen, 5 dozens at P per dozen, and 2 dozens at P per dozen. Find the weighted mean price per dozen eggs the vendor sold.

MEDIAN (M e ) Is the midpoint of the distribution. Half of the values in a distribution fall below the median, and the other half fall above it. For distributions having an even number of arrayed observations, the median is the average of the two middle values

EXAMPLE 3 Find the median of the following set of observations: 8, 4, 1, 3 and 7 Answer: 1, 3, 4, 7, 8M e = 4

EXAMPLE 4 Find the median of the following set of observations: 12, 9, 6, 10, 7 and 14 Answer: 6, 7, 9, 10, 12, 14 Median = (9+10) / 2 M e = 9.5

MODE (M o ) Is the value that appears with the highest (greatest) frequency. That is, the value that appears most often. Example 5 Determine the mode of the following distribution: 3, 8, 10, 5, 3, 5, 2, 5, 7 Answer: M o = 5 (uni-modal)

EXAMPLES Example 6 Determine the mode of the following distribution: 20, 15, 10, 9, 7, 20, 10, 10, 20 Answer: M o = 20 and 10 Example 7 Determine the mode of the following distribution: 7, 5, 10, 22, 20, 6, 11, and 21 Answer: There is no mode.

MEAN: GROUPED DATA Two methods of computing for the mean: LONG Method: n  x i f i x = i=1 n where:n = total # of items x i = class midpoints f i = corresponding frequencies SHORT Method (Use of CODING): n  u i f i x = x o + i=1 C n where:x o = assumed mean or coded mean C = class size

MEAN: GROUPED DATA Coding technique: –Choose the assumed mean xo (the class with the highest frequency). –Under a new column u, write the zero value code opposite xo and assign positive numbers to the classes higher in value than the class with the assumed mean, and consecutive negative integers to those classes lower in value. –Multiply the coded values with their corresponding frequencies and compute for the algebraic sum. –Substitute in the given coded formula and compute for the mean.

EXAMPLE 8 The following is the distribution of the wages of 50 workers of Hazel’s Garments Manufacturing Co. taken during a particular week last June. Determine the mean using both methods. Weekly Wages # of workers Total50

MEDIAN: GROUPED DATA (M e ) The median of a frequency distribution (grouped data) could be found by the following formula: n _ F  Me M e = L Me + 2 C f Me where:L Me = lower boundary of the median class n = total # of observations F  Me = cumulative frequency immediately preceding the median class f Me = frequency of the median class C = class size The median class is the class which contains the n/2th value.

MODE: GROUPED DATA (M o ) The mode of a frequency distribution (grouped data) could be found by the following formula: Mo = L Mo + d 1 C d 1 + d 2 where:L Me = lower boundary of the modal class d 1 = difference between the frequency of the modal class and the frequency of the class next lower in value. d 2 = difference between the frequency of the modal class and the frequency of the class next higher in value. The modal class is the class with the highest frequency.

EXAMPLE 9 Find the mean, median and mode of the following distribution of length of service in years of 30 employees of KITS INC. Length of service# of employees Total30

MEASURES OF VARIABILITY Indicate the extent to which values in a distribution are spread around the central tendency. Example: Suppose 2 applicants, A & B were given an examination to test and compare their typing speed (Assume all factors being equal). Each was given 10 trials and the results were as follows: A: 14, 16, 18, 20, 22, 24, 26, 28, 30 B: 18, 18, 20, 22, 24, 24, 24, 24, 24 Both A & B have the same mean of 22 minutes tending to show that there is no difference in the performance of both if the only basis of comparison is the mean. However, B has an advantage over A since her completion time varied only as much as 4 minutes above and below the mean while that of A varied as much as 8 minutes above and below the mean.

VARIANCE: UNGROUPED DATA Is the average of the squared deviation values from the distribution’s mean. n  (x i – x) 2 Sample Variance (S 2 ) = i=1 n - 1 If all values are identical the variance is zero (0). The greater the dispersion of values the greater the variance.

STANDARD DEVIATION: UNGROUPED DATA Is the positive square root of the variance. Measures the spread or dispersion of each value from the mean of the distribution. It is the most used measure of spread since it improves interpretability by removing the variance square and expressing deviations in their original unit. It is the most important measure of dispersion since it enables us to determine with a great deal of accuracy where the values of the distribution are located in relation to the mean. n  (x i – x) 2 Sample SD (S) = i=1 n - 1 n  (xi –  ) 2 Population SD (  ) = i=1 N

EXAMPLE 10 Solve for the variance and the standard deviation of the following set of data: 6, 2, 4, 7, 8, and 10

FOR GROUPED DATA n n 2 n  x i 2  f i --  x i  f i Sample Variance (S 2 ) = i=1 i=1 n (n – 1) n n 2 n  x i 2  f i --  x i  f i Sample SD (S) = i=1 i=1 n (n – 1)

EXAMPLE 11 Find the values of the variance and standard deviation of the following: Monthly Savings# of families