Central Tendency & Variability

Slides:



Advertisements
Similar presentations
Calculating & Reporting Healthcare Statistics
Advertisements

DESCRIBING DATA: 2. Numerical summaries of data using measures of central tendency and dispersion.
PPA 415 – Research Methods in Public Administration
Intro to Descriptive Statistics
Introduction to Educational Statistics
Chapter 3: Central Tendency
Central Tendency and Variability Chapter 4. Central Tendency >Mean: arithmetic average Add up all scores, divide by number of scores >Median: middle score.
1 Measures of Central Tendency Greg C Elvers, Ph.D.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Today: Central Tendency & Dispersion
Chapter 4 Measures of Central Tendency
Measures of Central Tendency
Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the.
Summarizing Scores With Measures of Central Tendency
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Measures of Central Tendency or Measures of Location or Measures of Averages.
PPA 501 – Analytical Methods in Administration Lecture 5a - Counting and Charting Responses.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Central Tendency Introduction to Statistics Chapter 3 Sep 1, 2009 Class #3.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Descriptive Statistics
1 Univariate Descriptive Statistics Heibatollah Baghi, and Mastee Badii George Mason University.
Statistics 11 The mean The arithmetic average: The “balance point” of the distribution: X=2 -3 X=6+1 X= An error or deviation is the distance from.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Psychology’s Statistics. Statistics Are a means to make data more meaningful Provide a method of organizing information so that it can be understood.
Copyright © 2014 by Nelson Education Limited. 3-1 Chapter 3 Measures of Central Tendency and Dispersion.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Descriptive Statistics The goal of descriptive statistics is to summarize a collection of data in a clear and understandable way.
IE(DS)1 Descriptive Statistics Data - Quantitative observation of Behavior What do numbers mean? If we call one thing 1 and another thing 2 what do we.
LIS 570 Summarising and presenting data - Univariate analysis.
Descriptive and Inferential Statistics Or How I Learned to Stop Worrying and Love My IA.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Measures of Central Tendency (MCT) 1. Describe how MCT describe data 2. Explain mean, median & mode 3. Explain sample means 4. Explain “deviations around.
Descriptive Statistics(Summary and Variability measures)
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Measures of Central Tendency. What is a measure of central tendency? Measures of Central Tendency Mode Median Mean Shape of the Distribution Considerations.
Descriptive Statistics ( )
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Measures of Central Tendency
Practice Page Practice Page Positive Skew.
Descriptive Statistics: Overview
Teaching Statistics in Psychology
Central Tendency and Variability
Summarizing Scores With Measures of Central Tendency
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Wednesday, September 26 Appreciating the beautiful world of data…
Descriptive Statistics
Description of Data (Summary and Variability measures)
Science of Psychology AP Psychology
Chapter 3 Describing Data Using Numerical Measures
Numerical Descriptive Measures
Central Tendency.
Descriptive Statistics
Numerical Descriptive Measures
MEASURES OF CENTRAL TENDENCY
Chapter 3: Central Tendency
Numerical Descriptive Measures
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Chapter 3: Central Tendency
Lecture 4 Psyc 300A.
Numerical Descriptive Measures
The Mean Variance Standard Deviation and Z-Scores
Univariate Description
Presentation transcript:

Central Tendency & Variability Heibatollah Baghi, Mastaneh Badii, and Farrokh Alemi Ph.D. This lecture was organized by Dr. Alemi. It is based on the work done by Dr. Baghi and Mastaneh Badii. The purpose of these slides are to Define measures of central tendency and dispersion. We help you select the appropriate measures to use for a particular dataset. Graphs may be useful, but the information they offer is often inexact. A frequency distribution provides many details, but often we want to condense a distribution further. Two concepts can help: Measures of Central Tendency. Measures of Variability or Scatter.

Measures of Central Tendency: Mean Add up observations Measures of Central Tendency: Mean Count of observations 23, 23, 24, 25, 25 ,25, 26, 26, 27, 28. 25.2 The mean describes the center or the balance point of a frequency distribution.

Measures of Central Tendency: Mode 20, 21, 21, 22, 22, 22, 22, 23, 23, 24 The most frequent value or category in a distribution is called a mode. We Calculate the mode for the following set of values to be 22.

Measures of Central Tendency: Median 21, 22, 22, 23, 24, 26, 26, 27, 28, 29 50% 50% The median is the middle value of a set of ordered numbers. 50% of data are above it and 50% below it. The calculation of median depends on whether we have an even or odds number of data points. For example to calculate for an even number of data points shown here. The median is 25

Median 22, 23, 23, 24, 25, 26, 27, 27, 28 50% 50% Here are data with odd number of data points with no duplicates. The median is 25

Most frequently occurring value Mode Most frequently occurring value Nominal, Ordinal, and (sometimes) Interval/Ratio-Level Data Median Exact center (when odd N) of rank-ordered data or average of two middle values (when even N) Ordinal-Level Data and Interval/Ratio-Level data (particularly when skewed) Mean Arithmetic average (Sum of Xs/N) Interval/Ratio-Level Data One of the common ways of describing data is to describe the central tendency of the data. We want to answer the question, “Around what value does the data tend to cluster?” Which statistics we use to answer that question depends on the level of the data. With nominal-level data, the only appropriate measure of central tendency is the mode, or the most frequently occurring value. The mode may also be used with ordinal-level data, and sometimes with interval-level data.

Comparison of Measures of Central Tendency Mode Most frequently occurring value Nominal, Ordinal, and (sometimes) Interval/Ratio-Level Data Median Exact center (when odd N) of rank-ordered data or average of two middle values (when even N) Ordinal-Level Data and Interval/Ratio-Level data (particularly when skewed) Mean Arithmetic average (Sum of Xs/N) Interval/Ratio-Level Data With Ordinal-level data, the most common measure of central tendency is the median, or the value of the middle case in rank-ordered data. Put another way, the median is the value at the 50th percentile. Note that, with highly skewed, interval-level data, the median might be more useful than the mean. We’ll return to that notion later.

Comparison of Measures of Central Tendency Mode Most frequently occurring value Nominal, Ordinal, and (sometimes) Interval/Ratio-Level Data Median Exact center (when odd N) of rank-ordered data or average of two middle values (when even N) Ordinal-Level Data and Interval/Ratio-Level data (particularly when skewed) Mean Arithmetic average (Sum of Xs/N) Interval/Ratio-Level Data With Interval-Level data, the mean is commonly used. This is the arithmetic average, arrived at by adding up all the values and dividing by the number of cases.

Comparison of Measures of Central Tendency in Normal Distribution Frequency When Mean, median and mode are the same, then the shape of distribution is symmetric

Comparison of Measures of Central Tendency in Bimodal Distribution Frequency In this distribution we have the situation where Mean & median are the same but there are two modes different from mean and median

Outliers pull the mean away From the median Skewed to Left Frequency Negatively Skewed Skewed to Left Frequency Outliers pull the mean away From the median When Mean, median & mode are different, if Mode is greater than Median and median is greater than Mean then we have a distribution skewed to the left.

Outliers pull the mean away From the median Skewed to Right Frequency Positively Skewed Skewed to Right Frequency Outliers pull the mean away From the median Mean, median & mode are different Mean > Median > Mode

Uniform Frequency In a uniform distribution, like normal distribution, Mean, median & mode are at the same point

Frequency Exponential Here the Mode is to extreme right and Mean is to the right of median

Measures of Variability or Scatter Same Average Different Variability Reporting only an average without an accompanying measure of variability may misrepresent a set of data. Two datasets can have the same average but very different variability.

Measures of Variability or Scatter: Range Max 110, 120, 130, 140, 150, 160, 170, 180, 190 Min Range is the difference between the highest and lowest score. It is Easy to calculate but highly unstable. If we were to calculate range for the data displayed here, The highest value is 190 and the lowest value is 110 and the difference or range is 80

Measures of Variability or Scatter: Semi Inter-quartile Range Standard Quartile Range SQR = (Q3-Q1)/2 Standard quartile range Is Half of the difference between the 25% quartile and 75% quartile. It is more stable than range

Sum Squares &Variance ss = Σ (X - M)2 Sample variance is The sum of squared differences between observations and their mean divided by n -1.

Standard Deviation Standard deviation is The squared root of the variance

Sum of Squares Variance Standard Deviation Sum of Squares is key in calculation of the Sample Variance and Sample Standard Deviation Standard Deviation

Calculating Standard Deviation Data X-M (X - M)2  110 -40 1600 120 -30 900 130 -20 400 140 -10 100 150 160 10 170 20 180 30 190 40 Total 6000 (SS) Calculating Standard Deviation N-1 9 Sample Variance 667 Standard Deviation 25.8 SS is the key to many statistics Calculate the difference, square the difference, sum it, divide it by n minus 1 to get sample variance. Square root of sample variance is standard deviation.

Calculating formula Defining formula Sum of squares Variance Formula Variations Calculating formula Defining formula Sum of squares Variance Standard deviation

6 Standard Deviations partition most data in Normal Distribution

Standardized Scores: Z Scores Standard Score Z = (x-m) / s Mean & standard deviations are also used to compute standard scores. Standard scores have distributions that are published and available on the web so one can easily calculate the frequency of an observation.

Standardized Scores: Z Scores Z = (x-m) / s Z = 140 – 110 / 10 = 3 For example, if we want to Calculate standard score for blood pressure of 140 if the sample mean is 110 and the standard deviation is 10. Now we can use the standard score to read from the web how often does the blood pressure of 140 occur in our distribution.

Observed Expected Standard deviations Allows comparison of observed distribution to expected distribution

Measures of Central Tendency & Variability Can Describe the Distribution of Data