Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.

Slides:



Advertisements
Similar presentations
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Advertisements

Math Qualification from Cambridge University
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Agricultural and Biological Statistics
Statistics for the Social Sciences
Calculating & Reporting Healthcare Statistics
Lecture 2 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
PSY 307 – Statistics for the Behavioral Sciences
Intro to Descriptive Statistics
Introduction to Educational Statistics
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chapter 3: Central Tendency
1 Measures of Central Tendency Greg C Elvers, Ph.D.
Chapter 3 Measures of Central Tendency. 3.1 Defining Central Tendency Central tendency Purpose:
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Chapter 3: Central Tendency Peer Tutor Slides Instructor: Mr. Ethan W. Cooper, Lead Tutor.
Measures of Central Tendency
Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the.
Summarizing Scores With Measures of Central Tendency
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 2 NUMERICAL DATA REPRESENTATION.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
COURSE: JUST 3900 TIPS FOR APLIA Developed By: Ethan Cooper (Lead Tutor) John Lohman Michael Mattocks Aubrey Urwick Chapter 3: Central Tendency.
Ibrahim Altubasi, PT, PhD The University of Jordan
Chapter 3 EDRS 5305 Fall 2005 Gravetter and Wallnau 5 th edition.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
CENTRAL TENDENCY Mean, Median, and Mode.
Chapters 1 & 2 Displaying Order; Central Tendency & Variability Thurs. Aug 21, 2014.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
What is Business Statistics? What Is Statistics? Collection of DataCollection of Data –Survey –Interviews Summarization and Presentation of DataSummarization.
Central Tendency Introduction to Statistics Chapter 3 Sep 1, 2009 Class #3.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Chapter 2 Describing Data.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
Chapter 4 – 1 Chapter 4: Measures of Central Tendency What is a measure of central tendency? Measures of Central Tendency –Mode –Median –Mean Shape of.
Describing Data: Numerical Measures. GOALS 1.Calculate the arithmetic mean, weighted mean, median, mode, and geometric mean. 2.Explain the characteristics,
Statistics 11 The mean The arithmetic average: The “balance point” of the distribution: X=2 -3 X=6+1 X= An error or deviation is the distance from.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
CHAPTER 3  Descriptive Statistics Measures of Central Tendency 1.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Numerical Measures Chapter 3.
Central Tendency A statistical measure that serves as a descriptive statistic Determines a single value –summarize or condense a large set of data –accurately.
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. Describing Data: Numerical Measures.
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
LIS 570 Summarising and presenting data - Univariate analysis.
Introduction to statistics I Sophia King Rm. P24 HWB
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Anthony J Greene1 Central Tendency 1.Mean Population Vs. Sample Mean 2.Median 3.Mode 1.Describing a Distribution in Terms of Central Tendency 2.Differences.
Chapter 3: Central Tendency 1. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter 2 Describing and Presenting a Distribution of Scores.
Measures of Central Tendency (MCT) 1. Describe how MCT describe data 2. Explain mean, median & mode 3. Explain sample means 4. Explain “deviations around.
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Data Description Chapter 3. The Focus of Chapter 3  Chapter 2 showed you how to organize and present data.  Chapter 3 will show you how to summarize.
Summarizing Data with Numerical Values Introduction: to summarize a set of numerical data we used three types of groups can be used to give an idea about.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Descriptive Statistics
Descriptive Statistics
Topic 3: Measures of central tendency, dispersion and shape
Summarizing Scores With Measures of Central Tendency
Characteristics of the Mean
Central Tendency.
MEASURES OF CENTRAL TENDENCY
LESSON 3: CENTRAL TENDENCY
Chapter 3: Central Tendency
Chapter 3: Central Tendency
Presentation transcript:

Chapter 3: Central Tendency

Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the distribution and represents the entire distribution of scores. The goal of central tendency is to identify the single value that is the best representative for the entire set of data.

Central Tendency (cont'd.) By identifying the "average score," central tendency allows researchers to summarize or condense a large set of data into a single value. Thus, central tendency serves as a descriptive statistic because it allows researchers to describe or present a set of data in a very simplified, concise form. In addition, it is possible to compare two (or more) sets of data by simply comparing the average score (central tendency) for one set versus the average score for another set.

The Mean, the Median, and the Mode It is essential that central tendency be determined by an objective and well ‑ defined procedure so that others will understand exactly how the "average" value was obtained and can duplicate the process. No single procedure always produces a good, representative value. Therefore, researchers have developed three commonly used techniques for measuring central tendency: the mean, the median, and the mode.

The Mean The mean is the most commonly used measure of central tendency. Computation of the mean requires scores that are numerical values measured on an interval or ratio scale. The mean is obtained by computing the sum, or total, for the entire set of scores, then dividing this sum by the number of scores.

Population Mean For ungrouped data, the population mean is the sum of all the population values divided by the total number of population values:

Sample Mean For ungrouped data, the sample mean is the sum of all the sample values divided by the number of sample values:

The Mean (cont'd.) Conceptually, the mean can also be defined in the following ways: 1.The mean is the amount that each individual receives when the total (ΣX) is divided equally among all N individuals. 2.The mean is the balance point of the distribution because the sum of the distances below the mean is exactly equal to the sum of the distances above the mean.

Properties of the Arithmetic Mean 1.Every set of interval-level and ratio-level data has a mean. 2.All the values are included in computing the mean. 3.The mean is unique. 4.The sum of the deviations of each value from the mean is zero.

Weighted Mean The weighted mean of a set of numbers X 1, X 2,..., X n, with corresponding weights w 1, w 2,...,w n, is computed with the following formula:

The Weighted Mean (p.77) Combine 2 sets of data and find the overall mean overall mean = (∑X 1 + ∑X 2 )/(n 1 +n 2 ) 1 st sample: n=12, ∑X=72, M=6 2 nd sample: n=8, ∑X=56, M=7 Combined sample: n=20 (=12+8) ∑X=72+56=128, M= 128/20 = 6.4

If the info we have is.... (ex.3, p.79) 1 st sample: n=5, M=4 2 nd sample: n=3, M=10 What is the overall mean? overall mean = (n 1 M 1 +n 2 M 2 )/(n 1 +n 2 ) = (5*4 + 3*10)/(5+3) = 50/8 = 6.25

Changing the Mean Because the calculation of the mean involves every score in the distribution, changing the value of any score will change the value of the mean. Modifying a distribution by discarding scores or by adding new scores will usually change the value of the mean. To determine how the mean will be affected for any specific situation you must consider: 1) how the number of scores is affected, and 2) how the sum of the scores is affected.

ex. 4 (p. 79) A sample of n=6, mean M=40. One new score is added to the sample and new mean M=35, what is the new score? Is it greater or less than 40? It should be less than 40 The new value is 5

ex. 4 (p.82) A sample of n=4 has a mean of 9. If one person with a score of X=3 is removed from the sample, what is the value for the new sample mean? Ans: old sample n = 4 ∑X= 4*9 = 36 new sample n = 3 ∑X= 4*9 – 3 = 33 new mean = 33/3 = 11

Changing the Mean (cont'd.) If a constant value is added to every score in a distribution, then the same constant value is added to the mean. Also, if every score is multiplied by a constant value, then the mean is also multiplied by the same constant value.

ex. 3 (p. 82) A population has a mean of μ = 40 a.If 5 points were added to every score, what would be the value for the new mean? b.If every score were multiplied by 3, what would be the value for the new mean? Ans: a.new mean = old mean + 5 = 45 b.new mean = old mean * 3 = 120

ex. 1, 2 (p. 82) Adding a new score to a distribution always changes the mean, true or false? false Changing the value of a score in a distribution always changes the mean, true or false? true

When the Mean Won’t Work Although the mean is the most commonly used measure of central tendency, there are situations where the mean does not provide a good, representative value, and there are situations where you cannot compute a mean at all. When a distribution contains a few extreme scores (or is very skewed), the mean will be pulled toward the extremes (displaced toward the tail). In this case, the mean will not provide a "central" value.

When the Mean Won’t Work (cont'd.) With data from a nominal scale it is impossible to compute a mean, and when data are measured on an ordinal scale (ranks), it is usually inappropriate to compute a mean. Thus, the mean does not always work as a measure of central tendency and it is necessary to have alternative procedures available.

The Median If the scores in a distribution are listed in order from smallest to largest, the median is defined as the midpoint of the list. The median divides the scores so that 50% of the scores in the distribution have values that are equal to or less than the median. Computation of the median requires scores that can be placed in rank order (smallest to largest) and are measured on an ordinal, interval, or ratio scale.

The Median (cont'd.) Usually, the median can be found by a simple counting procedure: 1.With an odd number of scores, list the values in order, and the median is the middle score in the list. 2.With an even number of scores, list the values in order, and the median is half-way between the middle two scores.

Examples - Median The ages for a sample of five college students are: 21, 25, 19, 20, 22 Arranging the data in ascending order gives: 19, 20, 21, 22, 25. Thus the median is 21. The ages for a sample of five college students are: 21, 25, 19, 20, 22 Arranging the data in ascending order gives: 19, 20, 21, 22, 25. Thus the median is 21. The heights of four basketball players, in inches, are: 76, 73, 80, 75 Arranging the data in ascending order gives: 73, 75, 76, 80. Thus the median is The heights of four basketball players, in inches, are: 76, 73, 80, 75 Arranging the data in ascending order gives: 73, 75, 76, 80. Thus the median is 75.5.

The Median (cont'd.) If the scores are measurements of a continuous variable, it is possible to find the median by first placing the scores in a frequency distribution histogram with each score represented by a box in the graph. Then, draw a vertical line through the distribution so that exactly half the boxes are on each side of the line. The median is defined by the location of the line. i.e. 50 th percentile

Example (p ) Finding the precise median for a continuous variable 1 st list X from the smallest to the largest value n=5, X=3, 5, 8, 10, 11 n=6, X=1, 1, 4, 5, 7, 8 n=8, X=1, 2, 3, 4, 4, 4, 4, 6 (scores clustered at the median) If X is discrete variable, median = 4 If X is continuous variable, median = 50 th percentile & is between [ ] There’re four 4s, the median is after the first 4 and before the last three 4s.  i.e. the ¼ distance from 3.5  median= = 3.75

The Median (cont'd.) One advantage of the median is that it is relatively unaffected by extreme scores. Thus, the median tends to stay in the "center" of the distribution even when there are a few extreme scores or when the distribution is very skewed. In these situations, the median serves as a good alternative to the mean.

Figure 3.6 (p. 86) A population N=6, X = 2, 2, 2, 3, 3, 12 ΣX = 24, μ = 4 median = (2+3)/2 = 2.5 < mean median is true midpoint mean is affected by extreme value, i.e. x=12

ex.3 (p. 87) The following is a distribution of measurements for a continuous variable. Find the precise median that divides the distribution exactly in half. Scores: 1,2,2,3,4,4,4,4,4,5 continuous variable median is between the firs two 4s  [3.5, 4.5]  median = /5 = = 3.7

ex. 1-2 (p. 87) If you have a score of 52 on an 80-point exam, then you definitely scored above the median, true or false? false Median for the following distributions: 3, 4, 6, 7, 9, 10, 11 8, 10, 11, 12, 14, 15

The Mode The mode is defined as the most frequently occurring category or score in the distribution. In a frequency distribution graph, the mode is the category or score corresponding to the peak or high point of the distribution. The mode can be determined for data measured on any scale of measurement: nominal, ordinal, interval, or ratio.

The Mode (cont'd.) The primary value of the mode is that it is the only measure of central tendency that can be used for data measured on a nominal scale. In addition, the mode often is used as a supplemental measure of central tendency that is reported along with the mean or the median. mean and median are both single-valued mode can be multiple-valued

Fig. 3.7, ex. 1 (p. 89) Fig 3.7, mode = ? ex. 1: mean=47/20=2.35 Xfcfc%fX sum =47

Bimodal Distributions It is possible for a distribution to have more than one mode. Such a distribution is called bimodal. (Note that a distribution can have only one mean and only one median.) In addition, the term "mode" is often used to describe a peak in a distribution that is not really the highest point. Thus, a distribution may have a major mode at the highest peak and a minor mode at a secondary peak in a different location.

Central Tendency and the Shape of the Distribution Because the mean, the median, and the mode are all measuring central tendency, the three measures are often systematically related to each other. In a symmetrical distribution, for example, the mean and median will always be equal. p

Central Tendency and the Shape of the Distribution (cont'd.) If a symmetrical distribution has only one mode, the mode, mean, and median will all have the same value. In a skewed distribution, the mode will be located at the peak on one side and the mean usually will be displaced toward the tail on the other side. The median is usually located between the mean and the mode.

The Relative Positions of the Mean, Median and the Mode

Selecting a measurement of central tendency: Mean Mean is usually the preferred measure of central tendency. Mean uses every score in the distribution.  a good representative for central tendency Mean is closely related to variance and standard deviation.  valuable measure for inferential statistics It is consider the best and most popular central tendency measure.

Selecting a measurement of central tendency: Median Extreme scores or skewed distribution (mean is affected by the extreme value) Undetermined values (impossible to compute the mean value) Open-ended distributions (when there’s no upper limit or lower limit) Ordinal scale (can show direction but not the distance)

Selecting a measurement of central tendency: Mode Nominal scales (because distance and direction are meaningless) Discrete variables (if variable cannot be divided into fraction, mode is preferred) Describing shape (mode = peak)

Reporting Central Tendency in Research Reports (p ) In manuscripts and in published research reports, the sample mean is identified with the letter M. There is no standardized notation for reporting the median or the mode. In research situations where several means are obtained for different groups or for different treatment conditions, it is common to present all of the means in a single graph. (p.94)

Reporting Central Tendency in Research Reports (cont'd.)(p.94) The different groups or treatment conditions are listed along the horizontal axis and the means are displayed by a bar or a point above each of the groups. The height of the bar (or point) indicates the value of the mean for each group. Similar graphs are also used to show several medians in one display.