Chapter 3: Central Tendency

Slides:



Advertisements
Similar presentations
Population vs. Sample Population: A large group of people to which we are interested in generalizing. parameter Sample: A smaller group drawn from a population.
Advertisements

© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Agricultural and Biological Statistics
Measures of Central Tendency. Central Tendency “Values that describe the middle, or central, characteristics of a set of data” Terms used to describe.
PRED 354 TEACH. PROBILITY & STATIS. FOR PRIMARY MATH
Lecture 2 PY 427 Statistics 1 Fall 2006 Kin Ching Kong, Ph.D
Descriptive Statistics
Chapter 3: Central Tendency
1 Measures of Central Tendency Greg C Elvers, Ph.D.
Chapter 3 Measures of Central Tendency. 3.1 Defining Central Tendency Central tendency Purpose:
Today: Central Tendency & Dispersion
Measures of Central Tendency
Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the.
Chapter 1: Introduction to Statistics
Summarizing Scores With Measures of Central Tendency
Objective To understand measures of central tendency and use them to analyze data.
BIOSTATISTICS II. RECAP ROLE OF BIOSATTISTICS IN PUBLIC HEALTH SOURCES AND FUNCTIONS OF VITAL STATISTICS RATES/ RATIOS/PROPORTIONS TYPES OF DATA CATEGORICAL.
Ibrahim Altubasi, PT, PhD The University of Jordan
Chapter 3 Statistical Concepts.
Chapter 3 EDRS 5305 Fall 2005 Gravetter and Wallnau 5 th edition.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
CENTRAL TENDENCY Mean, Median, and Mode.
Reasoning in Psychology Using Statistics Psychology
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Central Tendency Introduction to Statistics Chapter 3 Sep 1, 2009 Class #3.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Describing Data Lesson 3. Psychology & Statistics n Goals of Psychology l Describe, predict, influence behavior & cognitive processes n Role of statistics.
Lecture 3 Describing Data Using Numerical Measures.
Chapter 4 – 1 Chapter 4: Measures of Central Tendency What is a measure of central tendency? Measures of Central Tendency –Mode –Median –Mean Shape of.
Statistics 11 The mean The arithmetic average: The “balance point” of the distribution: X=2 -3 X=6+1 X= An error or deviation is the distance from.
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Central Tendency A statistical measure that serves as a descriptive statistic Determines a single value –summarize or condense a large set of data –accurately.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
LIS 570 Summarising and presenting data - Univariate analysis.
Anthony J Greene1 Central Tendency 1.Mean Population Vs. Sample Mean 2.Median 3.Mode 1.Describing a Distribution in Terms of Central Tendency 2.Differences.
Chapter 3: Central Tendency 1. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Measures of Central Tendency (MCT) 1. Describe how MCT describe data 2. Explain mean, median & mode 3. Explain sample means 4. Explain “deviations around.
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Data Description Chapter 3. The Focus of Chapter 3  Chapter 2 showed you how to organize and present data.  Chapter 3 will show you how to summarize.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Chapter 11 Summarizing & Reporting Descriptive Data.
Exploratory Data Analysis
Descriptive Statistics
Figure 2-7 (p. 47) A bar graph showing the distribution of personality types in a sample of college students. Because personality type is a discrete variable.
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Chapter 2: Methods for Describing Data Sets
Topic 3: Measures of central tendency, dispersion and shape
Reasoning in Psychology Using Statistics
Central Tendency and Variability
Summarizing Scores With Measures of Central Tendency
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Description of Data (Summary and Variability measures)
1 Chapter 1: Introduction to Statistics. 2 Variables A variable is a characteristic or condition that can change or take on different values. Most research.
Chapter 3 Describing Data Using Numerical Measures
Numerical Descriptive Measures
Central Tendency.
An Introduction to Statistics
Introduction to Statistics
MEASURES OF CENTRAL TENDENCY
Myers Chapter 1 (F): Statistics in Psychological Research: Measures of Central Tendency A.P. Psychology.
Chapter Nine: Using Statistics to Answer Questions
Chapter 3: Central Tendency
Measures of Central Tendency
Central Tendency & Variability
Displaying the Order in a Group of Numbers Using Tables and Graphs
Presentation transcript:

Chapter 3: Central Tendency

Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the distribution and represents the entire distribution of scores. The goal of central tendency is to identify the single value that is the best representative for the entire set of data.

Central Tendency (cont.) By identifying the "average score," central tendency allows researchers to summarize or condense a large set of data into a single value. Thus, central tendency serves as a descriptive statistic because it allows researchers to describe or present a set of data in a very simplified, concise form. In addition, it is possible to compare two (or more) sets of data by simply comparing the average score (central tendency) for one set versus the average score for another set.

Frequency Distributions 1 2 3 4 5 6 7 8 9 X (a) 1 2 3 4 5 6 7 8 9 X (b) 1 2 3 4 5 6 7 8 9 X (c)

The Mean, the Median, and the Mode It is essential that central tendency be determined by an objective and well‑defined procedure so that others will understand exactly how the "average" value was obtained and can duplicate the process. No single procedure always produces a good, representative value. Therefore, researchers have developed three commonly used techniques for measuring central tendency: the mean, the median, and the mode.

The Mean The mean is the most commonly used measure of central tendency. Computation of the mean requires scores that are numerical values measured on an interval or ratio scale. The mean is obtained by computing the sum, or total, for the entire set of scores, then dividing this sum by the number of scores.

Simple frequency distribution 5, 3, 8, 7, 2 3 f 2 1 1 2 3 4 5 6 7 8 9 X Score

Computing the mean: X 5 3 8 7 2 For a population: For a sample:

Visual demonstration of mean 1 2 3 4 5 6 7 8 9 10

The Weighted Mean

Characteristics of the Mean: Changing a score or introducing a new score Adding or subtracting a constant from each score Multiplying or dividing each score by a constant

When the Mean Won’t Work Although the mean is the most commonly used measure of central tendency, there are situations where the mean does not provide a good, representative value, and there are situations where you cannot compute a mean at all. When a distribution contains a few extreme scores (or is very skewed), the mean will be pulled toward the extremes (displaced toward the tail). In this case, the mean will not provide a "central" value.

When the Mean Won’t Work (cont.) With data from a nominal scale it is impossible to compute a mean, and when data are measured on an ordinal scale (ranks), it is usually inappropriate to compute a mean. Thus, the mean does not always work as a measure of central tendency and it is necessary to have alternative procedures available.

The Median If the scores in a distribution are listed in order from smallest to largest, the median is defined as the midpoint of the list. The median divides the scores so that 50% of the scores in the distribution have values that are equal to or less than the median. Computation of the median requires scores that can be placed in rank order (smallest to largest) and are measured on an ordinal, interval, or ratio scale.

The Median (cont.) Usually, the median can be found by a simple counting procedure: 1. With an odd number of scores, list the values in order, and the median is the middle score in the list. 2. With an even number of scores, list the values in order, and the median is half-way between the middle two scores.

Frequency distributions showing median 1 2 3 4 5 6 7 8 9 10 11 12 X Median Frequency 1 2 3 4 5 6 7 8 9 10 11 12 X Median Frequency

The Median (cont.) If the scores are measurements of a continuous variable, it is possible to find the median by first placing the scores in a frequency distribution histogram with each score represented by a box in the graph. Then, draw a vertical line through the distribution so that exactly half the boxes are on each side of the line. The median is defined by the location of the line.

1, 2, 2, 3, 4, 4, 4, 4, 4, 5

Comparison of median distribution 1 2 3 4 5 6 X (a) Frequency 1 2 3 4 5 6 X (b) Frequency 1/5 4/5 Median = 3.70

Percentile rank data table 1, 2, 2, 3, 4, 4, 4, 4, 4, 5 x f cf c% 5 1 10 100% 4 9 90% 3 40% 2 30% 10%

The Median (cont.) One advantage of the median is that it is relatively unaffected by extreme scores. Thus, the median tends to stay in the "center" of the distribution even when there are a few extreme scores or when the distribution is very skewed. In these situations, the median serves as a good alternative to the mean.

The Mode The mode is defined as the most frequently occurring category or score in the distribution. In a frequency distribution graph, the mode is the category or score corresponding to the peak or high point of the distribution. The mode can be determined for data measured on any scale of measurement: nominal, ordinal, interval, or ratio.

Frequency distribution x f 6 1 5 2 4 3 4 3 Frequency 2 1 1 2 3 4 5 6 7 Scores

The Mode (cont.) The primary value of the mode is that it is the only measure of central tendency that can be used for data measured on a nominal scale. In addition, the mode often is used as a supplemental measure of central tendency that is reported along with the mean or the median.

Frequency vs. Major Frequency Academic Major 6 4 2 Biology Psychology Sociology Business Academic Major

Bimodal Distributions It is possible for a distribution to have more than one mode. Such a distribution is called bimodal. (Note that a distribution can have only one mean and only one median.) In addition, the term "mode" is often used to describe a peak in a distribution that is not really the highest point. Thus, a distribution may have a major mode at the highest peak and a minor mode at a secondary peak in a different location.

Frequency vs. Time of day 12 10 8 (number caught) Frequency 6 4 2 4 5 6 7 8 9 10 11 12 1 2 3 4 5 6 7 8 a.m Noon p.m. Time of day

Mean is the most commonly used measure of central tendency because: It uses every score. It’s usually a very good representative value. It’s mathematically related to variance and standard deviation (most common measures of variability ).

Advantages of the mode over the mean: Easy to compute Can be used with any scale of measurement Especially useful when you can’t calculate a median or a mean

When to Use the Median: Extreme scores or skewed distribution Undetermined values Open-ended distribution Ordinal scale

Frequency vs. Number of Errors 5 4 3 Frequency 2 1 10 11 12 13 14 15 100 Number of errors

Frequency vs. Number of Errors 5 4 3 Frequency 2 1 10 11 12 13 14 15 100 Number of errors

Amount of time to complete a puzzle: Person Time (min.) 1 8 2 11 3 12 4 13 5 17 6 Never finished

Open-ended distribution: Number of Children f 5 or more 3 4 2 1 6

Ordinal Scale 1st Contest 2nd contest Child Rank # of Baskets A 1st 10 C 7 B 2nd 4 6 3rd 2 5 Compare mean ranks for child A vs. C Compare them on number of baskets made

Central Tendency and the Shape of the Distribution Because the mean, the median, and the mode are all measuring central tendency, the three measures are often systematically related to each other. In a symmetrical distribution, for example, the mean and median will always be equal.

Central Tendency and the Shape of the Distribution (cont.) If a symmetrical distribution has only one mode, the mode, mean, and median will all have the same value. In a skewed distribution, the mode will be located at the peak on one side and the mean usually will be displaced toward the tail on the other side. The median is usually located between the mean and the mode.

Skewed distributions (a) (b) Frequency X Mode Median Mean Frequency X

Reporting Central Tendency in Research Reports In manuscripts and in published research reports, the sample mean is identified with the letter M. There is no standardized notation for reporting the median or the mode. In research situations where several means are obtained for different groups or for different treatment conditions, it is common to present all of the means in a single graph.

Reporting Central Tendency in Research Reports (cont.) The different groups or treatment conditions are listed along the horizontal axis and the means are displayed by a bar or a point above each of the groups. The height of the bar (or point) indicates the value of the mean for each group. Similar graphs are also used to show several medians in one display.

Mean number of errors on easy vs. difficult tasks for males vs. females 1.45 8.36 Male 3.83 14.77

Food consumption vs. drug dose 30 20 Mean food consumption 15 10 5 1 2 3 4 Drug dose

House cost vs. Region of U.S. Northeast South Midwest West Region of the United States 25 50 75 100 125 150 Mean new house cost (in $1,000’s)