1 Psych 5500/6500 Measures of Central Tendency Fall, 2008.

Slides:



Advertisements
Similar presentations
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Advertisements

Measures of Central Tendency.  Parentheses  Exponents  Multiplication or division  Addition or subtraction  *remember that signs form the skeleton.
Measures of Central Tendency. Central Tendency “Values that describe the middle, or central, characteristics of a set of data” Terms used to describe.
Calculating & Reporting Healthcare Statistics
Measures of Central Tendency
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 3-1.
Chapter 3: Central Tendency
1 Measures of Central Tendency Greg C Elvers, Ph.D.
Measures of Central Tendency
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Today: Central Tendency & Dispersion
Chapter 4 Measures of Central Tendency
Measures of Central Tendency
Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the.
MAT 1000 Mathematics in Today's World. Last Time 1.Three keys to summarize a collection of data: shape, center, spread. 2.The distribution of a data set:
Describing Data: Numerical
Chapter 3 Descriptive Measures
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Statistics Used In Special Education
Central Tendency.
Objective To understand measures of central tendency and use them to analyze data.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Part II Sigma Freud & Descriptive Statistics
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Created by Tom Wegleitner, Centreville, Virginia Section 3-1 Review and.
Measures of Central Tendency or Measures of Location or Measures of Averages.
Chapter 3 Averages and Variations
Statistical Tools in Evaluation Part I. Statistical Tools in Evaluation What are statistics? –Organization and analysis of numerical data –Methods used.
Descriptive Statistics A Short Course in Statistics.
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Lecture 15 Sections 5.1 – 5.2 Wed, Sep 27, 2006 Measuring Center.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
Copyright © 2014 by Nelson Education Limited. 3-1 Chapter 3 Measures of Central Tendency and Dispersion.
Categorical vs. Quantitative…
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Measures of Central Tendency: The Mean, Median, and Mode
Chapter 2 Means to an End: Computing and Understanding Averages Part II  igma Freud & Descriptive Statistics.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Central Tendency & Dispersion
Data Analysis.
Describing Distributions with Numbers Chapter 2. What we will do We are continuing our exploration of data. In the last chapter we graphically depicted.
Central Tendency A statistical measure that serves as a descriptive statistic Determines a single value –summarize or condense a large set of data –accurately.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Anthony J Greene1 Central Tendency 1.Mean Population Vs. Sample Mean 2.Median 3.Mode 1.Describing a Distribution in Terms of Central Tendency 2.Differences.
Chapter 3: Central Tendency 1. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Summation Notation, Percentiles and Measures of Central Tendency Overheads 3.
Data Description Chapter 3. The Focus of Chapter 3  Chapter 2 showed you how to organize and present data.  Chapter 3 will show you how to summarize.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Copyright © Cengage Learning. All rights reserved. 2 Descriptive Analysis and Presentation of Single-Variable Data.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
Making Sense of Statistics: A Conceptual Overview Sixth Edition PowerPoints by Pamela Pitman Brown, PhD, CPG Fred Pyrczak Pyrczak Publishing.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Measures of Center.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Measures of Central Tendency. What is a measure of central tendency? Measures of Central Tendency Mode Median Mean Shape of the Distribution Considerations.
Chapter 3 Numerical Descriptive Measures. 3.1 Measures of central tendency for ungrouped data A measure of central tendency gives the center of a histogram.
Descriptive Statistics
INTRODUCTION TO STATISTICS
Introduction to Summary Statistics
Introduction to Summary Statistics
Single Variable Data Analysis
Numerical Measures: Centrality and Variability
Introduction to Summary Statistics
Measures of Central Tendency
MEASURES OF CENTRAL TENDENCY
LESSON 3: CENTRAL TENDENCY
Chapter 3: Central Tendency
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Chapter 3: Central Tendency
Presentation transcript:

1 Psych 5500/6500 Measures of Central Tendency Fall, 2008

2 Measures of Central Tendency Various ways of indicating the most typical or average score. 1.Mean 2.Median 3.Mode

3 The Mean ‘n’ is the number of scores in the sample Some people use ‘n’ to represent the size of a sample and ‘N’ to represent the size of a population. I use either ‘n’ or ‘N’ apparently arbitrarily and then I depend upon context to make it clear.

4 Summation Symbols YiYi Y1Y1 3 Y2Y2 4 Y3Y3 5 Y4Y4 8 Y = 3, 4, 5, 8 n=4

5 The Mean (Computation Example)

6 Rounding Conventions The sample mean was ….with an infinite number of 6’s to the right of the decimal point. I would like to establish the following rules about rounding off your answers: 1.Go at least two places to the right of the decimal point (e.g. rounding off at 3.66 or are ok but 3.6 is not). If you are using SPSS or having your calculator keep track of your intermediate calculations it won’t be rounding off at all and that is fine. 2.If the first number after that is ‘5’ or greater round up, if it is ‘4’ or less don’t round up. Thus is rounded to 3.67, while is rounded to 3.33 Now, if you know something about the topic of ‘significant figures’ this policy doesn’t make any sense. It will, however, keep all of you in the same ballpark when it comes to computing answers and handing them in to be graded.

7 Mean (Interesting Property #1) The mean is the balance point of a frequency distribution

8 Effect of Outliers One extreme score can have a big effect on the mean.

9 Outliers (cont.) Thus one outlier can dramatically affect the mean, making it no longer an effective representation of the majority of the scores.

10 Outliers and Skewed Data An extreme score (extreme when compared to the other scores in the distribution) is called an outlier. A distribution that has a number of extreme scores off in just one direction is said to be skewed. In general the mean is not a good measure of central tendency when you have an outlier or with skewed data as it is affected by the extreme scores off in one direction, making it no longer representative of the majority of the scores.

11 The Median The median is the middle score, the score that half of the scores are less than and half of the scores are greater than.

12 The Median (Computation) Step 1: First put the scores in order from smallest to largest. Step 2: If n is odd then the median is the one score in the middle, if n is even then the median is the mean of the two middle scores.

13 Median (Computation example) Example when ‘n’ is odd. Y = 1, 6, 5, 3, 2, 4, 2 Step 1: 1, 2, 2, 3, 4, 5, 6 Step 2: as n is odd (n=7) there is one score in the middle. The median = 3.

14 Median (Computation example) When when ‘n’ is even Y = 12, 9, 10, 8, 11, 7 Step 1: 7, 8, 9, 10, 11, 12 Step 2: as n is even (n=6) there are two scores in the middle, the median = (9+10)/2=9.5

15 Median (Interesting Property #1) The median divides the area of the histogram into two equal parts.

16 Effect of Outliers The median is not affect by an outlier.

17 Median: Special Case Sometimes, when the median is a value that occurs more than once in the data, then the simple formula I gave doesn’t quite work. For example, say your data are: Y = 1, 2, 2, 2, 3, 4 The median is ‘2’ but there is only one score below ‘2’ while there are two scores above ‘2’. In this case a median of 2 does not divide the area of the distribution into two equal pieces (see next slide).

18 Note we have 1+1/2+1/2+1/2 = 2.5 boxes below the median, while we have 2+1/2+1/2+1/2 = 3.5 boxes above the median.

19 If we tweak the value of the median a tad, then we get 1+2/3+2/3+2/3=3 boxes below the median, and 2 + 1/3+1/3+1/3= 3 boxes above the median.

20 Final Word on Median The ‘tweaking’ of the median to preserve its definition of dividing the area of the distribution into two equal parts is rarely done. Usually the simpler formula I have given (arrange the scores then find the middle of that list) is used, this is what SPSS does. Consequently, we will state that the median of Y = 1, 2, 2, 2, 3, 4 is ‘2’.

21 Mean, Median, and Skewed Data The median is often preferred over the mean when you have skewed data. Price of homes: $100,000 $130,000 $160,000 $180,000 $2,200,000 Mean = $554,000 Median = $160,000

22 The Mode The mode is the score that occurs the most. Y= 2, 4, 5, 5, 5, 7, 8, 9 Mode = 5 Sometimes there is no mode, sometimes there is more than one mode.

23 Mode (Semi-Interesting Property) The mode is the peak of a histogram

24 Bimodal Distributions The term bimodal is used when there are two peaks in the distribution even if both peaks aren’t exactly the same size. On a survey question measuring people’s views on a very controversial topic—one that few people feel neutral about--you might get a clump of low scores (with its own mode) and a clump of high scores (with its own mode) and the distribution could be called bimodal even if the two peaks are not identical in height (see graph below).

25 Nominal Scales and Central Tendency Racial background: 1=African American 2=Asian American 3=European American 4=Native American Y= 1, 1, 2, 4 Mean=2 Median=1.5 Mode=1 Only the mode makes sense.

26 Ordinal Scales and the Mean Size of household debt: 1=None ($0) 2=Tiny ($1 to $500) 3=Very Small ($501-$1000) 4=Large ($1000+) One person had a debt of $200 (Y=2) and one person had a dept of $2,000,000 (Y=4) Y= 2, 4 Mean=3 you are saying that the average debt in the sample was ‘Very Small’. This obviously isn’t working.

27 Ordinal Scales and the Median Size of household debt: 1=None ($0) 2=Tiny ($1 to $500) 3=Very Small ($501-$1000) 4=Large ($1000+) Y= 1, 3, 4 Median=3 you are saying that half the sample had a debt that was very small or less, and half had a debt that was very small or larger. This makes sense.

28 Ordinal Scales and the Mode Size of household debt: 1=None ($0) 2=Tiny ($1 to $500) 3=Very Small ($501-$1000) 4=Large ($1000+) Y= 1, 2, 3, 3, 3, 4 Mode=3 you are saying that the score that happened the most in the sample was a ‘3’, this also makes sense.

29 Rank Scales and Central Tendency (1) Within a sample: Order of finish in a foot race: Y = 1,2,3,4 Mean=2.5, Median=2.5, no mode You will get exactly the same values anytime you race four people, so what good are they?

30 Rank Scales and Central Tendency (2) When rank scores are used it is usually within a somewhat more complicated experimental design (e.g. one involving two groups). An example would be to take ten out-of-shape people, randomly divide them into two groups of 5 people each, have one group do a lot of training, then have all ten run a race and measure how they place in the race (a rank measure). The data might look like this: Training group: Y = 1, 2, 4, 5, 7 median = 4 No training group: Y = 3, 6, 8, 9, 10median = 8 It looks like the ‘training group’ placed better in the race than the ‘no training group’. To compare the performance of the two groups you could compare the medians of the two groups (it would be inappropriate to use the means of the groups because these are ordinal-type numbers). There is no mode so you can’t use that.

31 Cardinal Scales and Central Tendency How many magazines various households subscribe to: Y= 1, 1, 2, 4 Mean=2 Median=1.5 Mode=1 They all make sense.

32 Selecting a Measure of Central Tendency 1.The most important guideline for selecting which measure of central tendency to use is to select the one that does the best job of representing the data given what you are trying to determine. Sometimes you would be more interested in knowing that most families in some sample had 2 children than you would in knowing that the average child per household was Common sense, what you need to know, and which measure best represents what you need to know, will all determine which measure(s) you select.

33 Selecting a Measure of Central Tendency 2.By far, more statistical tools (including the ones we will be covering in this class) are developed around the mean than for any other measure of central tendency. Also, more people understand the mean as ‘the average’ than they do the other measures.

34 Selecting a Measure of Central Tendency 3.The median does a better job than the mean at describing skewed data. There are many more tools you can apply to the mean, however, and so it may make more sense to make the data be less skewed so you can use the mean. We will learn how to deskewify the data later in this class (don’t try to find that word in dictionary).

35 Selecting a Measure of Central Tendency 4.The measurement scale you use might determine which measure of central tendency would be appropriate (see the earlier slides)