Descriptive Statistics Measures of Center

Slides:



Advertisements
Similar presentations
Measures of Central Tendency
Advertisements

Measure of Center A measure of center is a value at the center or middle of the data set Surprising, huh?
Intro to Descriptive Statistics
3-2 Descriptive Statistics Inferential Statistics
12.2 – Measures of Central Tendency
Measures of Central Tendency Section 2.3 Statistics Mrs. Spitz Fall 2008.
Section 12-2 Measures of Central Tendency.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Chapter 3 Descriptive Measures
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Descriptive Statistics Measures of Center. Essentials: Measures of Center (The great mean vs. median conundrum.)  Be able to identify the characteristics.
Initial Data Analysis Central Tendency. Notation  When we describe a set of data corresponding to the values of some variable, we will refer to that.
Measures of Central Tendency
LECTURE 6 TUESDAY, 10 FEBRUARY 2008 STA291. Administrative Suggested problems from the textbook (not graded): 4.2, 4.3, and 4.4 Check CengageNow for second.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Measurements of Central Tendency. Statistics vs Parameters Statistic: A characteristic or measure obtained by using the data values from a sample. Parameter:
Chapter 3 Statistics for Describing, Exploring, and Comparing Data
Chapter 3 Averages and Variations
© Copyright McGraw-Hill CHAPTER 3 Data Description.
 Mean: of a data set is the sum of the data entries divided by the number of entries. To find the mean of a data set, use one of the following formulas.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week
1 Probabilistic and Statistical Techniques Lecture 4 Dr. Nader Okasha.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Averages and Variation.
Thinking Mathematically
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Three Averages and Variation.
Statistics Numerical Representation of Data Part 1 – Measures of Central Tendency.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Chapter Descriptive Statistics 2.
Chapter 2 Means to an End: Computing and Understanding Averages Part II  igma Freud & Descriptive Statistics.
Working with one variable data. Measures of Central Tendency In statistics, the three most commonly used measures of central tendency are: Mean Median.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
1 Review Sections 2.1, 2.2, 1.3, 1.4, 1.5, 1.6 in text.
Measures of Central Tendency A statistic is a characteristic or measure obtained by using the data values from a sample. A parameter is a characteristic.
Section 2.3 Measures of Central Tendency 1 of 149 © 2012 Pearson Education, Inc. All rights reserved.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
1 M ARIO F. T RIOLA E IGHTH E DITION E LEMENTARY S TATISTICS Section 2-4 Measures of Center.
Symbol Description It would be a good idea now to start looking at the symbols which will be part of your study of statistics.  The uppercase Greek letter.
Section 2.3 Measures of Central Tendency. Section 2.3 Objectives Determine the mean, median, and mode of a population and of a sample (and which to use.
Chapter 3 Descriptive Statistics: Numerical Methods.
Data Description Chapter 3. The Focus of Chapter 3  Chapter 2 showed you how to organize and present data.  Chapter 3 will show you how to summarize.
3-1 Review and Preview 3-2 Measures of Center 3-3 Measures of Variation 3-4 Measures of Relative Standing and Boxplots.
Data Description Note: This PowerPoint is only a summary and your main source should be the book. Lecture (8) Lecturer : FATEN AL-HUSSAIN.
Do Now Identify the w’s and specify each variable as categorical or quantitative. Scientists at a major pharmaceutical firm conducted an experiment to.
 2012 Pearson Education, Inc. Slide Chapter 12 Statistics.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Statistics for Business
Warm-Up 1..
Chapter 2: Methods for Describing Data Sets
Chapter 12 Statistics 2012 Pearson Education, Inc.
Describing, Exploring and Comparing Data
Central Tendency and Variability
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Lecture 5,6: Measures in Statistics
Numerical Descriptive Measures
Descriptive Statistics
Lecture Slides Elementary Statistics Twelfth Edition
Lecture Slides Elementary Statistics Twelfth Edition
Measures of Central Tendency
12.2 – Measures of Central Tendency
Elementary Statistics
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Numerical Descriptive Measures
Descriptive Statistics
Chapter 3 Data Description
Chapter 12 Statistics.
Lecture Slides Essentials of Statistics 5th Edition
Presentation transcript:

Descriptive Statistics Measures of Center Essentials Notation Measures of Center Mean Median Mode Mid-range Example Which to Use: Mean vs. Median vs. Mode Additional Topics (not addressed)

Essentials: Measures of Center (The great mean vs. median conundrum.) Be able to identify the characteristics of the median, mean and mode, and to which types of data each applies. Be able to calculate the median, mean and mode, as appropriate, for a set of data. Affected by vs. resistant to extreme values. What are the implications for the mean and median?.

Some Notation  denotes the addition of a set of values X (capital)is the variable usually used to represent the individual data values xi (small letter) represents a single value of a variable from the first value, x1, to the last value xn n represents the number of data values in a sample N represents the number of data values in a population

Measures of Center Measures of Central Tendency Indicate where the center or most typical value of a data set lies Are often thought of as averages Include the Mean, Median, Mode, and Midrange

The Mean (Arithmetic) The Formula: The “average” of a set of data. There are several types of means including: Harmonic mean for rates of change, such as speed: n/sigma(1/x); Geometric mean for average rates of change or growth such as interest: nth root of (x*y*z) where n = number of values; Quadratic mean for power (voltage, etc.): Sqrt of ((sum x2)/n) The “average” of a set of data. Is the sum of the observations divided by the number of observations. Is used only with quantitative data.

Population Mean vs. Sample Mean A Sample Mean is represented by the lower case letter x with a bar above it (called x-bar) A Population Mean is represented by the lower case Greek letter m (mu)

Median The middle observation in a set of data. Divides the data such that 50% of the observations lie below the median and 50% lie above it. Is used only with quantitative data. To obtain the median, the data must be placed in increasing order.

MEDIAN: The Formula First: Arrange the scores in increasing order. Second: Apply the formula (n+1)/2. (Where n is the number of data values.) If there is an EVEN number of scores, the Median lies between the two middle scores. e.g: 1, 2, 8, 15 => Median is (n+1)/2 = (4+1)/2 = 2.5 (position). So, the Median is the data value that lies 1/2 way between the second and third data values. Here that value would be 5. If there is an ODD number of scores, the middle score is the value of the Median. e.g: 1, 3, 6 => Median is (n+1)/2 = (3+1)/2 = 2 (position). So, the Median is value in the second position of the list of values. Here the second value is the number 3. Remember, the formula computes a position, not a data value.

Calculating a Median: Determine the median for the following backpack weights: Backpack weights (lb): 10, 14, 12, 18, 32, 15, 22, 19, 23, 61.

MODE: The Formula The most frequently occurring score in a data set. Obtain the frequency of each value. A Frequency Table based upon Single-Value Grouping or a Dot Plot would display this information. Used with both qualitative and quantitative data. It is the only measure of center for qualitative data. There may be more than one Mode If there are two modes, the data set is bimodal. If there are more than two modes, the data set is multimodal. If there is the same number of each value, then there is no mode

Midrange The Midrange is a measure of center of a distribution. It indicates the value midway between the highest and lowest values in a data set. To find the midrange. Highest Value + Lowest Value 2

Example: Comparing the Mean, Median, and Mode Find the mean, median, and mode of the sample ages of a class shown. Which measure of central tendency best describes a typical entry of this data set? Are there any outliers? Ages in a class 20 21 22 23 24 65 Source: Larson/Farber 4th ed.

Solution: Comparing the Mean, Median, and Mode Ages in a class 20 21 22 23 24 65 Mean: Median: Mode: 20 years (the entry occurring with the greatest frequency) Source: Larson/Farber 4th ed.

Solution: Comparing the Mean, Median, and Mode Mean ≈ 23.8 yrs. Median = 21.5 yrs. Mode = 20 yrs. The mean takes every entry into account, but is influenced by the outlier of 65. The median here was determined by taking the middle two entries into account, and it is not affected by the outlier. In this case the mode exists, but it doesn't appear to represent a typical entry. Source: Larson/Farber 4th ed.

Solution: Comparing the Mean, Median, and Mode Sometimes a graphical comparison can help you decide which measure of central tendency best represents a data set. In this case, it appears that the median best describes the data set. Source: Larson/Farber 4th ed.

Mean vs. Median vs. Mode Which is the best Measure of Center???? Is sensitive to the influence of extreme scores (outliers), which will “pull” the mean away from the center. Involves ALL data values in the calculation MEDIAN: Is resistant to the influence of extreme values. Only uses One or Two points in its calculation. MODE: May not be anywhere near the center of the data. Not really aimed at finding the middle of the data. Is the ONLY “Measure of Center” for Qualitative Data.

Additional Topics

Weighted Means Weighted Mean – a mean computed with different scores assigned different weights. To find the weighted mean

Weighted Example: Finding a Weighted Mean You are taking a class in which your grade is determined from five sources: 50% from your test mean, 15% from your midterm, 20% from your final exam, 10% from your computer lab work, and 5% from your homework. Your scores are 86 (test mean), 96 (midterm), 82 (final exam), 98 (computer lab), and 100 (homework). What is the weighted mean of your scores? If the minimum average for an A is 90, did you get an A? Source: Larson/Farber 4th ed.

Solution: Finding a Weighted Mean Source Score, x Weight, w x∙w Test Mean 86 0.50 86(0.50)= 43.0 Midterm 96 0.15 96(0.15) = 14.4 Final Exam 82 0.20 82(0.20) = 16.4 Computer Lab 98 0.10 98(0.10) = 9.8 Homework 100 0.05 100(0.05) = 5.0 Σw = 1 Σ(x∙w) = 88.6 Your weighted mean for the course is 88.6. You did not get an A. Source: Larson/Farber 4th ed.

Weighted Means Example Calculating a GPA. Given the following four grades, calculate the semester GPA. Statistics A (of course; 3 CrHrs; numeric value for an A = 4) History B (3 CrHr; B = 3) Physics C (3 CrHr; C = 2) Physical Education C (1 CrHr) The grade numeric equivalents are the x values. The credit hour values are the weights. Calculate the student’s GPA.

Finding a Mean From a Frequency Table (Grouped Data) When we view data in a frequency table, it is impossible to know the exact values falling in a particular class. To find this value, obtain the product of each frequency and class midpoint (here “x”), add the products, and then divide by the sum of the frequencies.

Finding the Mean of a Frequency Distribution In Words In Symbols Find the midpoint of each class. Find the sum of the products of the midpoints and the frequencies. Find the sum of the frequencies. Find the mean of the frequency distribution. Source: Larson/Farber 4th ed.

Example: Find the Mean of a Frequency Distribution Use the frequency distribution to approximate the mean number of minutes that a sample of Internet subscribers spent online during their most recent session. Class Midpoint Frequency, f 7 – 18 12.5 6 19 – 30 24.5 10 31 – 42 36.5 13 43 – 54 48.5 8 55 – 66 60.5 5 67 – 78 72.5 79 – 90 84.5 2 Source: Larson/Farber 4th ed.

Example: Find the Mean of a Frequency Distribution Use the frequency distribution to approximate the mean number of minutes that a sample of Internet subscribers spent online during their most recent session. Class Midpoint, x Frequency, f (x∙f) 7 – 18 12.5 6 12.5∙6 = 75.0 19 – 30 24.5 10 24.5∙10 = 245.0 31 – 42 36.5 13 36.5∙13 = 474.5 43 – 54 48.5 8 48.5∙8 = 388.0 55 – 66 60.5 5 60.5∙5 = 302.5 67 – 78 72.5 72.5∙6 = 435.0 79 – 90 84.5 2 84.5∙2 = 169.0 n = 50 Σ(x∙f) = 2089.0 Source: Larson/Farber 4th ed.

End of Slides