Descriptive Statistics The goal of descriptive statistics is to summarize a collection of data in a clear and understandable way.

Slides:



Advertisements
Similar presentations
Chapter 3 Properties of Random Variables
Advertisements

© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
Descriptive Statistics
Basic Statistical Concepts
Calculating & Reporting Healthcare Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
PPA 415 – Research Methods in Public Administration
Intro to Descriptive Statistics
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
1 Measures of Central Tendency Greg C Elvers, Ph.D.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Today: Central Tendency & Dispersion
Describing Data: Numerical
Chapter 3 Descriptive Measures
Chapter 3 Statistics for Describing, Exploring, and Comparing Data
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Created by Tom Wegleitner, Centreville, Virginia Section 3-1 Review and.
Measures of Central Tendency or Measures of Location or Measures of Averages.
Chapters 1 & 2 Displaying Order; Central Tendency & Variability Thurs. Aug 21, 2014.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
PPA 501 – Analytical Methods in Administration Lecture 5a - Counting and Charting Responses.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
Central Tendency Introduction to Statistics Chapter 3 Sep 1, 2009 Class #3.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Describing Data Lesson 3. Psychology & Statistics n Goals of Psychology l Describe, predict, influence behavior & cognitive processes n Role of statistics.
Lecture 3 Describing Data Using Numerical Measures.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Univariate Descriptive Statistics Heibatollah Baghi, and Mastee Badii George Mason University.
Statistics 11 The mean The arithmetic average: The “balance point” of the distribution: X=2 -3 X=6+1 X= An error or deviation is the distance from.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Measures of Central Tendency or Measures of Location or Measures of Averages.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
Introduction to Statistics Santosh Kumar Director (iCISA)
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
Central Tendency & Dispersion
Central Tendency. Variables have distributions A variable is something that changes or has different values (e.g., anger). A distribution is a collection.
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
LIS 570 Summarising and presenting data - Univariate analysis.
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
Chapter 2 Review Using graphs/tables/diagrams to show variable relationships Understand cumulative frequency, percentile rank, and cross-tabulations Perform.
Descriptive Statistics(Summary and Variability measures)
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
Summarizing Data with Numerical Values Introduction: to summarize a set of numerical data we used three types of groups can be used to give an idea about.
Descriptive Statistics Dr.Ladish Krishnan Sr.Lecturer of Community Medicine AIMST.
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
PRESENTATION OF DATA.
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
Central Tendency and Variability
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Description of Data (Summary and Variability measures)
Summary descriptive statistics: means and standard deviations:
Numerical Descriptive Measures
Means & Medians Chapter 4.
MEASURES OF CENTRAL TENDENCY
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Numerical Descriptive Measures
Descriptive Statistics: Describing Data
Summary descriptive statistics: means and standard deviations:
Measure of Central Tendency
Numerical Descriptive Measures
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Numerical Descriptive Measures
Central Tendency & Variability
Presentation transcript:

Descriptive Statistics The goal of descriptive statistics is to summarize a collection of data in a clear and understandable way.

Summary Measures Central Tendency Mean Median Mode Quartile Geometric Mean Summary Measures Variation Variance Standard Deviation Coefficient of Variation Range

INVESTIGATION

Measures of Central Tendency or Measures of Location or Measures of Averages

Central Tendency Measures of Central Tendency: Mean The sum of all scores divided by the number of scores. Median The value that divides the distribution in half when observations are ordered. Mode The most frequent score.

Measure of central tendency Population Sample Arithmetic Mean (Mean) Definition: Sum of all the observation s divided by the number of the observations The arithmetic mean is the most common measure of the central location of a sample.

Mean Population Sample “mu” “X bar” “sigma”, the sum of X, add up all scores “n”, the total number of scores in a sample “N”, the total number of scores in a population “sigma”, the sum of X, add up all scores

Mean: Example Data: {1,3,6,7,2,3,5} number of observations: 7 Sum of observations: 27 Mean: 3.9

Simple Frequency Distributions nameX Student120 Student223 Student315 Student421 Student515 Student621 Student715 Student820 fX raw-score distributionfrequency distribution   ff N Mean

Is the balance point of a distribution.

Central Tendency Example: Mean 52, 76, 100, 136, 186, 196, 205, 150, 257, 264, 264, 280, 282, 283, 303, 313, 317, 317, 325, 373, 384, 384, 400, 402, 417, 422, 472, 480, 643, 693, 732, 749, 750, 791, 891 Mean hotel rate: Mean hotel rate: $371.60

Pros and Cons of the Mean Pros Mathematical center of a distribution. Good for interval and ratio data. Does not ignore any information. Inferential statistics is based on mathematical properties of the mean. Cons Influenced by extreme scores and skewed distributions. May not exist in the data.

Median Definition: The value that is larger than half the population and smaller than half the population n is odd: the median score 5, 8, 9, 10, 28 median = 9 n is even: the th score 6, 17, 19, 20, 21, 27 median = 19.5

Pros and Cons of Median Pros Not influenced by extreme scores or skewed distributions. Good with ordinal data. Easier to compute than the mean. Cons May not exist in the data. Doesn’t take actual values into account.

Mode Most frequently occurring value Data{1,3,7,3,2,3,6,7} Mode : 3 Data {1,3,7,3,2,3,6,7,1,1} Mode : 1,3 Data {1,3,7,0,2,-3, 6,5,-1} Mode : none

Central Tendency Example: Mode 52, 76, 100, 136, 186, 196, 205, 150, 257, 264, 264, 280, 282, 283, 303, 313, 317,317, 325, 373, 384, 384, 400, 402, 417, 422, 472, 480, 643, 693, 732, 749, 750, 791, 891 Mode: most frequent observation Mode(s) for hotel rates: 264, 317, 384

Pros and Cons of the Mode Pros Good for nominal & ordinal data. Easiest to compute and understand. The score comes from the data set. Cons Ignores most of the information in a distribution. Small samples may not have a mode.

Example: Central Location Suppose the age in years of the first 10 subjects enrolled in your study are: 34, 24, 56, 52, 21, 44, 64, 34, 42, 46 Then the mean age of this group is 41.7 years To find the median, first order the data: 21, 24, 34, 34, 42, 44, 46, 52, 56, 64 The median is = 43 years 2 The mode is 34 years.

Comparison of Mean and Median Mean is sensitive to a few very large (or small) values “outliers” so sometime mean does not reflect the quantity desired. Median is “resistant” to outliers Mean is attractive mathematically

Suppose the next patient enrolls and their age is 97 years. How does the mean and median change? To get the median, order the data: 21, 24, 34, 34, 42, 44, 46, 52, 56, 64, 97 If the age were recorded incorrectly as 977 instead of 97, what would the new median be? What would the new mean be?

Calculating the Mean from a Frequency Distribution

MEASURES OF Central Tendency Geometric Mean & Harmonic Mean

The Shape of Distributions Distributions can be either symmetrical or skewed, depending on whether there are more frequencies at one end of the distribution than the other.

Symmetrical Distributions A distribution is symmetrical if the frequencies at the right and left tails of the distribution are identical, so that if it is divided into two halves, each will be the mirror image of the other. In a symmetrical distribution the mean, median, and mode are identical.

Almost Symmetrical distribution Mean= 13.4 Mode= 13.0

Skewed Distribution F ew extreme values on one side of the distribution or on the other. Positively skewed distributions: distributions which have few extremely high values (Mean>Median) Negatively skewed distributions: distributions which have few extremely low values(Mean<Median)

Positively Skewed Distribution Mean=1.13 Median=1.0

Negatively Skewed distribution Mean=3.3 Median=4.0

Mean, Median and Mode

Distributions Bell-Shaped (also known as symmetric” or “normal”) Skewed: positively (skewed to the right) – it tails off toward larger values negatively (skewed to the left) – it tails off toward smaller values

Choosing a Measure of Central Tendency IF variable is Nominal.. Mode IF variable is Ordinal... Mode or Median(or both) IF variable is Interval-Ratio and distribution is Symmetrical… Mode, Median or Mean IF variable is Interval-Ratio and distribution is Skewed… Mode or Median

EXAMPLE: (1) 7,8,9,10,11 n=5, x=45, =45/5=9 (2) 3,4,9,12,15 n=5, x=45, =45/5=9 (3) 1,5,9,13,17 n=5, x=45, =45/5=9 S.D. : (1) 1.58 (2) 4.74 (3) 6.32