Intro to Descriptive Statistics

Slides:



Advertisements
Similar presentations
Class Session #2 Numerically Summarizing Data
Advertisements

Descriptive Statistics
Measures of Dispersion or Measures of Variability
Chapter 3 Describing Data Using Numerical Measures
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Calculating & Reporting Healthcare Statistics
DESCRIBING DATA: 2. Numerical summaries of data using measures of central tendency and dispersion.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 3-1.
Biostatistics Unit 2 Descriptive Biostatistics 1.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Measures of Dispersion
Central Tendency and Variability Chapter 4. Central Tendency >Mean: arithmetic average Add up all scores, divide by number of scores >Median: middle score.
Measures of Central Tendency
Measures of Central Tendency
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data: Numerical
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 2 NUMERICAL DATA REPRESENTATION.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Numerical Descriptive Techniques
Chapter 3 – Descriptive Statistics
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.
Central Tendency Introduction to Statistics Chapter 3 Sep 1, 2009 Class #3.
Descriptive Statistics: Numerical Methods
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Created by Tom Wegleitner, Centreville, Virginia Section 2-4 Measures of Center.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Descriptive Statistics
Lecture 3 Describing Data Using Numerical Measures.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Univariate Descriptive Statistics Heibatollah Baghi, and Mastee Badii George Mason University.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Copyright © 2014 by Nelson Education Limited. 3-1 Chapter 3 Measures of Central Tendency and Dispersion.
INVESTIGATION 1.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
Measures of Central Tendency: The Mean, Median, and Mode
Chapter 2 Means to an End: Computing and Understanding Averages Part II  igma Freud & Descriptive Statistics.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Chapter 3, Part A Descriptive Statistics: Numerical Measures n Measures of Location n Measures of Variability.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data Descriptive Statistics: Central Tendency and Variation.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Summary Statistics: Measures of Location and Dispersion.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
LIS 570 Summarising and presenting data - Univariate analysis.
MATH 1107 Elementary Statistics Lecture 3 Describing and Exploring Data – Central Tendency, Variation and Relative Standing.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Bio-Statistic KUEU 3146 & KBEB 3153 Bio-Statistic Data grouping and presentations Part II: Summarizing Data.
Descriptive Statistics(Summary and Variability measures)
Data Description Chapter 3. The Focus of Chapter 3  Chapter 2 showed you how to organize and present data.  Chapter 3 will show you how to summarize.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Lecture 8 Data Analysis: Univariate Analysis and Data Description Research Methods and Statistics 1.
Topic 3: Measures of central tendency, dispersion and shape
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Descriptive Statistics
Description of Data (Summary and Variability measures)
Chapter 3 Describing Data Using Numerical Measures
Descriptive Statistics
Descriptive Statistics
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Presentation transcript:

Intro to Descriptive Statistics GTECH 201 Lecture 12

Topics for Today Measures of Central Tendency Measures of Dispersion Mean, Median, Mode Sample and Population Mean Weighted Means Selecting Appropriate Measures of Central Tendency Measures of Dispersion Variance Standard Deviation

Descriptive vs. Inferential Descriptive Statistics Methods for organizing and summarizing information Inferential Statistics Methods for drawing and measuring the reliability of conclusions about a population based on information obtained from a sample of the population

Looking at This Data Set… Student Performance in Class Tests Observations: 8 Examples of Variables: Test 1, Test 2 Data Value: for Observation 1 in Test 2, Data Value = A Variables – can have one data value

Overview Mean Median Mode Sample and Population Mean Weighted Means Selecting Appropriate Measures of Central Tendency Applying these measures

Mean The mean of a set of n observations is the arithmetic average Mean of n observations x1, x2,x3,….xn is In Excel, =AVERAGE(insert range)

Median The data value that is exactly in the middle of an ordered list if the number of pieces of data is odd The mean of the two middle pieces of data in an ordered list if the number of pieces of data is even The median is a typical value; it is the midpoint of observations when they are arranged in an ascending or descending order

Mode The most frequent data value; i.e., any value having the highest frequency among the observations In Excel,you use the functions =MEDIAN (insert range) =MODE (insert range) Unimodal, Bimodal, Multimodal data sets Outliers

Sample and Population Means Mean of a data set Population mean if data set includes entire population Sample mean if data set is only a sample of the population

Weighted Means To calculate the mean when your information is available only in the form of summary data C Interval Freq 25 – 29.9 4 30 – 34.9 5 35 – 39.9 12

Skewed Distributions

Skewed Distributions When there is one mode and the distribution is symmetric mean, median, mode are the same Positive skew mean moves towards the positive tail median also pulls towards the positive tail Negative skew mean moves towards the negative tail median also moves towards the negative tail

Selecting Appropriate Measures Mean affected by extreme values includes all observations, therefore comprehensive (useful for interval/ratio data) Median not affected by the number of observations reveals typical situations (used for ordinal data) Mode useful for nominal variables

Other Useful Calculations In addition to the sum of data, Sx we need to be able to calculate:

Variability or Spread Mean and the median - limits Range – coarse measure of variability Percentiles kth percentile is the point at which k percent of the numbers fall below it and the rest are fall above it 25th percentile (lower quartile) 50th percentile (median) 75th percentile (upper quartile) Interquartile range (difference between the 25th percentile value and the 75th percentile value)

Describing the Spread A five number summary Median Quartiles Extremes Variance and Standard Deviation Measures spread about the mean Standard deviation cannot be discussed without the mean

Calculating Percentiles In the list of twelve observations 4 7 11 11 11 11 14 16 16 24 29 Compute median, 25th and 75th percentiles The lower quartile is the median of the 6 observations that fall below the median The upper quartile is the median of the 6 observations that fall above the median

Five Number Summary Median = 11 Lower Quartile = 9 Upper Quartile = 16 Extremes are 2 and 29 Can compute the range = 27 In a symmetric distribution, the lower and upper quartiles are equally distant from the median

Variance Is the mean of the squares of the deviations of the observations from their mean Population variance Sample variance

Example The heights, in inches for five starting players in a men’s college basket ball team are: 67 72 76 76 84 Compute the mean and standard deviation. = 75

Standard Deviation Standard deviation is positive square root of the variance Variance in our basketball example: = 39

Formulas – Standard Deviation Standard deviation of a sample Standard deviation of a population

Example (Continued)

Short Cut – Simpler Formula Standard Deviation of a sample Sum of the squares of data values, i.e., you square each data value and then sum those squared values Square of the sum of data values, i.e., you sum all the data values and then square that sum

Example (using the short cut)

Interpreting Std. Deviation s and s 2 will be small when all the data are close together The deviations from the mean Will be both positive and negative Sum will always be 0 s is always 0 or a positive number s = 0 means no spread; as s value increases, the spread of the data increases The units of s are the same as the original observations s is heavily influenced by outliers

Coefficient of Variation CV is the standard deviation described as a percent of the mean CV = CV is useful when comparing different sets of data where sample size and standard deviation are different