Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.

Slides:



Advertisements
Similar presentations
Chapter 3, Numerical Descriptive Measures
Advertisements

Descriptive Statistics
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Calculating & Reporting Healthcare Statistics
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Descriptive Statistics – Central Tendency & Variability Chapter 3 (Part 2) MSIS 111 Prof. Nick Dedeke.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Numerical Descriptive Techniques
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 3-1.
Slides by JOHN LOUCKS St. Edward’s University.
Chapter 3, Part 1 Descriptive Statistics II: Numerical Methods
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 2 Describing Data with Numerical Measurements
Numerical Descriptive Techniques
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
Copyright © 2009 Cengage Learning 4.1 Day 5 Numerical Descriptive Techniques.
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Programming in R Describing Univariate and Multivariate data.
1 Tendencia central y dispersión de una distribución.
Economics 173 Business Statistics Lecture 2 Fall, 2001 Professor J. Petry
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
BIOSTAT - 2 The final averages for the last 200 students who took this course are Are you worried?
Numerical Descriptive Techniques
Chapter 3 – Descriptive Statistics
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Methods for Describing Sets of Data
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
QBM117 Business Statistics Descriptive Statistics Numerical Descriptive Measures.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
1 MATB344 Applied Statistics Chapter 2 Describing Data with Numerical Measures.
Descriptive Statistics: Numerical Methods
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Applied Quantitative Analysis and Practices LECTURE#09 By Dr. Osman Sadiq Paracha.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Three Averages and Variation.
Chapter Four Numerical Descriptive Techniques Sir Naseer Shahzada.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
1 Measures of Center. 2 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely.
Statistics Lecture Notes Dr. Halil İbrahim CEBECİ Chapter 03 Numerical Descriptive Techniques.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
Copyright © 2009 Cengage Learning 4.1 Chapter Four Numerical Descriptive Techniques.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Descriptive Statistics ( )
Chapter 3 Describing Data Using Numerical Measures
Ch 4 實習.
Midrange (rarely used)
Descriptive Statistics
Keller: Stats for Mgmt & Econ, 7th Ed Numerical Descriptive Techniques
Chapter 3 Describing Data Using Numerical Measures
Keller: Stats for Mgmt & Econ, 7th Ed
Keller: Stats for Mgmt & Econ, 7th Ed Numerical Descriptive Techniques
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
St. Edward’s University
Business and Economics 7th Edition
Presentation transcript:

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.2 Numerical Descriptive Techniques… Measures of Central Location Mean, Median, Mode Measures of Variability Range, Standard Deviation, Variance, Coefficient of Variation Measures of Relative Standing Percentiles, Quartiles Measures of Linear Relationship Covariance, Correlation, Least Squares Line

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.3 Measures of Central Location… The arithmetic mean, a.k.a. average, shortened to mean, is the most popular & useful measure of central location. It is computed by simply adding up all the observations and dividing by the total number of observations: Sum of the observations Number of observations Mean =

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.4 Notation… When referring to the number of observations in a population, we use uppercase letter N When referring to the number of observations in a sample, we use lower case letter n The arithmetic mean for a population is denoted with Greek letter “mu”: The arithmetic mean for a sample is denoted with an “x-bar”:

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.5 Statistics is a pattern language… PopulationSample Size Nn Mean

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.6 Arithmetic Mean… Population Mean Sample Mean

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.7 Statistics is a pattern language… PopulationSample Size Nn Mean

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.8 The Arithmetic Mean… …is appropriate for describing measurement data, e.g. heights of people, marks of student papers, etc. …is seriously affected by extreme values called “outliers”. E.g. as soon as a billionaire moves into a neighborhood, the average household income increases beyond what it was previously!

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.9 Measures of Central Location… The median is calculated by placing all the observations in order; the observation that falls in the middle is the median. Data: {0, 7, 12, 5, 14, 8, 0, 9, 22} N=9 (odd) Sort them bottom to top, find the middle: Data: {0, 7, 12, 5, 14, 8, 0, 9, 22, 33} N=10 (even) Sort them bottom to top, the middle is the simple average between 8 & 9: median = (8+9)÷2 = 8.5 Sample and population medians are computed the same way.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Measures of Central Location… The mode of a set of observations is the value that occurs most frequently. A set of data may have one mode (or modal class), or two, or more modes. Sample and population modes are computed the same way.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Mode… E.g. Data: {0, 7, 12, 5, 14, 8, 0, 9, 22, 33} N=10 Which observation appears most often? The mode for this data set is 0. How is this a measure of “central” location? Frequency Variable A modal class

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc =MODE(range) in Excel… Note: if you are using Excel for your data analysis and your data is multi-modal (i.e. there is more than one mode), Excel only calculates the smallest one. You will have to use other techniques (i.e. histogram) to determine if your data is bimodal, trimodal, etc.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Mean, Median, Mode… If a distribution is symmetrical, the mean, median and mode may coincide… mode mean median

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Mean, Median, Mode… If a distribution is asymmetrical, say skewed to the left or to the right, the three measures may differ. E.g.: mode mean median

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Measures of Variability… Measures of central location fail to tell the whole story about the distribution; that is, how much are the observations spread out around the mean value? For example, two sets of class grades are shown. The mean (=50) is the same in each case… But, the red class has greater variability than the blue class.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Range… The range is the simplest measure of variability, calculated as: Range = Largest observation – Smallest observation E.g. Data: {4, 4, 4, 4, 50}Range = 46 Data: {4, 8, 15, 24, 39, 50}Range = 46 The range is the same in both cases, but the data sets have very different distributions…

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Variance… Variance and its related measure, standard deviation, are arguably the most important statistics. Used to measure variability, they also play a vital role in almost all statistical inference procedures. Population variance is denoted by (Lower case Greek letter “sigma” squared) Sample variance is denoted by (Lower case “S” squared)

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Statistics is a pattern language… PopulationSample Size Nn Mean Variance

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Variance… The variance of a population is: The variance of a sample is: population mean sample mean Note! the denominator is sample size (n) minus one ! population size

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Application… Example 4.7. The following sample consists of the number of jobs six randomly selected students applied for: 17, 15, 23, 7, 9, 13. Finds its mean and variance. What are we looking to calculate? The following sample consists of the number of jobs six randomly selected students applied for: 17, 15, 23, 7, 9, 13. Finds its mean and variance. …as opposed to  or  2

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Sample Mean & Variance… Sample Mean Sample Variance Sample Variance (shortcut method)

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Standard Deviation… The standard deviation is simply the square root of the variance, thus: Population standard deviation: Sample standard deviation:

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Statistics is a pattern language… PopulationSample Size Nn Mean Variance Standard Deviation

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Standard Deviation… Consider Example 4.8 where a golf club manufacturer has designed a new club and wants to determine if it is hit more consistently (i.e. with less variability) than with an old club.Example 4.8 Using Tools > Data Analysis [may need to “add in” … > Descriptive Statistics in Excel, we produce the following tables for interpretation… You get more consistent distance with the new club.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc The Empirical Rule… If the histogram is bell shaped Approximately 68% of all observations fall within one standard deviation of the mean. Approximately 95% of all observations fall within two standard deviations of the mean. Approximately 99.7% of all observations fall within three standard deviations of the mean.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chebysheff’s Theorem…Not often used because interval is very wide. A more general interpretation of the standard deviation is derived from Chebysheff’s Theorem, which applies to all shapes of histograms (not just bell shaped). The proportion of observations in any sample that lie within k standard deviations of the mean is at least: For k=2 (say), the theorem states that at least 3/4 of all observations lie within 2 standard deviations of the mean. This is a “lower bound” compared to Empirical Rule’s approximation (95%).

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Box Plots… These box plots are based on data in Xm04-15.Xm04-15 Wendy’s service time is shortest and least variable. Hardee’s has the greatest variability, while Jack-in- the-Box has the longest service times.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Coefficient of Correlation… [Cause and effect?]  or r = +1 0 Strong positive linear relationship No linear relationship Strong negative linear relationship

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Problems: Descriptive Statistics -Numerical The number of sick days due to colds and flu last year at UTA was recorded for 5 faculty resulting in [ 5, 4, 0, 6, 0 ]. Calculate the following statistics *mean *median *variance *standard deviation *max *min *range

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Problems: Emperical Rule/Chebychev’s Rule The mean grade point average (gpa) for UTA students is 2.5 with a standard deviation of 0.5 *If the histogram for gpa’s is approximately mounded, what percent of the gpa’s would you expect between 1.5 and 3.5? *If the histogram for gpa’s is approximately mounded, what percent of the gpa’s would you expect greater than 3.5? *If the histogram for gpa’s is NOT mounded, what percent of the gpa’s would you expect between 1.5 and 3.5? *If the histogram for gpa’s is approximately mounded, what percent of the gpa’s would you expect between 1.0 and 4.0?

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Problem: Graphical Box and Whisker Plot The following box plot describes the last 200 grades made in this statistics course. Tell me everything you know about these grades.

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Problem: Graphical Box and Whisker Plots Grade distributions for three professors are shown below. What’s going on?