Univariate Descriptive Statistics Chapter 2. Lecture Overview Tabular and Graphical Techniques Distributions Measures of Central Tendency Measures of.

Slides:



Advertisements
Similar presentations
BCOR 1020 Business Statistics Lecture 4 – January 29, 2008.
Advertisements

Introduction to Summary Statistics
FREQUENCY ANALYSIS Basic Problem: To relate the magnitude of extreme events to their frequency of occurrence through the use of probability distributions.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Descriptive Statistics A.A. Elimam College of Business San Francisco State University.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 3-1.
Intro to Descriptive Statistics
Descriptive statistics (Part I)
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
Summarizing Scores With Measures of Central Tendency
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
(c) 2007 IUPUI SPEA K300 (4392) Outline: Numerical Methods Measures of Central Tendency Representative value Mean Median, mode, midrange Measures of Dispersion.
Chapter 3 Statistical Concepts.
Numerical Descriptive Techniques
Chapter 3 – Descriptive Statistics
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Created by Tom Wegleitner, Centreville, Virginia Section 3-1 Review and.
1 DATA DESCRIPTION. 2 Units l Unit: entity we are studying, subject if human being l Each unit/subject has certain parameters, e.g., a student (subject)
Methods for Describing Sets of Data
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Chapter 3 Descriptive Measures
Lecture 3 A Brief Review of Some Important Statistical Concepts.
Measures of Spread Chapter 3.3 – Tools for Analyzing Data I can: calculate and interpret measures of spread MSIP/Home Learning: p. 168 #2b, 3b, 4, 6, 7,
1 Review Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central Measures (mean,
QBM117 Business Statistics Descriptive Statistics Numerical Descriptive Measures.
STA Lecture 131 STA 291 Lecture 13, Chap. 6 Describing Quantitative Data – Measures of Central Location – Measures of Variability (spread)
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Psyc 235: Introduction to Statistics Lecture Format New Content/Conceptual Info Questions & Work through problems.
Graphical Displays of Information 3.1 – Tools for Analyzing Data Learning Goal: Identify the shape of a histogram MSIP / Home Learning: p. 146 #1, 2, 4,
Graphical Displays of Information Chapter 3.1 – Tools for Analyzing Data Mathematics of Data Management (Nelson) MDM 4U.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
Lecture 3 Describing Data Using Numerical Measures.
Skewness & Kurtosis: Reference
Warm up The following graphs show foot sizes of gongshowhockey.com users. What shape are the distributions? Calculate the mean, median and mode for one.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Review Sections Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Measures of Central Tendency Chapter 3.2 – Tools for Analyzing Data Mathematics of Data Management (Nelson) MDM 4U.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
1 Review Sections 2.1, 2.2, 1.3, 1.4, 1.5, 1.6 in text.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Symbol Description It would be a good idea now to start looking at the symbols which will be part of your study of statistics.  The uppercase Greek letter.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
Graphical Displays of Information
LIS 570 Summarising and presenting data - Univariate analysis.
Chapter 2 Descriptive Statistics
Unit 2 Section 2.3. What we will be able to do throughout this part of the chapter…  Use statistical methods to summarize data  The most familiar method.
Univariate Statistics Demonstration Day. Topics Questions Calculating measures of central tendency & dispersion by hand and in Excel Summation Notation.
Engineering College of Engineering Engineering Education Innovation Center Analyzing Measurement Data Rev: , MCAnalyzing Data1.
Summation Notation, Percentiles and Measures of Central Tendency Overheads 3.
Data Description Chapter 3. The Focus of Chapter 3  Chapter 2 showed you how to organize and present data.  Chapter 3 will show you how to summarize.
CHAPTER 3 – Numerical Techniques for Describing Data 3.1 Measures of Central Tendency 3.2 Measures of Variability.
StatisticsStatistics Unit 5. Example 2 We reviewed the three Measures of Central Tendency: Mean, Median, and Mode. We also looked at one Measure of Dispersion.
Chapter 3 Numerical Descriptive Measures. 3.1 Measures of central tendency for ungrouped data A measure of central tendency gives the center of a histogram.
An Introduction to Statistics
Chapter 4 – Statistics II
PRESENTATION OF DATA.
Methods for Describing Sets of Data
Chapter 3 Describing Data Using Numerical Measures
Chapter 2: Methods for Describing Data Sets
Dr.Fatima Alkhaledy M.B.Ch.B;F.I.C.M.S/C.M
4. Interpreting sets of data
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
An Introduction to Statistics
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Measures of Dispersion
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Presentation transcript:

Univariate Descriptive Statistics Chapter 2

Lecture Overview Tabular and Graphical Techniques Distributions Measures of Central Tendency Measures of Dispersion

Tabular and Graphical Techniques Frequency Tables –Ungrouped –Grouped Histograms Cumulative Frequency Histogram

Frequency Tables BinFrequency

Histograms Note: sometimes percent is on the Y axis rather than frequency

Cumulative Frequency Histograms

Key Concepts Choosing Intervals (i.e., choosing your “bins”) Rules from the textbook (pages 38 – 39) Commonly Used Examples from GIS –Equal Interval –Quantiles (e.g., quartiles and quintiles) –Natural Breaks –Standard Deviation

Rules For Bin Sizes Note: This is very relevant for GIS Rule 1: Use intervals with simple bounds Rule 2: Respect natural breakpoints Rule 3: Intervals should not overlap Rule 4: Intervals should be the same width Rule 5: Select an appropriate number of classes

The Effect of Classification Equal Interval –Splits data into user-specified number of classes of equal width –Each class has a different number of observations

The Effect of Classification Quantiles –Data divided so that there are an equal number of observations are in each class –Some classes can have quite narrow intervals

The Effect of Classification Natural Breaks –Splits data into classes based on natural breaks represented in the data histogram

The Effect of Classification Standard Deviation –Mean + or – Std. Deviation(s)

Key Concepts Making sense of your histograms using distributions –Rectangular –Unimodal –Bimodal –Multimodal –Skew (positive and negative)

Bimodal Distribution

Multimodal Distribution

Skew An asymmetrical distribution

Measures of Central Tendency Measures of central tendency –Measures of the location of the middle or the center of a distribution –Mean, median, mode, midrange

Definitions Midrange Mode Median –Quantiles Mean

Definitions Sample Mean Population Mean

Description of Mean Mean – Most commonly used measure of central tendency Average of all observations The sum of all the scores divided by the number of scores Note: Assuming that each observation is equally significant

Symbols n: the number of observations N: the number of elements in the whole population Σ: this (capital sigma) is the symbol for sum i: the starting point of a series of numbers X: one element in our dataset, usually has a subscript (e.g., i, min, max) : the sample mean : the population mean

Summation Notation: Components indicates we are taking a sum refers to where the sum of terms begins refers to where the sum of terms ends indicates what we are summing up

Mathematical Notation of Mean The mathematical notation used most often in this course is the summation notation The Greek letter capital sigma is used as a shorthand way of indicating that a sum is to be taken: The expression is equivalent to:

A summation will often be written leaving out the upper and/or lower limits of the summation, assuming that all of the terms available are to be summed Summation Notation: Simplification

Equation for Mean Sample mean: Population mean:

Example Mean Calculations Example I –Data: 8, 4, 2, 6, 10 Example II –Sample: 10 trees randomly selected from Battle Park –Diameter (inches): 9.8, 10.2, 10.1, 14.5, 17.5, 13.9, 20.0, 15.5, 7.8, 24.5

Example Mean Calculations Example III Monthly mean temperature (°F) at Chapel Hill, NC (2001). Annual mean temperature (°F)

Mean annual precipitation (mm) Mean annual temperature (°F) (°F) Mean (mm) Mean Examples IV & V Chapel Hill, NC ( )

Advantage –Sensitive to any change in the value of any observation Disadvantage –Very sensitive to outliers Explanation of Mean #Tree Height (m) #Tree Height (m) Mean = 6.19 mwithout #8 Mean = 8.10 mwith #8

Measures of Dispersion Used to describe the data dispersion/spread/variation/deviation numerically Usually used in conjunction with measures of central tendency

Measures of variation score # of obs score Low variationHigh variation Groups have equal means and equal n, but one varies more than the other

Definitions Range Mean Deviation Variance Standard Deviation Coefficient of Variation Pearson’s

Symbols s 2 : the sample variance σ 2 : the population variance s: the sample standard deviation σ : the population standard deviation

Sample Variance and Standard Deviation Note: as with the mean there are both sample and population standard deviations & variances VarianceStandard Deviation

Next Class Read chapter 3 Work on the homework Come with questions Bring your laptop