LECTURE 7 THURSDAY, 11 FEBRUARY STA291 Fall 2008.

Slides:



Advertisements
Similar presentations
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Advertisements

Descriptive Measures MARE 250 Dr. Jason Turner.
CHAPTER 1 Exploring Data
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Intro to Descriptive Statistics
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Homework Questions. Quiz! Shhh…. Once you are finished you can work on the warm- up (grab a handout)!
Business Statistics BU305 Chapter 3 Descriptive Stats: Numerical Methods.
QBM117 Business Statistics
CHAPTER 2: Describing Distributions with Numbers
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Describing Data: Numerical
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Programming in R Describing Univariate and Multivariate data.
Describing distributions with numbers
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
(c) 2007 IUPUI SPEA K300 (4392) Outline: Numerical Methods Measures of Central Tendency Representative value Mean Median, mode, midrange Measures of Dispersion.
LECTURE 6 TUESDAY, 10 FEBRUARY 2008 STA291. Administrative Suggested problems from the textbook (not graded): 4.2, 4.3, and 4.4 Check CengageNow for second.
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
CHAPTER 2: Describing Distributions with Numbers ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
STA Lecture 111 STA 291 Lecture 11 Describing Quantitative Data – Measures of Central Location Examples of mean and median –Review of Chapter 5.
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
STA Lecture 131 STA 291 Lecture 13, Chap. 6 Describing Quantitative Data – Measures of Central Location – Measures of Variability (spread)
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Chapter 2 Describing Data.
Lecture 3 Describing Data Using Numerical Measures.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Measure of Central Tendency Measures of central tendency – used to organize and summarize data so that you can understand a set of data. There are three.
THURSDAY, 24 SEPTEMBER 2009 STA291. Announcement Exam 1: September 30 th at 5pm to 7pm. Location MEH, Memorial Auditoriam. The make-up will be at 7:30pm.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Lecture 2 Dustin Lueker.  Center of the data ◦ Mean ◦ Median ◦ Mode  Dispersion of the data  Sometimes referred to as spread ◦ Variance, Standard deviation.
Box and Whisker Plots Measures of Central Tendency.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Summary Statistics: Measures of Location and Dispersion.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Chapter 3 Averages and Variation Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Chapter 6: Interpreting the Measures of Variability.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
LIS 570 Summarising and presenting data - Univariate analysis.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
What is a box-and-whisker plot? 5-number summary Quartile 1 st, 2 nd, and 3 rd quartiles Interquartile Range Outliers.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Lecture 16 Sec – Tue, Feb 12, 2008 The Five-Number Summary.
TUESDAY, 22 SEPTEMBER 2009 STA291. Exam 1: September 30 th at 5pm to 7pm. Location MEH, Memorial Auditoriam. The make-up will be at 7:30pm to 9:30pm at.
Descriptive Measures Descriptive Measure – A Unique Measure of a Data Set Central Tendency of Data Mean Median Mode 2) Dispersion or Spread of Data A.
CHAPTER 2: Describing Distributions with Numbers
Averages and Variation
CHAPTER 1 Exploring Data
Descriptive Statistics
Please take out Sec HW It is worth 20 points (2 pts
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
The absolute value of each deviation.
Basic Practice of Statistics - 3rd Edition
CHAPTER 2: Describing Distributions with Numbers
Chapter 1: Exploring Data
BUSINESS MATHEMATICS & STATISTICS.
Basic Practice of Statistics - 3rd Edition
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Presentation transcript:

LECTURE 7 THURSDAY, 11 FEBRUARY STA291 Fall 2008

Chap. 4: Numerical Descriptive Techniques 4.1 Measures of Central Location (last time) 4.2 Measures of Variability (this time) 4.3 Measures of Relative Standing and Box Plots (next time?) 2

Homework and Suggested Study Material 3 [10 points] Due Saturday, 27 September, 11pm Assignment HW3 on CengageNOW. Use the Study Tools at Thomson Now, click on our Courseware Book, and work through “Chapter 4 – Numerical Descriptive Techniques”. (Pre-test, study plan, and post-test) Suggested problems from the textbook: 4.20, 4.23, 4.24, 4.25, 4.40 If you are interested in Finance, please read p.96 and try problem 4.8. Feel free to ask about it in lab/office hour.

Review: Shapes of Distributions 4 Symmetric Distribution Skewed to the right Skewed to the left

Examples 5 Exponential Distribution Uniform Distribution Bimodal Distribution

Summarizing Data Numerically 6 Center of the data – Mean – Median – Mode Dispersion of the data – Variance, Standard deviation – Interquartile range – Range

Measuring Central Tendency (review) 7 “What is a typical measurement in the sample/ population?” Mean: Arithmetic average Median: Midpoint of the observations when they are arranged in increasing order Mode: Most frequent value

Mean vs. Median vs. Mode 8 The mean is sensitive to outliers, median and mode are not In general, the median is more appropriate for skewed data than the mean In some situations, the median may be too insensitive to changes in the data The mode may not be unique

Mean vs. Median vs. Mode 9 Mean: Interval data with an approximately symmetric distribution Median: Interval or ordinal data Mode: All types of data

Mean and Median 10 Example: For towns with population size 2500 to 4599 in the U.S. Northeast in 1994, the mean salary of chiefs of police was $37,527, and the median was $30,500. Does this suggest that the distribution of salary was skewed to the left, symmetric, or skewed to the right?

Mean, Median, Mode—Another Example 11 Identify the mode Identify the median response Mean?

Percentiles 12 The p th percentile is a number such that p% of the observations take values below it, and (100-p)% take values above it 50th percentile = median 25th percentile = lower quartile = Q 1 75th percentile = upper quartile = Q 3 In general, L p = (n+1)p/100 th is the spot in the ordered list of observations to find the p th percentile

Quartiles 13 25th percentile = lower quartile = approximately median of the observations below the median 75th percentile = upper quartile = approximately median of the observations above the median

Median and Quartiles can be found from a stem and leaf plot 14 Example: Murder Rate Data (w/o DC—key: 20|3 = 20.3) A quarter of the states has murder rate above… The median murder rate is… A quarter of the states has murder rate below…

Five-Number Summary 15 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software SAS output (Murder Rate Data) Note the distance from the median to the maximum compared to the median to the minimum.

Interquartile Range 16 The Interquartile Range (IQR) is the difference between upper and lower quartile IQR = Q 3 – Q 1 IQR= Range of values that contains the middle 50% of the data IQR increases as variability increases

Box Plot (AKA Box-and-Whiskers Plot) A box plot is basically a graphical version of the five- number summary (unless there are outliers) It consists of a box that contains the central 50%of the distribution (from lower quartile to upper quartile), A line within the box that marks the median, And whiskers that extend to the maximum and minimum values, unless there are outliers 17

Outliers An observation is an outlier if it falls – more than 1.5 IQR above the upper quartile or – more than 1.5 IQR below the lower quartile Example: Murder Rate Data w/o DC – upper quartile Q3 = 10.3 – IQR = 6.4 – Q IQR = _______ – Any outliers? 18

Five-Number Summary/Box Plot 19 On right-skewed distributions, minimum, Q 1, and median will be “bunched up”, while Q 3 and the maximum will be farther away. For left-skewed distributions, the “mirror” is true: the maximum, Q 3, and the median will be relatively close compared to the corresponding distances to Q 1 and the minimum. Guess on symmetric distributions?

Attendance Survey Question 7 20 On a your index card: – Please write down your name and section number – Today’s Question: