Chapter 3 (continued) Nutan S. Mishra. Exercises 3.11-3.15 Size of the data set = 12 for all the five problems In 3.11 variable x 1 = monthly rent of.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Descriptive Measures MARE 250 Dr. Jason Turner.
Class Session #2 Numerically Summarizing Data
The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set.
Numerically Summarizing Data
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Descriptive Statistics
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Calculating & Reporting Healthcare Statistics
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Descriptive Statistics – Central Tendency & Variability Chapter 3 (Part 2) MSIS 111 Prof. Nick Dedeke.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Slides by JOHN LOUCKS St. Edward’s University.
Measures of Dispersion
1 Numerical Summary Measures Lecture 03: Measures of Variation and Interpretation, and Measures of Relative Position.
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Describing Data Using Numerical Measures
Programming in R Describing Univariate and Multivariate data.
Department of Quantitative Methods & Information Systems
Describing distributions with numbers
Chapter 3 - Part B Descriptive Statistics: Numerical Methods
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.
Chapter 3 – Descriptive Statistics
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Copyright © 2005 Pearson Education, Inc. Slide 6-1.
4 - 1 Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Review Measures of central tendency
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
Describing distributions with numbers
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Applied Quantitative Analysis and Practices LECTURE#09 By Dr. Osman Sadiq Paracha.
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
INVESTIGATION 1.
Chapter 3 Data Description Section 3-3 Measures of Variation.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Chapter 3, Part B Descriptive Statistics: Numerical Measures n Measures of Distribution Shape, Relative Location, and Detecting Outliers n Exploratory.
Copyright © 2005 Pearson Education, Inc. Slide 6-1.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Using Measures of Position (rather than value) to Describe Spread? 1.
Outline of Today’s Discussion 1.Displaying the Order in a Group of Numbers: 2.The Mean, Variance, Standard Deviation, & Z-Scores 3.SPSS: Data Entry, Definition,
Last chapter... Four Corners: Go to your corner based on if your birthday falls in the Winter, Spring, Summer, or Fall; 1 minute In your group, come to.
Stat 2411 Statistical Methods Chapter 4. Measure of Variation.
Copyright © 2016 Brooks/Cole Cengage Learning Intro to Statistics Part II Descriptive Statistics Intro to Statistics Part II Descriptive Statistics Ernesto.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Section 2.1 Visualizing Distributions: Shape, Center, and Spread.
Lecture #3 Tuesday, August 30, 2016 Textbook: Sections 2.4 through 2.6
NUMERICAL DESCRIPTIVE MEASURES
Chapter 3 Describing Data Using Numerical Measures
Chapter 6 ENGR 201: Statistics for Engineers
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
NUMERICAL DESCRIPTIVE MEASURES
Descriptive Statistics
Summary Statistics 9/23/2018 Summary Statistics
Chapter 3 Describing Data Using Numerical Measures
Descriptive Statistics
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Advanced Algebra Unit 1 Vocabulary
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Presentation transcript:

Chapter 3 (continued) Nutan S. Mishra

Exercises Size of the data set = 12 for all the five problems In 3.11 variable x 1 = monthly rent of an apartment ($) In 3.12 variable x 2 = monthly phone bill ($) In 3.13 variable x 3 = price of gasoline ($/gallon) In 3.14 variable x 4 = amount paid to doctor ($/month) In 3.15 variable x 5 = prices of beer in a city ($) More description of the variables is given on page 82 Since all these five variables describe the amount of money, they are all continuous variables. They may take values between 0 to infinity.

Note Most of the statistical software including minitab prefer the raw (ungrouped) data as input and not the grouped data.

Shapes of frequency distributions Bell-shaped A bell-shaped picture, shown here, usually represents a normal distribution Bimodal A bimodal shape, shown here, has two peaks. This shape may show that the data has come from two different systems. If this shape occurs, the two sources should be separated and analyzed separately.

Shapes of frequency distributions Some histograms will show a skewed distribution to the right. A distribution skewed to the right is said to be positively skewed. This kind of distribution has a large number of occurrences in the lower value cells (left side) and few in the upper value cells (right side). Some histograms will show a skewed distribution to the left, as shown below. A distribution skewed to the left is said to be negatively skewed. This kind of distribution has a large number of occurrences in the upper value cells (right side) and few in the lower value cells (left side).

Parameters and Statistics Values of different numerical measures for population are called population parameters. For example population mean µ and population standard deviation σ are population parameters Values of different numerical measures for sample are called sample statistics. For example sample mean and sample standard deviation s are sample statistics. When the population is very very large, (most often) the population parameters are unknown and then we use sample statistics instead.

Interpreting the Standard Deviation Given two samples from a population, the sample with the larger standard deviation (SD) is the more variable –Example : We are using the SD as a relative or comparative measure How does the SD provide a measure of variability for a single sample or, what does 29.6 really mean?

Interpreting the Standard Deviation (continued) Consider the list of numbers: 10, 20, 30, 45, 50, 70, 85, 90 How many measurements are within 1 SD, 2 SDs of the mean? For 1 SD 4 out of 8, or 50% For 2 SD 8 out of 8, or 100%

Chebyshev’s Rule Applies to any data set, regardless of the shape of its frequency distribution No useful information on fraction of measurements falling within for samples and for populations At least of the measurements will fall w/in 2 SD of the mean; at least of the measurements will fall w/in 3 SD of the mean

Chebyshev’s Rule (continued) General formulation: For any number, at least of the measurements will fall within k SDs of the mean Gives the smallest percentages that are mathematically possible; the observed percentages can be much higher

The Empirical Rule A rule of thumb that applies to data sets that have a bell shaped, symmetric distribution –Approximately 68% of the measurements will fall within 1 SD of the mean –Approximately 95% of the measurements will fall within 2 SDs of the mean –Approximately 99.7% of the measurements will fall within 3 SDs of the mean

Solution to 3.78(a) Variable x = time taken to complete the race by a participant Given µ = 220 minutes σ = 20 minutes To find the percentage of people who completed their race between 180 and 260 minutes thus the numbers 180 and 260 are equi distant from the mean. In terms of σ, 40 = k σ i.e. 40 = k 20 i.e. k = 2 That is 180 and 260 are at a distance 2σ from mean Then by Chebyshev’s theorem at least (1 – ¼)% of runners completed the race between 180 and 260 minutes.

Solution to 3.83 (a) Variable x = annual salary of a teacher assistant in the state of Connecticut Given that µ = 24,317 σ = 2000 To find the percentage of the teacher assistants in the state whose annual salary is between 20,317 and 28,317 Also given that salary distribution has bell shaped curve. Let us compute the distance between mean and 20,317 in terms of σ : 24,317 – 20,317 = 4000 = 2 σ Similarly 28, ,317 = 4000 = 2 σ Thus using Empirical rule approximately 95% teacher assistants earn between the given two numbers.

Quartiles Are the values of variable x those divide the ordered dataset into four equal parts. There are three quartiles which divide an ordered data set into four equal parts; Q1. Q2, Q3 Q1 Q2 Q3 Obviously Q2 is the value which divides dataset into two equal parts thus Q2 is the median Q3 – Q1 is called inter quartile range.

Examples of quartiles N = 15 (N odd) Original Data Ordered Data Quartile Positions... Q1... M.. Q3... Positions N = 16 (N even) Original Data Ordered Data Quartile positions... Q1... -MED-... Q3... Positions Median at average the two middle positions when N is even.

Box plots To draw a box plot for the given dataset we need five summary measures Max value, Min value and three quartiles Inner fences = 1.5 * inter quartile range We will draw box plots with the help of minitab.