1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Lecture 2 Describing Data II ©. Summarizing and Describing Data Frequency distribution and the shape of the distribution Frequency distribution and the.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Descriptive Statistics: Numerical Measures
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Five-Number Summary 1 Smallest Value 2 First Quartile 3 Median 4
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 3-1.
Slides by JOHN LOUCKS St. Edward’s University.
Chapter 3, Part 1 Descriptive Statistics II: Numerical Methods
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Chapter 13 Statistics © 2008 Pearson Addison-Wesley. All rights reserved.
Correlation and Linear Regression
Economics 173 Business Statistics Lecture 2 Fall, 2001 Professor J. Petry
Chapter 3 - Part B Descriptive Statistics: Numerical Methods
1 1 Slide © 2001 South-Western /Thomson Learning  Anderson  Sweeney  Williams Anderson  Sweeney  Williams  Slides Prepared by JOHN LOUCKS  CONTEMPORARYBUSINESSSTATISTICS.
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Chapter 3 (continued) Nutan S. Mishra. Exercises Size of the data set = 12 for all the five problems In 3.11 variable x 1 = monthly rent of.
Numerical Descriptive Techniques
Chapter 3 – Descriptive Statistics
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
6.1 What is Statistics? Definition: Statistics – science of collecting, analyzing, and interpreting data in such a way that the conclusions can be objectively.
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
© 2008 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
QBM117 Business Statistics Descriptive Statistics Numerical Descriptive Measures.
Business Statistics: Communicating with Numbers
4 - 1 Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Business and Finance Colleges Principles of Statistics Eng. Heba Hamad week
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Review Measures of central tendency
Descriptive Statistics: Numerical Methods
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
1 1 Slide The Weighted Mean and Working with Grouped Data n The Weighted Mean n Mean for Grouped Data n Variance for Grouped Data n Standard Deviation.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Chapter 3 Descriptive Statistics II: Additional Descriptive Measures and Data Displays.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Chapter 3, Part B Descriptive Statistics: Numerical Measures n Measures of Distribution Shape, Relative Location, and Detecting Outliers n Exploratory.
1 Chapter 4 Numerical Methods for Describing Data.
Unit 2 Section 2.4 – Day 2.
Data Summary Using Descriptive Measures Sections 3.1 – 3.6, 3.8
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Interpreting Histograms
1 1 Slide © 2003 South-Western/Thomson Learning TM Chapter 3 Descriptive Statistics: Numerical Methods n Measures of Variability n Measures of Relative.
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Seventy efficiency apartments were randomly Seventy efficiency apartments were randomly sampled in a college town. The monthly rent prices for the apartments.
Chapter 3 Descriptive Statistics: Numerical Methods.
1 1 Slide © 2003 Thomson/South-Western. 2 2 Slide © 2003 Thomson/South-Western Chapter 3 Descriptive Statistics: Numerical Methods Part B n Measures of.
Chapter 3 Section 3 Measures of variation. Measures of Variation Example 3 – 18 Suppose we wish to test two experimental brands of outdoor paint to see.
Business and Economics 6th Edition
Descriptive Statistics
BUS173: Applied Statistics
Distribution Shape: Skewness
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
St. Edward’s University
Business and Economics 7th Edition
Presentation transcript:

1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach

2 2 Slide IS 310 – Business Statistics Measures of Distribution Shape, Relative Location, and Detecting Outliers n Distribution Shape n Chebyshev’s Theorem n Empirical Rule n Detecting Outliers

3 3 Slide IS 310 – Business Statistics Distribution Shape n In order to understand the shape of a distribution, we will refer to Histogram discussed in Chapter 2. n By looking at the Histogram, we will determine the shape of the distribution. n The shape of a distribution is measured with a quantity called Skewness.

4 4 Slide IS 310 – Business Statistics Distribution Shape: Skewness n An important measure of the shape of a distribution is called skewness. n The formula for computing skewness for a data set is somewhat complex. n Skewness can be easily computed using statistical software.

5 5 Slide IS 310 – Business Statistics Distribution Shape: Skewness n Symmetric (not skewed) Skewness is zero. Skewness is zero. Mean and median are equal. Mean and median are equal. Relative Frequency Skewness = 0 Skewness = 0

6 6 Slide IS 310 – Business Statistics Relative Frequency Distribution Shape: Skewness n Moderately Skewed Left Skewness is negative. Skewness is negative. Mean will usually be less than the median. Mean will usually be less than the median. Skewness = .31 Skewness = .31

7 7 Slide IS 310 – Business Statistics Distribution Shape: Skewness n Moderately Skewed Right Skewness is positive. Skewness is positive. Mean will usually be more than the median. Mean will usually be more than the median. Relative Frequency Skewness =.31 Skewness =.31

8 8 Slide IS 310 – Business Statistics Distribution Shape: Skewness n Highly Skewed Right Skewness is positive (often above 1.0). Skewness is positive (often above 1.0). Mean will usually be more than the median. Mean will usually be more than the median. Relative Frequency Skewness = 1.25 Skewness = 1.25

9 9 Slide IS 310 – Business Statistics Application of Standard Deviation Chebyshev’s Theorem How could we apply standard deviation to real-world problems? Let’s take the example of downtown LA. If we go further and further away from the downtown, we will capture more and more people. Similarly, if we go further and further from the mean of a data set, we will capture more and more of the data values. If we go three standard deviations from the mean, we will capture more data values than if we go two standard deviations. o o o o o o Mean Mean o o o o o o Mean Mean

10 Slide IS 310 – Business Statistics Chebyshev’s Theorem Chebyshev’s Theorem allows us to calculate the percent of data values that will be captured if we go so many standard deviations from the mean. 2 The formula is (1 – 1/z ) where z is the standard deviation and must be greater than 1. If we go 2 standard deviations from the mean, we will capture at least 2 (1 – 1/2 ) which is 75 percent (1 – 1/2 ) which is 75 percent

11 Slide IS 310 – Business Statistics At least of the data values must be At least of the data values must be within of the mean. within of the mean. At least of the data values must be At least of the data values must be within of the mean. within of the mean. 75%75% z = 2 standard deviations z = 2 standard deviations Chebyshev’s Theorem At least of the data values must be At least of the data values must be within of the mean. within of the mean. At least of the data values must be At least of the data values must be within of the mean. within of the mean.89%89% z = 3 standard deviations z = 3 standard deviations At least of the data values must be At least of the data values must be within of the mean. within of the mean. At least of the data values must be At least of the data values must be within of the mean. within of the mean. 94%94% z = 4 standard deviations z = 4 standard deviations

12 Slide IS 310 – Business Statistics Sample Problem Problem # 29 (10-Page 103; 11-Page 107) Mean = 6.9 Standard Deviation = 1.2 o o o o o o The distance between 6.9 and 4.5 and between 9.3 and 6.9 is 2.4 which is two standard deviations. Using z = 2, we calculate z = 2, we calculate 2 (1 – 1/2 ) = 75 (1 – 1/2 ) = 75 That means at least 75 percent of individuals sleep between 4.5 and 9.3 hours

13 Slide IS 310 – Business Statistics Sample Problem #29 Calculate the percentage of individuals who sleep between 4.5 hours and 10.5 hours       o o o o o o First, calculate the percentage who sleep between 4.5 and 6.9 hours. Then, calculate the percentage who sleep between 6.9 and 10.5 hours 37.5 percent individuals sleep between 4.5 and 6.9 hours. Percentage of individuals who sleep between 6.9 and 10.5 hours is half of 2 (1- 1/3 ) = 89 which is 44.5 Add 37.5 and 44.5 to get 82 percent. So, 82 percent of individuals sleep between 4.5 and 10.5 hours.

14 Slide IS 310 – Business Statistics Empirical Rule For data having a bell-shaped distribution: For data having a bell-shaped distribution: of the values of a normal random variable of the values of a normal random variable are within of its mean. are within of its mean. of the values of a normal random variable of the values of a normal random variable are within of its mean. are within of its mean.68.26%68.26% +/- 1 standard deviation of the values of a normal random variable of the values of a normal random variable are within of its mean. are within of its mean. of the values of a normal random variable of the values of a normal random variable are within of its mean. are within of its mean %95.44% +/- 2 standard deviations of the values of a normal random variable of the values of a normal random variable are within of its mean. are within of its mean. of the values of a normal random variable of the values of a normal random variable are within of its mean. are within of its mean.99.72%99.72% +/- 3 standard deviations

15 Slide IS 310 – Business Statistics Empirical Rule x  – 3   – 1   – 2   + 1   + 2   + 3  68.26% 95.44% 99.72%

16 Slide IS 310 – Business Statistics Sample Problem Problem # 30 (10-Page 103; ) Mean = 2.3 Standard Deviation = 0.1 o o o o o o The distance between 2.3 and 2.2 and between 2.4 and 2.3 is 0.1 which is one standard deviation. Therefore, approximately 68 percent of regular grade gasoline sold between $2.20 and $2.40.

17 Slide IS 310 – Business Statistics Continuation of Problem # 30 b. What percent of gasoline sold between $2.20 and $2.50? 34% 47.5% 34% 47.5% o o o o o o The distance between 2.3 and 2.2 is 0.1 which is one standard deviation. The distance between 2.5 and 2.3 is 0.2 which is two standard deviation. Take half of 68% and take half of 95% which gives us 81.5 percent. That means approximately 81.5 percent of regular grade gasoline sold between $2.2 and $2.5

18 Slide IS 310 – Business Statistics Measures of Association Between Two Variables n Covariance n Correlation Coefficient

19 Slide IS 310 – Business Statistics Covariance Positive values indicate a positive relationship. Positive values indicate a positive relationship. Negative values indicate a negative relationship. Negative values indicate a negative relationship. The covariance is a measure of the linear association The covariance is a measure of the linear association between two variables. between two variables. The covariance is a measure of the linear association The covariance is a measure of the linear association between two variables. between two variables.

20 Slide IS 310 – Business Statistics Covariance The covariance is computed as follows: The covariance is computed as follows: forsamples forpopulations

21 Slide IS 310 – Business Statistics Correlation Coefficient Just because two variables are highly correlated, it Just because two variables are highly correlated, it does not mean that one variable is the cause of the does not mean that one variable is the cause of the other. other. Just because two variables are highly correlated, it Just because two variables are highly correlated, it does not mean that one variable is the cause of the does not mean that one variable is the cause of the other. other. Correlation is a measure of linear association and not Correlation is a measure of linear association and not necessarily causation. necessarily causation. Correlation is a measure of linear association and not Correlation is a measure of linear association and not necessarily causation. necessarily causation.

22 Slide IS 310 – Business Statistics The correlation coefficient is computed as follows: The correlation coefficient is computed as follows: forsamplesforpopulations Correlation Coefficient

23 Slide IS 310 – Business Statistics Correlation Coefficient Values near +1 indicate a strong positive linear Values near +1 indicate a strong positive linear relationship. relationship. Values near +1 indicate a strong positive linear Values near +1 indicate a strong positive linear relationship. relationship. Values near -1 indicate a strong negative linear Values near -1 indicate a strong negative linear relationship. relationship. Values near -1 indicate a strong negative linear Values near -1 indicate a strong negative linear relationship. relationship. The coefficient can take on values between -1 and +1. The coefficient can take on values between -1 and +1.

24 Slide IS 310 – Business Statistics Covariance Problem n Use the data in Table 3.7 (10-Page 110; 11-Page 115) n Two variables are: Number of commercials and Sales volume n Week No. of Comm. Sales Volume ($100s) n n n s = 99/9 = 11 n xy n n calculations are in Table 3.8 n n n n

25 Slide IS 310 – Business Statistics Interpretation on Covariance n Look at Figure 3.8 (10-Page 112) Figure 3.9 (11-Page 117) n There are four quadrants. _ n Points in Quadrant I correspond to x’s greater than x n _ n and y’s greater than y; points in Quadrant II n _ _ n correspond to x’s less than x and y’s greater than y and so on. n If the value of s is positive, the points with the greatest influence on n xy n s must be in Quadrants I and III. If the value of s is negative, the n xy xy n points with the greatest influence must be in Quadrants II and IV.

26 Slide IS 310 – Business Statistics The Weighted Mean and Working with Grouped Data n Weighted Mean n Mean for Grouped Data n Variance for Grouped Data n Standard Deviation for Grouped Data

27 Slide IS 310 – Business Statistics Weighted Mean When the mean is computed by giving each data When the mean is computed by giving each data value a weight that reflects its importance, it is value a weight that reflects its importance, it is referred to as a weighted mean. referred to as a weighted mean. In the computation of a grade point average (GPA), In the computation of a grade point average (GPA), the weights are the number of credit hours earned for the weights are the number of credit hours earned for each grade. each grade. When data values vary in importance, the analyst When data values vary in importance, the analyst must choose the weight that best reflects the must choose the weight that best reflects the importance of each value. importance of each value.

28 Slide IS 310 – Business Statistics Weighted Mean where: x i = value of observation i x i = value of observation i w i = weight for observation i w i = weight for observation i

29 Slide IS 310 – Business Statistics Grouped Data The weighted mean computation can be used to The weighted mean computation can be used to obtain approximations of the mean, variance, and obtain approximations of the mean, variance, and standard deviation for the grouped data. standard deviation for the grouped data. To compute the weighted mean, we treat the To compute the weighted mean, we treat the midpoint of each class as though it were the mean midpoint of each class as though it were the mean of all items in the class. of all items in the class. We compute a weighted mean of the class midpoints We compute a weighted mean of the class midpoints using the class frequencies as weights. using the class frequencies as weights. Similarly, in computing the variance and standard Similarly, in computing the variance and standard deviation, the class frequencies are used as weights. deviation, the class frequencies are used as weights.

30 Slide IS 310 – Business Statistics Mean for Grouped Data where: f i = frequency of class i f i = frequency of class i M i = midpoint of class i M i = midpoint of class i n Sample Data n Population Data

31 Slide IS 310 – Business Statistics Given below is the previous sample of monthly rents for 70 efficiency apartments, presented here as grouped data in the form of a frequency distribution. Sample Mean for Grouped Data Sample Mean for Grouped Data

32 Slide IS 310 – Business Statistics Sample Mean for Grouped Data Sample Mean for Grouped Data This approximation differs by $2.41 from the actual sample mean of $

33 Slide IS 310 – Business Statistics End of Chapter 3, Part B