Distribution Shape: Skewness

Slides:



Advertisements
Similar presentations
The procedure for finding the variance and standard deviation for grouped data is similar to that for finding the mean for grouped data, and.
Advertisements

Terminology A statistic is a number calculated from a sample of data. For each different sample, the value of the statistic is a uniquely determined number.
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Descriptive Statistics: Numerical Measures
Chapter 8 Confidence Intervals 8.1 Confidence Intervals about a Population Mean,  Known.
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Slides by JOHN LOUCKS St. Edward’s University.
Chapter 3, Part 1 Descriptive Statistics II: Numerical Methods
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Review of Basic Statistics. Definitions Population - The set of all items of interest in a statistical problem e.g. - Houses in Sacramento Parameter -
Chapter 3 - Part B Descriptive Statistics: Numerical Methods
1 1 Slide © 2001 South-Western /Thomson Learning  Anderson  Sweeney  Williams Anderson  Sweeney  Williams  Slides Prepared by JOHN LOUCKS  CONTEMPORARYBUSINESSSTATISTICS.
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.
Chapter 3 – Descriptive Statistics
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Unit 3 Section 3-3 – Day : Measures of Variation  Range – the highest value minus the lowest value.  The symbol R is used for range.  Variance.
Probabilistic and Statistical Techniques
Some Useful Continuous Probability Distributions.
Chapter 3.2 Measures of Variance.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
Section 3-3 Measures of Variation. WAITING TIMES AT DIFFERENT BANKS Jefferson Valley Bank (single waiting line) Bank of Providence.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Refer to Ex 3-18 on page Record the info for Brand A in a column. Allow 3 adjacent other columns to be added. Do the same for Brand B.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
B AD 6243: Applied Univariate Statistics Data Distributions and Sampling Professor Laku Chidambaram Price College of Business University of Oklahoma.
Data Set: Apartment Rents (in ascending order)
Chapter 3, Part B Descriptive Statistics: Numerical Measures n Measures of Distribution Shape, Relative Location, and Detecting Outliers n Exploratory.
Chapter 3, part C. III. Uses of means and standard deviations Of course we don’t just calculate measures of location and dispersion just because we can,
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
1 Chebyshev’s Theorem. 2 However, the concept of data spread about the mean can be expressed quite generally for all data distributions (skewed, symmetric,
The Normal Distribution Chapter 2 Continuous Random Variable A continuous random variable: –Represented by a function/graph. –Area under the curve represents.
Seventy efficiency apartments were randomly Seventy efficiency apartments were randomly sampled in a college town. The monthly rent prices for the apartments.
1 1 Slide © 2003 Thomson/South-Western. 2 2 Slide © 2003 Thomson/South-Western Chapter 3 Descriptive Statistics: Numerical Methods Part B n Measures of.
Do Now Find the mean and standard deviation for the data set (assume sample data):
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Chapter 3 Section 3 Measures of variation. Measures of Variation Example 3 – 18 Suppose we wish to test two experimental brands of outdoor paint to see.
Math 201: Chapter 2 Sections 3,4,5,6,7,9.
Sampling Distributions
Business and Economics 6th Edition
Chapter 6 Continuous Probability Distribution
Understanding Sampling Distributions: Statistics as Random Variables
Measures of Dispersion
Chapter 3 Descriptive Statistics: Numerical Measures Part A
Chebyshev’s Theorem and the Empirical Rule
To compare information such as the mean and standard deviation it is useful to be able to describe how far away a particular observation is from the mean.
Chapter 12 Statistics 2012 Pearson Education, Inc.
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Week 10 Chapter 16. Confidence Intervals for Proportions
ANATOMY OF THE EMPIRICAL RULE
12/1/2018 Normal Distributions
Chapter 6 Confidence Intervals.
Monthly Starting Salary (TRL)
The Normal Distribution
The Normal Distribution
Empirical Rule Empirical rule only works for a bell-shaped distribution. That is, empirical rule cannot be applied to other distributions such as left-skewed,
10-5 The normal distribution
St. Edward’s University
The Normal Distribution
Chapter 5 A Normal World.
The Normal Distribution
Refer to Ex 3-18 on page Record the info for Brand A in a column. Allow 3 adjacent other columns to be added. Do the same for Brand B.
Elementary Statistics: Picturing The World
St. Edward’s University
Section 2.5 notes continued
Normal Distributions and the Empirical Rule
Chapter 12 Statistics.
Presentation transcript:

Distribution Shape: Skewness Example: Apartment Rents Seventy efficiency apartments were randomly sampled in a college town. The monthly rent prices for the apartments are listed below in ascending order. 525 530 535 540 545 550 560 565 570 572 575 580 585 590 600 610 615 625 635 649 650 670 675 680 690 700 715

z-Scores The z-score is often called the standardized value. It denotes the number of standard deviations a data value xi is from the mean. Excel’s STANDARDIZE function can be used to compute the z-score.

z-Scores An observation’s z-score is a measure of the relative location of the observation in a data set. A data value less than the sample mean will have a z-score less than zero. A data value greater than the sample mean will have a z-score greater than zero. A data value equal to the sample mean will have a z-score of zero.

Chebyshev’s Theorem At least (1 - 1/z2) of the items in any data set will be within z standard deviations of the mean, where z is any value greater than 1. Chebyshev’s theorem requires z > 1, but z need not be an integer.

Chebyshev’s Theorem At least of the data values must be 75% within of the mean. 75% z = 2 standard deviations At least of the data values must be within of the mean. 89% z = 3 standard deviations At least of the data values must be within of the mean. 94% z = 4 standard deviations

Empirical Rule When the data are believed to approximate a bell-shaped distribution … The empirical rule can be used to determine the percentage of data values that must be within a specified number of standard deviations of the mean. The empirical rule is based on the normal distribution, which is covered in Chapter 6.

+/- 1 standard deviation Empirical Rule For data having a bell-shaped distribution: of the values of a normal random variable are within of its mean. 68.26% +/- 1 standard deviation of the values of a normal random variable are within of its mean. 95.44% +/- 2 standard deviations of the values of a normal random variable are within of its mean. 99.72% +/- 3 standard deviations

Empirical Rule x 99.72% 95.44% 68.26% m m – 3s m – 1s m + 1s m + 3s

Detecting Outliers An outlier is an unusually small or unusually large value in a data set. A data value with a z-score less than -3 or greater than +3 might be considered an outlier. It might be: an incorrectly recorded data value a data value that was incorrectly included in the data set a correctly recorded data value that belongs in the data set

A survey of local companies found that the mean amount of travel allowances for executives was 0.25 USD per mile. The standard deviation was 0.02 USD. Using Chebyshev`s Theorem find the minimum percentage of the data values that will fall between 0.20 USD and 0.30 USD.

Problem Based on a survey of dental practitioners, the study reported that the mean number of units of local anesthetics used per week by dentists was 79, with a standard deviation of 23. Suppose we want to determine the percentage of dentists who use less than 102 units of local anesthetics per week. a- Assuming nothing is known about the shape of the distribution for the data, what percentage of dentists use less than 102 units of local anesthetics per week? b- Assuming that the data has a mound-shaped (bell-shaped or symmetric) distribution, what percentage of dentists use less than 102 units of local anesthetics per week?

Mean Standard Deviation Hand rubbing 35 59 Hand washing 69 106 Problem Based on the study to compare the effectiveness of washing the hands with soap and rubbing the hands with alcohol-based antiseptics. Table: Descriptive statistics on bacteria counts for the two groups of health care workers. Mean Standard Deviation Hand rubbing 35 59 Hand washing 69 106 a- For hand rubbers, form an interval that contains at least 75% of the bacterial counts. b- For hand washers, form an interval that contains at least 75% of the bacterial counts. (Note that the bacterial count cannot be less than 0) c- On the basis of your results in parts a and b, make an inference about the effectiveness of the two hand cleaning methods.