Variability GrowingKnowing.com © 2011 GrowingKnowing.com © 2011.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Statistics 1: Introduction to Probability and Statistics Section 3-3.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
GrowingKnowing.com © GrowingKnowing.com © 2011.
12.3 – Measures of Dispersion
Math 116 Chapter 12.
Describing Data: Numerical
Economics 173 Business Statistics Lecture 2 Fall, 2001 Professor J. Petry
GrowingKnowing.com © Variability We often want to know the variability of data. Please give me $1000, I will give you… 8% to 9% in a year. Small.
GrowingKnowing.com © Percentile What ‘s the difference between percentile and percent? Percent measures ratio 90% percent on a test shows you.
Chapter 3 – Descriptive Statistics
GrowingKnowing.com © GrowingKnowing.com © 2011.
Smith/Davis (c) 2005 Prentice Hall Chapter Six Summarizing and Comparing Data: Measures of Variation, Distribution of Means and the Standard Error of the.
4 - 1 Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
GrowingKnowing.com © Expected value Expected value is a weighted mean Example You put your data in categories by product You build a frequency.
Section 3.2 Measures of Dispersion. Range = the difference between the largest value and smallest value range = (largest value) – (smallest value)
Basic Measurement and Statistics in Testing. Outline Central Tendency and Dispersion Standardized Scores Error and Standard Error of Measurement (Sm)
GrowingKnowing.com © Estimates We are often asked to predict the future! When will you complete your team project? When will you make your first.
Central Tendency & Dispersion
GrowingKnowing.com © Estimates We are often asked to predict the future! When will you complete your team project? When will you make your first.
Copyright © 2011 Pearson Education, Inc. Describing Numerical Data Chapter 4.
Data Summary Using Descriptive Measures Sections 3.1 – 3.6, 3.8
Measures of Variation Range Standard Deviation Variance.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Measures of Variation. Range, Variance, & Standard Deviation.
Copyright © 2013, 2010 and 2007 Pearson Education, Inc.
Discrete Probability Distributions
Chapter Six Summarizing and Comparing Data: Measures of Variation, Distribution of Means and the Standard Error of the Mean, and z Scores PowerPoint Presentation.
Position GrowingKnowing.com © 2011 GrowingKnowing.com © 2011.
Confidence Intervals Excel
INTRODUCTION TO STATISTICS
Measures of Dispersion
Different Types of Data
Confidence Intervals GrowingKnowing.com © 2011
Descriptive Statistics
Normal distribution GrowingKnowing.com © 2012
Measures of Dispersion
Chapter 3 Describing Data Using Numerical Measures
Teaching Statistics in Psychology
Introduction to Summary Statistics
Descriptive Statistics: Presenting and Describing Data
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Midrange (rarely used)
NUMERICAL DESCRIPTIVE MEASURES
Chapter 3 Describing Data Using Numerical Measures
Section 3.2 Measures of Spread.
Numerical Descriptive Measures
Descriptive Statistics
Introduction to Summary Statistics
Please take out Sec HW It is worth 20 points (2 pts
Introduction to Summary Statistics
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Fall 2017 Room 150 Harvill Building 10: :50 Mondays, Wednesdays.
BUS7010 Quant Prep Statistics in Business and Economics
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Introduction to Summary Statistics
Measures in Variability
Measures of Dispersion (Spread)
Quantitative Methods PSY302 Quiz Normal Curve Review February 6, 2017
Numerical Descriptive Measures
Describing distributions with numbers
Introduction to Summary Statistics
Summary (Week 1) Categorical vs. Quantitative Variables
Click the mouse button or press the Space Bar to display the answers.
Summary (Week 1) Categorical vs. Quantitative Variables
Introduction to Summary Statistics
Introduction to Summary Statistics
Business and Economics 7th Edition
Presentation transcript:

Variability GrowingKnowing.com © 2011 GrowingKnowing.com © 2011

Variability We often want to know the variability of data. Please give me $1000, I will give you… 8% to 9% in a year. Small variability. -50% to 300% in a week. Large variability. Most people prefer certainty to variability. We won’t meet in this classroom next week, and I am not certain where we will meet? Sound good? Everyone happy? Business people consider variability a risk. Business people like to avoid risks. We can measure variability using range, variance, standard deviation, and coefficient of variation. GrowingKnowing.com © 2011

Range The range is the largest number minus the smallest. What is the Range of 1, 3, 4 and 9? Range = 9 – 1 = 8 The range is fast and easy but a crude measure We don’t know if most or a few data items are variable. The common mistake in range is going too fast and missing the actual smallest or largest number. GrowingKnowing.com © 2011

Excel Excel does not have an =RANGE function. Use two functions TIP: if you do a search in Excel Help files you will find lots of references to range because Excel uses the term often in a non-statistical way to refer to a group of cells. Use two functions =MAX to find the largest number =MIN for the smallest number Subtract =max from =min to get the range. Example: =MAX(a1:a9)-MIN(a1:a9) GrowingKnowing.com © 2011

Variance Variance subtracts every data item from the mean Variance is a better measure of variability than range because you look at every data item rather than just 2 data items. Variance is not easy to understand as the measure is units squared. For example, if your data measures how long a job takes in seconds, variance will be seconds squared (seconds2). Most people cannot understand or visualize a squared second. Variance is important because variance is used to calculate the standard deviation which is a very useful measure. GrowingKnowing.com © 2011

Formula Sample variance: Population variance: Σ is called "Sigma" (upper case) and requires you sum the data values. xi represents each data value. x̄ is pronounced "x bar" and it represents a sample mean. μ is called "mu" and it represents a population mean. n is the count of the number of data values in a set of sample data. N is the count of the number of data values in a population data set. Note: Sample and population formulas are different! GrowingKnowing.com © 2011

Excel =VAR(a1:a5) for a sample =VARP(a1:a5) for a population To use the correct formula, you must know, are you are working with a sample or population? GrowingKnowing.com © 2011

Hard - Manual Calculation What is the sample variance of 1, 2, 3 days? 1) Calculate the mean: =(1+2+3) / n = 6 /3 = 2 2)Data xi - x̄ (xi - x̄)2 1 1 - 2 = -1 1 2 2 - 2 = 0 0 3 3 - 2 = +1 1 Totals: 0 2 Variance = 2 / (3-1) = 1 days2 GrowingKnowing.com © 2011

Easy- Manual Calculation Hard formula: Easy formula: s2 = n∑x2 – (∑x) 2 n(n – 1) For sample, divide by n(n-1) in the easy method For population, divide by n(n) in the easy method No need to calculate a mean in the easy formula. GrowingKnowing.com © 2013

Easy- Variance Manual Calculation What is the sample variance of 1, 2, 3 days? x x2 1 1 2 4 3 9 6 14 s2 = n∑x2 – (∑x)2 n(n – 1) = 3(14) – 62 = 42 – 36 = 1 days2 3(3-1) 6 GrowingKnowing.com © 2013

Standard deviation Standard deviations (abbreviated S.D.) is used often. Calculate S.D. by taking the square root of the variance. If variance = 9, calculate the standard deviation? Manual S.D. = 9 = 3 Excel uses the square root function. =SQRT(9) Excel can calculate S.D. without calculating variance Assuming data is in cells A1 to A9 Sample standard deviation =STDEV(A1:A9) Population standard deviation =STDEVP(A1:A9) S.D. uses same unit of measurement as the data If your data is dollars, S.D. is dollars, but variance is in dollars2 GrowingKnowing.com © 2011

Skewed When you chart your data is the data symmetrical, or lopsided to the right or left? A skew value of zero indicates symmetrical Notice the long tail on the skewed diagram Symmetrical Skewed right GrowingKnowing.com © 2011

Formula There is more than one formula for skewness. The above formula is used for the test questions on our website. Excel has a function called =SKEW(a1:a9) Excel uses a different formula than the one above. Check with your teacher to see what method is preferred. GrowingKnowing.com © 2011

Skew questions The question may provide raw data, if so, calculate the mean, median, and standard deviation and use the results to find skewness The question may give you the mean, median, and standard deviation, so 3 fewer calculations are needed. A popular test question asks if data is skewed right or left by comparing the mean, median, and mode. If the mean, median, or mode are approximately equal then the data is symmetrical If the data is skewed, the mean will be pulled towards the long tail since the mean is easily influenced by extreme values If the mean is larger than the median, data is skewed right If mean is smaller than the median, data is skewed left GrowingKnowing.com © 2011

This example is skewed to the left. Calculate if the data is skewed for these numbers: 1, 2, 3, 4, 5, 9, 23, -5, and -39. In Excel: =3*(AVERAGE(a1:a9)-MEDIAN(a1:a9))/STDEV(a1:a9) = -0.4813 This example is skewed to the left. GrowingKnowing.com © 2011

Empirical Rule Many books are about six sigma; a concept using the empirical rule. Six sigma is popular in business to set quality objectives If your data is normally distributed, the empirical rule states (S.D. is abbreviated for standard deviation) 68% of the data will fall between 1 S.D. from the mean 95% of the data will fall between 2 S.D. from the mean 99.7% of the data will fall between 3 S.D. from the mean We recommend you memorize the values: .68, .95, or .997 The symbol for S.D. is sigma, so 3 S.D. above the mean plus 3 S.D. below the mean adds to 6 sigma 99.7% of work must meet quality objectives to be six sigma Is six sigma quality good enough? GrowingKnowing.com © 2011

Calculate the empirical rule There are 2 types of questions You are given the probability and asked for the data interval You are given the data interval and asked for the probability GrowingKnowing.com © 2011

Probability given: With a mean of 600, and S. D Probability given: With a mean of 600, and S.D. of 10, what is the interval needed to hold 95% of the data? 95% is 2 S.D. above and below the mean (memorized) Upper value is 600+10+10 = 620 Lower value is 600-10-10 = 580 Answer is 580 to 620 GrowingKnowing.com © 2011

With a mean of 20, standard deviation of 2, and interval of 18 to 22, what is the probability data lies in this interval? 22 is 1 S.D. above the mean, 18 is 1 S.D. below Answer = .68 because we memorized it 1 S.D. is 68%. GrowingKnowing.com © 2011

Chebyshev Empirical rule is for data that is normally distributed. Chebyshev is for data NOT normally distributed. Formula Percent of data = 1 – 1/standard deviation2 The questions are similar to the empirical rule GrowingKnowing.com © 2011

For data that is NOT normally distributed, what is the probability data will fall within 2 standard deviations of the mean? Probability = 1 – 1/22 = 1 – ¼ = 75% Answer: 75% of data will fall within 2 standard deviations of the mean. Using Chebyshev, what is the interval and percentage of data that will fall within 3 standard deviations if the mean is 100 and standard deviation is 10. For 3 standard deviations, mean +/- 3 x S.D. Upper interval = 100 +30 =130 Lower interval = 100 -30 = 70 Percent = 1 – 1/32 = 1 – 1/9 = .89 There is 89% probability data will fall within 3 std deviations. GrowingKnowing.com © 2011

Coefficient of Variation Is standard deviation (S.D) of 100 large variability? We are not sure until we know the size of the mean Mean = 200, then S.D. = 100 is very large Mean = 10 million then variability of S.D. = 100 is small How do we compare variability if one variable measures payment by the hour and another measures payment by commission in dollars? We can compare using Coefficient of Variation Formula: Coefficient Variation = (S.D/ Mean) x 100 GrowingKnowing.com © 2011

Coefficient of Variation Example Teacher A grades students finding a mean of 80 with standard deviation of 10. Teacher B grades students finds a mean of 1000 and standard deviation of 50. Which teacher has more consistent results? Step 1: CV Teacher A = 10 / 80 × 100% = 12.5% Step 2: CV Teacher B = 50 / 1000 × 100% = 5% Step 3: Teacher B has more consistent student results. Teacher A has more variability. GrowingKnowing.com © 2011

or the beginning of the end, of the beginning or the beginning of the end, if you have not been practising enough problems Keep practising questions is the key to success. End when you can get 3 questions right in a row at the hardest difficulty level for every topic. Run the Progress report to see your completion results. GrowingKnowing.com © 2011