Ch3: Exploring Your Data with Descriptives 15 Sep 2011 BUSI275 Dr. Sean Ho HW1 due tonight 10pm Download and open “02-SportsShoes.xls”02-SportsShoes.xls.

Slides:



Advertisements
Similar presentations
Descriptive Measures MARE 250 Dr. Jason Turner.
Advertisements

Measures of Dispersion
Numerically Summarizing Data
Chapter 3 Describing Data Using Numerical Measures
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Chapter 3 Describing Data Using Numerical Measures
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 3-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
1 Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Range Standard Deviation Interquartile Range (IQR)
Slides by JOHN LOUCKS St. Edward’s University.
1 Business 260: Managerial Decision Analysis Professor David Mease Lecture 1 Agenda: 1) Course web page 2) Greensheet 3) Numerical Descriptive Measures.
Basic Business Statistics 10th Edition
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 3 Describing Data Using Numerical Measures.
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 2 Describing Data with Numerical Measurements
Describing Data Using Numerical Measures
Exploring Data 17 Jan 2012 Dr. Sean Ho busi275.seanho.com HW1 due Thu 10pm By Mon, send to set proposal meeting For lecture, please download: 01-SportsShoes.xls.
Department of Quantitative Methods & Information Systems
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
Objectives 1.2 Describing distributions with numbers
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.
Numerical Descriptive Techniques
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Methods for Describing Sets of Data
Anthony J Greene1 Dispersion Outline What is Dispersion? I Ordinal Variables 1.Range 2.Interquartile Range 3.Semi-Interquartile Range II Ratio/Interval.
Copyright © 2005 Pearson Education, Inc. Slide 6-1.
Ch2: Exploring Data: Charts 13 Sep 2011 BUSI275 Dr. Sean Ho HW1 due Thu 10pm Download and open “SportsShoes.xls”SportsShoes.xls.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Review Measures of central tendency
1 MATB344 Applied Statistics Chapter 2 Describing Data with Numerical Measures.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Descriptive Statistics1 LSSG Green Belt Training Descriptive Statistics.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Lecture 3 Describing Data Using Numerical Measures.
Applied Quantitative Analysis and Practices LECTURE#09 By Dr. Osman Sadiq Paracha.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Business Statistics, A First Course.
Numerical Measures of Variability
Basic Business Statistics Chapter 3: Numerical Descriptive Measures Assoc. Prof. Dr. Mustafa Yüzükırmızı.
Describing Data Descriptive Statistics: Central Tendency and Variation.
Unit 3: Averages and Variations Week 6 Ms. Sanchez.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Basic Business Statistics 11 th Edition.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
Statistical Methods © 2004 Prentice-Hall, Inc. Week 3-1 Week 3 Numerical Descriptive Measures Statistical Methods.
Welcome to MM305 Unit 2 Seminar Dr. Bob Statistical Foundations for Quantitative Analysis.
Descriptive Statistics ( )
Methods for Describing Sets of Data
Descriptive Measures Descriptive Measure – A Unique Measure of a Data Set Central Tendency of Data Mean Median Mode 2) Dispersion or Spread of Data A.
Chapter 3 Describing Data Using Numerical Measures
Description of Data (Summary and Variability measures)
Laugh, and the world laughs with you. Weep and you weep alone
Chapter 3 Describing Data Using Numerical Measures
Numerical Descriptive Measures
Descriptive Statistics
Lecture 2 Chapter 3. Displaying and Summarizing Quantitative Data
Quartile Measures DCOVA
The absolute value of each deviation.
Section 2.4 Measures of Variation.
Numerical Descriptive Measures
Measures of Center and Spread
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Presentation transcript:

Ch3: Exploring Your Data with Descriptives 15 Sep 2011 BUSI275 Dr. Sean Ho HW1 due tonight 10pm Download and open “02-SportsShoes.xls”02-SportsShoes.xls

15 Sep 2011 BUSI275: Descriptives2 Outline for today Exploring data with charts: line, scatter Exploring data with descriptives: Measures of Centre  Mean, median, mode Quartiles, percentiles, boxplot Measures of Variation  Range, IQR  Standard deviation, variance, coef of var Empirical Rule and z-scores

15 Sep 2011 BUSI275: Descriptives3 2 quant. vars: scatterplot Each participant in the dataset is plotted as a point on a 2D graph (x,y) coordinates are that participant's observed values on the two variables Insert > XY Scatter If more than 2 vars, then either 3D scatter (hard to see), or Match up all pairs: matrix scatter

15 Sep 2011 BUSI275: Descriptives4 Time series: line graph Think of time as another variable Horizontal axis is time Insert > Line > Line

15 Sep 2011 BUSI275: Descriptives5 Descriptives: centres Visualizations are good, but numbers also help: Mostly just for quantitative vars Many ways to find the “centre” of a distribution Mean: AVERAGE()  Pop mean: μ ; sample mean: x  What happens if we have outliers? Median: line up all observations in order and pick the middle one Mode: most frequently occurring value  Usually not for continuous variables Statisti c AgeIncome Mean34.71$27, Median30$23, Mode24$19,000.00

15 Sep 2011 BUSI275: Descriptives6 Descriptives: quantiles The first quartile, Q 1, is the value ¼ of the way through the list of observations, in order Similarly, Q 3 is ¾ of the way through What's another name for Q 2 ? In general the p th percentile is the value p% of the way through the list of observations Rank = (p/100)n: if fractional, round up  If exactly integer, average the next two Median = which percentile? Excel: QUARTILE(data, 3), PERCENTILE(data,.70)

15 Sep 2011 BUSI275: Descriptives7 Box (and whiskers) plot Plot: median, Q 1, Q 3, and upper/lower limits: Upper limit = Q (IQR) Lower limit = Q 1 – 1.5(IQR) IQR = interquartile range = (Q 3 – Q 1 ) Observations outside the limits are considered outliers: draw as asterisks (*) Excel: try tweaking bar chartstry tweaking bar charts 25% 25% Outliers * * Lower lim Q1 Median Q3 Upper lim

15 Sep 2011 BUSI275: Descriptives8 Boxplots and skew Right-SkewedLeft-SkewedSymmetric Q1Q2Q3Q1Q2Q3 Q1Q2Q3

15 Sep 2011 BUSI275: Descriptives9 Boxplot Example Data: Right skewed, as the boxplot depicts: Min Q 1 Q 2 Q 3 Max * Upper limit = Q (Q 3 – Q 1 ) = (6 – 2) = is above the upper limit so is shown as an outlier

15 Sep 2011 BUSI275: Descriptives10 Measures of variation Spread (dispersion) of a distribution: are the data all clustered around the centre, or spread all over a wide range? Same center, different variation High variation Low variation

15 Sep 2011 BUSI275: Descriptives11 Range, IQR, standard deviation Simplest: range = max – min Is this robust to outliers? IQR = Q 3 – Q 1 (“too robust”?) Standard deviation: Population: Sample: In Excel: STDEV() Variance is the SD w/o square root Pop.Samp. Mean μx SD σ s

15 Sep 2011 BUSI275: Descriptives12 Coefficient of variation Coefficient of variation: SD relative to mean Expressed as a percentage / fraction e.g., Stock A has avg price x=$50 and s=$5 CV = s / x = 5/50 = 10% variation Stock B has x =$100 same standard deviation CV = s / x = 5/100 = 5% variation Stock B is less variable relative to its average stock price

15 Sep 2011 BUSI275: Descriptives13 SD and Empirical Rule Every distribution has a mean and SD, but for most “nice” distribs two rules of thumb hold: Empirical rule: for “nice” distribs, approximately 68% of data lie within ± 1 SD of the mean 95% within ±2 SD of the mean 99.7% within ±3 SD NausicaaDistribution

15 Sep 2011 BUSI275: Descriptives14 SD and Tchebysheff's Theorem For any distribution, at least (1-1/k 2 ) of the data will lie within k standard deviations of the mean Within (μ ± 1σ): ≥(1-1/1 2 ) = 0% Within (μ ± 2σ): ≥(1-1/2 2 ) = 75% Within (μ ± 3σ): ≥(1-1/3 2 ) = 89%

15 Sep 2011 BUSI275: Descriptives15 z-scores Describes a value's position relative to the mean, in units of standard deviations: z = (x – μ )/ σ e.g., you got a score of 35 on a test: is this good or bad? Depends on the mean, SD: μ=30, σ=10: then z = +0.5: pretty good μ=50, σ=5: then z = -3: really bad!

15 Sep 2011 BUSI275: Descriptives16 TODO HW1 (ch1-2): due tonight at 10pm Format as a clear, neat document Also upload your Excel spreadsheet HWs are to be individual work Get to know your classmates and form teams me when you know your team You can come up with a good name, too Discuss topics/variables you are interested in Find existing data, or gather your own?