Part 1: Data Presentation 1-1/41 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics.

Slides:



Advertisements
Similar presentations
Lesson Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website.
Advertisements

Excursions in Modern Mathematics, 7e: Copyright © 2010 Pearson Education, Inc. 14 Descriptive Statistics 14.1Graphical Descriptions of Data 14.2Variables.
Unit 1.1 Investigating Data 1. Frequency and Histograms CCSS: S.ID.1 Represent data with plots on the real number line (dot plots, histograms, and box.
Measures of Dispersion
Exploratory Data Analysis (Descriptive Statistics)
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Chapter 1 & 3.
Chapter 1 Introduction Individual: objects described by a set of data (people, animals, or things) Variable: Characteristic of an individual. It can take.
1 Economics 240A Power One. 2 Outline w Course Organization w Course Overview w Resources for Studying.
Statistics Lecture 2. Last class began Chapter 1 (Section 1.1) Introduced main types of data: Quantitative and Qualitative (or Categorical) Discussed.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Programming in R Describing Univariate and Multivariate data.
Chapter 1 Descriptive Analysis. Statistics – Making sense out of data. Gives verifiable evidence to support the answer to a question. 4 Major Parts 1.Collecting.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Chapter 12: Describing Distributions with Numbers We create graphs to give us a picture of the data. We also need numbers to summarize the center and spread.
Statistics 3502/6304 Prof. Eric A. Suess Chapter 3.
Statistics.
Methods for Describing Sets of Data
2011 Summer ERIE/REU Program Descriptive Statistics Igor Jankovic Department of Civil, Structural, and Environmental Engineering University at Buffalo,
Descriptive Statistics Roger L. Brown, Ph.D. Medical Research Consulting Middleton, WI Online Course #1.
Chapter 1 The Role of Statistics. Three Reasons to Study Statistics 1.Being an informed “Information Consumer” Extract information from charts and graphs.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
StatisticsStatistics Graphic distributions. What is Statistics? Statistics is a collection of methods for planning experiments, obtaining data, and then.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Chapter 2 Describing Data.
Data Analysis Qualitative Data Data that when collected is descriptive in nature: Eye colour, Hair colour Quantitative Data Data that when collected is.
Descriptive Statistics1 LSSG Green Belt Training Descriptive Statistics.
Lecture 3 Describing Data Using Numerical Measures.
Statistics 2. Variables Discrete Continuous Quantitative (Numerical) (measurements and counts) Qualitative (categorical) (define groups) Ordinal (fall.
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
Categorical vs. Quantitative…
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
Descriptive statistics Petter Mostad Goal: Reduce data amount, keep ”information” Two uses: Data exploration: What you do for yourself when.
To be given to you next time: Short Project, What do students drive? AP Problems.
Math 145 September 11, Recap  Individuals – are the objects described by a set of data. Individuals may be people, but they may also be animals.
Organizing Data AP Stats Chapter 1. Organizing Data Categorical Categorical Dotplot (also used for quantitative) Dotplot (also used for quantitative)
The field of statistics deals with the collection,
Part 1 – Data Presentation Statistics and Data Analysis.
Cumulative frequency Cumulative frequency graph
1 Chapter 10: Describing the Data Science is facts; just as houses are made of stones, so is science made of facts; but a pile of stones is not a house.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
Statistics Year 9. Note 1: Statistical Displays.
Descriptive Statistics  Individuals – are the objects described by a set of data. Individuals may be people, but they may also be animals or things. 
ALL ABOUT THAT DATA UNIT 6 DATA. LAST PAGE OF BOOK: MEAN MEDIAN MODE RANGE FOLDABLE Mean.
StatisticsStatistics Unit 5. Example 2 We reviewed the three Measures of Central Tendency: Mean, Median, and Mode. We also looked at one Measure of Dispersion.
What is Statistics?. Statistics 4 Working with data 4 Collecting, analyzing, drawing conclusions.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Unit 1 - Graphs and Distributions. Statistics 4 the science of collecting, analyzing, and drawing conclusions from data.
ALL ABOUT THAT DATA UNIT 6 DATA. LAST PAGE OF BOOK: MEAN MEDIAN MODE RANGE FOLDABLE Mean.
Prof. Eric A. Suess Chapter 3
Exploratory Data Analysis
Methods for Describing Sets of Data
Statistics 1: Statistical Measures
EXPLORATORY DATA ANALYSIS and DESCRIPTIVE STATISTICS
Unit 4 Statistical Analysis Data Representations
Module 6: Descriptive Statistics
Unit 6 Day 2 Vocabulary and Graphs Review
NUMERICAL DESCRIPTIVE MEASURES
Description of Data (Summary and Variability measures)
Distributions and Graphical Representations
Statistics and Data Analysis
Friday Lesson Do Over Statistics is all about data. There is a story to be uncovered behind the data--a story with characters, plots and problems. The.
Organizing Data AP Stats Chapter 1.
Descriptive Statistics
Welcome!.
Math 341 January 24, 2007.
Presentation transcript:

Part 1: Data Presentation 1-1/41 Statistics and Data Analysis Professor William Greene Stern School of Business IOMS Department Department of Economics

Part 1: Data Presentation 1-2/41 Statistics and Data Analysis Part 1 – Data Presentation Telling your story statistically

Part 1: Data Presentation 1-3/41 The Visual Data Do Tell the Story: Napoleon’s March to and from Moscow

Part 1: Data Presentation 1-4/41 What is the story?

Part 1: Data Presentation 1-5/41 Life Expectancy: Highest 15 Countries, 2010Disability Adjusted Life Expectancy 40

Part 1: Data Presentation 1-6/41 A Dynamic Picture

Part 1: Data Presentation 1-7/41 Healthcare ‘Efficiency:’ Source: Bloomberg. August 2013 What do we mean by ‘efficiency?’

Part 1: Data Presentation 1-8/41

Part 1: Data Presentation 1-9/41 Source: Bloomberg. August 2013

Part 1: Data Presentation 1-10/41 Probability of Survival to Age 50, Female at Birth U.S. and 20 Other Wealthy Countries It is possible to be mislead (slightly) by a presentation such as this one. Note the vertical axis. What does this graph tell you?

Part 1: Data Presentation 1-11/41

Part 1: Data Presentation 1-12/41 Does living longer make people happier? Or do people live longer because they are happier?

Part 1: Data Presentation 1-13/41 Does the Picture Tell the Story? New York Times, Page RE1, July 24, 2014 This is the only graphic in the article. The article compares default rates on VA vs. FHA mortgages. Is there anything wrong with this picture?

Part 1: Data Presentation 1-14/41 Data Presentation Agenda  Data Types: Cross Section and Time Series  Summarizing Data Graphically Pie chart, bar chart Box plot, histogram  Summarizing Data with Descriptive Statistics Central tendency Spread Distribution (shape)

Part 1: Data Presentation 1-15/41 Data = A Set of Facts A picture of some aspect of the world Pizza Sales by Type What do the data tell you? How can you use the information? What additional information would make these data (more) informative?

Part 1: Data Presentation 1-16/41 Data Types and Measurement  Quantitative Discrete = count: Number of car accidents by city by time Continuous = measurement: Housing prices  Qualitative Categorical: Shopping mall, car brand, trip mode Ordinal: Survey data on attitudes; “How do you feel about…?” Strongly disagree  Disagree  Neutral  Agree  Strongly agree Moody’s bond ratings: Aaa, Aa, A, Bbb, Bb, B, and so on.  Frameworks Cross section Time series

Part 1: Data Presentation 1-17/41 Discrete, Count Data

Part 1: Data Presentation 1-18/41 Discrete Data – US Crime Statistics; Counts of Occurrences.

Part 1: Data Presentation 1-19/41 Continuous Data Housing Prices and Incomes

Part 1: Data Presentation 1-20/41 Unordered Qualitative Data Travel Mode Between Sydney and Melbourne by 210 Travelers

Part 1: Data Presentation 1-21/41 Ordered Qualitative Data German Health Satisfaction Survey; 27,326 individuals. On a scale from 0 to 10, how do you feel about your health?

Part 1: Data Presentation 1-22/41 Bond Ratings Movie Ratings Ordered Qualitative Outcomes

Part 1: Data Presentation 1-23/41 A Problem with Ordered Survey Response Data “Differential Item Functioning” SafetyCountPercentCum Pct Stern Students’ Ranking of Subway Safety (1994)* Very Unsatisfactory Unsatisfactory OK Satisfactory Very Satisfactory Is there an objective meaning to “3” on some standard scale? Does everyone’s “1” or “2” or “3” … mean the same thing? * Jeff Simonoff: Data Presentation and Summary, pp. 3-4

Part 1: Data Presentation 1-24/41 Quantitative vs. Qualitative Data Qualitative Data: No units of measurement Arithmetic manipulation is usually meaningless. The average of Air and Bus is not Train Quantitative Data: Units of measurement make sense. Arithmetic computations make sense.

Part 1: Data Presentation 1-25/41 Cross Section Data Housing Prices and Incomes

Part 1: Data Presentation 1-26/41 Time Series Data: Car Thefts

Part 1: Data Presentation 1-27/41 Representing Data  In raw form  Transformed to a visual form  Summarized graphically  Summarized statistically

Part 1: Data Presentation 1-28/41 Pie Chart vs. Frequency Table Pizza Pies Sold, by Type

Part 1: Data Presentation 1-29/41 Data Representation: Bar Chart vs. Pie Chart Same data. Which is easier to understand? BAR CHART PIE CHART

Part 1: Data Presentation 1-30/ data. Source: Bloomberg

Part 1: Data Presentation 1-31/41 Football Baseball 2013 Valuation of U.S. Sports Teams What story do these figures reveal?

Part 1: Data Presentation 1-32/41 A Box Plot Describes the Distribution of Values in a Set of Data Hawaii Box and Whisker Plot for House Price Listings

Part 1: Data Presentation 1-33/41 Raw Data on Housing Prices and Incomes

Part 1: Data Presentation 1-34/41 Making a Box Plot for Per Capita Income Maximum=31136 Median =22610 Minimum= st Quartile = rd Quartile = Interquartile Range = IQR = = 3256

Part 1: Data Presentation 1-35/41 Box and Whisker Plot Median 75 th Percentile 25 th Percentile Interquartile range=IQR Larger of (Minimum, Median – 1.5 IQR Smaller of (Maximum, Median IQR Outliers HOG, pp What is an outlier? Why do we believe a particular point is an outlier? = extreme observations

Part 1: Data Presentation 1-36/41 Histogram Showing Counts

Part 1: Data Presentation 1-37/41 A Frequency Distribution for Grouped Data

Part 1: Data Presentation 1-38/41 Histogram for House Price Listings HOG, pp A histogram describes the sample data and suggests the nature of the underlying data generating process. Note the “skewness” of the distribution of listings.

Part 1: Data Presentation 1-39/41 Distribution of House Price Listings Asymmetry (skewness) in the histogram of listing prices… … shows up in the box and whisker plot. Note the long whisker at the top of the figure.

Part 1: Data Presentation 1-40/41 A Caution About Graphical Data Summaries Graphical tools can be very badly behaved when: (1) The data have only a few observations. (2) There are wild observations in the data set. The box and whisker plot is distorted (and dominated) by one wildly errant observation.

Part 1: Data Presentation 1-41/41 Summary  What story does the data presentation tell? Data in raw form tell no story. Visual representation of data tells something about the data The representation of the data may reveal something about the underlying process that the data measure.  What tool is most informative? Reduction to a small number of features Visual displays of data  Pie chart  Box and whisker plots  Bar charts  Histograms  Time series plots “There are lies, damned lies and statistics.” (Benjamin Disraeli)