Presentation is loading. Please wait.

Presentation is loading. Please wait.

Big Data Big Deal or Big Distraction? Agenda: 1.What is Big Data? 2.Why YOU Should Care About (Big) Data? 3.A Brief Introduction to Big Data Econometrics.

Similar presentations


Presentation on theme: "Big Data Big Deal or Big Distraction? Agenda: 1.What is Big Data? 2.Why YOU Should Care About (Big) Data? 3.A Brief Introduction to Big Data Econometrics."— Presentation transcript:

1 Big Data Big Deal or Big Distraction? Agenda: 1.What is Big Data? 2.Why YOU Should Care About (Big) Data? 3.A Brief Introduction to Big Data Econometrics

2 Internet of People

3

4 Today computers… are almost wholly dependent on human beings for information -- by typing, pressing a record button, taking a digital picture or scanning a bar code... The problem is, people have limited time, attention and accuracy—all of which means they are not very good at capturing data about things in the real world… Kevin Ashton, 'That 'Internet of Things' Thing', RFID Journal, July 22, 2009RFID Journal The Problem With People

5 Internet of Things

6 What do people want? But remember Jobs… Where are the people we want? Customization, add placement What will sales be? Predicting the future… Are my ads working? : The ATTRIBUTION problem

7

8 + “…the Publicis CEO noted that "the communication and marketing landscape has undergone dramatic changes in recent years, including the exponential development of new media giants, the explosion of Big Data, blurring of the roles of all players and profound changes in consumer behavior.“ WSJ 7/28/13 “…a $35.1 billion cross-border linkup that shows how Big Data is making Madison Avenue look more like Wall Street.” WSJ 7/28/13

9 Creative vs Analytical

10 A Brief Introduction to Big Data Econometrics A.What can we do with data? B.Correlation vs. Causation C. Types of data i. Cross section ii. Time series iii. Panel D. Fit, overfit, validation E. Tools of the trade i. Regression, logit, probit ii. Trees & Forests iii. Baysean simulation

11 1.Prediction 2.Summarization 3.Estimation 4.Hypothesis Testing

12 Is Marriage Good for Your Health? Tara Parker-Pope, 4/14/10 Contemporary studies, for instance, have shown that married people are less likely to get pneumonia, have surgery, develop cancer or have heart attacks. A group of Swedish researchers has found that being married or cohabiting at midlife is associated with a lower risk for dementia. A study of two dozen causes of death in the Netherlands found that in virtually every category, ranging from violent deaths like homicide and car accidents to certain forms of cancer, the unmarried were at far higher risk than the married. Correlation vs. Causation

13 What can get in the way of determining CAUSATION? ENDOGENEITY 1. Reverse causality (also selection bias): healthier people are more likely to get married 2. Unobservable characteristics such as time preference, aptitude, genetics

14 Counterfactual 1. What would happen if we change the “cause”? 2. Is there a plausible alternative explanation? What would sales have been if the ad did not run? What would people do if they did not use Google? What would people buy if the weather was warmer?

15 Cross-section Data: Lots of observations at one point in time.

16 Time Series Data: One observation over time.

17 Panel Data: Multiple observations of the same thing over time.

18

19 Fit, Overfit, Validation and Out of Sample Prediction

20

21 Linear RegressionLogit/Probit Regression Book: http://elsa.berkeley.edu/books/choice2.html On-line lectures: http://www.nera.com/7440.htm

22 Trees and Forests

23 “Uninformative” Prior Probability Gather Data Conditional probability of observing data “Updated” Probability Bayesian Statistics With BIG DATA we can repeat this process over and over again with multiple models to get better predictions!

24 Corsea Machine Learning by Andrew Ng https://www.coursera.org/course/ml An Introduction to Statistical Learning Book: http://www- bcf.usc.edu/~gareth/ISL/http://www- bcf.usc.edu/~gareth/ISL/ Lecture videos & problem sets: http://www.alsharif.info/#!iom530/c21o7 http://www.alsharif.info/#!iom530/c21o7

25 Feeling a bit overwhelmed?

26 “…it’s no wonder that the latest fad in the business world is Big Data … Big data can be an extraordinary tool, helping to gather new information about our behavior and preferences. What it can’t explain is why we do what we do.” WSJ 3/22/14


Download ppt "Big Data Big Deal or Big Distraction? Agenda: 1.What is Big Data? 2.Why YOU Should Care About (Big) Data? 3.A Brief Introduction to Big Data Econometrics."

Similar presentations


Ads by Google