Big Data Big Deal or Big Distraction? Agenda: 1.What is Big Data? 2.Why YOU Should Care About (Big) Data? 3.A Brief Introduction to Big Data Econometrics.

Slides:



Advertisements
Similar presentations
ENT4310 Business Economics and Marketing A six-step model for marketing research Arild Aspelund.
Advertisements

Introduction to Multimedia Adeyemi Adeniyi Bsc, MCP MCTS
GRA 5917 Public Opinion and Input Politics. Lecture September 16h 2010 Lars C. Monkerud, Department of Public Governance, BI Norwegian School of Management.
Authority 2. HW 8: AGAIN HW 8 I wanted to bring up a couple of issues from grading HW 8. Even people who got problem #1 exactly right didn’t think about.
Digital Marketing Applications MBA-IV Sem Unit-2 Salil Bhatia.
Evaluating IBP Strategies Professor Close. What is the best marketing? The one that works. The one where your target market and consumers get the message.
LECTURE 9 : EMPRICIAL EVIDENCE : CAPM AND APT
1 The Effect of Benefits on Single Motherhood in Europe Libertad González Universitat Pompeu Fabra May 2006.
TOOLS OF POSITIVE ANALYSIS
Research Methodologies in Allied Health SAHP 418/518 Research Planning Sandra Gunselman, Ph.D.
AGEC 622 Mission is prepare you for a job in business Have you ever made a price forecast? How much confidence did you place on your forecast? Was it correct?
Prelude of Machine Learning 202 Statistical Data Analysis in the Computer Age (1991) Bradely Efron and Robert Tibshirani.
Part 3: Regression and Correlation 3-1/41 Regression Models Professor William Greene Stern School of Business IOMS Department Department of Economics.
WHAT IS SCIENCE WORD WALL PART 2. REPETITION Making multiple sets of measurements or observations in a scientific investigation. Running through the experiment.
LSS Black Belt Training Forecasting. Forecasting Models Forecasting Techniques Qualitative Models Delphi Method Jury of Executive Opinion Sales Force.
GCSE Data Handling Coursework 1 Examining the Data examine carefully the data you are given it’s important to get a feel for the raw data before you use.
B. RAMAMURTHY EAP#2: Data Mining, Statistical Analysis and Predictive Analytics for Automotive Domain CSE651C, B. Ramamurthy 1 6/28/2014.
Correlation and Correlational Research Slides Prepared by Alison L. O’Malley Passer Chapter 5.
1 Chapter No 3 ICT IN Science,Maths,Modeling, Simulation.
1 10. Causality and Correlation ECON 251 Research Methods.
Lecture 2: Key Concepts of Econometrics Prepared by South Asian Network on Economic Modeling Reference Introductory Econometrics: Jeffrey M Wooldridge.
Longitudinal Data: An introduction to some conceptual issues Vernon Gayle.
Lecturer’s desk INTEGRATED LEARNING CENTER ILC 120 Screen Row A Row B Row C Row D Row E Row F Row G Row.
Lecture 12 Model Building BMTRY 701 Biostatistical Methods II.
10/15/11 New Media and The Rise of the Crowd Lanre Aina Business Analyst, Google.
Statistics and Quantitative Analysis U4320 Segment 8 Prof. Sharyn O’Halloran.
Chapter 1: Psychology, Research, and You Pages 2 – 21.
LECTURE DISCUSSION TODAY (9/17/02) LECTURE: RESEARCH METHODS IN SOCIAL PSYCHOLOGY DISCUSSION (time permitting) HUMANS VS. OTHER ANIMALS.
Time Series Analysis and Forecasting
CORRELATIONS: TESTING RELATIONSHIPS BETWEEN TWO METRIC VARIABLES Lecture 18:
Next Colin Clarke-Hill and Ismo Kuhanen 1 Analysing Quantitative Data 1 Forming the Hypothesis Inferential Methods - an overview Research Methods Analysing.
Ch. 2 Tools of Positive Economics. Theoretical Tools of Public Finance theoretical tools The set of tools designed to understand the mechanics behind.
STAT 3610/5610 – Time Series Analysis Topics to be covered Chapters 1 – 8 Chapter 9 – Skip Chapters 10 – 12.
1 REGRESSION ANALYSIS WITH PANEL DATA: INTRODUCTION A panel data set, or longitudinal data set, is one where there are repeated observations on the same.
Econ 488 Lecture 2 Cameron Kaplan. Hypothesis Testing Suppose you want to test whether the average person receives a B or higher (3.0) in econometrics.
Scientific Method. My 4 out of 5 Rule If you make an observation.
C M Clarke-Hill1 Analysing Quantitative Data Forming the Hypothesis Inferential Methods - an overview Research Methods.
Christine Pal Chee October 9, 2013 Research Design.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Chapter 10 Finding Relationships Among Variables: Non-Experimental Research.
 What is an association between variables?  Explanatory and response variables  Key characteristics of a data set 1.
1 Econometrics (NA1031) Lecture 1 Introduction. 2 ”How much” type questions oBy how much a unit change in income affects consumption? oBy how much should.
Business Forecasting 1 Dr. Mohammed Alahmed (011) Dr. Mohammed Alahmed.
Marketing Research Process and Types of Marketing Research.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 March 6, 2013.
1 Empirical methods: endogeneity, instrumental variables and panel data Advanced Corporate Finance Semester
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
DEPARTMENT OF MECHANICAL ENGINEERING VII-SEMESTER PRODUCTION TECHNOLOGY-II 1 CHAPTER NO.4 FORECASTING.
Economics 20 - Prof. Anderson1 Time Series Data y t =  0 +  1 x t  k x tk + u t 1. Basic Analysis.
Lecture 1 Introduction to econometrics
Annual Report Template for Nonprofits By Jackie LalleyJackie Lalley Jackie Lalley is a Managing Partner at Yodelpop. Prior to co-founding Yodelpop, Jackie.
AP Statistics. Issues Interpreting Correlation and Regression  Limitations for r, r 2, and LSRL :  Can only be used to describe linear relationships.
Yandell - Econ 216 Chap 1-1 Chapter 1 Introduction and Data Collection.
Chapter Two: Tools of Environmental Science
NATURAL WORLD. OBSERVATION INFERENCE. HYPOTHESIS.
Ch. 2 Tools of Positive Economics
Linear Regression.
William Greene Stern School of Business New York University
Research, Methodology, and Ethics
A Logit model of brand choice calibrated on scanner data
Introduction to microeconometrics
Dr. Morgan C. Wang Department of Statistics
Correlations: testing linear relationships between two metric variables Lecture 18:
Entrepreneurship, Strategy and Information Systems
Research Design Shamindra Nath Sanyal 12/4/2018 SNS.
Lecture 12 Model Building
DSS-ESTIMATING COSTS Cost estimation is the process of estimating the relationship between costs and cost driver activities. We estimate costs for three.
How do mortality rates in the U.S. compare to other countries?
Positive analysis in public finance
How do mortality rates in the U.S. compare to other countries?
Presentation transcript:

Big Data Big Deal or Big Distraction? Agenda: 1.What is Big Data? 2.Why YOU Should Care About (Big) Data? 3.A Brief Introduction to Big Data Econometrics

Internet of People

Today computers… are almost wholly dependent on human beings for information -- by typing, pressing a record button, taking a digital picture or scanning a bar code... The problem is, people have limited time, attention and accuracy—all of which means they are not very good at capturing data about things in the real world… Kevin Ashton, 'That 'Internet of Things' Thing', RFID Journal, July 22, 2009RFID Journal The Problem With People

Internet of Things

What do people want? But remember Jobs… Where are the people we want? Customization, add placement What will sales be? Predicting the future… Are my ads working? : The ATTRIBUTION problem

+ “…the Publicis CEO noted that "the communication and marketing landscape has undergone dramatic changes in recent years, including the exponential development of new media giants, the explosion of Big Data, blurring of the roles of all players and profound changes in consumer behavior.“ WSJ 7/28/13 “…a $35.1 billion cross-border linkup that shows how Big Data is making Madison Avenue look more like Wall Street.” WSJ 7/28/13

Creative vs Analytical

A Brief Introduction to Big Data Econometrics A.What can we do with data? B.Correlation vs. Causation C. Types of data i. Cross section ii. Time series iii. Panel D. Fit, overfit, validation E. Tools of the trade i. Regression, logit, probit ii. Trees & Forests iii. Baysean simulation

1.Prediction 2.Summarization 3.Estimation 4.Hypothesis Testing

Is Marriage Good for Your Health? Tara Parker-Pope, 4/14/10 Contemporary studies, for instance, have shown that married people are less likely to get pneumonia, have surgery, develop cancer or have heart attacks. A group of Swedish researchers has found that being married or cohabiting at midlife is associated with a lower risk for dementia. A study of two dozen causes of death in the Netherlands found that in virtually every category, ranging from violent deaths like homicide and car accidents to certain forms of cancer, the unmarried were at far higher risk than the married. Correlation vs. Causation

What can get in the way of determining CAUSATION? ENDOGENEITY 1. Reverse causality (also selection bias): healthier people are more likely to get married 2. Unobservable characteristics such as time preference, aptitude, genetics

Counterfactual 1. What would happen if we change the “cause”? 2. Is there a plausible alternative explanation? What would sales have been if the ad did not run? What would people do if they did not use Google? What would people buy if the weather was warmer?

Cross-section Data: Lots of observations at one point in time.

Time Series Data: One observation over time.

Panel Data: Multiple observations of the same thing over time.

Fit, Overfit, Validation and Out of Sample Prediction

Linear RegressionLogit/Probit Regression Book: On-line lectures:

Trees and Forests

“Uninformative” Prior Probability Gather Data Conditional probability of observing data “Updated” Probability Bayesian Statistics With BIG DATA we can repeat this process over and over again with multiple models to get better predictions!

Corsea Machine Learning by Andrew Ng An Introduction to Statistical Learning Book: bcf.usc.edu/~gareth/ISL/ bcf.usc.edu/~gareth/ISL/ Lecture videos & problem sets:

Feeling a bit overwhelmed?

“…it’s no wonder that the latest fad in the business world is Big Data … Big data can be an extraordinary tool, helping to gather new information about our behavior and preferences. What it can’t explain is why we do what we do.” WSJ 3/22/14