Computational Campaign Coverage with PollyVote.com

Slides:



Advertisements
Similar presentations
Part II – TIME SERIES ANALYSIS C3 Exponential Smoothing Methods © Angel A. Juan & Carles Serrat - UPC 2007/2008.
Advertisements

Validation and Monitoring Measures of Accuracy Combining Forecasts Managing the Forecasting Process Monitoring & Control.
General Mining Issues a.j.m.m. (ton) weijters Overfitting Noise and Overfitting Quality of mined models (some figures are based on the ML-introduction.
International Marketing Research: Practices and Challenges
A Search-based Method for Forecasting Ad Impression in Contextual Advertising Defense.
EFFECTIVE PREDICTIVE MODELING- DATA,ANALYTICS AND PRACTICE MANAGEMENT Richard A. Derrig Ph.D. OPAL Consulting LLC Karthik Balakrishnan Ph.D. ISO Innovative.
Demand Forecasts The three principles of all forecasting techniques: –Forecasting is always wrong –Every forecast should include an estimate of error –The.
Chapter 2 Data Patterns and Choice of Forecasting Techniques
Methodology for Architectural Level Reliability Risk Analysis Lalitha Krothapalli CSC 532.
BIS Application Chapter two
Demand Management and Forecasting
29 August 2005MBA III (Research Methodology) Course Instructor: Dr. Aurangzeb Z. Khan1 RESEARCH METHODOLOGY (Business Research Methods) Week 5.
Fourteenth Lecture Hour 9:30 – 10:20 am, Sunday, September 16 Software Management Disciplines Project Control and Process Automation (from Part III, Chapter.
Operations Fall 2015 Bruce Duggan Providence University College.
Chapter 3: Software Project Management Metrics
Project Portfolio Management Business Priorities Presentation.
Paolo Valente - UNECE Statistical Division Slide 1 Technology for census data coding, editing and imputation Paolo Valente (UNECE) UNECE Workshop on Census.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 The information we gather with experiments and surveys is collectively called data Example:
1 BABS 502 Moving Averages, Decomposition and Exponential Smoothing Revised March 14, 2010.
1 Decision Making ADMI 6510 Forecasting Models Key Sources: Data Analysis and Decision Making (Albrigth, Winston and Zappe) An Introduction to Management.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 1 Statistics: The Art and Science of Learning from Data Section 1.1 Using Data to Answer.
1 A latent information function to extend domain attributes to improve the accuracy of small-data-set forecasting Reporter : Zhao-Wei Luo Che-Jung Chang,Der-Chiang.
Demand Management and Forecasting Chapter 11 Portions Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Forecasting. Model with indicator variables The choice of a forecasting technique depends on the components identified in the time series. The techniques.
Chapter 11 – With Woodruff Modications Demand Management and Forecasting Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
G-20 Conference on Commodity Price Volatility Istanbul, September 13, 2011 Commodity Price Booms, the Global Economy, and Low-income Countries Thomas Helbling.
Chapter 9 Forecasting Copyright 2015 Health Administration Press.
Demand Forecasting Production and Operations Management Judit Uzonyi-Kecskés Ph.D. Student Department of Management and Corporate Economics Budapest University.
Welcome to MM305 Unit 5 Seminar Dr. Bob Forecasting.
BUS 173: Lecture 10 Forecasting for Businesses. Outline  What is a forecast?  Why do we need forecasting?  What are the common tools of forecasting?
Predictive Analytics Market to Global Analysis and Forecasts by Applications, Business Function, Deployment Model No of Pages: 150 Publishing Date:
Predictive Analytics Market to Global Analysis and Forecasts by Applications, Business Function, Deployment Model No of Pages: 150 Publishing Date:
COMPARING POLLS AND MARKETS
Prediction Games Players compete by making predictions about upcoming event/observation in the real world Predictions are scored after event At TAMU, we.
SAP Trade Repository Reporting by Virtusa
UNDP NIGERIA Democratic Governance for Development Project (DGD II)
Systems Analysis and Design in a Changing World, Fifth Edition
Operations Management Contemporary Concepts and Cases
Investment Appraisal - Is it worth it?
Tutorial 3 Working with Formulas and Functions
Supply Chain Management for Non Supply Chain Management Professionals
Demand Estimation and Forecasting
University of Southampton
the 2016 Presidential Campaign: Where We’ve Been, Where We’re GOING
Healix Risk Rating A Verisk Analytics Business
Establishing and Managing a Schedule Baseline
Mechanical Engineering Haldia Institute of Technology
Forecasting Chapter 11.
Understanding Results
AI emerging trend in QA Sanjeev Kumar Jha, Senior Consultant
Part C State Performance Plan/Annual Performance Report:
Type and impact of HTA-related scientific advice given during development The variability in HTA organisations and methodologies that are utilised in HTA.
Chapter 2: Working with Formulas and Functions
Predicting Frost Using Artificial Neural Network
The information Content of IPO Prospectuses
A Full Spectrum Approach to Election Polling and Forecasting
Subscript and Summation Notation
Part III Exchange Rate Risk Management
DATA ANALYSIS: STATISTICS AND GRAPHING
GEE’S Writing RULES.
Chapter 8 Software Evolution.
Model Combination.
CHANGE IS INEVITABLE, PROGRESS IS A CHOICE
Methodology for Architectural Level Reliability Risk Analysis
BEC 30325: MANAGERIAL ECONOMICS
Spreadsheet software.
Building Topic/Trend Detection System based on Slow Intelligence
Demand Management and Forecasting
A handbook on validation methodology. Metrics.
Presentation transcript:

Computational Campaign Coverage with PollyVote.com Andreas Graefe graefe.andreas@gmail.com Tow Center September 13, 2016

PollyVote History The PollyVote started in 2004 as a forecasting research project and was applied Prospectively to U.S. presidential elections in 2004, 2008, 2012, the 2013 German federal election Retrospectively to U.S. presidential elections in 1992, 1996, and 2000.

PollyVote Goals Demonstrate the power of combining forecasts for improving accuracy Assess the relative accuracy of different methods over time Develop models that aid decision-making Validate (new) methods for forecasting elections Transparency and accessibility Distribute PollyVote to other countries Provide automated campaign coverage (texts and visualizations) Make the project sustainable

How does the PollyVote work? Mechanical combination of forecasts Within different components of methods by calculating unweighted averages. Indicate that this is an example of 2 methods. Also show the averages for each/

How does the PollyVote work? Mechanical combination of forecasts Within Across different components of methods by calculating unweighted averages. Combined forecasts are accurate because They use more information Errors of individual forecasts cancel out Indicate that this is an example of 2 methods. Also show the averages for each/

Relative accuracy of forecasting methods On average across the last 100 days prior to each of the past 6 elections… Polls performed worst (MAE: 2.8 percentage points) Expectations (citizen forecasts) performed best (MAE: 1.2 percentage points) Performance of methods varies (within and) across elections Mean absolute error 1992 1996 2000 2004 2008 2012 Average Polls 4.8 4.7 2.7 1.5 1.7 1.3 2.8 Prediction markets 0.8 1.2 0.6 Econometric models 3.6 3.9 3.2 1.1 2.4 Expert judgment 1.4 Index models 0.4 2.3 Citizen forecasts 0.7 1.8 COLORS AGAIN

Power of combining forecasts On average across the last 100 days prior to each of the past 6 elections the PollyVote missed the final election outcome on average by 1 percentage point. was more accurate than each of the (combined) component forecasts. Also, since 2004, the PollyVote always predicted the correct winner, at any given day. Mean absolute error 1992 1996 2000 2004 2008 2012 Average Polls 4.8 4.7 2.7 1.5 1.7 1.3 2.8 Prediction market 0.8 1.2 0.6 Econometric models 3.6 3.9 3.2 1.1 2.4 Expert judgment 1.4 Index models 0.4 2.3 Citizen forecasts 0.7 1.8 PollyVote 0.9 1.0

PollyVote is a resource for election observers Provides accurate forecasts of the election outcome

PollyVote is a resource for election observers Provides accurate forecasts of the election outcome Allows for Comparing methods/models Discovering “interesting” stories

Our project Goals: Make our content more accessible by Visualizing data Providing first drafts of a story  Develop automated news Based on data from our component methods / models In English and German language

Automated („Robot“) Journalism Process of using software or algorithms to automatically generate news stories without human intervention …after the initial programming of the algorithm

AJ in newsrooms Leading media outlets have started to experiment with automated news. Early expansion phase Few news organizations Limited topics (sports, finance, crime, weather) No prior work for political news (but see WaPo‘s Heliograf)

Status quo The good news Completely automated process, from collecting the forecasting data to publishing texts in both languages More than 13,000 texts published since April But… Many texts contain errors Some texts are missing, others are published many times This is due to errors In the raw data With the data export / import In the algorithm

Key learnings to date (I) Data management Clean and accurate data are most crucial But difficult to maintain

Key learnings to date (II) Article quality Simple texts are easy to automate That is, texts based on a single row in the data Example: Description of poll results Tradeoff: adding complexity (i.e., insights) vs. risk of introducing new errors Additions to texts had to be delayed due to constant error fixing Source of error can be difficult to detect

Key learnings to date (III) Algorithm development Large onboarding effort to educate the NLG provider about our project NLG software difficult to use (but constantly improving) Very challenging to set the rules for the algorithm, for example, How to refer to the margin in polls? When does a candidate have a momentum? When is there a trend in the data?  Boundaries of automation?