The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou.

Slides:



Advertisements
Similar presentations
Chapter 10: Designing Databases
Advertisements

University of Washington Database Group The Complexity of Causality and Responsibility for Query Answers and non-Answers Alexandra Meliou, Wolfgang Gatterbauer,
SCIP Optimization Suite
DISCOVER: Keyword Search in Relational Databases Vagelis Hristidis University of California, San Diego Yannis Papakonstantinou University of California,
6 - 1 Lecture 4 Analysis Using Spreadsheets. Five Categories of Spreadsheet Analysis Base-case analysis What-if analysis Breakeven analysis Optimization.
1Key – Report Creation with DB2. DB2 Databases Create Domain for DB2 Test Demo.
University of Washington Database Group Tiresias The Database Oracle for How-To Queries Alexandra Meliou § ✜ Dan Suciu ✜ § University of Massachusetts.
Data Model driven applications using CASE Data Models as the nucleus of software development in a Computer Aided Software Engineering environment.
Modelling the Aerospace Aftermarket with Multi-agents Systems Ken Woghiren Technical Director - Lost Wax.
CoPhy: A Scalable, Portable, and Interactive Index Advisor for Large Workloads Debabrata Dash, Anastasia Ailamaki, Neoklis Polyzotis 1.
1 9. Evaluation of Queries Query evaluation – Quantifier Elimination and Satisfiability Example: Logical Level: r   y 1,…y n  r’ Constraint.
University of Washington Database Group Reverse Data Management … and the case for Reverse What-If queries 1 Alexandra Meliou, Wolfgang Gatterbauer, Dan.
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Data to Knowledge to Results Review and Analysis of Paper by Davenport et al Team: Something Different Myron Burr Kevin McComas Easwar Srinivasan Bill.
Supply Chain Design Problem Tuukka Puranen Postgraduate Seminar in Information Technology Wednesday, March 26, 2009.
3-1 Chapter 3 Data and Knowledge Management
George Papastefanatos 1, Panos Vassiliadis 2, Alkis Simitsis 3,Yannis Vassiliou 1 (1) National Technical University of Athens
SmartSQL AlfaTech Software Solutions Application Requirements Document  Radi Bekker  Vladimir Goldman  Marina Shaevich  Alexander Shapiro Team Members:
RBNetERP or Enterprise Resource Planning is a software that allows companies to integrate all their operations and resources and manage them through one.
Analysis Categories Base-case analysis What-if (sensitivity) analysis Breakeven Analysis Optimization Analysis Risk Analysis.
Explaining The Data Dan Suciu Or: is there anything left to do except helping out the 3 big DB vendors, or improving google’s click-through rate?
Software Reengineering 2003 년 12 월 2 일 최창익, 고광 원.
Objectives of the Lecture :
University of Toronto Department of Computer Science © 2001, Steve Easterbrook CSC444 Lec22 1 Lecture 22: Software Measurement Basics of software measurement.
More on Data Mining KDnuggets Datanami ACM SIGKDD
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
Introduction: The essential background
Database Design - Lecture 1
DBS201: DBA/DBMS Lecture 13.
©Silberschatz, Korth and Sudarshan5.1Database System Concepts Chapter 5: Other Relational Languages Query-by-Example (QBE) Datalog.
Context Tailoring the DBMS –To support particular applications Beyond alphanumerical data Beyond retrieve + process –To support particular hardware New.
Using Visual Basic 6.0 to Create Web-Based Database Applications
LAB CVP 2009 ‘Leveraging the LIMS Investment’. Invested in a Laboratory Information Management System (LIMS) Solution is limited to Storing and Reporting.
General Database Statistics Using Maximum Entropy Raghav Kaushik 1, Christopher Ré 2, and Dan Suciu 3 1 Microsoft Research 2 University of Wisconsin--Madison.
CHAPTER EIGHT Accessing Data Processing Databases.
Types of IP Models All-integer linear programs Mixed integer linear programs (MILP) Binary integer linear programs, mixed or all integer: some or all of.
CHAPTER EIGHT Accessing Data Processing Databases.
Query Processing. Steps in Query Processing Validate and translate the query –Good syntax. –All referenced relations exist. –Translate the SQL to relational.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
A COURSE ON PROBABILISTIC DATABASES Dan Suciu University of Washington June, 2014Probabilistic Databases - Dan Suciu 1.
Chapter 9 Query-by-Example Pearson Education © 2009.
© ETH Zürich Eric Lo ETH Zurich a joint work with Carsten Binnig (U of Heidelberg), Donald Kossmann (ETH Zurich), Tamer Ozsu (U of Waterloo) and Peter.
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Chapter 7 Storing Organizational Information - Databases.
CHAPTER 3 DATABASES AND DATA WAREHOUSES. 2 OPENING CASE STUDY Chrysler Spins a Competitive Advantage with Supply Chain Management Software Chapter 2 –
Goal Programming Linear program has multiple objectives, often conflicting in nature Target values or goals can be set for each objective identified Not.
University of Toronto Department of Computer Science Lifting Transformations to Product Lines Rick Salay, Michalis Famelis, Julia Rubin, Alessio Di Sandro,
View 1. Lu Chaojun, SJTU 2 View Three-level vision of DB users Virtual DB views DB Designer Logical DB relations DBA DBA Physical DB stored info.
A Framework for Testing Database Application Author: David Chays, Saikat Dan, Filippos I. Vokolos, Elaine J. Weyuker Presenter: Liping Liu.
CS4432: Database Systems II Query Processing- Part 2.
I2 U Intelligent Supply Chain Management Course Module Seven: Inventory Planning.
Robust Estimation With Sampling and Approximate Pre-Aggregation Author: Christopher Jermaine Presented by: Bill Eberle.
IT Applications for Decision Making. Operations Research Initiated in England during the world war II Make scientifically based decisions regarding the.
Types of Information Systems Basic Computer Concepts Types of Information Systems  Knowledge-based system  uses knowledge-based techniques that supports.
03/02/20061 Evaluating Top-k Queries Over Web-Accessible Databases Amelie Marian Nicolas Bruno Luis Gravano Presented By: Archana and Muhammed.
SQL Introduction to database and SQL. Chapter 1: Databases and Database Users 6 Introduction to Databases Databases touch all aspects of our lives. Examples:
Default-all is dangerous! Wolfgang Gatterbauer Alexandra Meliou Dan Suciu Database group University of Washington.
Chapter 13: Query Processing
Chapter 3 The Relational Model. Why Study the Relational Model? Most widely used model. Vendors: IBM, Informix, Microsoft, Oracle, Sybase, etc. “Legacy.
3-1 Chapter 3 Data and Knowledge Management
01-Business intelligence
Presentation by Team 2.
Lecture 16: Probabilistic Databases
Lecture 12: Data Wrangling
Probabilistic Databases
Updating Databases With Open SQL
Query Processing.
Implementing Data Models & Reports with Microsoft SQL Server
Updating Databases With Open SQL
Presentation transcript:

The Power of How-to Queries joint work with Dan Suciu (University of Washington) Alexandra Meliou

Hypothetical (What-if) Queries Brokerage company DB Key Performance Indicators (KPI) Example from [Balmin et al. VLDB’00]: “An analyst of a brokerage company wants to know what would be the effect on the return of customers’ portfolios if during the last 3 years they had suggested Intel stocks instead of Motorola.” change something in the source (hypothesis) observe the effect in the target forward

How-To Queries Brokerage company DB Key Performance Indicators (KPI) Modified example: “An analyst wants to ask how to achieve a 10% return in customer portfolios, with the least number of changes.” find changes to the source that achieve the desired effect declare a desired effect in the target reverse

TPC-H example  A manufacturing company keeps records of inventory orders in a LineItem table.  KPI: Cannot order more than 8% of the inventory from any single country  Can reassign orders to new suppliers as long as the supplier can supply the part  Minimize the number of changes (constraints) (variables) (optimization objective) constraint optimization

extract data Constraint Optimization on Big Data DB construct optimization model this is for a set of 10 lineitems and 40 suppliers Mixed Integer Programming (MIP) solver transform into data updates MathProg Impractical!

Demo: Tiresias a tool that makes how-to queries practical

Tiresias: How-To Query Engine DBMS MIP solver Tiresias TiQL (Tiresias Query Language) Declarative interface, extension to Datalog

Overview MathProg or AMPL TiQL Visualizations

MathProg or AMPL TiQL Visualizations Overview Demo

MathProg or AMPL TiQL Visualizations Overview Language semantics Evaluation of a TiQL program: Translation from TiQL to linear constraints Performance optimizations

Optimizing Performance  Model optimizer  eliminates variables, constraints, and parameters  uses key constraints, functional dependencies, and provenance  Partitioning optimizer Significantly faster than letting the MIP solver do it

Evaluation of the Model Optimizer baseline with optimization

Evaluation of Tiresias Partitioning 10k tuples 1M tuples granularity of partitioning complex dependency on the granularity of partitioning

 Next steps  Non-partitionable problems and approximations  Soft constraints, diversification of results  Interactive visualizations, feedback-based problem generation  Applications  Business intelligence, strategy planning  View updates  Data cleaning  Take-aways  Databases should support how-to queries  Data-driven optimizations could benefit the performance of external tools