Hippo a System for Computing Consistent Query Answers to a Class of SQL Queries Jan Chomicki University at Buffalo Jerzy Marcinkowski Wroclaw University.

Slides:



Advertisements
Similar presentations
Uncertainty in Data Integration Ai Jing
Advertisements

Information Integration Using Logical Views Jeffrey D. Ullman.
CS4432: Database Systems II
Entangled Queries :Enabling Declarative Data Driven Coordination Johannes Gehrke Department of Computer Science, Cornell University With Gabriel Bender,
Database Management Systems, R. Ramakrishnan and J. Gehrke1 The Relational Model Chapter 3.
Efficient Management of Inconsistent and Uncertain Data Renée J. Miller University of Toronto.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 Relational Algebra Chapter 4, Part A Modified by Donghui Zhang.
INFS614, Fall 08 1 Relational Algebra Lecture 4. INFS614, Fall 08 2 Relational Query Languages v Query languages: Allow manipulation and retrieval of.
Database Theory Why use database? Data is a valuable corporate resource which needs adequate accuracy, consistency and security controls. The centralized.
How to (repeatedly) change preferences * Jan Chomicki University at Buffalo * FOIKS’06, AMAI.
Database Systems: A Practical Approach to Design, Implementation and Management International Computer Science S. Carolyn Begg, Thomas Connolly Lecture.
Physical Database Monitoring and Tuning the Operational System.
1 Relational Model. 2 Relational Database: Definitions  Relational database: a set of relations  Relation: made up of 2 parts: – Instance : a table,
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Relational Algebra Chapter 4, Part A.
1 Query Planning with Limited Source Capabilities Chen Li Stanford University Edward Y. Chang University of California, Santa Barbara.
Rutgers University Relational Algebra 198:541 Rutgers University.
Relational Algebra Chapter 4 - part I. 2 Relational Query Languages  Query languages: Allow manipulation and retrieval of data from a database.  Relational.
CSCD343- Introduction to databases- A. Vaisman1 Relational Algebra.
Chapter 17 Methodology – Physical Database Design for Relational Databases Transparencies © Pearson Education Limited 1995, 2005.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education 4-1.
Relational Algebra, R. Ramakrishnan and J. Gehrke (with additions by Ch. Eick) 1 Relational Algebra.
The Relational Model. Review Why use a DBMS? OS provides RAM and disk.
Lecture 9 Methodology – Physical Database Design for Relational Databases.
Chapter 16 Methodology – Physical Database Design for Relational Databases.
Lecture 05 Structured Query Language. 2 Father of Relational Model Edgar F. Codd ( ) PhD from U. of Michigan, Ann Arbor Received Turing Award.
A Survey Based Seminar: Data Cleaning & Uncertain Data Management Speaker: Shawn Yang Supervisor: Dr. Reynold Cheng Prof. David Cheung
Database Management Systems Introduction. In the Beginning… Customer Program 1.
Object Oriented Analysis and Design 1 Chapter 7 Database Design  UML Specification for Data Modeling  The Relational Data Model and Object Model  Persistence.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
FALL 2004CENG 351 File Structures and Data Management1 Relational Model Chapter 3.
MIS 673: Database Analysis and Design u Objectives: u Know how to analyze an environment and draw its semantic data model u Understand data analysis and.
Lecture2: Database Environment Prepared by L. Nouf Almujally 1 Ref. Chapter2 Lecture2.
Database Management System (DBMS) an Introduction DeSiaMore 1.
Database Management Systems, R. Ramakrishnan and J. Gehrke1 Relational Algebra.
Database Management Systems 1 Raghu Ramakrishnan Relational Algebra Chpt 4 Xin Zhang.
ICS 321 Fall 2011 The Relational Model of Data (i) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 8/29/20111Lipyeow.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
1 Relational Algebra Chapter 4, Sections 4.1 – 4.2.
Methodology – Physical Database Design for Relational Databases.
Database Management Systems 1 Raghu Ramakrishnan Relational Algebra Chpt 4 Xin Zhang.
Indexes and Views Unit 7.
CSCD34-Data Management Systems - A. Vaisman1 Relational Algebra.
IST 210 The Relational Language Todd S. Bacastow January 2004.
Database Management Systems, R. Ramakrishnan1 Relational Algebra Module 3, Lecture 1.
The Relational Database Model & The Database Development Process
QUERY PROCESSING RELATIONAL DATABASE KUSUMA AYU LAKSITOWENING
ConQuer: Efficient Management of Inconsistent Databases Presented by: Presented by: Ariel Fuxman (Univ. of Toronto) Ariel Fuxman (Univ. of Toronto) Joint.
Chapter 3: Relational Databases
Scheduling of Transactions on XML Documents Author: Stijin Dekeyser Jan Hidders Reviewed by Jason Chen, Glenn, Steven, Christian.
IIS 645 Database Management Systems DDr. Khorsheed Today’s Topics 1. Course Overview 22. Introduction to Database management 33. Components of Database.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 The Relational Model Chapter 3.
Introduction: Databases and Database Systems Lecture # 1 June 19,2012 National University of Computer and Emerging Sciences.
1 CS122A: Introduction to Data Management Lecture #4 (E-R  Relational Translation) Instructor: Chen Li.
Databases and DBMSs Todd S. Bacastow January
Module 11: File Structure
CS4432: Database Systems II
Methodology – Physical Database Design for Relational Databases
Computing Full Disjunctions
Relational Algebra Chapter 4, Part A
CIS 336 str Competitive Success/snaptutorial.com
CIS 336 str Education for Service- -snaptutorial.com.
Relational Algebra 461 The slides for this text are organized into chapters. This lecture covers relational algebra, from Chapter 4. The relational calculus.
Database Applications (15-415) Relational Calculus Lecture 6, September 6, 2016 Mohammad Hammoud.
Lecture 16: Probabilistic Databases
Relational Algebra Chapter 4, Sections 4.1 – 4.2
Data Model.
Consistent Query Answering: a personal perspective
Test-Driven Ontology Development in Protégé
Database Dr. Roueida Mohammed.
Lecture 23 CS 507.
Presentation transcript:

Hippo a System for Computing Consistent Query Answers to a Class of SQL Queries Jan Chomicki University at Buffalo Jerzy Marcinkowski Wroclaw University Slawomir Staworko University at Buffalo

Motivation - Inconsistent data Enforcing data consistency no longer applicable: Data Integration – Consistent data sources, but inconsistent global view. Long-running transactions. Efficiency reasons.

Consistent Query Answers Repair Instance satisfying the constraint. The set of changes is minimal. There can be an exponential number of repairs. Tuple t is a consistent answer to Q if t is an answer to Q in every repair.

Computing CQA Query rewriting For query Q construct Q’ which evaluation returns consistent answers of Q. Logic programming Use disjunctive program to specify repairs and query result. Condensed representations of repairs

Conflict Hypergraphs Vertex – database tuple Edge – conflicting tuples NameTown J. SmithBuffalo J. SmithChicago D. GibsBuffalo M. AdamsBuffalo M. AdamsChicago M. AdamsNew York (J.S.,BUF) (D.G.,BUF) (M.A.,BUF) (M.A.,CHO) (M.A.,NYC) (J.S.,CHO) Repair – Maximal Independent Set

Hippo – System Description Conflict hypergraph – stored in RAM Denial integrity constraints Queries: SQL frontend – RDMBS independent Platform independent (Java2)

Hippo is fast Selection and Join – as fast as underlying database system. (QR takes approx. twice the time) Union and Difference – takes approx. Twice the time of simple query evaluation. (QR the same for difference).

Future Work Projection – In general problem is co-NP-data-complete. – Find an efficient heuristic. – Characterize hypergraphs where projection is easy Preferences – User provides preferences on resolving conflicts. – Computing Preferred CQA still easy.

References 1. M. Arenas, L. Bertossi, J. Chomicki. Consistent Query Answers in Inconsistent Databases. PODS’99 2. J. Chomicki, J. Marcinkowski. Minimal Change Integrity Maintenance using Tuple Deletions. Under revision for Information and Computation. 3. J. Chomicki, J. Marcinkowski, S. Staworko. Computing Consistent Query Answers using Conflict Hypergraphs. Under conference submission. 4.