Information Integration Using Logical Views Jeffrey D. Ullman.

Slides:



Advertisements
Similar presentations
Università di AnconaCoopIS01 September 6, C OOPERATION S TRATEGIES FOR I NFORMATION I NTEGRATION Maurizio Panti, Luca Spalazzi, Loris Penserini
Advertisements

1 Datalog: Logic Instead of Algebra. 2 Datalog: Logic instead of Algebra Each relational-algebra operator can be mimicked by one or several Database Logic.
Some Prolog Prolog is a logic programming language
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
CSE 636 Data Integration Conjunctive Queries Containment Mappings / Canonical Databases Slides by Jeffrey D. Ullman.
2005conjunctive-ii1 Query languages II: equivalence & containment (Motivation: rewriting queries using views)  conjunctive queries – CQ’s  Extensions.
Information Integration Using Logical Views Jeffrey D. Ullman.
Logic.
1 Global-as-View and Local-as-View for Information Integration CS652 Spring 2004 Presenter: Yihong Ding.
ICDT'2001, London, UK1 On Answering Queries in the Presence of Limited Access Patterns Chen Li Stanford University joint work with Edward Chang, UC Santa.
2005lav-ii1 Local as View: Some refinements  IM: Filtering irrelevant sources  Views with restricted access patterns  A summary of IM.
1 Answering Queries Using Views Alon Y. Halevy Based on Levy et al. PODS ‘95.
Generating Efficient Plans for Queries Using Views Chen Li Stanford University with Foto Afrati (National Technical University of Athens) and Jeff Ullman.
1 Models for Information Integration: Tuning Local-as-view into Global-as-view Andrea Cali Giuseppe De Giacomo Maurizio Lenzerini.
BYU 2003BYU Data Extraction Group Combining the Best of Global-as-View and Local-as-View for Data Integration Li Xu Brigham Young University Funded by.
Data Exchange & Composition of Schema Mappings Phokion G. Kolaitis IBM Almaden Research Center.
Data Integration Techniques Zachary G. Ives University of Pennsylvania CIS 550 – Database & Information Systems October 30, 2003 Some slide content may.
Inference and Resolution for Problem Solving
Local-as-View Mediators Priya Gangaraju(Class Id:203)
1 CIS607, Fall 2005 Semantic Information Integration Presentation by Paea LePendu Week 8 (Nov. 16)
Logical Agents Chapter 7. Why Do We Need Logic? Problem-solving agents were very inflexible: hard code every possible state. Search is almost always exponential.
Local-as-View Data Integration Zachary G. Ives University of Pennsylvania CIS 650 – Database & Information Systems February 21, 2005.
1 Describing and Utilizing Constraints to Answer Queries in Data-Integration Systems Chen Li Information and Computer Science University of California,
Credit: Slides are an adaptation of slides from Jeffrey D. Ullman 1.
2005Integration-intro1 Data Integration Systems overview The architecture of a data integration system:  Components and their interaction  Tasks  Concepts.
CSE 636 Data Integration Answering Queries Using Views Overview.
Infomaster: An information Integration Tool O. M. Duschka and M. R. Genesereth Presentation by Cui Tao.
Describing data sources. Outline Overview Schema mapping languages.
2005lav-iii1 The Infomaster system & the inverse rules algorithm  The InfoMaster system  The inverse rules algorithm  A side trip – equivalence & containment.
Distributed Database Management Systems. Reading Textbook: Ch. 4 Textbook: Ch. 4 FarkasCSCE Spring
Rada Chirkova (North Carolina State University) and Chen Li (University of California, Irvine) Materializing Views With Minimal Size To Answer Queries.
Credit: Slides are an adaptation of slides from Jeffrey D. Ullman 1.
1 Information Integration Mediators Warehousing Answering Queries Using Views.
2005lav-i1 Local as View: First steps  Introduction and an example  Rewriting queries using views  The Information Manifold system.
Containment CSE 590 DB Rachel Pottinger. Outline zIntroduction zMotivation zFormal definition zAlgorithms for different complexities zAn application:
1 Data integration Most slides are borrowed from Dr. Chen Li, UC Irvine.
1 Searching and Integrating Information on the Web Seminar 2: Data Integration Professor Chen Li UC Irvine.
CSE 636 Data Integration Limited Source Capabilities Slides by Hector Garcia-Molina Fall 2006.
Navigational Plans For Data Integration Marc Friedman Alon Levy Todd Millistein Presented By Avinash Ponnala Avinash Ponnala.
The TSIMMIS Approach to Mediation: Data Models and Languages Hector Garcia-Molina Yannis Papakonstantinou Dallan Quass Anand Rajaraman Yehoshua Sagiv Jeffrey.
1 Lessons from the TSIMMIS Project Yannis Papakonstantinou Department of Computer Science & Engineering University of California, San Diego.
Mediators, Wrappers, etc. Based on TSIMMIS project at Stanford. Concepts used in several other related projects. Goal: integrate info. in heterogeneous.
Answering Queries Using Views LMSS’95 Laks V.S. Lakshmanan Dept. of Comp. Science UBC.
Preference Revision via Declarative Debugging Pierangelo Dell’Acqua Dept. of Science and Technology - ITN Linköping University, Sweden EPIA’05, Covilhã,
CSE 636 Data Integration Conjunctive Queries Containment Mappings / Canonical Databases Slides by Jeffrey D. Ullman Fall 2006.
1 Information Integration. 2 Information Resides on Heterogeneous Information Sources different interfaces different data representations redundant and.
Background Motivation Implementation Conclusion 2.
Solving Problems by searching Well defined problems A probem is well defined if it is easy to automatically asses the validity (utility) of any proposed.
Presented by Jiwen Sun, Lihui Zhao 24/3/2004
Answering Queries Using Views: The Last Frontier.
Data Integration Zachary G. Ives University of Pennsylvania CIS 550 – Database & Information Systems November 14, 2007.
Bridging Different Data Representations Zachary G. Ives University of Pennsylvania CIS 550 – Database & Information Systems October 28, 2003 Some slide.
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
Data Integration: Achievements and Perspectives in the Last Ten Years AiJing.
3.2 Solving Equations by Using Addition and Subtraction Addition Property of Equality –If the same number is added to each side of an equation, the resulting.
Semi-structured Data In many applications, data does not have a rigidly and predefined schema: –e.g., structured files, scientific data, XML. Managing.
Data Integration Approaches
1 Integration of data sources Patrick Lambrix Department of Computer and Information Science Linköpings universitet.
MID-SEGMENT & TRIANGLE PROPORTIONALITY Day 8.  A midsegment of a triangle is a segment that connects the midpoints of two sides of a triangle. In the.
Extensions of Datalog Wednesday, February 13, 2001.
Answering Queries using Templates with Binding Patterns
Capability Based Mediation in TSIMMIS
Containment Mappings Canonical Databases Sariaya’s Algorithm
Logical Agents Chapter 7.
Solving Inequalities by Adding or Subtracting
INFORMATION INTEGRATION
Local-as-View Mediators
Information Integration
Chen Li Information and Computer Science
Semi-structured Data In many applications, data does not have a rigidly and predefined schema: e.g., structured files, scientific data, XML. Managing such.
Presentation transcript:

Information Integration Using Logical Views Jeffrey D. Ullman

Overview Theoretical Background Synthesizing Queries From Views Information Integration Systems Conclusions

Theoretical Background

Conjunctive Queries (CQ) (NP-Complete) – Query containment – Query equivalence – Q1: p(X,Z) :- a(X,Y) & a(Y,Z) – Q2: p(X,Z) :- a(X,Y) & a(V,Z) CQ’s With Negation ( -Complete) – Q1: p(X,Z) :- a(X,Y) & a(Y,Z) & NOT a(X,Z) CQ’s With Arithmetic Comparision ( -Complete) – Q1: p(X,Z) :- a(X,Y) & a(Y,Z) & X<Y Datalog Programs – p(A,C) :- a(A,B) & b(B,C)

Synthesizing Queries From Views Views Assumption: – Conjunctive Queries Answers – Equivalence – Containment

Solving Queries by Views Views – v1(Y,Z) :- p(X,Y) & p(Y,Z) – v2(X,Z) :- p(X,Y) & p(Y,Z) Query: q(c) :- p(O,A) & p(A,B) & p(B,C) Answer: s1(C) :- v2(O,D) & v1(D,C) Expansion: – e1(C) :- p(O,E) & p(E,D) & p(F,D) & p(D,C)

Solving Queries by Views (Cont. ) Query: q(c) :- p(O,A) & p(A,B) & p(B,C) Answer: – s1(C) :- v2(O,D) & v1(D,C) – e1(C) :- p(O,E) & p(E,D) & p(F,D) & p(D,C) Other Solutions – s2(C) :- v1(O,D) & v2(D,C) – s3(C) :- v1(O,D) & v2(D,E) & v1(E,C) – s4(C) :- v2(O,D) & v1(D,C) & v2(C,E)

Minimal-Solution Theorems NP-complete (Query Length) Minimal Solution – S  Q – There is no solution T for Q such that S  T  Q and T has fewer sub goals than S Theorem1 Theorem2

Information Integration Systems Information Manifold (IM) – AT&T – Local-as-View (Lav) – Source relations defined as views of mediator relations Tsimmis – Stanford and IBM – Global-as-View (Gav) – Mediator relations defined as views of source relations

Information Manifold (IM) Local-as-view approach Description Logic called CARIN Characters (top-down) – A collection of global predicates – Views as source descriptions (some) – Constraints

IM Example Global Predicates

IM Example (Cont.) Views Query: “What are Sally’s phone and office?” Mediator

Query Rewriting

IM Example (Cont.) Answer: Expansion:

Answering Queries in IM Global Predicates View Definitions Source Constraints How much do the mediators know? What’s the hard part? (Query Rewriting!)

Tsimmis OEM and MSL

Tsimmis Example Exported OEM Objects Query: “What are Sally’s phone and office?” Mediator

Answering Queries in Tsimmis MSL Mediator Description View Definitions How much do the mediators know? What’s the hard part? (Mediator description!)

IM vs. Tsimmis Query Processing Adding Sources Levels of Mediation Semistructured Data Constraints Automatic Generation of Components