May 28, 2002 P2P Databases 1 Philip A. Bernstein Microsoft Research Fausto Giunchiglia Univ. of Trento Anastasios Kementsietsidis Univ. of Toronto John.

Slides:



Advertisements
Similar presentations
The ANSI/SPARC Architecture of a Database Environment
Advertisements

An Introduction to Description Logics
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
Dynamic Ontology Matching Pavel Shvaiko OpenKnowledge meetings 9 February, 13 March, 2006 Trento, Italy.
A Probabilistic Framework for Information Integration and Retrieval on the Semantic Web by Livia Predoiu, Heiner Stuckenschmidt Institute of Computer Science,
Chapter 3 The Relational Model Transparencies © Pearson Education Limited 1995, 2005.
1 Basic abstract interpretation theory. 2 The general idea §a semantics l any definition style, from a denotational definition to a detailed interpreter.
Chapter 3. 2 Chapter 3 - Objectives Terminology of relational model. Terminology of relational model. How tables are used to represent data. How tables.
Implementing Database Coordination in P2P Networks * Ilya Zaihrayeu SemPGRID-04, 18 May 2004, New York, USA * work with Fausto Giunchiglia.
Distributed Database Management Systems. Reading Textbook: Ch. 4 Textbook: Ch. 4 FarkasCSCE Spring
1 Relational Algebra and Calculus Yanlei Diao UMass Amherst Feb 1, 2007 Slides Courtesy of R. Ramakrishnan and J. Gehrke.
1 Joint work with Antonio Bucchiarone (Fondazione Bruno Kessler - IRST, Trento) and Fabrizio Montesi (University of Bologna/INRIA, Bologna) A Framework.
Peer-to-Peer Databases David Andersen Advanced Databases.
Domain Modelling the upper levels of the eframework Yvonne Howard Hilary Dexter David Millard Learning Societies LabDistributed Learning, University of.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
Evaluating Centralized, Hierarchical, and Networked Architectures for Rule Systems Benjamin Craig University of New Brunswick Faculty of Computer Science.
IMS 4212: Distributed Databases 1 Dr. Lawrence West, Management Dept., University of Central Florida Distributed Databases Business needs.
Lecture 2 The Relational Model. Objectives Terminology of relational model. How tables are used to represent data. Connection between mathematical relations.
Chapter 4 The Relational Model Pearson Education © 2014.
Chapter 4 The Relational Model.
Chapter 3 The Relational Model Transparencies Last Updated: Pebruari 2011 By M. Arief
Reasoning with context in the Semantic Web … or contextualizing ontologies Fausto Giunchiglia July 23, 2004.
思科网络技术学院理事会. 1 Application Layer Functionality and Protocols Network Fundamentals – Chapter 3.
National Survey and Cadastre – Denmark Conceptual Modeling of Geographic Databases - Emphasis on Relationships among Geographic Databases Anders Friis-Christensen.
MAHI Research Database Data Validation System Software Prototype Demonstration September 18, 2001
Ming Fang 6/12/2009. Outlines  Classical logics  Introduction to DL  Syntax of DL  Semantics of DL  KR in DL  Reasoning in DL  Applications.
Chapter 3 The Relational Model. 2 Chapter 3 - Objectives u Terminology of relational model. u How tables are used to represent data. u Connection between.
Data Management for Peer-to-Peer Computing: A Vision Ali Rahbari.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
LDK R Logics for Data and Knowledge Representation Modeling First version by Alessandro Agostini and Fausto Giunchiglia Second version by Fausto Giunchiglia.
Navigational Plans For Data Integration Marc Friedman Alon Levy Todd Millistein Presented By Avinash Ponnala Avinash Ponnala.
An Algebra for Composing Access Control Policies (2002) Author: PIERO BONATTI, SABRINA DE CAPITANI DI, PIERANGELA SAMARATI Presenter: Siqing Du Date:
HEPTOX 1 : Marrying XML and Heterogeneity in Your P2P Databases Angela Bonifati (Icar CNR, Italy), Elaine Q.Chang, Laks V.S.Lakshmanan, Terence Ho, Rachel.
9/7/2012ISC329 Isabelle Bichindaritz1 The Relational Database Model.
AOIS’02 - June 02, 2002 Coordinating Peer-to-Peer information sources1 Fausto Giunchiglia, University of Trento Coordinating Peer-to-Peer information sources.
DRAGO: Distributed Reasoning Architecture for the Semantic Web Andrei Tamilin and Luciano Serafini Work is supported by 1 June 2005 Second European Semantic.
An Introduction to Description Logics (chapter 2 of DLHB)
CMU SCS Carnegie Mellon Univ. Dept. of Computer Science Database Applications Lecture#6: Relational calculus.
Lesson Overview 3.1 Components of the DBMS 3.1 Components of the DBMS 3.2 Components of The Database Application 3.2 Components of The Database Application.
P2P Concept Search Fausto Giunchiglia Uladzimir Kharkevich S.R.H Noori April 21st, 2009, Madrid, Spain.
Data Sharing in the Hyperion Peer Database System Patricia Rodriguez-Gianolli Anastasios Kementsietsidis Maddalena Garzetti Iluju Kiringa Lei Jiang Mehedi.
1 Incorporating Data Mining Applications into Clinical Guidelines Reza Sherafat Dr. Kamran Sartipi Department of Computing and Software McMaster University,
Object Oriented Multi-Database Systems An Overview of Chapters 4 and 5.
Proposed NWI KIF/CG --> Common Logic Standard A working group was recently formed from the KIF working group. John Sowa is the only CG representative so.
Distributed DBMSs- Concept and Design Jing Luo CS 157B Dr. Lee Fall, 2003.
Scaling Heterogeneous Databases and Design of DISCO Anthony Tomasic Louiqa Raschid Patrick Valduriez Presented by: Nazia Khatir Texas A&M University.
Syntax and Semantics CIS 331 Syntax: the form or structure of the expressions, statements, and program units. Semantics: the meaning of the expressions,
Aggregate Queries in Peer-to-Peer OLAP Mauricio Minuto Espil Faculty of Engineering Universidad Católica Argentina Alejandro A. Vaisman Computer Science.
Logical Agents Chapter 7. Outline Knowledge-based agents Logic in general Propositional (Boolean) logic Equivalence, validity, satisfiability.
Making Peer Databases Interact – A Vision for an Architecture Supporting Data Coordination Working Group (in alph. order): Bernstein Phil (4) Kementsietsidis.
A Data Stream Publish/Subscribe Architecture with Self-adapting Queries Alasdair J G Gray and Werner Nutt School of Mathematical and Computer Sciences,
INRIA - Progress report DBGlobe meeting - Athens November 29 th, 2002.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Lecture 7: Foundations of Query Languages Tuesday, January 23, 2001.
1 Reasoning with Infinite stable models Piero A. Bonatti presented by Axel Polleres (IJCAI 2001,
Object storage and object interoperability
ece 627 intelligent web: ontology and beyond
The Relational Model © Pearson Education Limited 1995, 2005 Bayu Adhi Tama, M.T.I.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 The Relational Model Chapter 3.
Physical Data Model – step-by-step instructions and template
Chapter 4 The Relational Model Pearson Education © 2009.
Chapter 4 The Relational Model Pearson Education © 2009.
The Relational Model Transparencies
Working Group (in alph. order): Bernstein Phil (4)
Query Optimization.
Representations & Reasoning Systems (RRS) (2.2)
Presentation transcript:

May 28, 2002 P2P Databases 1 Philip A. Bernstein Microsoft Research Fausto Giunchiglia Univ. of Trento Anastasios Kementsietsidis Univ. of Toronto John Mylopoulos Univ. of Toronto Luciano Serafini Univ. of Trento Ilya Zaihrayeu Univ. of Trento Data Management for Peer-to-Peer Computing: A Vision

May 28, 2002 P2P Databases 2 A Peer to Peer (P2P) Research Project Database and AI researchers intermittently connect to exchange research ideas … about P2P DBs Seattle Toronto Trento (Italy)

May 28, 2002 P2P Databases 3 Is There a Role for P2P DBs ? Peers come and go, but must still be able to interoperate. To us, the big question is how to cope with DBs that  are incomplete, overlapping, and mutually inconsistent  dynamically appear and disappear  have limited connectivity. Scenario  Databases of medical patients  Complete integration is likely to be infeasible  But dynamic integration of DBs relevant to one patient could have high value.

May 28, 2002 P2P Databases 4 Contributions Why P2P databases are different A P2P database scenario A logic for P2P databases Architecture and implementation issues

May 28, 2002 P2P Databases 5 A Model for P2P Databases Each peer is a node with a database. It exchanges data and services with acquaintances (i.e. other peers). The set of acquaintances changes often, due to  site availability  changing usage patterns Peers are fully autonomous.  No global control or central server. Hence no global schema  It would be impractical to build one for each peer  And it might be impossible because of inconsistencies

May 28, 2002 P2P Databases 6 A Motivating Scenario  A patient may be described in several DBs, which use different patient id formats, disease descriptions, etc.  When a patient is admitted to the hospital, H becomes acquainted with D  The acquaintance is dropped when treatment is over  When the doctor prescribes a drug, D becomes acquainted with P  A patient is injured skiing, so more DBs get involved H: Hospital P: Pharmacist D: Doctor Ski Clinic

May 28, 2002 P2P Databases 7 Proposal: Local Relational Model (LRM) A logic for P2P data integration Instead of a global schema, each peer has  coordination formulas – each specifies semantic interdependencies between two acquaintances  binary domain relations – each specifies how symbols in one database translate to symbols in an acquaintance’s database. Each expression in a coordination formula is relative to just one participating database Use coordination formulas and domain relations for query and update processing.

May 28, 2002 P2P Databases 8 A Coordination Formula p: pharmacist DB medication(PrescriptionID, Pid, Prod) d: doctor DB treatment(Tid, Pid, Description, Type) where type  {“hospital”, “home”}. (  i:x).A(x) means for all v in the domain of database i, A(v) is true. A coordination formula (  p:y).(  p:z).(p: (  x).medication(x, y, z)  d: (  w).treatment(w, y, z, “home”) ) “There’s a row in treatment in the doctor DB for each row in medication in the pharmacist DB”

May 28, 2002 P2P Databases 9 Domain Relation A row in domain relation r ik specifies that value d 1 in DB i corresponds to value d 2 in DB k r ik may be partial r ik,r ki need not be symmetric Example - DB i contains lengths in meters and DB k in kilometers (total but not symmetric)  r ik (x) = roundToClosestK(x) r ik (653)=1, r ik (453)=0  r ki (x) = x*1000 r ki (1)=1000

May 28, 2002 P2P Databases 10 Queries A query is a coordination formula of the form A(x)  i: q(x), where  A(x) is a coordination formula  x has n variables  i is the database against which the query is posed  q is a new n-ary predicate symbol A relational space is a pair where db is a set of DBs and r associates an r ik with each pair of DBs ⊨  A relational space satisfies a coordination formula  The answer to a query: {d  dom i | ⊨ ((  i:x).A(x)  i:x=d)} n

May 28, 2002 P2P Databases 11 Interpreting a Query A query ((i:P(x)  j:R(y))  k:S(x,y) )  h: q(x,y) Evaluate P,R,S in i,j,k (respectively) Map these results via r ih,r jh,r kh to sets s i,s j,s k And then compute ((s i  s j )  s k )

May 28, 2002 P2P Databases 12 View-Based Data Integration The standard model for data integration is based on  Global as view  Local as view How does this relate to LRM? Could use LRM to express view definitions  Either map “standard” view defns to LRM, or  Specify a restricted LRM for view definitions Could customize LAV/GAV query interpretation for such LRM views and queries This is work in progress.

May 28, 2002 P2P Databases 13 Implementation Architecture A classic multi-database system, with  A protocol for adding/dropping acquaintances  LRM query processing (domain mapping logic) that can cope with chains of acquaintances  Dynamic approach to materialized view creation Tools to help a user establish an acquaintance (e.g. yellow pages, defining domain relations & coord formulas,...) Local Information Source Wrapper Query Mgr Update Mgr User Interface LRM layer A Node P2P Network

May 28, 2002 P2P Databases 14 Summary Why P2P databases are different A P2P database scenario A logic for P2P databases (LRM)  Coordination formulas and domain relations  Query semantics Architecture and implementation issues

May 28, 2002 P2P Databases 15 Theoretical Results Provide inference rules for coordination formulas Prove that the rules are sound and complete. Define a generalized relational theory as a theory with domain closure, distinct domain values, and a finite number of possible relation extensions (CWA). Define relational multi-context system as a family of relational languages (one per database) with a generalized relational theory (in T) and a set of coordination formulas (in R) for each one. Prove that for any relational multi-context system, there’s a unique maximal relational space that satisfies it. (Generalizes Reiter’s result on CWA.) Other results on recursive queries and query evaluation.