Grigoris Karvounarakis Zachary G. Ives University of Pennsylvania Bidirectional Mappings for Data and Update Exchange WebDB 2008.

Slides:



Advertisements
Similar presentations
Update Exchange with Mappings and Provenance Todd J. Green Grigoris Karvounarakis Zachary G. Ives Val Tannen University of Pennsylvania VLDB 2007 Vienna,
Advertisements

Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 More SQL: Complex Queries, Triggers, Views, and Schema Modification.
Maintenance of Materialized Views: Problems, Techniques, and Applications Ashish Gupta IBM almaden Research Center Inderpal Singh Mumick AT&T Bell Laboratories.
O RCHESTRA : Rapid, Collaborative Sharing of Dynamic Data Zachary Ives, Nitin Khandelwal, Aneesh Kapur, University of Pennsylvania Murat Cakir, Drexel.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 More SQL: Complex Queries, Triggers, Views, and Schema Modification.
© nCode 2000 Title of Presentation goes here - go to Master Slide to edit - Slide 1 Reliable Communication for Highly Mobile Agents ECE 7995: Term Paper.
Implementing Mapping Composition Todd J. Green * University of Pennsylania with Philip A. Bernstein (Microsoft Research), Sergey Melnik (Microsoft Research),
Data Exchange & Composition of Schema Mappings Phokion G. Kolaitis IBM Almaden Research Center.
Chapter 6: Database Evolution Title: AutoAdmin “What-if” Index Analysis Utility Authors: Surajit Chaudhuri, Vivek Narasayya ACM SIGMOD 1998.
Data Integration and Exchange for Scientific Collaboration DILS 2009 July 20, 2009 Zachary G. Ives University of Pennsylvania Funded by NSF IIS ,
Database Systems More SQL Database Design -- More SQL1.
1 Provenance in O RCHESTRA T.J. Green, G. Karvounarakis, Z. Ives, V. Tannen University of Pennsylvania Principles of Provenance (PrOPr) Philadelphia, PA.
UT DALLAS Erik Jonsson School of Engineering & Computer Science FEARLESS engineering Secure Data Storage and Retrieval in the Cloud Bhavani Thuraisingham,
Query Optimization Allison Griffin. Importance of Optimization Time is money Queries are faster Helps everyone who uses the server Solution to speed lies.
Context Tailoring the DBMS –To support particular applications Beyond alphanumerical data Beyond retrieve + process –To support particular hardware New.
CSE314 Database Systems More SQL: Complex Queries, Triggers, Views, and Schema Modification Doç. Dr. Mehmet Göktürk src: Elmasri & Navanthe 6E Pearson.
Application-Layer Anycasting By Samarat Bhattacharjee et al. Presented by Matt Miller September 30, 2002.
Compiling Mappings to Bridge Applications and Databases Melnik, Adya and Research.
Session-9 Data Management for Decision Support
Chapter 16 Methodology – Physical Database Design for Relational Databases.
SEC835 Practical aspects of security implementation Part 1.
IDB, SNU Dong-Hyuk Im Efficient Computing Deltas between RDF Models using RDFS Entailment Rules (working title)
Web Caching By Neeraj Agrawal. Caching Caching is widely used for improving performance in many context( e.g processor caches in hardware, buffer pool.
DATA-DRIVEN UNDERSTANDING AND REFINEMENT OF SCHEMA MAPPINGS Data Integration and Service Computing ITCS 6010.
Computer Security: Principles and Practice
1 Schema Mediation and Query Processing in Peer Data Management Systems Presenter: Jie Zhao Supervisor: Rachel Pottinger Sept. 29, 2006.
Join Synopses for Approximate Query Answering Swarup Achrya Philip B. Gibbons Viswanath Poosala Sridhar Ramaswamy Presented by Bhushan Pachpande.
First Edition by William Stallings and Lawrie Brown Lecture slides by Lawrie Brown Chapter 5 – Database Security.
1 Copyright © 2004, Oracle. All rights reserved. Introduction.
Using Procedures & Functions Oracle Database PL/SQL 10g Programming Chapter 9.
M.Kersten Dec 31, Cracking the database store The far side of the Moon Martin Kersten, Stefan Manegold Centre for Mathematics and Computer Science.
Reconcilable Differences Todd J. GreenZachary G. IvesVal Tannen University of Pennsylvania March 24, ICDT 09, Saint Petersburg.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
WP3: Provenance and Access Policies Giorgos Flouris (FORTH) - Irini Fundulaki (CWI & FORTH) -
Scaling Heterogeneous Databases and Design of DISCO Anthony Tomasic Louiqa Raschid Patrick Valduriez Presented by: Nazia Khatir Texas A&M University.
KAIS T High-throughput multicast routing metrics in wireless mesh networks Sabyasachi Roy, Dimitrios Koutsonikolas, Saumitra Das, and Y. Charlie Hu ICDCS.
Query Processing – Query Trees. Evaluation of SQL Conceptual order of evaluation – Cartesian product of all tables in from clause – Rows not satisfying.
Understanding Data Intensive Systems Using Dynamic Analysis and Visualization Nesrine NOUGHI.
The Object-Oriented Database System Manifesto Malcolm Atkinson, François Bancilhon, David deWitt, Klaus Dittrich, David Maier, Stanley Zdonik DOOD'89,
Update Exchange with Provenance Schemas are related by GLAV schema mappings (tgds) : M4: Domain_Ref(SrcID, 'Interpro', ITAcc), Entry2Meth(ITAcc, DBAcc,
WP3: Data Provenance and Access Control Irini Fundulaki, FORTH December 11-12, 2012, Luxembourg.
Copyright © 2004 Pearson Education, Inc.. Chapter 24 Enhanced Data Models for Advanced Applications.
1 SQL: The Query Language. 2 Example Instances R1 S1 S2 v We will use these instances of the Sailors and Reserves relations in our examples.
Database Security Advanced Database Dr. AlaaEddin Almabhouh.
Fundamental of Database Systems
More SQL: Complex Queries, Triggers, Views, and Schema Modification
SEDEX: Scalable Entity Preserving Data Exchange
Databases (CS507) CHAPTER 2.
Oracle & SQL Introduction
Chapter 6 - Database Implementation and Use
The Object-Oriented Database System Manifesto
A paper on Join Synopses for Approximate Query Answering
Rapid, Collaborative Sharing of Dynamic Data
Information Systems Database Management
Methodology – Physical Database Design for Relational Databases
Chapter 4 Relational Databases
SQL: Advanced Options, Updates and Views Lecturer: Dr Pavle Mogin
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
ADMIN TRAINING SESSION
Chapter 2 Database Environment Pearson Education © 2009.
Implementing Mapping Composition
Implementing Row Level Security (RLS)
Semantic Adaptation of Schema Mappings when Schemas Evolve
Session #, Speaker Name Views 1/2/2019.
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Chapter 9: Database Systems
Views 1.
Chapter 9 Query-by-Example Pearson Education © 2009.
Chapter 2 Database Environment Pearson Education © 2009.
Course Instructor: Supriya Gupta Asstt. Prof
Presentation transcript:

Grigoris Karvounarakis Zachary G. Ives University of Pennsylvania Bidirectional Mappings for Data and Update Exchange WebDB 2008

2 ● Each peer has a local instance ● Relate peers by schema mappings ● Support update exchange according to different administrator policies Collaborative Data Sharing Systems [CIDR05] DBMS Queries, edits PUBLISH ∆ B +/− ∆ C +/− Peer A Peer B Peer C ∆ A +/−

3 T T Unidirectional update exchange ● O RCHESTRA [ SIGMOD06,VLDB07 ]: ▪ Each mapping (tgd[ Fagin+03 ]/GLAV[ Lenzerini02 ]) has source and target peers ▪ Propagate effects of updates forward, to target ● Related to view maintenance (1,2) U U m1m1 m2m2 (1,3) + m2m2 m 1 : R (xy) Æ S(yz) ! T(xyz) m 2 : T(xyz) ! U(xyz) (1,2) (1,3) – R R (1,2) + S S (2,3) +

4 Bidirectional mappings and update exchange ● Goal: Tighter coupling between peers ▪ Both forward and backward propagation of updates (insertions/deletions) along mappings ▪ E.g., for mirroring of content between peers ● Our contributions: ▪ Language for specifying bidirectional mappings and update policies (for backwards propagation of deletions) ▪ Algorithms for propagation of updates in both directions and for propagation with detection and prevention of side effects [Dayal+82] at run-time ▪ Experimental evaluation, illustrating feasibility

5 m 1 :T(xyz) Æ V(wx) ! R(xy) Æ S(xzw) S Bidirectional update exchange: insertions R (3, 3, 5) T (1, 1, 1) (1, 1, 2) (3, 2, 3) + (1, 1, 4) (1, 2, 4) + (1, 1) + (3, 2) m1m1 m1m1 m1m1 m 1 :R(xy) Æ S(xzw) $ T(xyz) Æ V(wx) U (1, 1, 1) (1, 1, 2) (3, 2, 3) m2m2 m2m2 m2m2 + + m 2 :T(xyz) $ U(xyz) m 1 :R(xy) Æ S(xzw) ! T(xyz) Æ V(wx) m 2 : U(xyz) ! T(xyz) m 2 : T(xyz) ! U(xyz) V (4,1) (5,3) +

6 Propagating deletions to source tuples ● Need to track down and delete source tuples from which deleted ones were derived ● Multiple options for propagating updates backwards over joins ● User specified update policies resolve this ambiguity: ● Guaranteed to perform any deletion, as long as there is at least one * in each side R (xy) Æ S(xzw) $ T(xyz) Æ V(wx) **

7 S Bidirectional update exchange: deletions R (3, 3, 5) T (1, 1, 1) (1, 1, 2) (3, 2, 3) + (1, 1, 4) (1, 2, 4) + (1, 1) + (3, 2) m1m1 m1m1 m1m1 m 1 : * R(xy) Æ S(xzw) $ T(xyz) Æ * V(wx) U (1, 1, 1) (1, 1, 2) (3, 2, 3) m2m2 m2m2 m2m2 + + m 2 : * T(xyz) $ * U(xyz) V (4,1) (5,3) + (1, 1, 2) (3, 3, 5) (1, 1, 2)

8 S Avoiding side effects on T R (3, 3, 5) T (1, 1, 1) (1, 1, 2) (3, 2, 3) + (1, 1, 4) (1, 2, 4) + (1, 1) + (3, 2) m1m1 m1m1 m1m1 m 1 : * R(xy) Æ S(xzw) $ T(xyz) Æ * V(wx) U (1, 1, 1) (1, 1, 2) (3, 2, 3) m2m2 m2m2 m2m2 + + m 2 : * T(xyz) $ * U(xyz) V (4,1) (5,3) +

9 Experimental evaluation ● Implementation strategy: Mappings -> Datalog programs -> SQL + fixpoint ▪ Java layer over RDBMS (DB2) ● Synthetic update workload sampled from SWISS- PROT biological data set ▪ Randomly-generated schemas and mappings ▪ 2000 initial tuples in each peer before propagation ● Questions: ▪ Overhead of bidirectional over unidirectional (in paper) ▪ Feasibility of deletion propagation and overhead of side effect detection

10 Propagation of 10% dels is feasible for 5 peers

11 Related work ● Data exchange [Haas+99, Miller+00, Popa+02, Fagin+03, Hernich+07], peer data exchange [Fuxman+05] ● View update [Dayal+82, Bancilhon+81], Harmony [Bohannon+06], [Matsuda+07] ● Incremental view maintenance [Gupta+93] ● Peer data management systems (PDMS) Piazza[Halevy+03,04],Hyperion[Kementsietsidis+04], [Bernstein+02], [Calvanese+04],...

12 Conclusions and future work ● Contributions: ▪ Bidirectional mappings for update exchange between peers that want closer collaboration ▪ Combine forward and backward propagation to compute peer instances incrementally ▪ Incorporate update policies for backward propagation ▪ Dynamic detection and prevention of side effects ● Future work: ▪ Combine unidirectional and bidirectional mappings, to also guarantee that all initial deletions are performed ▪ Study opportunities for performance optimization