Sangam: A Transformation Modeling Framework Kajal T. Claypool (U Mass Lowell) and Elke A. Rundensteiner (WPI)

Slides:



Advertisements
Similar presentations
Chungnam National University DataBase System Lab
Advertisements

Inside an XSLT Processor Michael Kay, ICL 19 May 2000.
The eXtensible Markup Language (XML) An Applied Tutorial Kevin Thomas.
XML: Extensible Markup Language
TIMBER A Native XML Database Xiali He The Overview of the TIMBER System in University of Michigan.
Relational Databases for Querying XML Documents: Limitations & Opportunities VLDB`99 Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton,
Presented by: Thabet Kacem Spring Outline Contributions Introduction Proposed Approach Related Work Reconception of ADLs XTEAM Tool Chain Discussion.
An Extensible System for Merging Two Models Rachel Pottinger University of Washington Supervisors: Phil Bernstein and Alon Halevy.
Using Schema Matching to Simplify Heterogeneous Data Translation Tova Milo, Sagit Zohar Tel Aviv University.
CMPT 354, Simon Fraser University, Fall 2008, Martin Ester 52 Database Systems I Relational Algebra.
Interactive Generation of Integrated Schemas Laura Chiticariu et al. Presented by: Meher Talat Shaikh.
Xyleme A Dynamic Warehouse for XML Data of the Web.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
Merging Models Based on Given Correspondences Rachel A. Pottinger Philip A. Bernstein.
Web-site Management System Strudel Presented by: LAKHLIFI Houda Instructor: Dr. Haddouti.
Visual Web Information Extraction With Lixto Robert Baumgartner Sergio Flesca Georg Gottlob.
A First Attempt towards a Logical Model for the PBMS PANDA Meeting, Milano, 18 April 2002 National Technical University of Athens Patterns for Next-Generation.
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
Query Languages Aswin Yedlapalli. XML Query data model Document is viewed as a labeled tree with nodes Successors of node may be : - an ordered sequence.
The Data Mining Visual Environment Motivation Major problems with existing DM systems They are based on non-extensible frameworks. They provide a non-uniform.
TU/e eindhoven university of technology / faculty of mathematics and informatics Exporting Databases in XML DTD A Conceptual and Generic Approach Philippe.
Summary. Chapter 9 – Triggers Integrity constraints Enforcing IC with different techniques –Keys –Foreign keys –Attribute-based constraints –Schema-based.
Page 1 Multidatabase Querying by Context Ramon Lawrence, Ken Barker Multidatabase Querying by Context.
QoM: Qualitative and Quantitative Measure of Schema Matching Naiyana Tansalarak and Kajal T. Claypool (Kajal Claypool - presenter) University of Massachusetts,
CIS607, Fall 2005 Semantic Information Integration Article Name: Clio Grows Up: From Research Prototype to Industrial Tool Name: DH(Dong Hwi) kwak Date:
1 Advanced Topics XML and Databases. 2 XML u Overview u Structure of XML Data –XML Document Type Definition DTD –Namespaces –XML Schema u Query and Transformation.
Quete: Ontology-Based Query System for Distributed Sources Haridimos Kondylakis, Anastasia Analyti, Dimitris Plexousakis Kondylak, analyti,
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
H ECATAEUS A Framework for Representing SQL Constructs as Graphs George Papastefanatos 1, Kostis Kyzirakos 1, Panos Vassiliadis 2, Yannis Vassiliou 1 1.
XML-to-Relational Schema Mapping Algorithm ODTDMap Speaker: Artem Chebotko* Wayne State University Joint work with Mustafa Atay,
Selective and Authentic Third-Party distribution of XML Documents - Yashaswini Harsha Kumar - Netaji Mandava (Oct 16 th 2006)
© 2007 Open Grid Forum OGF Modeling Activities DMTF Alliance Partner Symposium Portland, 2007 July 18 Ellen Stokes
Context Tailoring the DBMS –To support particular applications Beyond alphanumerical data Beyond retrieve + process –To support particular hardware New.
1 CIS336 Website design, implementation and management (also Semester 2 of CIS219, CIS221 and IT226) Lecture 6 XSLT (Based on Møller and Schwartzbach,
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
An Algebra for Composing Access Control Policies (2002) Author: PIERO BONATTI, SABRINA DE CAPITANI DI, PIERANGELA SAMARATI Presenter: Siqing Du Date:
Querying Structured Text in an XML Database By Xuemei Luo.
Minor Thesis A scalable schema matching framework for relational databases Student: Ahmed Saimon Adam ID: Award: MSc (Computer & Information.
Dimitrios Skoutas Alkis Simitsis
RRXS Redundancy reducing XML storage in relations O. MERT ERKUŞ A. ONUR DOĞUÇ
CSE 636 Data Integration Schema Matching Cupid Fall 2006.
Hybrid Transformation Modeling Integrating a Declarative with an Imperative Model Transformation Language Pieter Van Gorp
Rainbow: XML and Relational Database Design, Implementation, Test, and Evaluation Project Members: Tien Vu, Mirek Cymer, John Lee Advisor:
XML and Database.
VLDB2005 CMS-ToPSS: Efficient Dissemination of RSS Documents Milenko Petrovic Haifeng Liu Hans-Arno Jacobsen University of Toronto.
Sept. 27, 2002 ISDB’02 Transforming XPath Queries for Bottom-Up Query Processing Yoshiharu Ishikawa Takaaki Nagai Hiroyuki Kitagawa University of Tsukuba.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Visualization Four groups Design pattern for information visualization
Dec. 13, 2002 WISE2002 Processing XML View Queries Including User-defined Foreign Functions on Relational Databases Yoshiharu Ishikawa Jun Kawada Hiroyuki.
Architectural Design Introduction Design has been described as a multistep process in which representations of data and program structure,
Conceptualization Relational Model Incomplete Relations Indirect Concept Reflection Entity-Relationship Model Incomplete Relations Two Ways of Concept.
Deriving Relation Keys from XML Keys by Qing Wang, Hongwei Wu, Jianchang Xiao, Aoying Zhou, Junmei Zhou Reviewed by Chris Ying Zhu, Cong Wang, Max Wang,
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Lecture 15: Query Optimization. Very Big Picture Usually, there are many possible query execution plans. The optimizer is trying to chose a good one.
1 Storing and Maintaining Semistructured Data Efficiently in an Object- Relational Database Mo Yuanying and Ling Tok Wang.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
1 Towards Integrated Tool Support for the User Requirements Notation Jean-François Roy
Of 24 lecture 11: ontology – mediation, merging & aligning.
XML: Extensible Markup Language
SEAA 2014 Automatic Production of Transformation Chains Using Structural Constraints on Output Models Cuauhtémoc Castellanos Etienne Borde Thomas Vergnaud.
Web Ontology Language for Service (OWL-S)
Microsoft Access 2003 Illustrated Complete
Associative Query Answering via Query Feature Similarity
Presented by: Jacky Ma Date: 11 Dec 2001
Ivan Kurtev, Klaas van den Berg Software Engineering Group
Database Dr. Roueida Mohammed.
ER Example Expressed as IMM Core Instance Specifications
Presentation transcript:

Sangam: A Transformation Modeling Framework Kajal T. Claypool (U Mass Lowell) and Elke A. Rundensteiner (WPI)

The Era of Electronic Information Age of electronic information Age of electronic information Data exists in many different formats Data exists in many different formats  Different data models  Different schemas Users need to Users need to  Publish data in many formats  Integrate and transform data  Query and expect results in common format Underlying problem Underlying problem  Need to express mapping of data from one format to another  Need to perform transformation of data based on expressed mappings.

Schema Translation: State of the Art Naïve approach [Zhang01,Shanmugasundram99] Naïve approach [Zhang01,Shanmugasundram99]  Write specific programs to translate data from one format to another  Examples:  Algorithms: translate XML documents into relational data [zhang01,shanmugasundram99]  Latex2html: convert latex into HTML documents

Schema Translation: State of Art Matching approach [milo98] Matching approach [milo98]  Automatically discover the semantic correspondences between two schemas  Generate translations based on discovered matches Modeling approach [bernstein00,atzeni96] Modeling approach [bernstein00,atzeni96]  Transform local schema into common data model  Translation language to express mappings between schemas in middle layer

The Sangam Framework Goals: Flexible, extensible, and re-usable Goals: Flexible, extensible, and re-usable Allow users to: Allow users to:  Explicitly model translations between schemas  Compose translations from an existing library of modeled translation patterns  Choose from a library of translation operators  Generate translation model from based on schema match process  For all modeled translations: transform the data based on translation

Overview of Sangam Framework Matcher Schema S1 Schema S2 Matches Transformation Framework Transformation Model Pattern Interface TranInterfaceTranInterface User feedback Displayed to User Evaluator Data D2 Data D1 Transformed Data Transformed Schema Transform- ation Patterns User Input System generated output System Input Legend : ToolSet

Outline Sangam graphs Sangam graphs Cross algebra operators Cross algebra operators Composition techniques Composition techniques Cross algebra graphs Cross algebra graphs Execution strategies Execution strategies Architecture Architecture Conclusions Conclusions

Sangam Graphs Sangam Sangam 1. Common data model:  Sangam graph model 2. Translation language  Algebra-based RDB Import XML Export Sangam graph Cross Algebra for translation

Requirements for a Common Data Model Graph-based Graph-based  Common denominator for most data models Expressiveness Expressiveness  Represent schemas from different data models Fundamental constraints Fundamental constraints  Represent constraints such as quantifier, order and key constraints Existing data models not completely suitable Existing data models not completely suitable  Relational, and OO cannot represent order in clean manner  XML (older spec) can not represent key constraints

Sangam Graph Model Satisfies requirements Satisfies requirements  Graph based  Based on SIGs[ Miller93 ]  Can model schemas from different data models  Can represent quantifier, order, key and foreign key constraints Graph Graph  Nodes represent entities  Eg. Relation, attribute, element  Edge relationships between them  Eg. Containment relationship between relation and attribute

<!ATTLIST item id ID #REQUIRED featured CDATA #IMPLIED> Example: Sangam Graph

Cross Algebra Requirements for a transformation language Requirements for a transformation language  Node and edge manipulations  Minimal granularity  Eg: Relation has name and attributes  Allow composition Unique contribution: algebra-based translation language Unique contribution: algebra-based translation language  Translate from one Sangam graph to another Four Operators: Four Operators:  Represent core set of graph linear transformations [ GBook00 ]  Can be composed to formulate more complex operations such as a join operation – not our focus

Cross Algebra Operators cross, connect, smooth and subdivide cross, connect, smooth and subdivide

Composition of Operators Context Dependency Context Dependency  Output = union of output of all operators Derivation Derivation  Output = output of root operator

Cross Algebra Graphs

Evaluating a Cross Algebra Graph Function EvaluateCAT (input: Operator op, Sangam Graph G, output: Sangam Graph G’) { if (!op.hasChildren ()) G’  p.evaluate (G, G’) op.markDone () out  G’ // cached local output return G’ while (op.hasChildren()) { operator opC  op.getNextChild () if (e: == derivation) G_local  EvaluateCAT (opC, G, G’) G’  op.evaluate (G_local, G’) op.markDone () out  G’ // cached local output return G’ elseif (e: == context dependency) G_local  EvaluateCAT (opC, G, G’) G’_local  op.evaluate (G, G’) G’  G_local U G’_local op.markDone () out  G’_local // local cached output return G’

Architecture

Conclusions Sangam: Flexible, extensible, and re-usable transformation modeling framework Sangam: Flexible, extensible, and re-usable transformation modeling framework Key contributions: Key contributions:  Concept of Sangam  Cross Algebra  An algebra for modeling linear transformations  Composition techniques to deal with finer granularity  Evaluation techniques Future work Future work  Modeling the data model layer  Optimization of evaluation strategies  Non-linear transformations