Interactive Generation of Integrated Schemas Laura Chiticariu et al. Presented by: Meher Talat Shaikh.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Modeling and Querying Possible Repairs in Duplicate Detection George Beskales Mohamed A. Soliman Ihab F. Ilyas Shai Ben-David.
A theory of Attribute Equivalence in Databases with Application to Schema Integration JAMES A.LARSON SHAMKANT B. NAVATHE RAMEZ ELMASRI Presented by REEMA.
Page 1 Integrating Multiple Data Sources using a Standardized XML Dictionary Ramon Lawrence Integrating Multiple Data Sources using a Standardized XML.
Distributed DBMS© M. T. Özsu & P. Valduriez Ch.4/1 Outline Introduction Background Distributed Database Design Database Integration ➡ Schema Matching ➡
Reducing the Cost of Validating Mapping Compositions by Exploiting Semantic Relationships Eduard C. Dragut Ramon Lawrence Eduard C. Dragut Ramon Lawrence.
An Extensible System for Merging Two Models Rachel Pottinger University of Washington Supervisors: Phil Bernstein and Alon Halevy.
The Relational Model System Development Life Cycle Normalisation
Merging Models Based on Given Correspondences Rachel A. Pottinger Philip A. Bernstein.
© 2005 by Prentice Hall Chapter 3a Database Design Modern Systems Analysis and Design Fourth Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich.
A Review of Ontology Mapping, Merging, and Integration Presenter: Yihong Ding.
24/1/20081 Architecture of multiple databases integration.
Page 1 MDBS Schema Integration: The Relational Integration Model Ramon Lawrence MDBS Schema Integration: The Relational Integration Model Candidacy Exam.
Slides adapted from A. Silberschatz et al. Database System Concepts, 5th Ed. Entity-Relationship Model Database Management Systems I Alex Coman, Winter.
Distributed Database Management Systems. Reading Textbook: Ch. 4 Textbook: Ch. 4 FarkasCSCE Spring
QoM: Qualitative and Quantitative Measure of Schema Matching Naiyana Tansalarak and Kajal T. Claypool (Kajal Claypool - presenter) University of Massachusetts,
1 Relational Algebra and Calculus Yanlei Diao UMass Amherst Feb 1, 2007 Slides Courtesy of R. Ramakrishnan and J. Gehrke.
Sangam: A Transformation Modeling Framework Kajal T. Claypool (U Mass Lowell) and Elke A. Rundensteiner (WPI)
CIS607, Fall 2005 Semantic Information Integration Article Name: Clio Grows Up: From Research Prototype to Industrial Tool Name: DH(Dong Hwi) kwak Date:
The information integration wizard (Iwiz) project Report on work in progress Joachim Hammer Presented by Muhammed Al-Muhammed.
Modeling & Designing the Database
Michael F. Price College of Business Chapter 6: Logical database design and the relational model.
Overview of the Database Development Process
Lecture 2 The Relational Model. Objectives Terminology of relational model. How tables are used to represent data. Connection between mathematical relations.
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall 9.1.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Software School of Hunan University Database Systems Design Part III Section 5 Design Methodology.
9/14/2012ISC329 Isabelle Bichindaritz1 Database System Life Cycle.
DBSQL 3-1 Copyright © Genetic Computer School 2009 Chapter 3 Relational Database Model.
Chapter 2 Adapted from Silberschatz, et al. CHECK SLIDE 16.
Querying Structured Text in an XML Database By Xuemei Luo.
10/3/2012ISC329 Isabelle Bichindaritz1 Logical Design.
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
Minor Thesis A scalable schema matching framework for relational databases Student: Ahmed Saimon Adam ID: Award: MSc (Computer & Information.
Methodology - Conceptual Database Design. 2 Design Methodology u Structured approach that uses procedures, techniques, tools, and documentation aids to.
Dimitrios Skoutas Alkis Simitsis
Data integration and transformation 3. Data Exchange Paolo Atzeni Dipartimento di Informatica e Automazione Università Roma Tre 28/10/2009.
P15 Lai Xiaoni (U077151L) Qiao Li (U077194E) Saw Woei Yuh (U077146X) Wang Yong (U077138Y)
1 5 Normalization. 2 5 Database Design Give some body of data to be represented in a database, how do we decide on a suitable logical structure for that.
RRXS Redundancy reducing XML storage in relations O. MERT ERKUŞ A. ONUR DOĞUÇ
Methodology - Conceptual Database Design
Expert Systems with Applications 34 (2008) 459–468 Multi-level fuzzy mining with multiple minimum supports Yeong-Chyi Lee, Tzung-Pei Hong, Tien-Chin Wang.
Object Oriented Multi-Database Systems An Overview of Chapters 4 and 5.
Part4 Methodology of Database Design Chapter 07- Overview of Conceptual Database Design Lu Wei College of Software and Microelectronics Northwestern Polytechnical.
Scaling Heterogeneous Databases and Design of DISCO Anthony Tomasic Louiqa Raschid Patrick Valduriez Presented by: Nazia Khatir Texas A&M University.
1 Resolving Schematic Discrepancy in the Integration of Entity-Relationship Schemas Qi He Tok Wang Ling Dept. of Computer Science School of Computing National.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
DeepDive Model Dongfang Xu Ph.D student, School of Information, University of Arizona Dec 13, 2015.
Top-K Generation of Integrated Schemas Based on Directed and Weighted Correspondences by Ahmed Radwan, Lucian Popa, Ioana R. Stanoi, Akmal Younis Presented.
Class Diagrams. Terms and Concepts A class diagram is a diagram that shows a set of classes, interfaces, and collaborations and their relationships.
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
Semantic Alignment Spring 2009 Ben-Gurion University of the Negev.
Logical Design 12/10/2009GAK1. Learning Objectives How to remove features from a local conceptual model that are not compatible with the relational model.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
1 Storing and Maintaining Semistructured Data Efficiently in an Object- Relational Database Mo Yuanying and Ling Tok Wang.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Chapter (12) – Old Version
Lec 6: Practical Database Design Methodology and Use of UML Diagrams
Entity-Relationship Model
Relations and Functions
Chapter 2: Intro to Relational Model
Associative Query Answering via Query Feature Similarity
Entity Relationship Diagrams
Instance Model Structure
[jws13] Evaluation of instance matching tools: The experience of OAEI
Chapter 2: Intro to Relational Model
Tantan Liu, Fan Wang, Gagan Agrawal The Ohio State University
Practical Database Design and Tuning Objectives
Presentation transcript:

Interactive Generation of Integrated Schemas Laura Chiticariu et al. Presented by: Meher Talat Shaikh

Objectives: Creation of unified schema based on a set of existing source schemas. provides standard representation of data that deals with heterogeneous sources. Applications: Unified schema provides a single access point against which queries are posed. Consolidation of databases of merged organization.

Overview convert each schema into a graph of concepts with Has-A relationships. Identify matching concepts between different graphs. For every pair of matching concept merge/separate the concepts. Allow user to specify constraints on the merging process. Result is adaptive and interactive enumeration method.

Overview

Correspondences Signify “semantic equivalent” elements in two schemas. Bidirectional. Can be specified by the user or discovered by schema matching techniques. The approach considers correspondences between attributes.

Correspondences

Schema Integration The integrated target schema T is capable of representing all the attributes in the source schemas. Every attribute in T must represent some attribute of the input source schemas. The source schema data is transformed into T data via mapping ‘M’. In short, all basic relationships in the source schemas are preserved in T.

Graph of Concepts Each schema is converted into a logical view (graph of concepts). Each concept is a relation name with an associated set of attributes. Concepts in a schema may have references to other concepts and these references are captured by Has-A edges. A concept graph is a pair (V; HasA).

Concepts

Graph of Concepts A concept graph is a pair (V; HasA).

Matching of Concepts Use the correspondences to match the concepts. FORMAL DEFINITION: Let S1 and S2 be two source schemas and let C be a set of correspondences between attributes of S1 and S2. Let A be a concept of S1 and B be a concept of S2. We say that A and B match if there is at least one attribute a in A and one attribute b in B such that there is a correspondence at the schema level between attribute a and attribute b. Result is a matching graph.

Matching Concepts

Matching graph is represented by G=(V, HasA, E) x0 to x7 are the matching edge Matching edges will be the candidates for merging x 0..x 7 are the matching edges Matching Graph

Merging of Concepts Input assignment X. Merges all the concepts according to input assignment. Takes union of attributes. HasA relationship exists as per the source concept graphs.

Integrated Concept Graph

Redundancy constraints Similarly, using redundancy constrains loops can be eliminated.

Removal of Redundancy

Merging of Concepts Maintains an integration function f x for the given assignment X. Specifies how each individual concept, attribute and HasA edge in a source concept graph relates to integrated concept graph.

Mapping Input: integrated concept graph G 1, matching graph G and the integration function fx. Output: mapping M between source and integrated concepts. For every source concept C a mapping M C is created in M, which specifies how an instance of C, together with all its relationships, is to be transformed into an instance of an integrated concept C 1.

Adaptive Enumeration Does not enumerate all integrated schemas. Target schemas are output one by one and the user is allowed to browse through the schemas. Enumeration constraints: The user can express additional constraints on how concepts should be merged. Apply(x), (NOT)Apply(x), Merge(A1…An), (NOT)Merge(A1..An). eg. (NOT) Merge(org, location, emp, phone, fund).

Experimental Results

Strengths And Weakness Simple model: includes HasA edges as the basic form of relationships, and a simpler form of Contains. Graphs of concepts can express most of the essential features that appear in schemas or in conceptual models. The input is just a set of atomic correspondences. Weaknesses: Do not resolve type and representation conflicts. No weight or probabilities involved into matching between concepts.

Conclusion Provides a systematic and effective way of enumerating multiple integrated schemas. Allows user interaction to refine the schemas and generated a final integrated schema.

Thank you.