Affinity-based Schema Matching Silvana Castano Università di Milano D2I –– Modena, 27 aprile 2001.

Slides:



Advertisements
Similar presentations
ISDSI 2009 Francesco Guerra– Università di Modena e Reggio Emilia 1 DB unimo Searching for data and services F. Guerra 1, A. Maurino 2, M. Palmonari.
Advertisements

Università di Modena e Reggio Emilia ;-)WINK Maurizio Vincini UniMORE Researcher Università di Modena e Reggio Emilia WINK System: Intelligent Integration.
Ontology Assessment – Proposed Framework and Methodology.
D2I Project, Rome, October ARTEMIS The ARTEMIS prototype for the construction of reconciled views based on affinity evaluation and interactive.
The Relational Model Much of the material presented in these slides was developed by Dr. Ramon Lawrence at the University of Iowa.
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
Distributed DBMS© M. T. Özsu & P. Valduriez Ch.4/1 Outline Introduction Background Distributed Database Design Database Integration ➡ Schema Matching ➡
Reducing the Cost of Validating Mapping Compositions by Exploiting Semantic Relationships Eduard C. Dragut Ramon Lawrence Eduard C. Dragut Ramon Lawrence.
An Extensible System for Merging Two Models Rachel Pottinger University of Washington Supervisors: Phil Bernstein and Alon Halevy.
SSP Re-hosting System Development: CLBM Overview and Module Recognition SSP Team Department of ECE Stevens Institute of Technology Presented by Hongbing.
Interactive Generation of Integrated Schemas Laura Chiticariu et al. Presented by: Meher Talat Shaikh.
Diego Milano, Monica Scannapieco and Tiziana Catarci Università di Roma “La Sapienza” Dipartimento di Informatica e Sistemistica
Università degli Studi di Modena e Reggio Emilia The MOMIS project - Sonia Bergamaschi, Alberto Corni, Francesco Guerra,
Tema 1: Applicazioni per basi di dati su Internet e Intranet Use of ontologies and extensional inter-schema properties for integration D. Beneventano,
The Data Mining Visual Environment Motivation Major problems with existing DM systems They are based on non-extensible frameworks. They provide a non-uniform.
D2I Modena, 27 Aprile 2001 Methodologies and techniques for translating information from source to target data models Unità Responsabile: CS-RC Unità Coinvolte:
SEQUOIAS YR-SOC'07 - Leicester June A NOVEL APPROACH TO WEB SERVICES DISCOVERY Marco Comerio Università di Milano-Bicocca
Summary. Chapter 9 – Triggers Integrity constraints Enforcing IC with different techniques –Keys –Foreign keys –Attribute-based constraints –Schema-based.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Multifaceted Exploitation of Metadata for Attribute Match Discovery in Information Integration Li Xu David W. Embley David Jackman.
Generic Schema Matching with Cupid Jayant Madhavan Philip A. Bernstein Erhard Raham Proceedings of the 27 th VLDB Conference.
QoM: Qualitative and Quantitative Measure of Schema Matching Naiyana Tansalarak and Kajal T. Claypool (Kajal Claypool - presenter) University of Massachusetts,
Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.
Universe Design Concepts Business Intelligence Copyright © SUPINFO. All rights reserved.
Ontology-based Access Ontology-based Access to Digital Libraries Sonia Bergamaschi University of Modena and Reggio Emilia Modena Italy Fausto Rabitti.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Why XML ? Problems with HTML HTML design - HTML is intended for presentation of information as Web pages. - HTML contains a fixed set of markup tags. This.
Lecture 6 of Advanced Databases XML Schema, Querying & Transformation Instructor: Mr.Ahmed Al Astal.
BACKGROUND KNOWLEDGE IN ONTOLOGY MATCHING Pavel Shvaiko joint work with Fausto Giunchiglia and Mikalai Yatskevich INFINT 2007 Bertinoro Workshop on Information.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
A Z Approach in Validating ORA-SS Data Models Scott Uk-Jin Lee Jing Sun Gillian Dobbie Yuan Fang Li.
Development of Front End Tools for Semantic Grid Services Dr.S.Thamarai Selvi, Professor & Head, Dept. of Information Technology, Madras Institute of Technology,
Database Management System Lecture 4 The Relational Database Model- Introduction, Relational Database Concepts.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
WEB SEARCH PERSONALIZATION WITH ONTOLOGICAL USER PROFILES Data Mining Lab XUAN MAN.
September 5-7, Trento Deriving “sub-source” similarities from heterogeneous, semi-stuctured information sources D. Rosaci, G. Terracina, D. Ursino Dipartimento.
Dimitrios Skoutas Alkis Simitsis
Friends and Locations Recommendation with the use of LBSN By EKUNDAYO OLUFEMI ADEOLA
A Declarative Similarity Framework for Knowledge Intensive CBR by Díaz-Agudo and González-Calero Presented by Ida Sofie G Stenerud 25.October 2006.
Interoperable Visualization Framework towards enhancing mapping and integration of official statistics Haitham Zeidan Palestinian Central.
Taxonomic Concept Transfer Schema Robert Kukla. Transfer Schema Taxonomic units of interest? Which details do we need to record? What relationships between.
A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.
1 Tutorial 14 Validating Documents with Schemas Exploring the XML Schema Vocabulary.
Web-site Building Methodologies Current Research.
Announcements Reading for Monday –4.6 Homework 3 – Due 9/29.
Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.
Mar 27, 2008 Christiano Santiago1 Schema Matching Matching Large XML Schemas Erhard Rahm, Hong-Hai Do, Sabine Maßmann Putting Context into Schema Matching.
CELLULAR MANUFACTURING. Definition Objectives of Cellular Manufacturing  To reduce WIP inventory  To shorten manufacturing lead times  To simplify.
Ranking of Database Query Results Nitesh Maan, Arujn Saraswat, Nishant Kapoor.
An Ontology-based Approach to Context Modeling and Reasoning in Pervasive Computing Dejene Ejigu, Marian Scuturici, Lionel Brunie Laboratoire INSA de Lyon,
HKU CSIS DB Seminar: HKU CSIS DB Seminar: COMA-A system for flexible combination of schema matching approaches - VLDB Hong-Hai Do and Erhard Rahm.
An Ontological Approach to Financial Analysis and Monitoring.
Modeling Security-Relevant Data Semantics Xue Ying Chen Department of Computer Science.
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Question Answering Passage Retrieval Using Dependency Relations (SIGIR 2005) (National University of Singapore) Hang Cui, Renxu Sun, Keya Li, Min-Yen Kan,
Of 24 lecture 11: ontology – mediation, merging & aligning.
SERVICE ANNOTATION WITH LEXICON-BASED ALIGNMENT Service Ontology Construction Ontology of a given web service, service ontology, is constructed from service.
CCNT Lab of Zhejiang University
OPM/S: Semantic Engineering of Web Services
Network analysis.
Dr. Sudha Ram Huimin Zhao Department of MIS University of Arizona
Nov. 29, 2001 Ontology Based Recognition of Complex Objects --- Problems to be Solved Develop Base Object Recognition algorithms that identify non-decomposable.
A Graph-Based Approach to Learn Semantic Descriptions of Data Sources
MOMA - A Mapping-based Object Matching System
Tantan Liu, Fan Wang, Gagan Agrawal The Ohio State University
Presentation transcript:

Affinity-based Schema Matching Silvana Castano Università di Milano D2I –– Modena, 27 aprile 2001

Affinity relationships ySchema-level matching (no instances) yFind mappings between schema elements that correspond semantically to each other yAffinity mappings have an associated calculated degree of similarity yARTEMIS is the schema matching component of the MOMIS system which evaluates schema element (e.g., ODLI3 classes) affinity based on the following comparison features xclass names xclass attributes (name and domain) xclass references (name and domain)

Affinity Coefficients Name Affinity  [0,1] Names are compared by exploiting knowledge provided by a reference ontology O (e.g., the Common Thesaurus) O xOntology O is organized according to given strengthened terminological relationships (e.g., synonymy, hypernymy).  Given two names n and n’ an affinity function A(n,n’)  [0,1] returns the strength of the path between them in O; they have affinity iff A(n,n’) is greater than 

Name Affinity NA(School_Member, Professor) = 0.8 * 0.8 = 0.64 School_Member Professor Student University_Student Research_Staff CS_Person 0.8 Thesaurus Definition Determining the path with highest strength Computing Affinity coefficient NA(o,o’) = { otherwise0  1  *  2  * … *  (m-1)  o  m o’ and  1  *  2  *…*  (m-1)    With o  m o’ path of length m, m  1, with highest strength  i  strength of i-th terminological relationship in o  m o’

Affinity Coefficients Structural Affinity  [0,1] yDomains are compared based on compatibility relationships (e.g., validated attributes in the Common Thesaurus) yThe coefficient is proportional to the number of matching attributes and referenced classes Global Affinity  [0,1] yComprehensive affinity value, weighted sum of the two previous coefficients

Semantic Correspondences Validity Check Thesaurus Domain Compatibility Structural Affinity (Interactive validation) School_Member - name (string) - faculty (string) Professor - title (string) - first_name (string) - last_name (string) SA(School_Member, Professor) = 0.25 ODL I 3 Classes first_name last_name name

Affinity-based Clustering yHierachical clustering techniques are employed to identify all schema elements candidates to integration based on evaluated affinities. yClusters of candidates are selected interactively based on affinity thresholds yClusters selection occurs in a way that schema elements inside a cluster are characterized by high values of affinity with other elements of the cluster and by lower levels of affinity with elements outside.

Affinity Tree: Candidate Clusters Candidate cluster choice Threshold choice (es. 0,6) Professor Student CS_Person CourseSection Room School_Memeber University_Student Research_Staff Location OfficeDepartment

Ongoing work and open issues yApplication of the affinity/clustering schema matching techniques to integration of XML datasources has been started by Milano and first results have been published [IWASS00,Retis01,DS-901]. Ongoing work is on XML data reconciliation by providing a further ontological layer on top of integrated representations. yAffinity and clustering are actually performed based on intensional inter-schema properties yExtension of affinity and clustering techniques to consider also extensional inter-schema properties will be performed in collaboration with Modena and Reggio-Emilia.