Model Based Mediation With Domain Maps ___________________________ Xiaosen Li Guanrao William

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

Three-Step Database Design
Università di Modena e Reggio Emilia ;-)WINK Maurizio Vincini UniMORE Researcher Università di Modena e Reggio Emilia WINK System: Intelligent Integration.
Intelligent Technologies Module: Ontologies and their use in Information Systems Revision lecture Alex Poulovassilis November/December 2009.
Interoperability of Distributed Component Systems Bryan Bentz, Jason Hayden, Upsorn Praphamontripong, Paul Vandal.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
Xyleme A Dynamic Warehouse for XML Data of the Web.
Knowledge Acquisitioning. Definition The transfer and transformation of potential problem solving expertise from some knowledge source to a program.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
1 CIS607, Fall 2006 Semantic Information Integration Instructor: Dejing Dou Week 10 (Nov. 29)
1 Lecture 13: Database Heterogeneity Debriefing Project Phase 2.
TAMBIS Transparent Access to Multiple Biological Information Sources.
Infomaster: An information Integration Tool O. M. Duschka and M. R. Genesereth Presentation by Cui Tao.
Page 1 Multidatabase Querying by Context Ramon Lawrence, Ken Barker Multidatabase Querying by Context.
Automatic Data Ramon Lawrence University of Manitoba
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Advanced Database CS-426 Week 2 – Logic Query Languages, Object Model.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
XML, distributed databases, and OLAP/warehousing The semantic web and a lot more.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
revised CmpE 583 Fall 2006Discussion: OWL- 1 CmpE 583- Web Semantics: Theory and Practice DISCUSSION: OWL Atilla ELÇİ Computer Engineering.
Metadata Tools and Methods Chris Nelson Metanet Conference 2 April 2001.
San Diego Supercomputer Center EDBT'02, Prague 1 EDBT Panel, March 2002, Prague: Scientific Data Integration for Complex Multiple-Worlds Scenarios: Databases.
EXCS Sept Knowledge Engineering Meets Software Engineering Hele-Mai Haav Institute of Cybernetics at TUT Software department.
GEON AHM, April 16-18, SDSC C YBERINFRASTRUCTURE FOR THE G EOSCIENCES Towards Semantic Mediation for GEON: Facilitating Scientific Data Integration using.
Alignment of ATL and QVT © 2006 ATLAS Nantes Alignment of ATL and QVT Ivan Kurtev ATLAS group, INRIA & University of Nantes, France
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
XML & Mediators Thitima Sirikangwalkul Wai Sum Mong April 10, 2003.
1 Lessons from the TSIMMIS Project Yannis Papakonstantinou Department of Computer Science & Engineering University of California, San Diego.
XML Database Integration for Visualizing US Election Results Isabel F. Cruz, Afsheen Rajendran, and William Sunna University of Illinois at Chicago
Knowledge Modeling, use of information sources in the study of domains and inter-domain relationships - A Learning Paradigm by Sanjeev Thacker.
Dimitrios Skoutas Alkis Simitsis
1 Model-Based Information Integration in a Neuroscience Mediator System Bertram Ludaescher Amarnath Gupta Maryann E. Martone University of California San.
Interoperability & Knowledge Sharing Advisor: Dr. Sudha Ram Dr. Jinsoo Park Kangsuk Kim (former MS Student) Yousub Hwang (Ph.D. Student)
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
Knowledge-Based Integration of Neuroscience Data Sources Amarnath Gupta Bertram Ludäscher Maryann Martone University of California San Diego.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Working with Ontologies Introduction to DOGMA and related research.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
1 Resolving Schematic Discrepancy in the Integration of Entity-Relationship Schemas Qi He Tok Wang Ling Dept. of Computer Science School of Computing National.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Model-Based Mediation with Domain Maps Bertram Ludäscher * Amarnath Gupta * Maryann E. Martone + * San Diego Supercomputer Center (SDSC) + National Center.
Semantic Mediation and Scientific Workflows Bertram Ludäscher Data and Knowledge Systems San Diego Supercomputer Center University of California, San Diego.
Object storage and object interoperability
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
An Extensible Model-Based Mediator System with Domain Maps Amarnath Gupta * Bertram Ludäscher * Maryann E. Martone + * San Diego Supercomputer Center (SDSC)
1 Integration of data sources Patrick Lambrix Department of Computer and Information Science Linköpings universitet.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
A Rule Driven Bi-Directional Translation System for Remapping Queries and Result Sets Between a Mediated Schema and Heterogeneous Data Sources R. Shaker.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Improvement of Semantic Interoperability based on Metadata Registry(MDR) Doo-Kwon Baik Dept. of CSE Korea University.
National Partnership of Advanced Computational Infrastructure San Diego Supercomputer Center KNOW-ME (KNOWledge-Map-Explorer) Semantic Browsing of Integrated.
VERA AULIA ( ).  Oil palm is one of the major edible oil traded in the global market.  Oil palm tree will start to produce fruits within three.
Of 24 lecture 11: ontology – mediation, merging & aligning.
1 © 2013 Cengage Learning. All Rights Reserved. This edition is intended for use outside of the U.S. only, with content that may be different from the.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
UCSD Neuron-Centered Database
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Piotr Kaminski University of Victoria September 24th, 2002
Metadata Framework as the basis for Metadata-driven Architecture
Model Based Mediation With Domain Maps ___________________________
Query Optimization.
Ontologies: Introduction and Some Uses
Presentation transcript:

Model Based Mediation With Domain Maps ___________________________ Xiaosen Li Guanrao William Instructor (Prof. Isabel Cruz) The University of Illinois at Chicago

Outline Introduction XML-based Mediation Model-based Mediation Model-based Mediation with Domain Maps Application in Bioinformatics ISIS Comparison of ISIS and Model-based Mediation

Different Schemes Federated Databases XML-Based Mediation Model-Based Mediation One-World One-/Multiple-Worlds Complex Multiple-Worlds

Our Goal Given different data sources: And we have different queries: (Q1,Q2,……..Qk) over (S1,S2…..Sk) Find answers to these questions: (A1,A2,……..Ak) S1 S2 Sn …….

Introduction Model-Based Mediation: Integration of different data sources to retrieve information that cannot be retrieved using one source. Domain Map Domain Maps (Ontology's): Glue Knowledge Sources

“One Simple World” example Given: car Dealer A, Car Dealer B Find cars from Dealer A and Dealer B, Join on Make. Group by Manufacturing year, and Price. Solution: we can use XML-Based Mediation to find the answer.

XML-Mediator (Abstract) S1 Wrapper XML VIEW MEDIATOR Integrated XML View IVD (S1,S2) Query(s1,s2) USER …... SnS2 You can add multiple sources XML QUERIES/RESULTS CAR DEALER A CAR DEALER B

Integrated View Definition for the Car example XMAS XML Matching And Structuring language CONSTRUCT $m1 $p $ma { $ma } { $m1, $p } WHERE $m1 : $p : IN WRAP(“Dealer_A”) AND $m2 : $ma : IN WRAP(“Dealer_B”) AND value( $m1 ) = value( $m2 )

XMAS QUERY PROCESSING Translator COMPOSITION Rewriter, Optimizer XMAS QUERY XMAS VIEW DEFINITION PLAN EXECUTION

XML-Based Mediation: –XML Models –XML Elements –Structural Constraints: DTD (Parent, Child, Sibling) –No classes relationships (is-a, has-a) –No logical Domain constraints

Complex Multiple-Worlds Navigating the multiresolution data using knowledge-based mediation with domain maps ___________________________ Genes Proteins Cells Tissues Organism s Different Species Different Techniques Different Disciplines

Complex Multiple-Worlds

Strategies Take all the huge different databases and put them into an even larger database (warehouse) Or develop a system to talk to different databases and correlates the results

Database What is the cerebellar distribution of rat proteins with more than 80% homolgy with human NCS-1? How about other rodents? Protein localizationMorphologyNeurotransmision Database CaBP SYSTEM THAT CAN PROCESS THE QUERY FROM MULTIPLE COMPLEX WORLD DATABASES Query/Result

Model-based Mediation XML Wrapper Mediator CM Integrated View User/Client Integrated View Definition IVD(S_1,S_2,…,S_k) S_2 S_k S_1 CM Plug-ins CM Wrapper CM S_1CM S_2CM S_k GCM CM Queries & Results

Model-based Mediation “Lift” from syntax level to conceptual level Lift: –before: the source has element names that are NOT related –after: the element names are linked to a domain map Data provider adds links from raw data to domain maps

Model-based Mediation CM plug-in To make the mediator independent of CM formalism: --Sources export all CM information in XML --Use GCM so that the mediator no longer needs one module per CM formalism

Model-based Mediation CM to GCM GCM is a meta-model that any conceivable CM formalism can be expressed in. F-Logic as GCM --Convenience: root in knowledge representation and Object-Oriented database --Availability: FLORA, FLORID

A Question Different data sources contains different aspects of data. How to integrate them? For example Calcium channel Cell membrane Ca++ Intracellular Extracellular

Structural vs. Semantic Integration Source 1 Physiological data of calcium current through calcium channels Source 2 Immunolocalization of calcium channels Structurally they are isolated Conceptually and Semantically they are related Physiology data Immunolocalization data

Domain Maps Domain Map = Ontology –definition of “things” that are relevant to your application –representation of terminological knowledge –explicit specification of a conceptualization –concept hierarchy (“is-a”) –further semantic relationships between concepts abstractions of relational schemas, (E)ER, UML classes, XML Schemas Formalisms: Semantic nets, Frame-logic, Description logic,...

Domain Maps Formal definition --A finite set containing: --Description Logic (DL) --Logic rules --Facts expressed as edge-labeled digraphs with nodes representing concepts and edge labeled as roles: C r D : if c belongs to C then there is some d in D such that r(c,d) holds

Domain Map Use in Model-Based Mediation --“Provide declarative means for specifying additional knowledge that is not present in the source but that can be used to navigate through and interrelate the multiple data sources.” --when used as part of the IVD, can infer knowledge or derive virtual relations

Brain Neuron Cerebellum Purkinje cell layer Purkinje cell has_a is_a Knowledge based mediation (Use of Domain Maps) Using ontology maps to encode these semantic relationships

Domain Maps

The Whole Picture XML Wrapper Mediator CM Integrated View User/Client Integrated View Definition IVD(S_1,S_2,…,S_k) S_2S_kS_1 CM Plug-ins CM Wrapper CM S_1CM S_2CM S_k GCM CM Queries & Results Domain Map

XML-Based vs. Model-Based Mediation Raw Data IF  THEN  Logical Domain Constraints Integrated-CM := CM-QL(Src1-CM,...) Integrated-CM := CM-QL(Src1-CM,...) (XML) Objects Conceptual Models XML Elements XML Models C2 C3 C1 R Classes, Relations, is-a, has-a,... Domain Maps Domain Maps Integrated-DTD := XML-QL(Src1-DTD,...) Integrated-DTD := XML-QL(Src1-DTD,...) No Domain Constraints A = (B*|C),D B =... Structural Constraints (DTDs), Parent, Child, Sibling,... CM ~ {Descr.Logic, ER, UML, RDF/XML(-Schema), …} CM-QL ~ {F-Logic, …}

Achieving Interoperability of Genome Databases Through Intelligent Web Mediators Problem: There are hundreds or even thousands of biology databases, each with its own interface. Querying these databases are tedious, expensive and error prone. Solution: Developing a database-independent, intelligent user interface using their existing query systems and architecture.

Abstraction Hierarchy of the Genome Database on the Web accept into clustalx (select clean(a.sequence) from GlobalDB as g, AnimalDB as a where g.organism = “Drosophila” and g.source(country)=“Kenya” and g.journal like “USA” and a.accession in (select b.accession from blast(AnimalDB, clean(g.sequence)) as b where b.e-value >= 0.98)) GlobalDB GenBankAnimalDBPlantDB AceDB FlyBaseMaizeDBRiceDB GQL Example

LifeDB Web Browser Web Interface Response Web Interface XML Negotiator Query Processor Interpreter Generalizer Mediator Database Schema Mappings Global Schema Query Mappings GQL Query GSchema Query S or Query Mappings Answer A Web Server Ontology Web Interface Response Query G Parameterized queries and responses Data queries and responses Query map info map infoGlobal scheme Query plan GQL query schema info feedback loop schema queries test data probes more databases

ISIS Mediation Architecture ISIS : Interoperable Spatial Information System –Integration of Heterogeneous Spatial or Geographic information system. –Multi-Agent Paradigm  Sharing spatial knowledge and Services. –Web Oriented Information System –Example of Geographic information systems (GIS’s ): Road, Traffic Information on an area Land use information Population Distribution Marketing research Demographics University of Bourgogne (France)

S2 Wrapper Agent CA Cooperation Bus Query Processing Agent Interface Agent ISIS Mediation Architecture MULTI-agent System Architecture Ontology Agent S1 Wrapper Agent CA Semantic router Agent USER CA = Cooperation agent CA

WRAPPER AGENT: ~ processes OQL (Object Query Language) queries from Corresponding Cooperation Agent Difference between SQL, OQL: refer to this suggested website: ~ Forwards the results to the Cooperation Agent ~ A wrapper Agent is an “Employee” of one Cooperation Agent. Responsive when triggered by the “boss” ~ Schemas are represented by AMUN (Multi-level data Model) objects Which Lacks Semantics COOPERATION AGENT: ~ Contains knowledge of one source only ( represented by Semantic Cooperation Objects) ~ Semantic Objects are created with the help of the semantic router agent ~ Process self initiated Queries or sub Queries initiated by other agents ~ Queries are written in terms of the local objects and passed to the wrapper

ONTOLOGY AGENT: ~ provides Mutual understanding of concepts between the various agents to help them work with each other without a need for a global schema ~ defines ontological set of terms to be used by the cooperation agents and the semantic router SEMANTIC ROUTER AGENT: ~ To achieve communication between Cooperation agents, The semantic Router provides information about the location and identity of every Cooperation agent. Cooperation agents can participate in executing queries Query PROCESSOR AGENT: ~ It identifies relevant information sources and creates an execution plan INTERFACE AGENT: ~ Receives Queries from the user and pass them to one Cooperation agent. ~ reports back the results of the query to the user ~ only connected with one Cooperation agent

AMUN DATA MODEL used to represent schemas on both the wrapper level and the cooperation level. Geometry Coordinate Geometry CurvePointSurfaceSolid Line StringPolygon Line ringLine ISIS Page 6 Polyhedral surface Type hierarchy of AMUN

ISIS vs Model-Based Mediation With Domain Maps ISIS: Application: developed to integrate heterogeneous geographic systems in the first place Terminological Knowledge: Uses Ontology Agent Schemas: Represented by AMUN Data Model in all stages of mediation Model-Based: Application: developed to integrate heterogeneous Biological data bases in the first place Terminological Knowledge : Uses domain maps Schemas: represented in different models in different stages (XML,CM,GCM)

QUESTONS? COMMENTS?

References [1] Model –based Mediation with Domain Maps, B. Ludäscher, A. Gupta, M. E. Martone, 17 th Intl. Conference on Data Engineering, Heidelberg, Germany, IEEE Computer Society, April [2].Model-Based Information Integration in a Neuroscience Mediator System, B. Ludäscher, A. Gupta, M. E. Martone, demonstration track, 26th Intl. Conference on Very Large Databases (VLDB), Cairo, Egypt, September [3] ISIS: A Semantic Mediation Model and an Agent Based Architecture for GIS Interoperability, Eric Leclercq, Djamal Benslimane and Kokou Yétongnon, In Proceedings of the 1999 International Database Engineering and Applications Symposium, IDEAS 1999, August, 1999, Montreal, Canada. [4]. Model-Based Mediation: Framework and Challenges, B. Ludäscher, Faculty Research Seminar, Computer Science and Engineering, U.C. San Diego, November 28th, [5] Achieving interoperability of genome databases through intelligent web mediator, H. M. Jamil, In Proceedings of the IEEE International Symposium on Bio- Informatics and Biomedical Engineering (BIBE 2000), Washington, DC, November 8- 10,