Download presentation
Presentation is loading. Please wait.
Published byAntony Holt Modified over 6 years ago
1
Ontology-Based Information Integration Using INDUS System
Center for Computational Intelligence, Learning, and Discovery Artificial Intelligence Research Laboratory Department of Computer Science ETC 2007 Ontology-Based Information Integration Using INDUS System Doina Caragea, Cornelia Caragea, Jie Bao, and Vasant Honavar User view A user view with respect to a set of ontology-extended data sources is given by a user schema and ontology and a set of semantic correspondences from data source meta-data to user meta-data. Motivation: Collaborative and Interdisciplinary e-Science INDUS main features A clear distinction between data and the semantics of the data: makes it easy to define mappings from data source ontologies to user ontologies User-specified ontologies: each user can specify his or her ontology and mappings from data source ontologies to the user ontology; there is no single global ontology. A user-friendly ontology and mappings editor: this can be easily used to specify ontologies and mappings; however, a predefined set of ontologies and mappings are also available in a repository. Knowledge acquisition capabilities: machine learning algorithms can be easily linked to INDUS, making it an appropriate tool for information integration as well as knowledge acquisition tasks. Available: large amounts of data in many application domains (e.g., Bioinformatics, Social Informatic, and Bibliography Informatics). Semantic correspondences Opportunities: share data and findings between scientists working on related problems. INRIA UMBC CiteULike MIT Library Challenges: large amounts of data; heterogeneous structure; different ontological commitments; constraints imposed by autonomous data sources. Ontology Extended Relational Data Sources (OERDS) Making Data Sources Self Describing Structure Ontology Learning Classifiers from OERDS from a user’s point of view Needed: knowledge acquisition from semantically heterogeneous, networked data and knowledge. INDUS: An Ontology-Based Approach to Information Integration from Distributed, Semantically Heterogeneous, Autonomous Data Sources Work in progress Construct benchmark relational data sets - DBLP Evaluate the robustness of our approach wrt different user ontologies and mappings Evaluate the robustness of our approach wrt errors (inconsistencies) in mappings Use the results of learning to rank mappings the better the classifiers, the better the mappings Assess the scalability of the approach Content Ontology INDUS prototype: web address Acknowledgements: This work is supported in part by grants from the National Science Foundation (IIS ), and the National Institutes of Health (GM ) to Vasant Honavar.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.