Center for Computational Intelligence, Learning, and Discovery Artificial Intelligence Research Laboratory Department of Computer Science Supported in.

Slides:



Advertisements
Similar presentations
Discovery Informatics Workshop February 2-3, 2012 NSF Workshop on Discovery Informatics Vasant Honavar Program Director Information & Intelligent Systems.
Advertisements

Iowa State University Department of Computer Science Center for Computational Intelligence, Learning, and Discovery Harris T. Lin and Vasant Honavar. BigData2013.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Iowa State University Department of Computer Science Artificial Intelligence Research Laboratory Research supported in part by grants from the National.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
SimDL: A Model Ontology Driven Digital Library for Simulation Systems Jonathan Leidig - Edward A. Fox Kevin Hall Madhav Marathe Henning Mortveit.
Ontology Classifications Acknowledgement Abstract Content from simulation systems is useful in defining domain ontologies. We describe a digital library.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
McGuinness – Microsoft eScience – December 8, Semantically-Enabled Science Informatics: With Supporting Knowledge Provenance and Evolution Infrastructure.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
NSF Break-out Group: Medical Informatics Coordinator: Wanda Pratt Scribe: Betty Salzberg.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
SCIENCE-DRIVEN INFORMATICS FOR PCORI PPRN Kristen Anton UNC Chapel Hill/ White River Computing Dan Crichton White River Computing February 3, 2014.
Institute of Systems Biology (INBIOSIS)/ School of Biosciences & Biotechnology (Faculty of Science & Technology), Bioinformatics Development in Malaysia.
Data R&D Issues for GTL Data and Knowledge Systems San Diego Supercomputer Center University of California, San Diego Bertram Ludäscher
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
The analyses upon which this publication is based were performed under Contract Number HHSM C sponsored by the Center for Medicare and Medicaid.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Iowa State University Department of Computer Science Center for Computational Intelligence, Learning, and Discovery Harris Lin, Neeraj Koul, and Vasant.
NSF Support for Semantic Web Research Frank Olken National Science Foundation CISE/IIS Presentation to International Semantic Web Conference Athens, GA.
NSF Support for Semantic Web Research Frank Olken National Science Foundation CISE/IIS Presentation to SICOP Special Conference.
Kansas State University Department of Computing and Information Sciences Kansas State University KDD Lab ( cDNA.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Computing and Communications and Biology Molecular Communication; Biological Communications Technology Workshop Arlington, VA 20 February 2008 Jeannette.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
Subtask 1.8 WWW Networked Knowledge Bases August 19, 2003 AcademicsAir force Arvind BansalScott Pollock Cheng Chang Lu (away)Hyatt Rick ParentMark (SAIC)
Iowa State University Department of Computer Science Artificial Intelligence Research Laboratory Research supported in part by a grant from the National.
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
Semantic based P2P System for local e-Government Fernando Ortiz-Rodriguez 1, Raúl Palma de León 2 and Boris Villazón-Terrazas 2 1 1Universidad Tamaulipeca.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Computing Ontology Part II. So far, We have seen the history of the ACM computing classification system – What have you observed? – What topics from CS2013.
Master of Science in Biological Informatics PROGRAM DESCRIPTION The MS in Biological Informatics program program aims.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
B IOINFORMATICS AND C OMPUTATIONAL B IOLOGY A Computational Method to Identify RNA Binding Sites in Proteins Jeff Sander Iowa State University Rocky 2006.
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Databases, Ontologies and Text mining Session Introduction Part 2 Carole Goble, University of Manchester, UK Dietrich Rebholz-Schuhmann, EBI, UK Philip.
Computational Tools for Population Biology Tanya Berger-Wolf, Computer Science, UIC; Daniel Rubenstein, Ecology and Evolutionary Biology, Princeton; Jared.
XML-Based Grid Data System for Bioinformatics Development Noppadon Khiripet, Ph.D Wasinee Rungsarityotin, MS Chularat Tanprasert, Ph.D Royol Chitradon.
 Developed Struct-SVM classifier that takes into account domain knowledge to improve identification of protein-RNA interface residues  Results show that.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
Iowa State University Department of Computer Science Center for Computational Intelligence, Learning, and Discovery Harris T. Lin, Sanghack Lee, Ngot Bui.
Feature Extraction Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and.
David Chiu and Gagan Agrawal Department of Computer Science and Engineering The Ohio State University 1 Supporting Workflows through Data-driven Service.
Typically, classifiers are trained based on local features of each site in the training set of protein sequences. Thus no global sequence information is.
Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Program Computational Intelligence, Learning, and Discovery Program.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
High throughput biology data management and data intensive computing drivers George Michaels.
Towards ‘Ubiquitous’ Ubiquitous Computing: an alliance with ‘the Grid’ Oliver Storz, Adrian Friday, and Nigel Davies Computing Department, Lancaster University,
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
Tools and Services Workshop
Joslynn Lee – Data Science Educator
Databases, Ontologies and Text mining Session Introduction Part 2
Gregory Cooper Professor of Biomedical Informatics Director, Center for Causal Discovery Vice Chair Research, Department of Biomedical Informatics.
Jie Bao, Doina Caragea and Vasant G Honavar
Discussion Lead: Pen-Chung (Pen) Yew
Artificial Intelligence Research Laboratory
Ontology-Based Information Integration Using INDUS System
Gregory Cooper Professor of Biomedical Informatics Director, Center for Causal Discovery Vice Chair, Department of Biomedical Informatics Research involves.
NSF Support for Semantic Web Research
Data and Applications Security Developments and Directions
Data and Applications Security Developments and Directions
Data and Applications Security Developments and Directions
Presentation transcript:

Center for Computational Intelligence, Learning, and Discovery Artificial Intelligence Research Laboratory Department of Computer Science Supported in part by grants from the National Science Foundation (IIS , IIS ) to Vasant Honavar. Students: Computer Science: Doina Caragea (Ph.D., 2004), Jun Zhang (Ph.D., 2005), Jie Bao (Ph.D., 2007), Jyotishman Pathak (Ph.D., 2007), Cornelia Caragea, Oksana Yakhnenko, Neeraj Koul, Yeaser El-Manzalawy, Kewei Tu, Raphael Osorio, Flavian Vasile, Adrian Silvescu.. Students, Bioinformatics: Changhui Yan (Ph.D., 2004), Michael Terribilini, Feihong Wu, Tim Alcon, Carson Andorf, Laron Hughes. Algorithms and Software for Distributed, Collaborative, Integrative e-Science From data to knowledge Statistically based machine learning offers one of the most cost-effective approaches to data-driven knowledge discovery in emerging data-rich application domains (e.g., Bioinformatics, Security Informatics, Medical Informatics, Social Informatics). Cyber-enabled Discovery Applications Bioinformatics and Computational Molecular and Systems Biology Plant Genome Annotation (with Brendel) Protein Function Prediction (with Dobbs funded by NIH GM ) Prediction of Protein-Protein, Protein-DNA, and Protein-RNA interfaces (with Dobbs and Jernigan, funded by NIH GM066387) Integrating Quantitative and Functional Genomics (with Tuggle et al., funded by USDA) Synthesis of Gene Networks (with Greenlee and Serb) Cross-species Comparative Animal Genomics (with Reecy, funded by USDA) Critical Infrastructure Protection  Distributed power systems management, monitoring, and protection (with McCalley et al, funded by NSF CNS ) Work in Progress INDUS (with Caragea, KSU, funded by NSF IIS ) Ontology Federation and Distributed Inference (with Slutzki, funded by NSF IIS ) Interactive service composition and adaptation (with Basu and Lutz, funded by NSF CCF ) Challenges  Scalability: Massive, distributed autonomous data  Differences in data semantics: terminological differences, different levels of abstraction  Access constraints: e.g., due to privacy,  Multiple points of view Research Questions  Can we construct predictive models without centralized access to data?  Can we learn in the presence of semantic gaps between user and data sources?  How do the results compare with the centralized setting? Learning from Distributed Data [Caragea et al., 2004]  Decompose learning into an interleaving of statistical queries and computation  Reduce learning classifiers from distributed data reduces to statistical query answering from distributed data under  Different types of data fragmentation  Different constraints on access and query capabilities  Different bandwidth and resource constraints Results  Efficient algorithms for learning predictive models from distributed data  Strong performance guarantees relative to centralized counterparts  Scalable implementations of the resulting algorithms Learning from Semantically Heterogeneous Distributed Data [Caragea et al., 2005]  Make data sources self-describing: ontology-extended data sources (OEDS)  Data source schema ontology  Data source content ontology  Establish semantic correspondences from data source ontology to user ontology  Query data sources from a user’s point of view User Ontology O U (is-a) Data Source Ontologies O 1 (is-a) O 2 (is-a) Mappings between Ontologies  Rainy : O 1 = Rain : O U  Snow : O 1 = Snow : O U  NoPrec : O U < Outlook : O 1  {Sunny, Cloudy} : O 1 = NoPrec : O U  Unit conversion (e.g. deg. F to deg. C) Results:  Tools for associating ontologies with data, specifying mappings between ontologies  Algorithms for querying distributed semantically heterogeneous data Learning from Partially Specified Data [Zhang et al., 2003, 2004, 2006]  Semantic gaps lead to partially specified data  Different data sources may describe data at different levels of abstraction  If the description of data at source is more abstract than what the user expects, additional statistical assumptions become necessary Results:  Efficient algorithms for learning concise predictive models from partially specified data under user-specified statistical assumptions INDUS: Open source software for building predictive models from distributed, semantically heterogeneous, autonomous data sources