Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

The National Center for Biomedical Ontology Online Knowledge Resources for the Industrial Age Mark A. Musen Stanford University
Introduction to BioConductor Friday 23th nov 2007 Ståle Nygård Statistical methods and bioinformatics for the analysis of microarray.
SEVENPRO – STREP KEG seminar, Prague, 8/November/2007 © SEVENPRO Consortium SEVENPRO – Semantic Virtual Engineering Environment for Product.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
SONet (Scientific Observations Network) and OBOE (Extensible Observation Ontology): Mark Schildhauer, Director of Computing National Center for Ecological.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
A Paradigm for Space Science Informatics Kirk D. Borne George Mason University and QSS Group Inc., NASA-Goddard or
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Structural Genomics – an example of transdisciplinary research at Stanford Goal of structural and functional genomics is to determine and analyze all possible.
1 genSpace: Community- Driven Knowledge Sharing for Biological Scientists Gail Kaiser’s Programming Systems Lab Columbia University Computer Science.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
GenSpace: Exploring Social Networking Metaphors for Knowledge Sharing and Scientific Collaborative Work Chris Murphy, Swapneel Sheth, Gail Kaiser, Lauren.
Making the Most of What We Know: Towards Effective Use of Genomics Data Terence Critchlow Center for Applied Scientific Computing Lawrence Livermore National.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
1 Data Integration and Extraction over Molecular Biological Data Cui Tao supported by NSF.
1 CIS607, Fall 2006 Semantic Information Integration Instructor: Dejing Dou Week 10 (Nov. 29)
GenSpace: Exploring Social Networking Metaphors for Scientific Collaborative Work Gail Kaiser, Swapneel Sheth, Chris Murphy {kaiser, swapneel, cmurphy}
1 CIS607, Fall 2004 Semantic Information Integration Attendees: Vikash Agarwal, Julian M Catchen Kevin A Huck, Kushal M Koolwal, Paea J Le Pendu Xiangkui.
CBioC: Massive Collaborative Curation of Biomedical Literature Future Directions.
Luxembourg, Sep 2001 Pedro Fernandes Inst. Gulbenkian de Ciência, Oeiras, Portugal EMBER A European Multimedia Bioinformatics Educational Resource.
1 Digital Libraries and Evidence in the Developing World Context Dr. Jon Ferguson Senior Health Database Scientist IMMPACT Project University of Aberdeen.
BIOCMS: Resource Integration and Web Application Framework for Bioinformatics DHUNDY R BASTOLA †, *, ANIL KHADKA †, MOHAMMAD SHAFIULLAH † AND HESHAM ALI.
SYSTEMS ANALYSIS. Chapter Five Systems Analysis Define systems analysis Describe the preliminary investigation, problem analysis, requirements analysis,
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
COHSE Informed WWW Link Navigation Using Ontologies Prof. Carole Goble, Sean Bechhofer Dr. Leslie Carr, Prof. Wendy Hall, Prof. David De Roure, Steve Harris,
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Chapter 11 Managing Knowledge. Dimensions of Knowledge.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
>>> Korean BioInformation Center >>> KRIBB Korea Research institute of Bioscience and Biotechnology GS2PATH: Linking Gene Ontology and Pathways Jin Ok.
Srihari-CSE730-Spring 2003 CSE 730 Information Retrieval of Biomedical Text and Data Inroduction.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Analysis Environments For Scientific Communities From Bases to Spaces Bruce R. Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign.
Chapter 1 Introduction to Data Mining
Ontology Summit2007 Survey Response Analysis -- Issues Ken Baclawski Northeastern University.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
Data provenance in biomedical discovery Donald Dunbar Queen’s Medical Research Institute University of Edinburgh Workshop on Principles of Provenance in.
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Introduction to the Semantic Web and Linked Data Module 1 - Unit 2 The Semantic Web and Linked Data Concepts 1-1 Library of Congress BIBFRAME Pilot Training.
Data Mining and Decision Trees 1.Data Mining and Biological Information 2.Data Mining and Machine Learning Techniques 3.Decision trees and C5 4.Applications.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
Mining the Biomedical Research Literature Ken Baclawski.
A collaborative tool for sequence annotation. Contact:
Proposed Research Problem Solving Environment for T. cruzi Intuitive querying of multiple sets of heterogeneous databases Formulate scientific workflows.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
APPLICATION OF ONTOLOGIES IN CANCER NANOTECHNOLOGY RESEARCH Faculty of Engineering in Foreign Languages 1 Student: Andreea Buga Group: 1241E – FILS Coordinating.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
International Workshop 28 Jan – 2 Feb 2011 Phoenix, AZ, USA Ontology in Model-Based Systems Engineering Henson Graves 29 January 2011.
Genomic Medicine Grid Juan Pedro Sánchez Merino Instituto de Salud Carlos III
Developing our Metadata: Technical Considerations & Approach Ray Plante NIST 4/14/16 NMI Registry Workshop BIPM, Paris 1 …don’t worry ;-) or How we concentrate.
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
TDM in the Life Sciences Application to Drug Repositioning *
Biological Databases By: Komal Arora.
MATLAB Distributed, and Other Toolboxes
CCNT Lab of Zhejiang University
What contribution can automated reasoning make to e-Science?
Architecture Components
Using the Drupal Content Management Software (CMS) as a framework for OMICS/Imaging-based collaboration.
Gail Kaiser, Swapneel Sheth, Chris Murphy
CCO: concept & current status
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Collaborative RO1 with NCBO
Presentation transcript:

Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris (Plant Pathology)

Motivation Many fungal genomes being sequenced –100s in the next few years Important fungal genetics work done by large and strong UNL group –Have prototype fungal genome database Numerous groups around the world are developing disparate fungal genome databases –Databases dissimilar and widely distributed –Difficult to unify others’ results with one’s own –Unnecessarily complicates research But it’s impractical to unify the databases!

Semantic Web Develop an ontology to describe the fungal genome data –An ontology is a formal, explicit specification of shared concepts –Allows both human and machine processing –Concepts shared between ontology files on the WWW –Ontology describes properties of genes, relations between genes, and operations useful in analyzing them Participants keep their own data locally, but represent it in a way consistent with this framework Captures the semantic meaning of the data, facilitating automatic processing –This is where the fun starts

Semantic Web Architecture

What we can do with it Can do transitive reasoning on genes –E.g. if genes A and B are related via property 1 and B and C are related via 2, then perhaps so are A and C Inverse relationships to reduce data entry –E.g. “EvolvedFrom” data entered automatically implies “EvolvedTo” relation Consistency checking –E.g. verify that UNL’s assertions about fungal genomes don’t contradict those by others on the same genomes Hypothesis building and testing –E.g. identification of genes that function in specific cellular processes Knowledge discovery and data mining –Ontology includes appropriate techniques for users to apply to extract new knowledge from the data

What we can do with it (cont’d) Uniform interface to the world’s collection of genomic resources –Visualization, query & search –Instructional tool: Train postdocs/students as bi/tri- lingual scientists who can understand molecular (fungal) biology/genetics, bioinformatics, and computer science Can add active machine learning component to facilitate querying of database to classify new sequences –Computer learns how to classify biological sequences via labeled examples, interaction with the user, and interaction with other experts

Prior Work Application of semantic web technology to bioinformatics is not new Gene Ontology ( ) –Collection of ontologies related to molecular functions, biological processes, and cellular components –Takes a rather limited view of ontologies Little (if any) use of quantifiers, shared concepts, etc.

Prior Work (cont’d) Fungal Web ( –Built a fungal gene ontology based on GO –Developing technologies to parse on-line scientific literature to add data to database –Tools to query databases and perform analysis Similar to what we propose, but: –Their extensions to GO do not suit the needs of UNL scientists or the broader fungal community They focus on fungi that degrade cellulose Their annotations too limited to represent entire fungal kingdom –They support machine learning, but not active learning

Extending Other Repositories Use existing ontologies (e.g. GO) and data stores as a basis for fungal ontologies –Utilize existing concepts in other gene ontologies –Extend to meet needs of fungal genomes –Extensions can in turn be utilized by other researchers, both fungal and other kingdoms Because we use common concepts where applicable

Funding Opportunities NSF –Frontiers in Integrative Biological Research (FIBR): Oct prerop, Feb full –Science and Engineering Information Integration and Informatics (SEIII): December NIH –INNOVATIONS IN BIOMEDICAL COMPUTATIONAL SCIENCE AND TECHNOLOGY: Sept LOI, Oct full Nebraska Research Initiative: November

Conclusions Semantic web now popular within bioinformatics, but no support for the work of UNL’s fungal research community We plan to build the necessary infrastructure to unify disparate data sources and provide an interface conducive to knowledge discovery, hypothesis testing, and collaboration –Will build on existing fungal database here at UNL –Contributions: distributed infrastructure, means for querying, drawing inferences We should send someone to the Knowledge- Based Bioinformatics Workshop to learn more about the state of the art Knowledge- Based Bioinformatics WorkshopKnowledge- Based Bioinformatics Workshop