1 Bioinformatics in the Department of Computer Science Lenwood S. Heath Department of Computer Science Blacksburg, VA 24061 College of Engineering Northern.

Slides:



Advertisements
Similar presentations
Network Dynamics and Cell Physiology John J. Tyson Dept. Biological Sciences Virginia Tech.
Advertisements

John J. Tyson Biological Sciences, Virginia Tech
Analysis of Yeast Mutants to Test a Mathematical Model of the Cell Cycle Neil R. Adames 1, Logan Schuck 1, Kathy Chen 2, John J. Tyson 2, and Jean Peccoud.
Deterministic Global Parameter Estimation for a Budding Yeast Model T.D Panning*, L.T. Watson*, N.A. Allen*, C.A. Shaffer*, and J.J Tyson + Departments.
DARPA BioComp PI Meeting, 2001 “The Eukaryotic Cell Cycle as a Test Case for Modeling Cellular Regulation in a Collaborative Problem Solving Environment”
Computational biology and computational biologists Tandy Warnow, UT-Austin Department of Computer Sciences Institute for Cellular and Molecular Biology.
August 19, 2002Slide 1 Bioinformatics at Virginia Tech David Bevan (BCHM) Lenwood S. Heath (CS) Ruth Grene (PPWS) Layne Watson (CS) Chris North (CS) Naren.
Modeling and Understanding Stress Response Mechanisms with Expresso Ruth G. Alscher Lenwood S. Heath Naren Ramakrishnan Virginia Tech, Blacksburg, VA
AI and Bioinformatics From Database Mining to the Robot Scientist.
10/29/2004 Bioinformatics in Computer Science 1 Bioinformatics in Computer Science, the Virginia Bioinformatics Institute, and Opportunities for Engineering.
Prof. Jesús A. Izaguirre Department of Computer Science and Engineering Computational Biology and Bioinformatics at Notre Dame.
Potato Genomics and Bioinformatics
Bioinformatics: A New Frontier for Computer Scientists Ruth G. Alscher Lenwood S. Heath.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Gene expression analysis summary Where are we now?
The Golden Age of Biology DNA -> RNA -> Proteins -> Metabolites Genomics Technologies MECHANISMS OF LIFE Health Care Diagnostics Medicines Animal Products.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Using Bioinformatics to Make the Bio- Math Connection The Confessions of a Biology Teacher.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Scientific Data Mining: Emerging Developments and Challenges F. Seillier-Moiseiwitsch Bioinformatics Research Center Department of Mathematics and Statistics.
The Power of Microarray Technology Ruth G. Alscher.
December 14, 2001Slide 1 Some Biology That Computer Scientists Need for Bioinformatics Lenwood S. Heath Virginia Tech Blacksburg, VA 24061
The JigCell Problem Solving Environment (PSE) Marc Vass and Nick Allen Department of Computer Science Virginia Tech.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Computational Biology and Bioinformatics in Computer Science
Aguda & Friedman Chapter 6 The Eukaryotic Cell-Cycle Engine.
Modeling the Cell Cycle with JigCell and DARPA’s BioSPICE Software Departments of Computer Science* and Biology +, Virginia Tech Blacksburg, VA Faculty:
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Pathway Modeling and Problem Solving Environments Cliff Shaffer Department of Computer Science Virginia Tech Blacksburg, VA
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Bioinformatics.
The DARPA BioSPICE Project Clifford A. Shaffer Department of Computer Science Virginia Tech.
Ran Libeskind-Hadas, Department of Computer Science Thanks to Eliot Bush (Biology) and Zach Dodds (Computer Science) Bioinformatics Education at Harvey.
Genome-scale Metabolic Reconstruction and Modeling of Microbial Life Aaron Best, Biology Matthew DeJongh, Computer Science Nathan Tintle, Mathematics Hope.
Expresso and Chips Studying Drought Stress in Plants with cDNA Microarrays Lenwood S. Heath Department of Computer Science Virginia Tech, VA
What is Biotechnology?.
Reconstructing Gene Networks Presented by Andrew Darling Based on article  “Research Towards Reconstruction of Gene Networks from Expression Data by Supervised.
GTL Facilities Computing Infrastructure for 21 st Century Systems Biology Ed Uberbacher ORNL & Mike Colvin LLNL.
Converting Macromolecular Regulatory Models from Deterministic to Stochastic Formulation Pengyuan Wang, Ranjit Randhawa, Clifford A. Shaffer, Yang Cao,
1 Bio-Trac 40 (Protein Bioinformatics) October 8, 2009 Zhang-Zhi Hu, M.D. Associate Professor Department of Oncology Department of Biochemistry and Molecular.
November 16, 2001Slide 1 Opportunities in Bioinformatics for Computer Science Lenwood S. Heath Virginia Tech Blacksburg, VA University.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Bioinformatics Core Facility Guglielmo Roma January 2011.
BioSPICE and Problem-Solving Environments for Systems Biology Clifford A. Shaffer Department of Computer Science Virginia Tech.
1 Using JigCell and other BioSPICE Tools to Understand the Regulation of Cell Growth and Division John J. Tyson Virginia Polytechnic Institute and State.
Aggregation Connector: A Tool for Building Large Molecular Network Models from Components Thomas C. Jones Jr 1, Clifford A. Shaffer 1, Alida Palmisano.
User Interface Paradigms for Describing Pathway Models Cliff Shaffer Department of Computer Science Virginia Tech.
Master of Science in Biological Informatics PROGRAM DESCRIPTION The MS in Biological Informatics program program aims.
Data Mining and Decision Trees 1.Data Mining and Biological Information 2.Data Mining and Machine Learning Techniques 3.Decision trees and C5 4.Applications.
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
May 23, 2002Slide 1 Networks in Bioinformatics Lenwood S. Heath Virginia Tech Blacksburg, VA, USA I-SPAN’02 Manila, Philippines May 23, 2002.
Systems Biology Markup Language Ranjit Randhawa Department of Computer Science Virginia Tech.
1 Bioinformatics at Norwegian University of Science and Technology Professor Finn Drabløs Department of Cancer Research and Molecular Medicine Finn Drabløs.
The Eukaryotic Cell Cycle : Molecules, Mechanisms, and Mathematical Models John J. Tyson Virginia Tech Bela Novak Tech Univ Budapest.
JigCell Nicholas A. Allen*, Kathy C. Chen**, Emery D. Conrad**, Ranjit Randhawa*, Clifford A. Shaffer*, John J. Tyson**, Layne T. Watson* and Jason W.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Graduate Research with Bioinformatics Research Mentors Nancy Warter-Perez, ECE Robert Vellanoweth Chem and Biochem Fellow Sean Caonguyen 8/20/08.
Dynamics of biological switches 1. Attila Csikász-Nagy King’s College London Randall Division of Cell and Molecular Biophysics Institute for Mathematical.
Learning gene regulatory networks in Arabidopsis thaliana
Department of Computer Science
Pathway Modeling and Problem Solving Environments
Understanding Stress Response Mechanisms with Expresso
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
Estimating Rate Constants in Cell Cycle Models
Presentation transcript:

1 Bioinformatics in the Department of Computer Science Lenwood S. Heath Department of Computer Science Blacksburg, VA College of Engineering Northern Virginia Engineering Showcase March 5, 2004

2 Bioinformatics Faculty Cliff Shaffer Adrian Sandu Alexey Onufriev Lenny HeathT. M. Murali Naren Ramakrishnan Eunice Santos Layne Watson Roger Ehrich Chris North Joao Setubal, CS and VBI

3/5/2004 Bioinformatics in Computer Science 3 Relevant Expertise Algorithms — Heath, Santos, Setubal, Shaffer, Watson Computational structural biology — Onufriev, Sandu Computational systems biology — Murali Data mining — Ramakrishnan Genomics — Heath, Murali, Ramakrishnan Human-omputer interaction, visualization — North Image processing — Ehrich, Watson High performance computing — Sandu, Santos, Watson Numerical analysis — Onufriev, Watson Optimization — Watson Problem solving environments — Ramakrishnan, Shaffer

3/5/2004 Bioinformatics in Computer Science 4 Selected Collaborations Virginia Tech: Biochemistry, Biology, Fralin Biotechnology Center, Plant Physiology, Veterinary Medicine, Virginia Bioinformatics Institute (VBI), Wood Science North Carolina State University: Forest Biotechnology Center Duke: Biology University of Illinois: Plant Biology

5 Selected Funding NSF IBN : ITR: Understanding Stress Resistance Mechanisms in Plants: Multimodal Models Integrating Experimental Data, Databases, and the Literature. L. S. Heath; R. Grene, B. I. Chevone, N. Ramakrishnan, L. T. Watson. $499,973. NSF EIA : A Microarray Experiment Management System. N. Ramakrishnan, L. S. Heath, L. T. Watson, R. Grene, J. W. Weller (VBI). $600,000. DARPA N : Dryophile Genes to Engineer Stasis-Recovery of Human Cells. M. Potts, L. S. Heath, R. F. Helm, N. Ramakrishnan, T. O. Sitz, F. Bloom, P. Price (Life Technologies), J. Battista (LSU). $4,532,622. NSF MCB : Biocomplexity---Incubation Activity: A Collaborative Problem Solving Environment for Computational Modeling of Eukaryotic Cell Cycle Controls. J. J. Tyson, L. T. Watson, N. Ramakrishnan, C. A. Shaffer, J. C. Sible. $99,965. NIH 1 R01 GM : ``Problem Solving Environment for Modeling the Cell Cycle. J. J. Tyson, J. Sible, K. Chen, L. T. Watson, C. A. Shaffer, N. Ramakrishnan, P. Mendes (VBI). 211,038. Air Force Research Laboratory F : The Eukaryotic Cell Cycle as a Test Case for Modeling Cellular Regulation in a Collaborative Problem Solving Environment. J. J. Tyson, J. C. Sible, K. C. Chen, L. T. Watson, C. A. Shaffer, N. Ramakrishnan. $1,650,000.

3/5/2004 Bioinformatics in Computer Science 6 Research Resources System X Third fastest computer on the planet Laboratory for Advanced Scientific Computing & Applications (LASCA) Parallel algorithms & math software Anantham Cluster Grid computing Bioinformatics Research LAN Linux, Mac OS X, Windows Bioinformatics databases and analysis

3/5/2004 Bioinformatics in Computer Science 7 JigCell: A PSE for Eukaryotic Cell Cycle Controls Marc Vass, Nick Allen, Jason Zwolak, Dan Moisa, Clifford A. Shaffer, Layne T. Watson, Naren Ramakrishnan, and John J. Tyson Departments of Computer Science and Biology

3/5/2004 Bioinformatics in Computer Science 8 Computational Molecular Biology DNA mRNA Protein Enzyme Reaction Network Cell Physiology …TACCCGATGGCGAAATGC... …AUGGGCUACCGCUUUACG... …Met - Gly - Tyr - Arg - Phe - Thr... ATP ADP -P XYZ E1E1 E2E2 E3E3 E4E4

9 Clb5 MBF P Sic1 SCF Sic1 Swi5 Clb2 Mcm1 Unaligned chromosomes Cln2 Clb2 Clb5 Cdc20 Cdh1 Cdc20 APC PPX Mcm1 SBF Esp1 Pds1 Cdc20 Net1 Net1P Cdc14 RENT Cdc14 Cdc15 Tem1 Bub2 CDKs Esp1 Mcm1 Mad2 Esp1 Unaligned chromosomes Cdc15 Lte1 Budding Cln2 SBF ? Cln3 Bck2 and growth Sister chromatid separation DNA synthesis Cell Cycle of Budding Yeast

3/5/2004 Bioinformatics in Computer Science 10 JigCell Problem-Solving Environment Experimental Database Wiring Diagram Differential EquationsParameter Values Analysis Simulation Visualization Automatic Parameter Estimation

3/5/2004 Bioinformatics in Computer Science 11 Why do these calculations? Is the model “yeast-shaped”? Bioinformatics role: the model organizes experimental information. New science: prediction, insight JigCell is part of the DARPA BioSPICE suite of software tools for computational cell biology.

3/5/2004 Bioinformatics in Computer Science 12 Expresso: A Next Generation Software System for Microarray Experiment Management and Data Analysis

3/5/2004 Bioinformatics in Computer Science 13 Integration of design, experimentation, and analysis Data mining; inductive logic programming (ILP) Closing the loop Drought stress experiments with pine trees and Arabidopsis Expresso: A Problem Solving Environment (PSE) for Microarray Experiment Design and Analysis

3/5/2004 Bioinformatics in Computer Science 14 Scenarios for Effects of Abiotic Stress on Gene Expression in Plants

3/5/2004 Bioinformatics in Computer Science 15 Data Mining with ILP ILP (inductive logic programming) is a data mining algorithm for inferring relationships or rules. ILP groups related data and chooses in favor of relationships having short descriptions. ILP can also flexibly incorporate a priori biological knowledge (e.g., categories and alternate classifications). Hybrid reasoning: Information Integration “Is there a relationship between genes in a given functional category and genes in a particular expression cluster?” ILP mines this information in a single step

3/5/2004 Bioinformatics in Computer Science 16 Rule Inference in ILP Infers rules relating gene expression levels to categories, both within a probe pair and across probe pairs, without explicit direction Example Rule: [Rule 142] [Pos cover = 69 Neg cover = 3] level(A,moist_vs_severe,not positive) :- level(A,moist_vs_mild,positive). Interpretation: “If the moist versus mild stress comparison was positive for some clone named A, it was negative or unchanged in the moist versus severe comparison for A, with a confidence of 95.8%.”

3/5/2004 Bioinformatics in Computer Science 17 ILP in the Expresso Pipeline Expresso is a next generation software system for microarray experiments that provides a database interface to ILP functionality.

3/5/2004 Bioinformatics in Computer Science 18 Status of Expresso Capabilities –Data capture and storage –Statistical analysis –Data mining by ILP –Microarray experiment design — GeneSieve –Expresso-assisted experiment composition –Closing the experimental loop Successful microarray experiment analysis –Pine, Norway spruce, yeast, Deinococcus radiodurans (an extremophile microorganism), human cell lines Planned microarray experiment analysis –Potato, Arabidopsis thaliana, tomato, rice, corn

3/5/2004 Bioinformatics in Computer Science 19 Networks in Bioinformatics Mathematical Model(s) for Biological Networks Representation: What biological entities and parameters to represent and at what level of granularity? Operations and Computations: What manipulations and transformations are supported? Presentation: How can biologists visualize and explore networks?

3/5/2004 Bioinformatics in Computer Science 20 Reconciling Networks Munnik and Meijer, FEBS Letters, 2001 Shinozaki and Yamaguchi- Shinozaki, Current Opinion in Plant Biology, 2000

3/5/2004 Bioinformatics in Computer Science 21 Multimodal Networks Nodes and edges have flexible semantics to represent: - Time - Uncertainty - Cellular decision making; process regulation - Cell topology and compartmentalization - Rate constants - Phylogeny Hierarchical

3/5/2004 Bioinformatics in Computer Science 22 Using Multimodal Networks Help biologists find new biological knowledge Visualize and explore Generating hypotheses and experiments Predict regulatory phenomena Predict responses to stress Incorporate into Expresso as part of closing the loop

3/5/2004 Bioinformatics in Computer Science 23 Conclusions Engaged faculty with the right expertise Numerous life science collaborations Federal research funding First-class computational resources A variety of cutting-edge bioinformatics research projects

3/5/2004 Bioinformatics in Computer Science 24 Bioinformatics Education Courses in Computer Science Courses in the Life Sciences Bioinformatics Option Doctoral Program in Genetics, Bioinformatics, and Computational Biology

3/5/2004 Bioinformatics in Computer Science 25 Doctoral Program in Genetics, Bioinformatics, and Computational Biology Multidisciplinary: biology, biochemistry, crop science, plant physiology, computer science, mathematics, statistics, veterinary medicine

3/5/2004 Bioinformatics in Computer Science 26 Anantham Cluster Previous cluster specs 200 AMD 1 GHz processors 1 GB RAM per processor 2 TB disk space 2.56 Gb/s Myrinet network Previous 200 processor cluster