1 Department of Cancer Research and Molecular Medicine Norwegian University of Science and Technology Trondheim, Norway Gastrointestinal systems biology.

Slides:



Advertisements
Similar presentations
Gene regulation in biological responses:
Advertisements

13:10:58 A New Tool for Mapping Microarray Data onto the Gene Ontology Structure ( Abstract e GOn (explore Gene Ontology) is a.
Biological pathway and systems analysis An introduction.
Microarray Data Analysis Day 2
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Gene Ontology John Pinney
Bioinformatics “Other techniques raise more questions than they answer. Bioinformatics is what answers the questions those techniques generate.” SheAvery
Learning rule-based models from gene expression time profiles annotated with Gene Ontology terms Jan Komorowski and Astrid Lägreid.
Bioinformatics at IU - Ketan Mane. Bioinformatics at IU What is Bioinformatics? Bioinformatics is the study of the inherent structure of biological information.
Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis Jonsson.
Genomic Signal Processing: Ensemble Dependence Model for Classification and Prediction of Cancer Based on Gene Expression Data Joseph DePasquale Engineering.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
COG and GO tutorial.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Functional annotation and network reconstruction through cross-platform integration of microarray data X. J. Zhou et al
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Gene Ontology and Functional Enrichment Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Pathways Database System: An Integrated System For Biological Pathways L. Krishnamurthy, J. Nadeau, G. Ozsoyoglu, M. Ozsoyoglu, G. Schaeffer, M. Tasan.
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
Comprehensive Gene Expression Analysis of Prostate Cancer Reveals Distinct Transcriptional Programs Associated With Metastatic Disease Kevin Paiz-Ramirez.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Malignant Melanoma and CDKN2A
Genome of the week - Deinococcus radiodurans Highly resistant to DNA damage –Most radiation resistant organism known Multiple genetic elements –2 chromosomes,
Knowledgebase Creation & Systems Biology: A new prospect in discovery informatics S.Shriram, Siri Technologies (Cytogenomics), Bangalore S.Shriram, Siri.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Automated Explanation of Gene-Gene Relationships Wacek Kuśnierczyk.
Bioinformatics and medicine: Are we meeting the challenge?
Introduction to Bioinformatics Spring 2002 Adapted from Irit Orr Course at WIS.
1 Bio-Trac 40 (Protein Bioinformatics) October 8, 2009 Zhang-Zhi Hu, M.D. Associate Professor Department of Oncology Department of Biochemistry and Molecular.
Reconstruction of Transcriptional Regulatory Networks
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
Agent-based methods for translational cancer multilevel modelling Sylvia Nagl PhD Cancer Systems Science & Biomedical Informatics UCL Cancer Institute.
Cell Signaling Ontology Takako Takai-Igarashi and Toshihisa Takagi Human Genome Center, Institute of Medical Science, University of Tokyo.
Construction of cancer pathways for personalized medicine | Presented By Date Construction of cancer pathways for personalized medicine Predictive, Preventive.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
PaLS: Pathways and Literature Strainer Filtering common literature, ontology terms and pathway information. Andrés Cañada Pallarés Instituto Nacional de.
Analysis and Management of Microarray Data Previous Workshops –Computer Aided Drug Design –Public Domain Resources in Biology –Application of Computer.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Respective contributions of MIAME, GeneOntology and UMLS for transcriptome analysis Fouzia Moussouni, Anita Burgun, Franck Le Duff, Emilie Guérin, Olivier.
Cellular macromolecule catabolism cellular macromolecule metabolism cytoplasm organization and biogenesis establishment of cellular localization intracellular.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
Central dogma: the story of life RNA DNA Protein.
Clustering Algorithms to make sense of Microarray data: Systems Analyses in Biology Doug Welsh and Brian Davis BioQuest Workshop Beloit Wisconsin, June.
Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.
An overview of Bioinformatics. Cell and Central Dogma.
A collaborative tool for sequence annotation. Contact:
Bioinformatics and Computational Biology
A Report on CAMDA’01 Biointelligence Lab School of Computer Science and Engineering Seoul National University Kyu-Baek Hwang and Jeong-Ho Chang.
A literature network of human genes for high-throughput analysis of gene expression Speaker : Shih-Te, YangShih-Te, Yang Advisor : Ueng-Cheng, YangUeng-Cheng,
1 Bioinformatics at Norwegian University of Science and Technology Professor Finn Drabløs Department of Cancer Research and Molecular Medicine Finn Drabløs.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Effect of Alcohol on Brain Development NormalFetal Alcohol Syndrome.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
(1) Genotype-Tissue Expression (GTEx) Largest systematic study of genetic regulation in multiple tissues to date 53 tissues, 500+ donors, 9K samples, 180M.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
Uncovering the Protein Tyrosine Phosphatome in Cattle
An Artificial Intelligence Approach to Precision Oncology
Dept of Biomedical Informatics University of Pittsburgh
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
9 Future Challenges for Bioinformatics
Lecture 7: Biological Network Crosstalk Y. Z
Fouzia Moussouni, Anita Burgun, Franck Le Duff,
Loyola Marymount University
Loyola Marymount University
Loyola Marymount University
Loyola Marymount University
Presentation transcript:

1 Department of Cancer Research and Molecular Medicine Norwegian University of Science and Technology Trondheim, Norway Gastrointestinal systems biology and the challenge of integrating biological background knowledge Astrid Lægreid

2 biomedical problem data analysis modeling hypothesis generation screening genome-wide system-wide hypothesis testing high throughput/ single gene Systems biology of the normal and diseased gastrointestinal system

3 biomedical problem gastric carcinoma gastric carcinoids gastrin-mediated gene expression modulation by CREB/CREM/ATF Systems biology of the normal and diseased gastrointestinal system hypergastrinemia

4 biomedical problem Systems biology of the normal and diseased gastrointestinal systemgastrin samples chronic transient hours gastrin-mediated gene expression

5 biomedical problem gastric carcinoma gastric carcinoids gastrin-mediated gene expression modulation by CREB/CREM/ATF Systems biology of the normal and diseased gastrointestinal system hypergastrinemia regulators effectors diagnosis prognosis treatment

6 Systems biology of the normal and diseased gastrointestinal system model for classification of gastric cancer modeling gene function with rule based methods modeling signal transduction from first principles retrieving information from databases, literature and ontologies modeling biological background information using case based reasoning data analysis modeling hypothesis generation

7 Model for classification of gastric cancer Nørsett K, Lægreid A, Midelfart H, Yadetie Y, Falkmer S, Grønbeck J, Waldum HL, Komorowski J, Sandvik AK. Gene Expression based Classification of Gastric Carcinoma. Cancer Letters, 210: , 2004 Herman Midelfart, Jan Komorowski, Kristin Nørsett, Fekadu Yadetie, Arne K. Sandvik and Astrid Lægreid. Learning Rough Set Classifiers from Gene Expression and Clinical Data. Fundamenta Informaticae. 53: , 2002.

8 Use of Gene Ontology in models predicting gene function from gene expression time profiles The Transcriptional Program in the Response of Human Fibroblasts to Serum. Iyer et al, Science, 283: 83, 1999 ~ 500 genes ~ 200 with unknown function

(Increasing) AND (Decreasing) AND (Constant) => GO (cell proliferation) 1. Mine functional classes from an ontology 2. Extract features for learning 3. Induce minimal decision rules using rough sets 4. Predict the function of unknown genes using the rules

10 w the model w incorporates background biological knowledge w predicts multiple functions per gene w can provide possible new functions of the known genes w can provide hypotheses about the function of unknown genes w experimental work needs to be done to confirm our predictions Lægreid A, Hvidsten T, Midelfart H, Komorowski J, Sandvik AK. Predicting Gene Ontology Biological Process from Temporal Gene Expression Patterns. Genome Research. 13: , 2003 Hvidsten TR, Lægreid A, Komorowski J. Learning rule-based models from gene expression time profiles annotated using Gene Ontology. Bioinformatics, 19: , 2003 conclusions

11 how to improve models for prediction of biological roles of genes/proteins? wmore genes/proteins wmore measurements per gene/protein (time points, cell types, tissues, states,...) wmore annotations (GO, sequence, protein structure, cell biology, physiology, pathology,…)  more training examples challenges  improved modeling strategies wimproved strategies to combine data from many different domains

12 Modeling Signal Transduction from first principles

13 Modeling Signal Transduction from first principles Gunnar Tufte. Development of Digital Circuits on a Virtual Sblock FPGA. PhD thesis. NTNU Towards Evolvable Hardware Modeling Search for models Evaluate/run model

14 how to improve signal transduction models? wmore genes/proteins wmore annotations (GO, sequence, protein structure, cell biology, physiology, pathology,…) wmore measurements per gene/protein (time points, cell types, tissues, states,...)  more signal transduction data challenges  improved modeling strategies

15 Systems biology of the normal and diseased gastrointestinal system model for classification of gastric cancer modeling gene function with rule based methods modeling signal transduction from first principles retrieving information from databases, literature and ontologies data analysis modeling hypothesis generation

16 Local Gene AnnotationDatabase AnnotationDatabase GeneOntologyGeneOntology LocusLinkLocusLinkUniGeneUniGene StatisticaltestsStatisticaltests Editable GO tree Editable File export InputDatabase Applicat ion UniGene Cluster ID`s UniGene GenBank Acc. Nr. GenBank Clone ID`s Homolo-GeneHomolo-GeneSwissProtSwissProt Output NMCAnnotationDatabaseNMCAnnotationDatabase e GOn GeneAnnotationsGeneAnnotations File export Retrieval and management of biological background information from public databases Local Gene AnnotationDatabase AnnotationDatabase GeneOntologyGeneOntology EntrezGeneEntrezGeneUniGeneUniGene StatisticaltestsStatisticaltests Editable GO tree Editable File export InputDatabaseApplication AffymetrixIDsAffymetrixIDs Clone IDs Homolo-GeneHomolo-GeneSwissProtSwissProt Output NMCAnnotationDatabaseNMCAnnotationDatabase e GOn GeneAnnotationsGeneAnnotations File export GenBank Acc. Nr. GenBank UniGene Cluster IDs UniGene

17 Local Gene AnnotationDatabase AnnotationDatabase GeneOntologyGeneOntology LocusLinkLocusLinkUniGeneUniGene StatisticaltestsStatisticaltests Editable GO tree Editable File export InputDatabase Applicat ion UniGene Cluster ID`s UniGene GenBank Acc. Nr. GenBank Clone ID`s Homolo-GeneHomolo-GeneSwissProtSwissProt Output NMCAnnotationDatabaseNMCAnnotationDatabase e GOn GeneAnnotationsGeneAnnotations File export Retrieval and management of biological background information from public databases Local Gene AnnotationDatabase AnnotationDatabase GeneOntologyGeneOntology EntrezGeneEntrezGeneUniGeneUniGene StatisticaltestsStatisticaltests Editable GO tree Editable File export InputDatabaseApplication AffymetrixIDsAffymetrixIDs Clone IDs Homolo-GeneHomolo-GeneSwissProtSwissProt Output NMCAnnotationDatabaseNMCAnnotationDatabase e GOn GeneAnnotationsGeneAnnotations File export GenBank Acc. Nr. GenBank UniGene Cluster IDs UniGene NMC

18 Retrieval of biological background information from unstructured sources using natural language processing Tveit A, Sætre R, Steigedal TS and Lægreid A. ProtChew: Automatic Extraction of Protein Names from Biomedical Literature. Proceedings of the International Workshop on Biomedical Data Engineering, BMDE2005, Tokyo Sætre R, Tveit A, Steigedal TS and Lægreid A. Semantic Annotation of Biomedical Literature using Google. Proceedings of the First International Workshop on Datamining and Bioninformatics, DMBIO2005, Singapore

19 Retrieval of biological background information from unstructured sources using natural language processing example output

20 Retrieval of biological background information from unstructured sources using natural language processing example output

21 Retrieval of biological background information from unstructured sources using natural language processing example output

22 Retrieval of biological background information from unstructured sources using natural language processing example output

23 New Gene Ontology structures for improved biological reasoning Original Gene Ontology MOLECULAR FUNCTION transcription regulator activity transcription termination factor activity RNA polymerase I transcription termination factor activity RNA polymerase II transcription termination factor activity RNA polymerase III transcription termination factor activity transcription termination from RNA polymerase I promoter transcription termination from RNA polymerase II promoter transcription termination from RNA polymerase III promoter transcription from RNA polymerase I promoter transcription from RNA polymerase II promoter transcription from RNA polymerase III promoter transcription, DNA dependent nucleobase, nucleoside, nucleotide and nucl e ic acid metabolism transcription cellular metabolism cellular physiological process cellular process BIOLOGICAL PROCESS intracellular membrane-bound organelle nucleus intracellular organelle intracellular cell CELLULAR COMPONENT GENE ONTOLOGY

24 New Gene Ontology structures for improved biological reasoning The Second Gene Ontology Layer offers relations between molecular function - biological process molecular function - cellular component (MOLECULAR FUNCTION) RNA polymerase I transcription termination factor activity RNA polymerase II transcription termination factor activity RNA polymerase III transcription termination factor activity transcription termination from RNA polymerase I promoter transcription termination from RNA polymerase II promoter transcription termination from RNA polymerase III promoter (BIOLOGICAL PROCESS ) nucleus (CELLULAR COMPONENT)

25 Second Layer contributes to a more complete and powerful biological knowledge network suitable for e.g. automatic reasoning. MOLECULAR FUNCTION transcription regulator activity transcription termination factor activity RNA polymerase I transcription termination factor activity RNA polymerase II transcription termination factor activity RNA polymerase III transcription termination factor activity transcription termination from RNA polymerase I promoter transcription termination from RNA polymerase II promoter transcription termination from RNA polymerase III promoter transcription from RNA polymerase I promoter transcription from RNA polymerase II promoter transcription from RNA polymerase III promoter transcription, DNA dependent nucleobase, nucleoside, nucleotide and nucleic acid metabolism transcription cellular metabolism cellular physiological process cellular process BIOLOGICAL PROCESS intracellular membrane-bound organelle nucleus intracellular organelle intracellular cell CELLULAR COMPONENT New Gene Ontology structures for improved biological reasoning

26 data set: 4223 genes differentially expressed in fibroblast serum response 7589 publicly available molecular function annotations new annotations biological process: 2623 cellular component:: 2112 Second Layer New Gene Ontology structures for improved biological reasoning (MOLECULAR FUNCTION) RNA polymerase I transcription termination factor activity RNA polymerase II transcription termination factor activity RNA polymerase III transcription termination factor activity transcription termination from RNA polymerase I promoter transcription termination from RNA polymerase II promoter transcription termination from RNA polymerase III promoter (BIOLOGICAL PROCESS ) nucleus (CELLULAR COMPONENT) Second Gene Ontology Layer complements existing annotation sets

27 New Gene Ontology structures for improved biological reasoning Second Gene Ontology Layer complements existing annotation sets

28 how to improve models for prediction of biological roles of genes/proteins? wmore genes/proteins wmore measurements per gene/protein (time points, cell types, tissues, states,...) wmore annotations (GO, sequence, protein structure, cell biology, physiology, pathology,…)  more training examples challenges  improved modeling strategies wimproved strategies to combine data from many different domains

29 Modeling biological backgound information using case based reasoning Kusnierczyk W, Aamodt A, Lægreid A. Knowledge-Based Support for Smart Pathway Building Tools. Proceedings of the International Conference on Case- Based Reasoning 2005, In Press

30 Modeling biological backgound information using case based reasoning Kusnierczyk W, Aamodt A, Lægreid A. Knowledge-Based Support for Smart Pathway Building Tools. Proceedings of the International Conference on Case-Based Reasoning 2005, In Press PathwayAssist networks can be used to generate cases for the case-based reasoning (CBR) tool

31 Arne K. Sandvik Helge L. Waldum Liv Thommesen Torunn Bruland Ola Ween Fekadu Yadetie Eva Hofsli Kristin Nørsett Kristine Misund Tonje S. Steigedal Norwegian Microarray Consortium Vidar Beisvåg Berit Doseth Eitrem Hallgeir Bergum Frode Jünge Lars Jølsum Vladimir Yankovski Jan Komorowski Torgeir Hvidsten Herman Midelfart Mette Langaas Turid Follestad Bjørn Alsberg Arnar Flatberg Lars Giskehaug Einar Ryeng Agnar Aamodt Waclaw Kusnierczyk Kjell Bratbergsengen Heri Ramampiaro Yan Hua Chen Tore Amble Rune Sætre Torulf Mollestad Henrik Tveit Simen Myhre Sonja Ylving Berit Johansen Katarina Jørgensen Pauline Haddow Gunnar Tufte