Department of Plant Systems Biology Research at the Bioinformatics & Computational Biology research groups.

Slides:



Advertisements
Similar presentations
Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Advertisements

2 Unité de Biométrie et d’Intelligence Artificielle (UBIA) INRA
Dissecting plant genomes using PLAZA 2.5 Michiel Van Bel 1,2+, Sebastian Proost 1,2+, Elisabeth Wischnitzki 1,2, Sara Mohavedi 1,2, Christopher Scheerlinck.
Finding regulatory modules from local alignment - Department of Computer Science & Helsinki Institute of Information Technology HIIT University of Helsinki.
Transcriptomics Breakout. Topics Discussed Transcriptomics Applications and Challenges For Each Systems Biology Project –Host and Pathogen Bacteria Viruses.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Comparative genomics Joachim Bargsten February 2012.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Bioinformatics Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 1 Introduction Aleppo University Faculty of technical engineering.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
Introduction to Bioinformatics Spring 2008 Yana Kortsarts, Computer Science Department Bob Morris, Biology Department.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Bioinformatics and Phylogenetic Analysis
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Annotation of Tomato Stephane Rombauts Wageningen 18/09/2004 Bioinformatics & Evolutionary Genomics Ghent, Belgium.
Algorithm Animation for Bioinformatics Algorithms.
The Sorcerer II Global ocean sampling expedition Katrine Lekang Global Ocean Sampling project (GOS) Global Ocean Sampling project (GOS) CAMERA CAMERA METAREP.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Network construction and exploration using CORNET and Cytoscape SPICY WORKSHOP Wageningen, March 8 th 2012 Stefanie De Bodt.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Automatic methods for functional annotation of sequences Petri Törönen.
Analyzing transcription modules in the pathogenic yeast Candida albicans Elik Chapnik Yoav Amiram Supervisor: Dr. Naama Barkai.
Chapter 26: Phylogeny and the Tree of Life Objectives 1.Identify how phylogenies show evolutionary relationships. 2.Phylogenies are inferred based homologies.
Igor Ulitsky.  “the branch of genetics that studies organisms in terms of their genomes (their full DNA sequences)”  Computational genomics in TAU ◦
NCBI Review Concepts Chuong Huynh. NCBI Pairwise Sequence Alignments Purpose: identification of sequences with significant similarity to (a)
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics.
Finish up array applications Move on to proteomics Protein microarrays.
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
IPG2P Working Group Update. iPG2P Final deliverable: – Procedure allowing an investigator to begin with trait of interest in species possessing limited.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Protein and RNA Families
Localising regulatory elements using statistical analysis and shortest unique substrings of DNA Nora Pierstorff 1, Rodrigo Nunes de Fonseca 2, Thomas Wiehe.
Mark D. Adams Dept. of Genetics 9/10/04
Central dogma: the story of life RNA DNA Protein.
EB3233 Bioinformatics Introduction to Bioinformatics.
Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
Bioinformatics and Computational Biology
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
__________________________________________________________________________________________________ Fall 2015GCBA 815 __________________________________________________________________________________________________.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Finding genes in the genome
Bioinformatics Research Overview Li Liao Develop new algorithms and (statistical) learning methods > Capable of incorporating domain knowledge > Effective,
Starter What do you know about DNA and gene expression?
BIOINFORMATICS Ayesha M. Khan Spring 2013 Lec-8.
1 Survey of Biodata Analysis from a Data Mining Perspective Peter Bajcsy Jiawei Han Lei Liu Jiong Yang.
bacteria and eukaryotes
Introduction to Bioinformatics Resources for DNA Barcoding
The Transcriptional Landscape of the Mammalian Genome
Basics of Comparative Genomics
Pipelines for Computational Analysis (Bioinformatics)
High-throughput Biological Data The data deluge
Genomes and Their Evolution
Genomes and Their Evolution
Genome organization and Bioinformatics
2 Unité de Biométrie et d’Intelligence Artificielle (UBIA) INRA
BIOL 433 Plant Genetics Term 2,
Unit Genomic sequencing
Basics of Comparative Genomics
Deep Learning in Bioinformatics
Presentation transcript:

Department of Plant Systems Biology Research at the Bioinformatics & Computational Biology research groups

2Yvan Saeys, Donostia 2004 Department of Plant Systems Biology Headed by Prof. Dirk Inzé –203 people (179 research staff, 24 technical/administrative staff) 6 Research Divisions –Biology (146) Molecular Genetics Division (87) Functional Genomics Division (19) Plant-Microbe Division (19) Genome Dynamics and Gene Regulation Division (19) –(Bio)Informatics (33) Bioinformatics and Evolutionary Genomics Division (24) Computational Biology Division (9)

3Yvan Saeys, Donostia “Computational” research groups Bioinformatics and Evolutionary Genomics (BEG) –Mainly deal with sequence data Comparative Genomics (Yves Van de Peer) Gene prediction & Annotation (Pierre Rouzé) Computational Biology Division (CBD) –Explore biological systems (networks) Headed by Martin Kuiper

4Yvan Saeys, Donostia 2004 Dr. Martin Kuiper Prof. Yves Van de Peer Dr. Pierre Rouzé Group Leaders

5Yvan Saeys, Donostia 2004 Research activities Comparative Genomics Gene Prediction & Genome Annotation Annotation of genomes Machine Learning Ancient large- scale gene duplications Functional divergence of duplicated genes Promoters and regulatory elements Transcription factors Bacterial comparative genomics Non coding RNAs Gene network modelling Heterosis

6Yvan Saeys, Donostia 2004 Ancient large-scale gene duplications Investigate major events during evolutionary past of genomes: –Large scale gene duplications –Genome duplications Research –Algorithms to detect colinear regions –Compare intra and inter species –Arabidopsis: 3 whole genome duplications –Comparisons between Arabidopsis and Rice –Duplications in vertebrate genomes Klaas Vandepoele Cedric Simillion

7Yvan Saeys, Donostia 2004 Large-scale duplications synteny ancient duplication HsaC1 HsaC9 recent duplication C2 C4 colinearity

8Yvan Saeys, Donostia 2004 Ancient large-scale gene duplications A B A B Building genomic profiles C Not significant ! C Not significant

9Yvan Saeys, Donostia 2004 Functional divergence of duplicated genes Duplications stimulate biological novelties –Investigate what happens to duplicated genes –Study of models for gene evolution –Genes are not individual entities, but members of gene families Research –Up to 65% of the genes in Arabidopsis belong to a gene family –Divergence at the regulatory/expression level –Divergence at the coding level. Tine CasneufJeroen Raes

10Yvan Saeys, Donostia 2004 Functional divergence of duplicated genes

11Yvan Saeys, Donostia 2004 Bacterial comparative genomics Investigation of multiple bacterial genomes –Genomes evolve over time, changing in subtle or radical ways, constantly adapting to the surrounding environment –Genomes can evolve gradually through vertical transmission of mutations, gene duplications, deletions, and rearrangements –Alternatively, they can evolve more suddenly and sporadically via horizontal transfer of genetic information between different microbial species Research –Assess the contribution of gene duplications to genome evolution in prokaryotes Dirk Gevers

12Yvan Saeys, Donostia 2004 Bacterial comparative genomics Functional Landscape of the Paranome (FLOP): Linking functional information to the paranome information Allows us to determine whether paralog retention is biased towards specific functional classes for each of the bacterial strains

13Yvan Saeys, Donostia 2004 Transcription factors Towards a better understanding of the link between evolution and development (evo-devo) –Transcription factors play a major role in the regulation of gene expression –Study the evolutionary and functional divergence of genes belonging to large transcription factor gene families Research –Structural and phylogenetic analyses of the MADS-box gene family –Comprehensive view on the regulatory role of MADS-box genes in plant development –Phylogenetic footprinting Stefanie De Bodt

14Yvan Saeys, Donostia 2004 Transcription factors

15Yvan Saeys, Donostia 2004 Genome Annotation Structural annotation of genes/genomes –Locate genes in genomes –Find the exact gene structures –Investigation of particular gene families Research –Development of an automatic annotation platform that can be applied to different genomes –Genomes: Arabidopsis, Poplar, Medicago, Ostrecoccus tauri Stephane Rombauts Lieven Sterck Steven Robbens

16Yvan Saeys, Donostia 2004 Genome Annotation platform RepeatMasker Coding potential search SplicePredictor Netstart NetGene2 BlastnBlastx EuGene Intrinsic approaches Extrinsic approaches Predicted Genes (structural annotation)

17Yvan Saeys, Donostia 2004 Dataset construction for Poplar Let EuGene make prediction based on extrinsic data EuGene Blastn RepeatMasker Blastx Extrinsic approaches IMM Splicing: WAM Start: const Intrinsic approaches EuGene framework Blast against Arabidopsis proteins with full length, discard cDNAs that have no hit Training set of mapped cDNAs Poplar IMM SpliceMachine Start prediction Select predicted genes covered by FL cDNA Final prediction of EuGene

18Yvan Saeys, Donostia 2004 Annotation of core cell cycle genes in Ostreococcus tauri The CDK gene family

19Yvan Saeys, Donostia 2004 Machine Learning (applied to genome annotation) Computational techniques to identify structural elements –Supervised classification methods –Support Vector Machines –Feature selection for knowledge extraction Research –New splice site prediction models –New feature selection techniques for gene prediction –Leads to more accurate gene models Sven Degroeve Yvan Saeys

20Yvan Saeys, Donostia 2004 Splice Machine

21Yvan Saeys, Donostia 2004 Feature selection for acceptor prediction

22Yvan Saeys, Donostia 2004 Promoter prediction Computational identification of promoter regions –Signal elements –Structural features –Still many false positives Research –Develop new tools and approaches for the automatic delineation of promoters –Motif detection –Detecting cis-regulatory elements –Phylogenetic footprinting Kobe Florquin

23Yvan Saeys, Donostia 2004 Promoter prediction

24Yvan Saeys, Donostia 2004 Non coding RNAs Many RNA molecules are not protein coding but instead function through their RNA form –Known a long time: transfer RNAs (tRNA), ribosomal RNAs (rRNA) –Only recently discovered: small interfering RNAs (siRNA), micro RNAs (miRNA), … –Regulate gene expression at the post-transcriptional level Research –Developing different computational tools and techniques to detect and characterize non-coding RNAs in Arabidopsis and other plant genomes Jan Wuyts Eric Bonnet

25Yvan Saeys, Donostia 2004 Non coding RNAs: MIRfinder

26Yvan Saeys, Donostia 2004 Comparison between plant species

27Yvan Saeys, Donostia 2004 Genetic networks Integrate functional genomics data of all types in a global network that reflects the regulatory wiring and modularity of an organism –Micro-array data from perturbation experiments –Leaf development Research –Novel methods, based on combinatorial statistics and graph theory –Unsupervised classification techniques (k-core clustering, Kohonen maps) Steven Maere Steven Vercruysse

28Yvan Saeys, Donostia 2004 Genetic networks Comb. p-value < 0.01 k-core clusteringGO labeling & visualization Gene profiles Experiments

29Yvan Saeys, Donostia 2004 Genetic networks Hierarchical clustering Many other algorithms… Self-organizing map - Regulatory interactions Goal : getting information about: - Protein function (same profile => same biol. process?)

30Yvan Saeys, Donostia 2004 Heterosis Modeling of “hybrid vigour” –Improved performance of F1 hybrids with respect to the parents –Dominance Model –Over-dominance Model –Epistatic Model –biometrics versus soft-computing approach Research –Additive versus dominance effects –Estimation of the molecular phenotype of the hybrid Jeroen MeeusElena Tsiporkova

31Yvan Saeys, Donostia 2004 Heterosis: Biometrics Approach genes 10 parents45 hybrids genes biomass leaf size … biomass leaf size … 10 parents 45 hybrids heteroticnon-heterotic Step 3 prediction Step 1 correlation hybrid-parents Step 2 correlation morphological- molecular phenotypes Step 2 correlation morphological- molecular phenotypes Molecular Phenotypes Morphological Phenotypes

32Yvan Saeys, Donostia 2004 Heterosis: Soft-Computing Approach genes 10 parents45 hybrids genes biomass leaf size … biomass leaf size … 10 parents 45 hybrids heteroticnon-heterotic direct classification simulation association Molecular Phenotypes Morphological Phenotypes

33Yvan Saeys, Donostia 2004 Databases European ribosomal RNA database / European Plant Promoter database (PlantCARE) PlantCARE/index.html European Federated Plant Database Network (Planet) Software Tree construction: TreeCon Tools: ForCon, SPADS, ZT, AFLPinSilico Large-scale duplications: Adhore, i-Adhore, ASaturA Website Francis Dierick: databases, webmaster, support Gert Sclep: CATMA and CAGE databases

34Yvan Saeys, Donostia 2004 “Part-time” Phd students Secretary Guy Baele: Modelling the covarion hypothesis Dirk Vandycke: Extrinsic gene prediction approaches Ann Bostyn

35Yvan Saeys, Donostia 2004 Thanks to…