Fouzia Moussouni, Anita Burgun, Franck Le Duff,

Slides:



Advertisements
Similar presentations
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Advertisements

CAVEAT 1 MICROARRAY EXPERIMENTS ARE EXPENSIVE AND COMPLICATED. MICROARRAY EXPERIMENTS ARE THE STARTING POINT FOR RESEARCH. MICROARRAY EXPERIMENTS CANNOT.
Gene Ontology John Pinney
Genetics and Health Jennifer Eyvindson Epi 6181 November 2006.
Experimental pathology refers to the observation of the effects of manipulations on animal models or cell cultures regarding researches on human diseases.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Genetics and the Organism 10 Jan, Genetics Experimental science of heredity Grew out of need of plant and animal breeders for greater understanding.
Introduction of Cancer Molecular Epidemiology Zuo-Feng Zhang, MD, PhD University of California Los Angeles.
Chromosomes carry genetic information
More regulating gene expression. Fig 16.1 Gene Expression is controlled at all of these steps: DNA packaging Transcription RNA processing and transport.
Express yourself That darn ribosome Mighty Mighty Proteins Mutants RNA to the Rescue
Unit 1: The Language of Science  communicate and apply scientific information extracted from various sources (3.B)  evaluate models according to their.
AP Biology Control of Eukaryotic Genes.
DNA microarrays Each spot contains a picomole of a DNA ( moles) sequence.
1 Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 25 Image Slides.
Compare and contrast prokaryotic and eukaryotic cells.[BIO.4A] October 2014Secondary Science - Biology.
RNA and Protein Synthesis
RNA and Protein Synthesis
More regulating gene expression. Combinations of 3 nucleotides code for each 1 amino acid in a protein. We looked at the mechanisms of gene expression,
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Modeling of complex systems: what is relevant? Arno Knobbe, Marvin Meeng, Joost Kok Leiden Institute of Advanced Computer Science (LIACS)
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
INTRODUCTION TO PLANT MOLECULAR GENETICS. Genetics The study of heredity The study of heredity The study of how differences between individuals are transmitted.
Protein Synthesis. Learning Objectives By the end of this class you should understand: The purpose and mechanism of codons The two steps of protein synthesis.
Respective contributions of MIAME, GeneOntology and UMLS for transcriptome analysis Fouzia Moussouni, Anita Burgun, Franck Le Duff, Emilie Guérin, Olivier.
Proteomics Session 1 Introduction. Some basic concepts in biology and biochemistry.
Central dogma: the story of life RNA DNA Protein.
Complexities of Gene Expression Cells have regulated, complex systems –Not all genes are expressed in every cell –Many genes are not expressed all of.
Bioinformatics and Computational Biology
Genetics Review Honors Human Anatomy & Physiology Mr. Mazza
RNA Makin’ Proteins DNAMutations Show off those Genes!
Genes in ActionSection 2 Section 2: Regulating Gene Expression Preview Bellringer Key Ideas Complexities of Gene Regulation Gene Regulation in Prokaryotes.
Faculdade de Medicina da Universidade de Coimbra Curso de Medicina 1º Ano Ano lectivo 2009/2010.
KEY CONCEPT 8.5 Translation converts an mRNA message into a polypeptide, or protein.
KEY CONCEPT Gene expression is carefully regulated in both prokaryotic and eukaryotic cells. Chapter 11 – Gene Expression.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
DNA Microarray. Microarray Printing 96-well-plate (PCR Products) 384-well print-plate Microarray.
SC.912.L.16.3 DNA Replication. – During DNA replication, a double-stranded DNA molecule divides into two single strands. New nucleotides bond to each.
How to Use This Presentation
High-throughput data used in bioinformatics
Molecular Genetics Transcription & Translation
Lesson Four Structure of a Gene.
Variation among organisms
Lesson Four Structure of a Gene.
Transcription Translation
Concepts and Applications Eighth Edition
Biomedical Therapies Foundation Standard 1: Academic Foundation
Functional Genomics in Evolutionary Research
 The human genome contains approximately genes.  At any given moment, each of our cells has some combination of these genes turned on & others.
RNA and Protein Synthesis
Human Cells Genes and proteins in health and disease
more regulating gene expression
MICROBIAL GENETICS CHAPTER 7.
Eukaryotic Genome Control Mechanisms for Gene expression
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Eukaryotic Genome Control Mechanisms for Gene expression
Complete Station Race Assignment…
Introduction to Bioinformatic
AH Biology: Unit 1 Proteomics and Protein Structure 1
RNA and Protein Synthesis
Eukaryotic Genome Control Mechanisms for Gene expression
From Mendel to Genomics
BIO307- Bioengineering principles SPRING 2019
Introduction to Bioinformatics
Pierre Brissot, Marie-Bérengère Troadec, Olivier Loréal 
Homework #2 is due 10/18 Bonus #1 is due 10/25 Exam key is online.
So how do we get from DNA to Protein?
Presentation transcript:

Respective contributions of MIAME, GeneOntology and UMLS for transcriptome analysis Fouzia Moussouni, Anita Burgun, Franck Le Duff, Emilie Guérin, Olivier Loréal INSERM U522 and Medical Informatics Laboratory, CHU Pontchaillou Rennes, FRANCE

Transcriptome & DNA microarray study of transcriptionnal response of the cell Normal Pathologic Response to chemics or foods treatment Response to a growth factor Response to genetic disturbances

? Pathological situations studied at INSERM U522 IRON overload  DNA mutation(s) Hemochromatosis…  Chronic liver diseases Mechanisms ?  Fibrosis  Cirrhosis  Hepatocarcinoma

One may deposit thousands of genes 1 measure 1 Expression Level 1 Spot intensity Intensive data generation 1 gene but multiple facets ! Available knowledge on the expressed genes, that need to be capturized and organized. Experimental Raw Data

One gene but multiple descriptions Nucleic Sequence components - promoters, introns, exons, transcripts, regulators, … Chromosomal localization, Functional proteins and known genes products, Tissue distribution, Known gene interactions, Expression level in physiologic and pathologic conditions, Known gene variations, Clinical Implications, Literature and bibliographic data on a gene.

Need of an integrated gene expression environment (for the liver!) External Sources ? ? ? Integration Data cleaning ! Gene Expression warehouse Analysis Clinical Data experimental data Micro-arrays Substractive banks SAGE

and controlled specification BIO KNOWLEDGE Gene Expression Warehouse Standardization and controlled specification ONTOLOGY DESIGN Knowledge extraction and data exchange

Respective contributions Standardization ONTOLOGY DESIGN Respective contributions MIAME GO UMLS

MIAME Experience Array Samples Hybridization Measures MIAME will provide a standard framework to represent the minimum information that must be reported about microarray experiments : Experience Array Samples Hybridization Measures Normalisation and control Work in progress ... Minimum information about a microarray experiment (MIAME) toward standards for microarray data', A. Brazma, at al., Nature Genetics, vol 29 (December 2001), pp 365 - 371.

GeneOntology (GO) GO is an ontology for molecular biology and Genomics, But GO is not populated with : GOA gene sequences gene products, ...

UMLS The Unified Medical Language System (UMLS) is intended to help health professionals and researchers to use biomedical information from different sources.

Examples from iron metabolism are studied How pathologic disease states related to iron metabolism alteration are described in GO and UMLS ?

Iron metabolism diseases BIOLOGICAL MODEL FOR IRON METABOLISM IRON METABOLISM GENES PATHOLOGIC STATES alteration Iron metabolism diseases Other diseases hyperferritinemia cataract Other diseases hyperferritinemia cataract Iron overload aceruloplasminemia Iron deficiency

Iron overload during Aceruloplasminemia Iron overload due to a gene alteration Iron overload during Aceruloplasminemia Ceruloplasmin Gene mutation Feroxydase activity in plasma Fe2+ Fe3+ NO Iron binding with plasmatic transferrin NO THE IRON STAYS INSIDE THE CELL !!

Iron metabolism diseases BIOLOGICAL MODEL FOR IRON METABOLISM IRON METABOLISM GENES PATHOLOGIC STATES alteration Other diseases hyperferritinemia cataract Iron metabolism diseases Iron overload aceruloplasminemia Iron deficiency

A second scenario related to iron metabolism genes alteration Cataract and hyperferritinemia L_Ferritin gene mRNA mutation L_Ferritin Translation in excess IRE IRP L_Ferritin protein in excess CATARACT and HYPERFERRITINEMIA !

Biologically Active Substance UMLS view Cataract and hyperferritinemia Iron compound Metalloprotein AA, Peptide or Prorein Biologically Active Substance Ferritin Cataract L_Ferritin H_Ferritin RNAbinding Protein Iron Sulfur Prot AA, Peptide or Protein Co-occurs In Medline IRE Co-occurs In Medline (freq 26) In the UMLS, Ferritin Is a child of iron compound, and Metalloprotein Is the parent of H-ferritin and L-ferritin Is categorized (is assigned to 2 Semantic Types) Amino-Acid, Peptide or protein, and Biologically Active Substance It can be noticed that Ferritin co-occurs in Medline with Cataract, co-occurrence relationships being represented in the UMLS (COC file) Similarly, Ferritin co-occurs in Medline with RNA binding Protein, and Iron sulfur protein, which are the parents of IRP IRE is not present in the UMLS Notice that Ferritin, L-ferritin, etc are represented as Proteins, not as genes Concl: all the terms but IRE are found in the UMLS. Hierarchical relationships are usable. Co-occurrence relationships are represented in the UMLS resource, they are interesting. However, they are not semantically defined, and precise representation of the pathological process is not provided IRP

GO/ GOAnnotations view Link in GO Annotations DB Cataract and hyperferritinemia Cell component Ferritin Ligand binding Prot or carrier Ferric iron binding Ferritin Light Chain Link in GO Annotations DB Iron homeostasis Ferritin Heavy Chain Iron transport Metabolism IRE IRP In the GO, only Ferritin is represented and classified Cell component. H-ferritin and L-ferritin are not present in GO. In GO Annotations, Ferritin hight chain is associated with 5 GO terms: Ferritin, Ligand binding protein or carrier, Ferric iron binding, Iron homeostasis, Iron transport IRP is associated with 2 GO terms: Metabolism, and Hydrolyase Concl: Only few concepts are represented in GO. GOA files provide associated relationships between gene products and GO concepts. However, they are not semantically defined, and precise representation of the pathological process is not provided Hydro-lyase Cataract

Target representation Cataract and hyperferritinemia Ligand binding Prot or carrier Ferric iron binding Iron homeostasis Iron transport Ferritin Light Chain Ferritin Heavy Chain Dynamic links Modeling of biological functions IRE Hyperferritinemia Genes Mutated genes IRP Cataract

? Recapitulative UMLS GOA MIAME And more generally … Recapitulative Information on disease states, clinical treatments and followups. Normal vs. pathologic UMLS DNA Chips ? Information on biological samples, experiments and results MIAME We need precise and dynamic models to get the whole picture Information on Roles of the genes in Biological and metabolic states GOA

Gene products for Iron metabolism, as they are actually described in GO and UMLS.