Respective contributions of MIAME, GeneOntology and UMLS for transcriptome analysis Fouzia Moussouni, Anita Burgun, Franck Le Duff, Emilie Guérin, Olivier.

Slides:



Advertisements
Similar presentations
Microarray statistical validation and functional annotation
Advertisements

Molecular Biomedical Informatics Machine Learning and Bioinformatics Machine Learning & Bioinformatics 1.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
CAVEAT 1 MICROARRAY EXPERIMENTS ARE EXPENSIVE AND COMPLICATED. MICROARRAY EXPERIMENTS ARE THE STARTING POINT FOR RESEARCH. MICROARRAY EXPERIMENTS CANNOT.
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
DNA microarray example: –As a scientist, you are interested in the differences in expression patterns between normal human cells and cancer cells –A DNA.
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
Minimum Information About a Microarray Experiment - MIAME MGED 5 workshop.
Genetics and Health Jennifer Eyvindson Epi 6181 November 2006.
Experimental pathology refers to the observation of the effects of manipulations on animal models or cell cultures regarding researches on human diseases.
1 MicroArray -- Data Analysis Cecilia Hansen & Dirk Repsilber Bioinformatics - 10p, October 2001.
Gene expression analysis summary Where are we now?
Additional Powerful Molecular Techniques Synthesis of cDNA (complimentary DNA) Polymerase Chain Reaction (PCR) Microarray analysis Link to Gene Therapy.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
10 Genomics, Proteomics and Genetic Engineering. 2 Genomics and Proteomics The field of genomics deals with the DNA sequence, organization, function,
Bacterial Physiology (Micr430)
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Review of important points from the NCBI lectures. –Example slides Review the two types of microarray platforms. –Spotted arrays –Affymetrix Specific examples.
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
DNA and Chromosome Structure. Chromosomal Structure of the Genetic Material.
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Genome of the week - Deinococcus radiodurans Highly resistant to DNA damage –Most radiation resistant organism known Multiple genetic elements –2 chromosomes,
A number of slides taken/modified from:
Unit 1: The Language of Science  communicate and apply scientific information extracted from various sources (3.B)  evaluate models according to their.
Microarrays (Gene Chips) Pioneered by Pat Brown in mid 1990’s To monitor thousands of mRNAs simultaneously Comparative Northern blot on thousands of genes.
AP Biology Control of Eukaryotic Genes.
DNA microarrays Each spot contains a picomole of a DNA ( moles) sequence.
DNA MICROARRAYS WHAT ARE THEY? BEFORE WE ANSWER THAT FIRST TAKE 1 MIN TO WRITE DOWN WHAT YOU KNOW ABOUT GENE EXPRESSION THEN SHARE YOUR THOUGHTS IN GROUPS.
Bioinformatics and medicine: Are we meeting the challenge?
CDNA Microarrays MB206.
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
Data Analysis Summary. Elephant in the room General Comments General understanding that informatics is integral in medical sequencing and other –omics.
1 MIAME The MIAME website: © 2002 Norman Morrison for Manchester Bioinformatics.
Name of the topic Author, group, General information about protein (we suggest to use the websites
More regulating gene expression. Combinations of 3 nucleotides code for each 1 amino acid in a protein. We looked at the mechanisms of gene expression,
Recombinant DNA technology
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Modeling of complex systems: what is relevant? Arno Knobbe, Marvin Meeng, Joost Kok Leiden Institute of Advanced Computer Science (LIACS)
Copy Number Variation Eleanor Feingold University of Pittsburgh March 2012.
Protein Synthesis. Learning Objectives By the end of this class you should understand: The purpose and mechanism of codons The two steps of protein synthesis.
Introduction of Pathology
Proteomics Session 1 Introduction. Some basic concepts in biology and biochemistry.
Epigenetic Modifications in Crassostrea gigas Claire H. Ellis and Steven B. Roberts School of Aquatic and Fishery Sciences, University of Washington, Seattle,
Genes and Genomic Datasets. DNA compositional biases Base composition of genomes: E. coli: 25% A, 25% C, 25% G, 25% T P. falciparum (Malaria parasite):
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
Central dogma: the story of life RNA DNA Protein.
Complexities of Gene Expression Cells have regulated, complex systems –Not all genes are expressed in every cell –Many genes are not expressed all of.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Bioinformatics and Computational Biology
Telomerase, Immortalization and Cancer Eric Bankaitis Cancer Bio 169 March 9, 2006 Fig.[9]
A Report on CAMDA’01 Biointelligence Lab School of Computer Science and Engineering Seoul National University Kyu-Baek Hwang and Jeong-Ho Chang.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Modeling the cell cycle regulation by the RB/E2F pathway Laurence Calzone Service de Bioinformatique U900 Inserm / Ecoles de Mines / Institut Curie Collaborative.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Homework #2 is due 10/17 Bonus #1 is due 10/24 Exam key is online Office hours: M 10/ :30am 2-5pm in Bio 6.
Gene Expression Profile in Proliferation and Apoptosis of Human Hepatic Stellate Cell Using Microarray 신혜원 병리학교실.
High throughput biology data management and data intensive computing drivers George Michaels.
Faculdade de Medicina da Universidade de Coimbra Curso de Medicina 1º Ano Ano lectivo 2009/2010.
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
Microarray: An Introduction
KEY CONCEPT Gene expression is carefully regulated in both prokaryotic and eukaryotic cells. Chapter 11 – Gene Expression.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Biomedical Therapies Foundation Standard 1: Academic Foundation
Human Cells Genes and proteins in health and disease
more regulating gene expression
Genome organization and Bioinformatics
Fouzia Moussouni, Anita Burgun, Franck Le Duff,
Pierre Brissot, Marie-Bérengère Troadec, Olivier Loréal 
Presentation transcript:

Respective contributions of MIAME, GeneOntology and UMLS for transcriptome analysis Fouzia Moussouni, Anita Burgun, Franck Le Duff, Emilie Guérin, Olivier Loréal INSERM U522 and Medical Informatics Laboratory, CHU Pontchaillou Rennes, FRANCE

Transcriptome & DNA microarray study of transcriptionnal response of the cell Normal Pathologic Response to a growth factor Response to genetic disturbances Response to chemics or foods treatment

Pathological situations studied at INSERM U522 IRON overload  DNA mutation(s) Hemochromatosis…  Chronic liver diseases  Fibrosis  Cirrhosis  Hepatocarcinoma Mechanisms

One may deposit thousands of genes 1 gene but multiple facets ! Intensive data generation 1 measure 1 Expression Level 1 Spot intensity Available knowledge on the expressed genes, that need to be capturized and organized. Experimental Raw Data

 Nucleic Sequence components - promoters, introns, exons, transcripts, regulators, …  Chromosomal localization,  Functional proteins and known genes products,  Tissue distribution,  Known gene interactions,  Expression level in physiologic and pathologic conditions,  Known gene variations,  Clinical Implications,  Literature and bibliographic data on a gene. One gene but multiple descriptions

External Sources Analysis Gene Expression warehouse Micro-arrays Substractive banks SAGE Clinical Data ? Need of an integrated gene expression environment (for the liver!) Integration Data cleaning ! experimental data

BIO KNOWLEDGE Gene Expression Warehouse Standardization and controlled specification ONTOLOGY DESIGN Knowledge extraction and data exchange

Standardization ONTOLOGY DESIGN Respective contributions MIAME UMLS GO

MIAME MIAME will provide a standard framework to represent the minimum information that must be reported about microarray experiments : Experience Array Samples Hybridization Measures Normalisation and control Work in progress... Minimum information about a microarray experiment (MIAME) toward standards for microarray data', A. Brazma, at al., Nature Genetics, vol 29 (December 2001), pp

GO is an ontology for molecular biology and Genomics, GeneOntology (GO) But GO is not populated with : GOA è gene sequences è gene products,...

è The Unified Medical Language System (UMLS) is intended to help health professionals and researchers to use biomedical information from different sources. UMLS

è Examples from iron metabolism are studied è How pathologic disease states related to iron metabolism alteration are described in GO and UMLS ?

BIOLOGICAL MODEL FOR IRON METABOLISM Iron metabolism diseases IRON METABOLISM GENES Iron overload aceruloplasminemia Iron deficiency Other diseases hyperferritinemia cataract PATHOLOGIC STATES alteration Other diseases hyperferritinemia cataract

Iron overload due to a gene alteration Iron overload during Aceruloplasminemia mutation Feroxydase activity in plasma Fe2+ Fe3+ Iron binding with plasmatic transferrin Ceruloplasmin Gene THE IRON STAYS INSIDE THE CELL !! NO

BIOLOGICAL MODEL FOR IRON METABOLISM Other diseases hyperferritinemia cataract IRON METABOLISM GENES PATHOLOGIC STATES alteration Iron metabolism diseases Iron overload aceruloplasminemia Iron deficiency

A second scenario related to iron metabolism genes alteration Cataract and hyperferritinemia mutation IRP IRE Translation in excess L_Ferritin gene L_Ferritin mRNA L_Ferritin protein in excess CATARACT and HYPERFERRITINEMIA !

UMLS view Cataract and hyperferritinemia AA, Peptide or Prorein Biologically Active Substance AA, Peptide or Protein Ferritin Iron compound L_FerritinH_Ferritin Metalloprotein RNAbinding Protein Iron Sulfur Prot Cataract Co-occurs In Medline (freq 26) Co-occurs In Medline IRP IRE

GO/ GOAnnotations view Cataract and hyperferritinemia Ferritin Heavy Chain Cell component Ferritin IRP Ferritin Light Chain IRE Ligand binding Prot or carrierFerric iron bindingIron homeostasisIron transport Hydro-lyase Metabolism Cataract Link in GO Annotations DB

Target representation Cataract and hyperferritinemia Cataract IRP Ligand binding Prot or carrier Ferric iron binding Iron homeostasis Iron transport Ferritin Light Chain Ferritin Heavy Chain Ferritin IRE Hyperferritinemia Genes Mutated genes Dynamic links Modeling of biological functions

DNA Chips And more generally … Recapitulative Information on disease states, clinical treatments and followups. Normal vs. pathologic Information on Roles of the genes in Biological and metabolic states ? We need precise and dynamic models to get the whole picture MIAMEMIAME Information on biological samples, experiments and results GOA UMLS

Gene products for Iron metabolism, as they are actually described in GO and UMLS.