Ontologies and vocabularies supporting data integration: emphasis on mouse phenotypes and disease model Control C3H/HeJ Homozygous Fasl gld /Fasl gld The.

Slides:



Advertisements
Similar presentations
Discovering Disease Associations using a Biomedical Semantic Web: Integration and Ranking One of the principal goals of biomedical research is to elucidate.
Advertisements

Annotation of Gene Function …and how thats useful to you.
Linkage and Genetic Mapping
The genetic dissection of complex traits
Why this paper Causal genetic variants at loci contributing to complex phenotypes unknown Rat/mice model organisms in physiology and diseases Relevant.
Using mouse genetics to understand human disease Mark Daly Whitehead/Pfizer Computational Biology Fellow.
Mouse Genome Informatics Online Resource Joanne Berghout, PhD Oct 13,
Using Ontologies to Annotate Phenotypic Data Janan T. Eppig December Mouse Genome Informatics.
Gene Ontology John Pinney
The Inheritance of Complex Traits
Alignment of Ontologies for Biological Research Judith A. Blake, Ph.D. Bioinformatics and Computational Biology The Jackson Laboratory.
Terry F. Hayamizu Mouse Genome Informatics, The Jackson Laboratory M OUSE A NATOMY O NTOLOGIES AND GXD.
2 March, 2005 Chapter 12 Mutational dissection Normal gene Altered gene with altered phenotype mutagenesis.
Quantitative Genetics
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
POC tutorial #1: Introduction This tutorial will run automatically in Quicktime. To run the tutorial at your own pace use the internal controllers within.
Mouse Genome Informatics November 2008 Paul Szauter MGI User Support.
Genetics Genetics. What is Genetics ? Genetics is the study of heredity and variation Examples of genetic variation 1. Domesticated species 2. Human genetics.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
PATO An ontology for phenotypes. The development of PATO is the work of George Gkoutos, supported by the NCBO, working in Cambridge.
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Computational Biology and Informatics Laboratory Development of an Application Ontology for Beta Cell Genomics Based On the Ontology for Biomedical Investigations.
Insulin-like signaling pathway: flies and mammals
Managing Big Scientific Data Capturing, Integrating and Presenting Mouse Data at MGI Cynthia Smith Canberra April Mouse Genome.
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
Linking Diseases and Genes through Informatics Knowledge Bases and Ontologies Joyce A. Mitchell, Ph.D. National Library of Medicine University of Missouri.
Bioinformatics and medicine: Are we meeting the challenge?
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
Multifactorial Traits
GENOME-CENTRIC DATABASES Daniel Svozil. NCBI Gene Search for DUT gene in human.
GENE ONTOLOGY FOR THE NEWBIES Suparna Mundodi, PhD The Arabidopsis Information Resources, Stanford, CA.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Genome-Scale Mutagenesis Introduction Model systems –Yeast –Mouse Implications for science.
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
Online Mendelian Inheritance in Man (OMIM): What it is & What it can do for you Knowledge Management & Eskind Biomedical Library January 27, 2012 helen.
The GUDMAP Database: An Online Resource for Genitourinary Research Dr. Simon Harding Stem Cells & Bioinformatics 22 nd September 2009.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Linking Animal Models and Human Diseases Supported by NIH P41 HG002659, U54 HG004028, & R01 HG Cambridge University & the University of Oregon.
PGA Workshop August 2003 Rat Genome Database an introduction Simon N. Twigger, Ph.D. Bioinformatics Research Center Medical College of Wisconsin, Milwaukee.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
The Plant Ontology Consortium Lincoln Stein 1, Susan McCouch 2, Elizabeth Kellogg 3, Seung Rhee 4, Pankaj Jaiswal 2, Doreen Ware 1, Peter Stevens 5 1 Cold.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Quantitative Genetics
QTL Mapping in Heterogeneous Stocks Talbot et al, Nature Genetics (1999) 21: Mott et at, PNAS (2000) 97:
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Copyright OpenHelix. No use or reproduction without express written consent1.
Rice Proteins Data acquisition Curation Resources Development and integration of controlled vocabulary Gene Ontology Trait Ontology Plant Ontology
This tutorial will describe how to navigate the section of Gramene that provides descriptions of alleles associated with morphological, developmental,
Phenotype Curation Susan R. McCouch Department of Plant Breeding Cornell University.
Gene editing in embryos and germ line
 9 European Countries  1 Third Country  14 Research Centers of Excellence  5 Universities  4 SMEs  1 Venture Capital.
MGI and Phenotyping Projects Mouse Genome Informatics.
Phenotype And Trait Ontology (PATO) and plant phenotypes
Chapter 22 - Quantitative genetics: Traits with a continuous distribution of phenotypes are called continuous traits (e.g., height, weight, growth rate,
GENETICS Dr. Samar Saleh Assiss. Lecturer Mosul Medical College Pathology3 rd year.
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
生物資料庫搜尋 ( 第八組 ) 連威森 王鼎 黃智楹 張鈞淵
Brendan Burke and Kyle Steffen. Important New Tool in Genomic Medicine GWAS is used to estimate disease risk and test SNPs( the most common type of genetic.
Genetics Mendelian Genetics Genetic Engineering. Gregor Mendel Used pea plants to experiment on genetic traits Pea plants can self-pollinate, producing.
Networks and Interactions
The Human-Mouse: Disease Connection in MGI (BETA)
Quantitative traits Lecture 13 By Ms. Shumaila Azam
Functional Annotation of the Horse Genome
Genetic architecture of behaviour
Complex Traits Qualitative traits. Discrete phenotypes with direct Mendelian relationship to genotype. e.g. black or white, tall or short, sick or healthy.
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
QTL Annotation in MGI Susan M Bello, Ira Lu, Cynthia L Smith, Janan T Eppig, and the Mouse Genome Informatics Group.
Relationship between Genotype and Phenotype
Chapter 7 Multifactorial Traits
Presentation transcript:

Ontologies and vocabularies supporting data integration: emphasis on mouse phenotypes and disease model Control C3H/HeJ Homozygous Fasl gld /Fasl gld The mouse generalized lymphoproliferative disease (gld) mutation in the FAS ligand (TNF superfamily, member 6) gene. These mice model human Autoimmune Lymphoproliferative Syndrome; ALPS, type IB Janan T. Eppig PATO Meeting, Dec. 2006

The genetic tools for mouse provide an ideal platform for experimentation: genetic engineering techniques to specifically manipulate the genome sequenced genome Inbred strains high resolution maps Mammal : small, easy to breed and maintain, short lifespan Similar to human genetically & physiologically human disease model ES cell lines

short domed skull short-limbed dwarfism malocclusion bulging abdomen as adults respiratory problems shorted lifespan Achondroplasia Homozygous achondroplasia mouse mutant and control …facilitating the use of the mouse as a model for human biology by providing integrated access to data on the genetics, genomics, and biology of the laboratory mouse.

…make phenotype and disease model data robust and accessible to researchers and computational biologists semantically consistent search methods integrated access to all phenotypic variation sources (single-gene and genomic mutations, QTLs, strains) ability to query across sequence, orthology, expression, function, phenotype, disease data on human disease correlation access to mouse models from various approaches - Genetic - Phenotypic - Computational Objective

Existing Wealth of Mouse Phenotype Data in MGI >16,800 phenotypic alleles representing ≈6,830 unique genes. >71,000 annotations associating MP terms to genotypes. >6,550 phenotype records for 3,210 QTL. >9,000 strains catalogued.

A few of the challenges alleles can produce pleiotropic phenotypic effects non-allelic mutations can produce indistinguishable phenotypes modifiers and epistasis can influence mutant phenotypes alleles of different genes can interact to produce unique phenotypes genetic background can greatly influence mutant phenotypes imprinted genes/alleles influence phenotype quantitative trait loci (QTLs) can contribute unequally to phenotypes genomic mutations can delete or disrupt multiple genes strains (“whole-genome”) have characteristic phenotypes complex genetically engineered and multiple mutation stocks are often developed for disease models environmental influences and age can dramatically affect phenotype

Data Challenge Mouse phenotype data from publications electronic submissions mutagenesis (ENU centers) ( ≈ 300 new alleles; ≈ 700 publications per month on phenotypes) New initiatives to knock-out every gene in the mouse in next 5 years… Need for efficiency, accuracy, full description of complex observations, storage/analysis of individual and population data

Making semantic sense Controlled vocabularies/nomenclatures Strains Genes Alleles (phenotypic or variant) Classes of genetic markers Types of mutations Types of assays Developmental stages Tissues Clone libraries ES cell lines ….. organized as lists or simple hierarchies

Clone Library Names Inbred Strain Names Gene Symbols

Hbp1 (high mobility group box transcription factor 1) gene expression differences in Kit W-e /Kit W-e homozygotes vs wild-type Assay Gene nomenclature Results Specimen

Semantics plus relationship data Ontologies/structured vocabularies Gene Ontology (GO) Molecular function Biological process Cellular component Mouse Anatomy (MA) Embryonic Adult Mammalian Phenotype (MP) Sequence Ontology (SO) ….. organized as directed acyclic graphs (DAGs) DAGs

Phenotype detail, including genotypes for mouse models of human diseases Navigating the views of phenotypes & disease Human/mouse disease relationships 3.MP Ontology Summary: genotype, MP term, & ref 1.Gene Page Summary: phenotype classes & human disease associated 4.Disease vocabulary 2.Phenotype Query 5.Sequence (GBrowse)

enlarged brain ventricles L1cam tm1Mtei /Y129/SvEvnone affected C57BL/6Jhigh percentage affected postnatal deathGnas tm1Kel-pat /Gnas + 129/Sv * C57BL/6Jmost die by P2; all by P9 129/Sv * C57BL/6J * CD-1 most die by P9; 10-20% survive past P21 TMEV viral susceptibility Cd8a tm1Mak /Cd8a tm1Ma k C57BL/6Inflammation after infection resolves by 45 days; disease is absent by 10 mo. PL/Jviral infection persists Genotype = allele combinations carried in the context of a specific genetic background (strain)

Mammalian Phenotype Ontology Structured as DAG Over 4,500 terms covering physiological systems, behavior, development and survival Available in browser and OBO formats from MGI ftp and OBO sites Each term linked to all annotations to the term or its children

Summary Results Genotypes that are annotated to a term or children of the term References supporting annotation Links to allele detail pages for full mutant phenotype

Allele Detail Page full phenotype annotations (MP) for each genotype specific detail for MP terms each MP annotation referenced human diseases for which genotype is used as a model

Mouse model genotypes linked to phenotype details Genes associated with phenotypes characteristic of a disease in human, mouse, or both

osteopetrosis Human-mouse disease relationships OMIM terms6,113 Genotypes associated w/ OMIM1,847 OMIM associated w/ genotypes 720 to Human Disease and Mouse Model Page

Vocabularies in MGI DAGs Definition Synonyms MP:1956 Strain: AEJ Alleles:bd/bd Genotype Strain: C57BL/6 Alleles: Ppp1r3a tm1Adpt / Ppp1r3a tm1Adpt Terms … Respiratory failure Postnatal lethality Dilated renal tubules Growth retardation Vocabulary Note … J:65378TAS J:62648IDA J:65322EE Annotations

Making Mammalian Phenotype Ontology Work DAG accommodate bio-specific terms computationally useful human accessible practical for curation cross-reference to other ontologies

Terms in MP MP termEntityQualityOther Info microphthalmiaeyesmall size hydrocephalycerebro- spinal fluid increasedexcessive brainlarge size (dilated) traumaobservedbrain increased blood pressure ?increased

Future MP Ontology Development New terms from ongoing curation process Collaborative community efforts identify new terms revise organization of existing terms within particular branches Recruit domain experts for systematic review Cross-ref and comparison to other relevant ontologies (GO, Anatomy, Cell Type, Mpath, etc.)

Collaborators …currently annotating with MP and contributing to MP development Rat Genome Database (RGD) Mouse Mutagenesis Centers Human (NCBI/dbSNP) Online Mendelian Inheritance in Animals (OMIA) …under discussion Teratology Society Animal Traits

Summary Structured vocabularies and ontologies support semantic integration for the MGI system and promote broader integration of mouse knowledge To meet community needs, practical implementations parallel formal ontological development MGI has implemented a generalized structure for vocabularies and ontologies in MGI The Mouse Genome Informatics group continues its strong interest and participation in community bio-ontology efforts

Human FOXN1 forkhead box N1 T-CELL IMMUNODEFICIENY, CONGENITAL ALOPECIA, AND NAIL DYSTROPHY Frank J, et al. Nature 398, (1999) Mouse Foxn1 Homozygous “nude” mouse. One of eight known phenotypic mutations in mouse (6 spontaneous; 2 engineered) for the forkhead box N1 gene.