Human & Mouse Orthologous Gene Nomenclature (HUMOT) HUGO Gene Nomenclature Committee (HGNC) Matt Wright

Slides:



Advertisements
Similar presentations
1 Orthologs: Two genes, each from a different species, that descended from a single common ancestral gene Paralogs: Two or more genes, often thought of.
Advertisements

Nomenclature: The Language of Genomics Ruth Lovering HUGO Gene Nomenclature Committee
HUGO Gene Nomenclature Committee (HGNC), Department of Biology, University College London, Wolfson House, 4 Stephenson Way, London NW1 2HE, UK. The work.
Genomic Innovations- Orthology Paralogy. Genomic innovation.
GENE TREES Abhita Chugh. Phylogenetic tree Evolutionary tree showing the relationship among various entities that are believed to have a common ancestor.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Comparative genomics Joachim Bargsten February 2012.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
M ulti P aranoid Automatic Clustering of Orthologs and Inparalogs Shared by Multiple Proteomes Andrey Alexeyenko Ivica Tamas Gang Liu Erik L.L. Sonnhammer.
Finding Orthologous Groups René van der Heijden. What is this lecture about? What is ‘orthology’? Why do we study gene-ancestry/gene-trees (phylogenies)?
Evolution at the DNA level …ACGGTGCAGTTACCA… …AC----CAGTCCACCA… Mutation SEQUENCE EDITS REARRANGEMENTS Deletion Inversion Translocation Duplication.
Lecture 7.11 The Ensembl Database Erin Pleasance Steven Jones Canada’s Michael Smith Genome Sciences Centre, Vancouver.
[Bejerano Aut08/09] 1 MW 11:00-12:15 in Beckman B302 Profs: Serafim Batzoglou, Gill Bejerano TA: Cory McLean.
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Tree Pattern Matching in Phylogenetic Trees Automatic Search for Orthologs or Paralogs in Homologous Gene Sequence Databases By: Jean-François Dufayard,
CS273a Lecture 10, Aut 08, Batzoglou Multiple Sequence Alignment.
Bootcamp: Data Resources1 Paul Bain Reference and Education Services Librarian Countway Library of Medicine Countway.
Genomic Database - Ensembl Ka-Lok Ng Department of Bioinformatics Asia University.
CS273a Lecture 9/10, Aut 10, Batzoglou Multiple Sequence Alignment.
Protein Modules An Introduction to Bioinformatics.
How to access genomic information using Ensembl August 2005.
Finding Orthologous Groups René van der Heijden. What is this lecture about? What is ‘orthology’? Why do we study gene-ancestry/gene-trees (phylogenies)?
Bioinformatics Genome anatomy Comparisons of some eukaryotic genomes Allignment of long genomic sequences Comparative genomics Oxford Grid Reconstruction.
Arabidopsis Gene Project GK-12 April Workshop Karolyn Giang and Dr. Mulligan.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Genome Databases Computational Molecular Biology Biochem 218 – BioMedical Informatics.
Comparative Genomics of the Eukaryotes
Genome Annotation and Databases Genomic DNA sequence Genomic annotation BIO520 BioinformaticsJim Lund Reading Ch 9, Ch10.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Christian M Zmasek, PhD Burnham Institute for Medical Research Bioinformatics and Systems Biology
NCBI’s Genome Annotation: Overview Incremental processing Re-annotation ( batch ) Post-annotation review Case studies NOTE: limiting discussion to annotation.
Module 3 Sequence and Protein Analysis (Using web-based tools) Working with Pathogen Genomes - Uruguay 2008.
Alastair Kerr, Ph.D. WTCCB Bioinformatics Core An introduction to DNA and Protein Sequence Databases.
Future Plans for the HGNC Elspeth Bruford. HGNC Team Elspeth Bruford Susan Tweedie* Ruth Seal Kris Gray Welcome to Susan * starting Beth Yates.
Bioinformatic Tools for Comparative Genomics of Vectors Comparative Genomics.
NCBI FieldGuide NCBI Molecular Biology Resources March 2007 Using Entrez.
Orthology & Paralogy Alignment & Assembly Alastair Kerr Ph.D. [many slides borrowed from various sources]
Measures of Conserved Synteny Work was funded by the National Science Foundation’s Interdisciplinary Grants in the Mathematical Sciences All work is joint.
Using blast to study gene evolution – an example.
Generic Database. What should a genome database do? Search Browse Collect Download results Multiple format Genome Browser Information Genomic Proteomic.
Data Mining in Ensembl with BioMart Giulietta Spudich.
Cédric Notredame (08/12/2015) Molecular Evolution Cédric Notredame.
Copyright OpenHelix. No use or reproduction without express written consent1.
Orthology & Paralogy Alignment & Assembly Alastair Kerr Ph.D. WTCCB Bioinformatics Core [many slides borrowed from various sources]
Search Functions Simple Search Advanced Search.
Sequence Alignment Abhishek Niroula Department of Experimental Medical Science Lund University
S. pombe Unicellular archiascomycete Diverged from S. cerevisiae Ma Size ~14 Mb, 3 chromosomes No synteny Data stored in GeneDB.
1 of 28 Evaluating Genes and Transcripts (“Genebuild”)
Future Plans for the HGNC (Questions, Answers, and More Questions) Elspeth Bruford.
Need a solid base for analysis of future genomes Reference genome criteria: Sequenced genome MOD Functional genomics projects Adequate research community.
BLAST: Basic Local Alignment Search Tool Robert (R.J.) Sperazza BLAST is a software used to analyze genetic information It can identify existing genes.
Tools For Vertebrate Gene Naming
Basics of Comparative Genomics
Chromosome-level Mutation
Comparative Genomics.
Functional Annotation of the Horse Genome
Genome Annotation Continued
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Ensembl Genome Repository.
Comparative Genomics.
Basics of Comparative Genomics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Problems from last section
Welcome - webinar instructions
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Human & Mouse Orthologous Gene Nomenclature (HUMOT) HUGO Gene Nomenclature Committee (HGNC) Matt Wright

The Role of HGNC Assign unique approved gene names and symbols Assign unique approved gene names and symbols Co-ordinate with other databases and nomenclature groups Co-ordinate with other databases and nomenclature groups Collate and curate data (Genew) Collate and curate data (Genew)

Searchgenes

Gene Symbol Report

Why Use Approved Nomenclature? Standardisation Standardisation User friendly User friendly Communication Communication Manually curated Manually curated Consistency across species Consistency across species

Naming Criteria Stable MeaningfulMemorable Unique

Symbol Assignment Sequence comparison Sequence comparison Information from literature Information from literature Information from researchers Information from researchers Domain structures Domain structures Information from other databases Information from other databases

HUMOT Two dedicated HUMOT editors Approve the same gene symbol for each human/mouse ortholog pair Human gene symbols are written in all uppercase letters (e.g. human PON3) Mouse only the first letter is uppercase and the rest are lowercase (e.g. mouse Pon3).

Definitions Orthologs Orthologs Related by speciation (and preferably by function) Paralogs Paralogs Related by gene duplication Mouse Gene Rat Gene Human Gene 1 Human Gene 2 Common Ancestral Gene

Determining Orthology Best BLAST hit, Reciprocal BLAST Best BLAST hit, Reciprocal BLAST with approved symbol PON3

BLAST Problems Unequal rates of evolution Unequal rates of evolution Hidden paralogy Hidden paralogy Large numbers of genomes Large numbers of genomes Mouse Gene 1 Mouse Gene 2 Human Gene Rat Gene 2 Mouse Gene 2 Human Gene 2 Rat Gene 1 Mouse Gene 1 Human Gene 1 Unequal rates Hidden paralogy

Determining Orthology Best BLAST hit, Reciprocal BLAST Best BLAST hit, Reciprocal BLAST Conserved Synteny Conserved Synteny

Conserved Synteny HUMAN MOUSE (taken from doegenomes.org)

Conserved Synteny Human Symbol PON1 PON2 PON3 Mouse Symbol Pon1 Pon2 Pon3 Human Location 7q21.3 Mouse Location 6 A2 6 A1 Searched with approved symbol PON2

Determining Orthology Best BLAST hit, Reciprocal BLAST Best BLAST hit, Reciprocal BLAST Conserved Synteny Conserved Synteny Phylogeny Phylogeny

Phylogeny PON1 PON2 PON3 Searched with approved symbol PON1

Orthology Resources Provider HGNC: MGI: NCBI: Ensembl: JGI: ANGIS: Resource HUMOT Consensus Orthology Search Mammalian Orthology Query Oxford Grid Comparative Maps Human/Mouse Orthology Map Homologene MapViewer ‘Orthologue Predictions’ in Ensembl gene reports PhIGs Search PhIGs Synteny Viewer Oxford Grid

Consensus Orthology Search HGNC, MGI, Homologene, Ensembl and PhIGs orthology assertions Will include approved symbols and names, DNA sequences, database identifiers, aliases, and chromosomal locations for each putative ortholog pair.

Determining Orthology Best BLAST hit, Reciprocal BLAST Best BLAST hit, Reciprocal BLAST Conserved Synteny Conserved Synteny Phylogeny Phylogeny Orthology Resources Orthology Resources WE DO NOT MAKE AN ORTHOLOGOUS ASSERTION IF RESULTS ARE UNCLEAR WE DO NOT MAKE AN ORTHOLOGOUS ASSERTION IF RESULTS ARE UNCLEAR

HGNC HUMOT Activities 2,500 curated orthology assertions 7,000 consensus orthology assertions Examine human/mouse orthologs with different approved symbols Eradicating mis-mapped data

MGI Oxford Grid A section from the MGI Oxford Grid (September 2004) mouse chromosomes humanhuman 270 mouse chromosome 6 genes with orthologs on human chromosome 7

humanhuman mouse chromosomes Oxford Grid Singletons A section from the MGI Oxford Grid (April 2005) 279 mouse chromosome 6 genes with orthologs on human chromosome 7

Future Work Continue to assign orthologous gene nomenclature Provide common nomenclature for human/mouse orthologs with different symbols Utilise the large cDNA data sets such as those from the H-Invitational and RIKEN to rapidly identify and assign parallel nomenclature

cDNA Data Sets Gene structures Secondary/Tertiary Structure Possible functions Encoded functional domains Sub-cellular localizations Sub-cellular localizations

HGNC Team Dr Varsha Khodiyar Conover Talbot Jr. Fabrice Ducluzeau Prof Sue Povey (Chair) Dr Elspeth Bruford (Project Co-ordinator) Dr Matt Wright (HUMOT Editor) Dr Ruth Lovering (HUGOA Editor) Dr Michael Lush The work of the HGNC is supported by the Wellcome Trust, NHGRI grant P41 HG and the UK Medical Research Council. (Senior Bioinformatician) Dr Tam Sneddon Dr Tina Eyre Dr Kate Sneddon