Genomes School B&I TCD Bioinformatics May 2010. Genome sizes Completed eukaryotic nuclear genomes Type of organismSpeciesGenome size (10 6 base pairs)

Slides:



Advertisements
Similar presentations
GBrowse at TAIR Philippe Lamesch TAIR curator. Seqviewer.
Advertisements

Part I: Tips and Techniques from curators GBrowse at TAIR David Swarbreck.
Group Work: How many chromosomes are found in human cells?
DNAStructureandReplication. Transformation: Robert Griffith (1928)
Eukaryotic Intron Loss Tobias Mourier & Daniel C. Jeffares.
Pathways analysis Iowa State Workshop 11 June 2009.
Introduce GeneSpring GX12 Yun Lian GeneSpring Layout.
Genomic Innovations- Orthology Paralogy. Genomic innovation.
Reconstructing Ancestral Vertebrate Genomes by in silico Palaeogenomics Hugues Roest Crollius Laboratoire Dyogen - CNRS Ecole Normale Supérieure Paris.
1/30 Comparative Genomics. 2/30 Overview of the Talk Comparing Genomes Homologies & Families Sequence Alignments.
First release of HOGENOM, a database of homologous genes from complete genome Equipe Bioinformatique et Génomique Evolutive Laboratoire de Biométrie et.
E. coli Genome PROKARYOTES Typically, - >10 6 bp - Sequence without gaps ANIMALS Typically, >10 9 bp - Sequence with many gaps - 95+% covered.
Genome Browsers Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Lecture 7.11 The Ensembl Database Erin Pleasance Steven Jones Canada’s Michael Smith Genome Sciences Centre, Vancouver.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
Genomes and Genetic Architecture. Life on Earth.
BioSci 145B lecture 1 page 1 © copyright Bruce Blumberg All rights reserved mRNA frequency and cloning mRNA frequency classes –classic references.
Model Organisms and Databases. Model Organisms Characteristics of model organisms in genetics studies –Genetic history well known –Short life cycle; large.
EVOLUTIONARY AND COMPUTATIONAL GENOMICS Shin-Han Shiu Plant Biology / CMB / EEBB / Genetics / QBMI.
Spinal Muscular Atrophy SMN1 Billy Baader - Genetics 677 Medline Plus (2009) Spinal Muscular Atrophy retrieved Feb 3, 2009 from:
Comparative Genomics of the Eukaryotes
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
Daniel Rico, PhD. Daniel Rico, PhD. ::: Introduction to Functional Analysis Course on Functional Analysis Bioinformatics Unit.
The Ensembl Gene set The “Genebuild” 21 April 2008.
TAIR, PMN, SGN and Gramene workshop Focus on comparative genomics and new tools Philippe Lamesch, A. S. Karthikeyan, Aureliano Bombarely Gomez, Pankaj.
Using The Gene Ontology: Gene Product Annotation.
Meiosis Organisms that reproduce sexually have specialized cells called gametes (sex cells) Gametes are the result of a type of cell division called meiosis.
EBI is an Outstation of the European Molecular Biology Laboratory. Bert Overduin Daniel Rios Stephen Fitzgerald Edinburgh, 24 & 25 February 2009 Ensembl.
Intralab Workshop - Reactome CMAP Chang-Feng Quo June 29 th, 2006.
CANDID: A candidate gene identification tool Janna Hutz March 19, 2007.
The University of Texas at Austin, CS 395T, Spring 2008, Prof. William H. Press 1 Computational Statistics with Application to Bioinformatics Prof. William.
Ontologies, data standards and controlled vocabularies.
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
This presentation was originally prepared by C. William Birky, Jr. Department of Ecology and Evolutionary Biology The University of Arizona It may be used.
The Human Genome (part 1 of 2) Wednesday, November 5, 2003 Introduction to Bioinformatics ME: J. Pevsner
IGEM 101: Session 7 4/2/15Jarrod Shilts 4/5/15Ophir Ospovat.
DAY 1c: Accessing Completed Genomes 1. UCSC Genome Bioinformatics 2. Ensembl 3. NCBI Genomic Biology.
Introduction to Bioinformatics Databases. DNARNAphenotypeprotein Central dogma of molecular biology A main focus of bioinformatics is to study molecular.
An Introduction to ENSEMBL Cédric Notredame. The Top 5 Surprises in the Human Genome Map 1.The blue gene exists in 3 genotypes: Straight Leg, Loose Fit.
An Introduction to Ensembl Presented By Hilary O. Pavlidis.
1/29 Comparative Genomics. 2/29 Overview of the Talk Comparing Genomes Homologies & Families Sequence Alignments.
Professional Development Course 1 – Molecular Medicine Genome Biology June 12, 2012 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services.
© 2015 W. H. Freeman and Company CHAPTER 1 The Genetics Revolution Introduction to Genetic Analysis ELEVENTH EDITION Introduction to Genetic Analysis ELEVENTH.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
NCBI FieldGuide NCBI Molecular Biology Resources March 2007 Using Entrez.
Changing Databases This presentation gives a quick overview on how to change databases in Osprey.
Comparative genomics Haixu Tang School of Informatics.
E. coli Genome PROKARYOTES Typically, - >10 6 bp - Sequence without gaps ANIMALS Typically, >10 9 bp - Sequence with many gaps - 95+% covered.
Chapter 1 Introduction.
Diving into the gene pool: Chromosomes, genes and DNA
Search Functions Simple Search Advanced Search.
Evolution of Animal Cytochromes P450 from Sponges to Mammals David R. Nelson University of Tennessee Health Sciences Center Memphis.
It will help in preparing for the exam to read:
Lecture 21 – Genome Annotation & Sequenced Genomes Based on Chapther 8 Genomics: The Mapping and Sequencing of Genomes Copyright © 2010 Pearson Education.
Gene models and proteomes for Saccharomyces cerevisiae (Sc), Schizosaccharomyces pombe (Sp), Arabidopsis thaliana (At), Oryza sativa (Os), Drosophila melanogaster.
Chapter 11 Meiosis & Genetics What do you think meiosis makes?
Biology Developmental Genetics
Professor William H. Press, Department of Computer Science, the University of Texas at Austin1 Opinionated in Statistics by Bill Press Lessons #51 Hierarchical.
SpeciesCommon Name (Symbol) HSF1HSF2HSF3HSF4 Danio rerio Homo sapien Mus musculus Canus lupis Equus caballus Xenopus laevis Xenopus tropicalis Gallus gallus.
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
Lecture/Lab 7.31
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
What’s new in GO?. Priorities Annotation outreach Reference genomes User advocacy Ontology development Software.
Annotating with GO: an overview
TAIR, PMN, SGN and Gramene workshop
Introduction to Bioinformatics II
Chromosome Number in Species
Part I: Tips and Techniques from curators
Evolution of eukaryote genomes
Gene Safari (Biological Databases)
Presentation transcript:

Genomes School B&I TCD Bioinformatics May 2010

Genome sizes Completed eukaryotic nuclear genomes Type of organismSpeciesGenome size (10 6 base pairs) Primitive microsporidianE. cuniculi2.5 FungiS. cerevisiae12.1 Sc. pombe13.8 N. crassa40 Nematode wormC. elegans100 Insect: Fruit flyD. melanogaster180 mosquitoA. gambiae278 Malarial parasiteP. falciparum22.8 Plants: Thale cressA. thaliana116.8 riceO. sativa400 HumanH. sapiens3400 MouseM. musculus3454 RatR. norvegicus2556 ChickenG. gallus1200

What’s it all about? With complete chromosome or big chunks –Can put genes in context, synteny, neighbours With complete genome –Have all paralogs of gene family –So can identify orthologs – genes similar by descent and so by function Gene clusters –Operons or “operons” –Tissue expression –Positive selection / excessively variable regions

Caron Human Genome Highly expressed genes are clustered (densely)

Tissue expression mammals

Where are tissue expressed genes clustered?

Mouse/Human Synteny

Three resources Golden Path at UCSC –Jim Kent and his group at Santa Cruz Ensembl –Ewan Birney, Wellcome Trust, EBI, Sanger NCBI Genome Database –US government

UCSC Golden Path Access to human, mouse, rat, chicken etc. Two modes: –BLAT search BLAT search - find sequences of >95% similarity and length >40 bases on the genome. –Genome browser Choose and display data you want: repeats, SNPs, ESTs

Golden Path UCSC Vertebrate genomes available Human Chimp Rhesus Dog Cow Mouse Rat Cat Opossum Chicken Xenopus Zebrafish Tetraodon Fugu

Ensembl is a joint project between EMBL - EBI and the Sanger Institute to develop a software system which produces and maintains automatic annotation on eukaryotic genomes. Continually updated and improved.

Ensembl genomes Mammals Homo sapiens Pan troglodytes (chimp) Macacca mulatto (monkey) Mus musculus (mouse) Rattus norvegicus (rat) Oryctylagus cuniculus (rabbit) Canis familiaris (dog) Bos taurus (cow) Dasypus novemcinctus (armadillo) Loxodonta africana (elephant) Echinops telfari (tenrec) Monodelphis domestica (opossum) … and others Not mammals Gallus gallus (chicken) Xenopus tropicalis (frog) Danio rerio (zebra fish) Tetraodon nigroviridis (puffer fish) Ciona intestinalis (chordate) Drosophila melanogaster (fly) Anopheles gambiae (mosquito) Aedes aegypti (mosquito 2) Apis mellifera (bee) Caenorhabditis elegans (worm) Saccharomyces cerevisiae

NCBI Genome Center Start here for any genome –Bacterial –Archaeal –Eukaryotic Uniform arrangement of information

NCBI genes and disease Resource for find authoritative info about diseases. is one of the many NCBI on-line BOOKS Classifies diseases and syndromes by –Cancer –Immune system –Muscle and bone –Signals and Transporters –Nervous system –Etc.

OMIM On line Mendelian Inheritance in Man Everything you need to know –Diseases and syndromes –But also quirky stuff But only 2% of syndromes are simple mendelian (single gene)

How to classify genes What species? What function? –What gene family –What domains/motifs What pathway? What genomic neighborhood/synteny? What ligands / interactions?

Summary Different ways/contexts of viewing data Bioinformatics is integrative biology Your task is … To access available resources to maximise our understanding