Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.

Slides:



Advertisements
Similar presentations
Tracing Evolutionary History
Advertisements

The Organization of Cellular Genomes Complexity of Genomes Chromosomes and Chromatin Sequences of Genomes Bioinformatics As we have discussed for the last.
The DNA Story Germs, Genes, and Genomics 4. Heredity Genes DNA Manipulating DNA.
Comparative genomics: Overview & Tools + MUMmer algorithm Urmila Kulkarni-Kale Bioinformatics Centre University of Pune, Pune
Comparative genomics: Overview & Tools Urmila Kulkarni-Kale Bioinformatics Centre University of Pune, Pune
Phylogenetics workshop: Protein sequence phylogeny week 2 Darren Soanes.
Human Genome Project What did they do? Why did they do it? What will it mean for humankind? Animation OverviewAnimation Overview - Click.
The chemical Basis of Inheritance. Chromatin / Chromosomes.
Cell Structure and Evolutionary History Structure, p. 22.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Duplication, rearrangement, and mutation of DNA contribute to genome evolution Chapter 21, Section 5.
Background About the Pufferfish: Fugu is a teleost fish belonging to the order Tetraodontiformes. Fugu rubripes, an eukaryota and vertebrate, more commonly.
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Alternative splicing and evolution Daniel Jeffares.
ASTR-3040: Astrobiology Day 11 The Nature of Life on Earth Chapter 5.
Genomes and Genetic Architecture. Life on Earth.
EVOLUTIONARY AND COMPUTATIONAL GENOMICS Shin-Han Shiu Plant Biology / CMB / EEBB / Genetics / QBMI.
The diversity of genomes and the tree of life
Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species.
Comparative Genomics of the Eukaryotes
What is genomics? Study of genomes. What is the genome? Entire genetic compliment of an organism.
Genome of Drosophila species Olga Dolgova UAB Barcelona, 2008.
Essential knowledge 1.A.4:
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
1. What is a gene? Definition: A gene is a discrete unit of DNA (or RNA in some viruses) that encodes a nucleic acid or protein product that contributes.
Chapter 5 Genome Sequences and Gene Numbers. 5.1Introduction  Genome size vary from approximately 470 genes for Mycoplasma genitalium to 25,000 for human.
1 Genome Evolution Chapter Introduction Genomes contain the raw material for evolution Comparing whole genomes enhances – Our ability to understand.
Tentative definition of bioinformatics Bioinformatics, often also called genomics, computational genomics, or computational biology, is a new interdisciplinary.
1 Orthology and paralogy A practical approach Searching the primaries Searching the secondaries Significance of database matches DB Web addresses Software.
This presentation was originally prepared by C. William Birky, Jr. Department of Ecology and Evolutionary Biology The University of Arizona It may be used.
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
1 Gene Geography Dan Graur Department of Biology and Biochemistry 3c.
IGEM 101: Session 7 4/2/15Jarrod Shilts 4/5/15Ophir Ospovat.
Genome Organization & Evolution. Chromosomes Genes are always in genomic structures (chromosomes) – never ‘free floating’ Bacterial genomes are circular.
Chapter 21 Eukaryotic Genome Sequences
1 Genome Evolution Chapter Introduction Genomes contain the raw material for evolution; Comparing whole genomes enhances – Our ability to understand.
Genome Analysis II Comparative Genomics Jiangbo Miao Apr. 25, 2002 CISC889-02S: Bioinformatics.
Comparative genomics Haixu Tang School of Informatics.
Genome annotation and search for homologs. Genome of the week Discuss the diversity and features of selected microbial genomes. Link to the paper describing.
David Sadava H. Craig Heller Gordon H. Orians William K. Purves David M. Hillis Biologia.blu B – Le basi molecolari della vita e dell’evoluzione The Eukaryotic.
Bailee Ludwig Quality Management. Before we get started…. ….Let’s see what you know about Genomics.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
Genomes & The Tree of Life
How many genes are there?
Lecture 21 – Genome Annotation & Sequenced Genomes Based on Chapther 8 Genomics: The Mapping and Sequencing of Genomes Copyright © 2010 Pearson Education.
Plant Genomes Houses of genetic materials Total genetic material within a cell Usually referred to a haploid cell [Basic set of genetic material (1x)]
MICROBIOLOGIA GENERALE Prokaryotic genomes. The prokaryotic genome.
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
E VOLUTION OF E UKARYOTIC G ENOMES G ENE 342 Lecture 13 – Comparative genomics.
Phylogeny and Taxonomy. Phylogeny and Systematics The evolutionary history of a species or related species Reconstructing phylogeny is done using evidence.
Astronomy 3040 Astrobiology Spring_2016 Day-11. Cells: Basic Units of Life Cells – microscopic units separated from the outside by a “membrane.”  Basic.
MICROBIOLOGIA GENERALE Prokaryotic genomes. The Escherichia coli nucleoid.
Chromosome Organization & Molecular Structure. Chromosomes & Genomes Chromosomes complexes of DNA & proteins – chromatin Viral – linear, circular; DNA.
Genomes.
MCB 7200: Molecular Biology
Genetics and Evolutionary Biology
Basics of Comparative Genomics
PBIO 4500/5500: Biotechnology and Genetic Engineering
Genomes and their evolution
EL: To find out what a genome is and how gene expression is regulated
Today… Review a few items from last class
Genomes and Their Evolution
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
Evolution of eukaryote genomes
What do you with a whole genome sequence?
BIOL 433 Plant Genetics Term 2,
From Mendel to Genomics
Basics of Comparative Genomics
Computational genomics
Presentation transcript:

Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan

Genome projects and model organisms

Genome projects Completed genomes: Eubacteria (inc. Escherichia coli, Bacillis subtilis, Haemophilus influenzae, Synechocystis PCC6803) Archaea (inc. Methanococcus jannaschii, Methanobacterium thermoautotrophium) Eukarya: — Saccharomyces cerevisiae — Caenorhabditis elegans — Homo sapiens — Arabidopsis thaliana Partially sequenced genomes e.g. Drosophila melanogaster, Fugu rubripes, Oryza sativa

Relationships between model organisms H. sapiens C. elegans D. melanogaster S. cerevisiae A. thaliana Methanococcus Archaeglobus Synechocystis PCC6803 B. subtilis M. genitalium M. pneumoniae B. burgdorferi H. influenzae E. coli

Eubacterial genomes: Bacillus subtilis Genome 4,214,810 bp: protein sequences Average gene 890 bp — Density 1 gene / 1028 bp — 89% of total genome is protein-coding Protein coding genes: 53% single copy 47% paralogous gene families: — Mostly involved in transport — Genes are proximal i.e. have evolved through tandem duplication of single genes

Eubacterial genomes: Bacillus subtilis On the basis of homology with genes of known function, 58% of B. subtilis genes could be assigned to functional categories The B. subtilis genome contains remnants of 10 prophages, suggesting that horizontal transfer has played a significant role in evolution of the genome Orthologous counterparts in other bacteria: ~1000 genes (24%) have counterparts in E. coli (Gram -ve) More significantly, ~100 operons conserved as well ~800 genes (20%) have orthologues in Synechocystis PCC6803 (Cyanobacterium)

Eubacterial genomes: Mycoplasmas Obligate parasites Thought to be derived from Gram +ve bacteria similar to B. subtilis 312 genes of M. genitalium (66%) have homologues in Gram +ve bacteria Parasitic lifestyle has led to a dramatic reduction in genome size and content Smallest-known genome in a self-replicating organism

Eubacterial genomes: Mycoplasmas M. genitalium genome: Circular chromosome of 580,070 bp Only 470 predicted genes for DNA replication, transcription and translation, DNA repair, cellular transport and energy metabolism Coding regions comprise ~88% of the genome — Similar to H. influenzae (85%) — Suggests that genome reduction has been due to loss of genes and not reduction in gene size or increase in gene density M. pneumoniae genome: Larger than M. genitalium (816 kbp) All M. genitalium genes found in M. pneumoniae Not simply truncated - evidence of genome rearrangements

Eubacterial genomes: E. coli 4288 protein coding genes: Average ORF 317 amino acids Very compact: average distance between genes 118bp Numerous paralogous gene families: 38 – 45% of genes arisen through duplication Homologues: H. influenzae (1130 of 1703) Synechocystis (675 of 3168) M. jannaschii (231 of 1738) S. cerevisiae (254 of 5885)

The minimum genome and redundancy Minimum set of genes required for survival: Replication and transcription Translation (rRNA, ribosomal proteins, tRNAs etc.) Transport proteins to derive nutrients ATP synthesis Entire pathways eliminated in Mycoplasma: Amino acid biosynthesis (1 gene vs. 68 in H. influenzae) Metabolism (44 genes vs. 228 in H. influenzae) Comparison of M. genitalium and H. influenzae has identified a minimum set of 256 genes

Archaeal genomes: M. jannaschii Requires no organic nutrients for growth: has all biochemical pathways to use inorganic constituents Only 38% of genes could be assigned a known function Genes for translation, transcription and DNA replication similar to eukaryote genes: DNA polymerase Ribosomal proteins Translation initiation factors

Fungal genomes: S. cerevisiae First completely sequenced eukaryote genome Very compact genome: Short intergenic regions Scarcity of introns Lack of repetitive sequences Strong evidence of duplication: Chromosome segments Single genes Redundancy: non-essential genes provide selective advantage

Plant genomes: Arabidopsis thaliana Contains 25,498 genes from 11,000 families Cross-phylum matches: Vertebrates 12% Bacteria / Archaea 10% Fungi 8% 60% ESTs have no match in non-plant databases Evolution involved whole genome duplication followed by subsequent gene loss and extensive local gene duplications

Invertebrate genomes: C. elegans Genome even less compact than yeast: One gene every 7143 bp (2155 bp in yeast) Due mainly to introns in protein coding genes Much more compact than humans (One gene every 50,000 bp) Compactness due mainly to polycistronic arrangement: Trans-splicing Co-expression and co-regulation

Vertebrate genomes: Fugu rubripes Pufferfish genome (400 Mb) only four times larger than C. elegans and 7.5 times smaller than human genome Homologous genes in Fugu and mammals show conserved synteny: Same exon-intron organisation Introns much smaller Useful for identifying conserved essential elements in vertebrate genomes

The genome of the cenancestor Availability of complete genome sequences from the three domains of life creates an opportunity for the reconstruction of the complete genome of the common ancestor: Of minimal bacterial set (256 genes), 143 have orthologues in yeast (eukaryote) Universal translation apparatus suggests that cenancestor had a fully developed translation system Extreme differences in DNA replication apparatus Many fundamental metabolic processes are carried out by similar proteins in Archaea and eubacteria: — Suggests a universal, autotrophic ancestor — Not all central metabolism is universal (methanogenesis, photosynthesis etc.)