Conserved Gene Order and Expanded Inverted Repeats Characterize Plastid Genomes of Thalassiosirales Anna Mengjie Yu THE UNIVERSITY OF TEXAS —– AT AUSTIN.

Slides:



Advertisements
Similar presentations
1 Orthologs: Two genes, each from a different species, that descended from a single common ancestral gene Paralogs: Two or more genes, often thought of.
Advertisements

Gene an d genome duplication Nadia El-Mabrouk Université de Montréal Canada.
1 3. genome analysis. 2 The first DNA-based genome to be sequenced in its entirety was that of bacteriophage Φ-X174; (5,368 bp), sequenced by Frederick.
Sizes of (plastid) cpDNA Range is 70,000 bp (70 kb) to ~2,000,000 bp (2,000 kb), but most are less than 250,000 bp (250 kb) Land plants typically 120 –
Chapter 5: History of Life on Earth. Dates of origins 1. Everything (Universe):14 Bya 2. Solar System (Earth):4.6 Bya Conservative estimates 3. Prokaryotic.
Phylogenetic reconstruction
Current Approaches to Whole Genome Phylogenetic Analysis Hongli Li.
Mother of Green Phylogenomics of the P
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Chloroplast DNA in Molecular Systematics. -organelle found in plant cells and eukaryotic algae -Photosynthesis Chloroplast.
Reminder: Class on Friday, Discussion of Li et al. Proposal/Projects CAMERA feedback?
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Subsystem Approach to Genome Annotation National Microbial Pathogen Data Resource Claudia Reich NCSA, University of Illinois, Urbana.
Plant of the Day! Rafflesia arnoldii (Euphorbiaceae)
CHAPTER 28 THE ORIGINS OF EUKAYOTIC DIVERSITY Copyright © 2002 Pearson Education, Inc., publishing as Benjamin Cummings Section B: The Origin and Early.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
Binary Encoding and Gene Rearrangement Analysis Jijun Tang Tianjin University University of South Carolina (803)
RNA and Protein Synthesis
발표자 석사 2 년 김태형 Vol. 11, Issue 3, , March 2001 Comparative DNA Sequence Analysis of Mouse and Human Protocadherin Gene Clusters 인간과 마우스의 PCDH 유전자.
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
Anatomy of a Genome Project A.Sequencing 1. De novo vs. ‘resequencing’ 2.Sanger WGS versus ‘next generation’ sequencing 3.High versus low sequence coverage.
Current Challenges in Metagenomics: an Overview Chandan Pal 17 th December, GoBiG Meeting.
Fea- ture Num- ber Feature NameFeature description 1 Average number of exons Average number of exons in the transcripts of a gene where indel is located.
Introduction: It has long been thought that, while the hemizygous Y chromosome changes drastically over time, the X chromosome conserves the ancestral.
Orthology & Paralogy Alignment & Assembly Alastair Kerr Ph.D. WTCCB Bioinformatics Core [many slides borrowed from various sources]
.1Sources of DNA and Sequencing Methods.1Sources of DNA and Sequencing Methods 2 Genome Assembly Strategy and Characterization 2 Genome Assembly.
MPL The DNA Sequence of chimpanzee chromosome 22 and comparative analysis with its human ortholog, chromosome 21 Bioinformatics Dae-Soo Kim.
Phylogenetics.
Genomes & The Tree of Life
How many genes are there?
BLAST Sequences queried against the nr or grass databases. GO ANALYSIS Contigs classified based on homology to known plant or fungal genes Next.
Chloroplast Genome Evolution Level 3 Molecular Evolution and Bioinformatics Jim Provan.
Plant Genomes Houses of genetic materials Total genetic material within a cell Usually referred to a haploid cell [Basic set of genetic material (1x)]
When the next-generation sequencing becomes the now- generation Lisa Zhang November 6th, 2012.
De Novo Assembly of Mitochondrial Genomes from Low Coverage Whole-Genome Sequencing Reads Fahad Alqahtani and Ion Mandoiu University of Connecticut Computer.
Phylogeny and the Tree of Life
University of Bucharest Collage of Engineering
Introduction to Bioinformatics Resources for DNA Barcoding
3. genome analysis.
BME 130 – Genomes Lecture 8 Genome Anatomies.
House spider genome uncovers evolutionary shifts in the diversity and expression of black widow venom proteins associated with extreme toxicity Gendreau.
Organelle genome evolution
Gapless genome assembly of Colletotrichum higginsianum reveals chromosome structure and association of transposable elements with secondary metabolite.
From: Is ftsH the Key to Plastid Longevity in Sacoglossan Slugs?
Very important to know the difference between the trees!
Workshop on the analysis of microbial sequence data using ARB
Henrik Lantz - NBIS/SciLife/Uppsala University
Genome Projects Maps Human Genome Mapping Human Genome Sequencing
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Volume 21, Issue 3, Pages (October 2017)
Genomics Workshop Hartnell College March 2018
Conservation in Evolution
Volume 21, Issue 3, Pages (October 2017)
Volume 27, Issue 24, Pages e6 (December 2017)
Maximum likelihood (ML) unrooted tree based on the full-length 16S rRNA genes (A) and 31 conserved single-copy genes (B) showing the phylogenetic position.
Gautam Dey, Tobias Meyer  Cell Systems 
(A, left) Radial cladogram based on RAxML-based maximum-likelihood phylogeny (500 bootstraps, gamma distribution model, and LG+F substitution model) constructed.
Genome-wide binding sites of OsMADS1 and the distribution of binding sites in different regions of annotated genes. Genome-wide binding sites of OsMADS1.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
.1Sources of DNA and Sequencing Methods 2 Genome Assembly Strategy and Characterization 3 Gene Prediction and Annotation 4 Genome Structure 5 Genome.
Matthew A. Campbell, Piotr Łukasik, Chris Simon, John P. McCutcheon 
Core genome phylogeny of V. anguillarum strains.
Volume 14, Issue 3, Pages (February 2004)
Comparison of species and function profiles with ultradeep sequencing data. Comparison of species and function profiles with ultradeep sequencing data.
Qian Cong, Dominika Borek, Zbyszek Otwinowski, Nick V. Grishin 
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Volume 21, Issue 23, Pages (December 2011)
Presentation transcript:

Conserved Gene Order and Expanded Inverted Repeats Characterize Plastid Genomes of Thalassiosirales Anna Mengjie Yu THE UNIVERSITY OF TEXAS —– AT AUSTIN –— Theriot Lab

Keeling, 2004, Am. J. Bot. 9:1481

Introduction 15 diatom plastid genome sequenced Gene loss/ duplication/ transfer Variable plastid genome sizes Extensive genome rearrangement

Why Thalassiosirales? Reasonably dense phylogeny known Environmentally –driven petF transfer from plastid to nucleus reported in T. oceanica (Lommer et al, 2010) 2 plastid genomes published Cyclotella nana (formerly T. pseudonana) & T. oceanica Gene content and genome size more similar within Thalassiosirales? Gene order more conserved within Thalassiosirales?

Rhizosolenia imbricata Roundia cardiophora Cerataulina daemon Chaetoceros simplex Thalassiosira weissflogii Cyclotella sp. WC03_2 Cyclotella sp. L04_2

Method Genomic DNA Extraction Illumina HiSeq 2000 PE sequencing Velvet v de novo Assembly Blast Plastid Contig Genome Annotation ( DOGMA, tRNAscan-SE 1.21) Plastid Genome Assembly Genome Finishing Sanger SequencingBowtie2 mapping

Results General Features of Plastid Genome of Thalassiosirales and Other Three Sequenced Diatoms Gene Loss/Gain/Pseudonization and Functional Gene Transfer from Plastid to Nucleus Expanded IR and Conserved IR boundary in Thalassiosirales Conserved Gene Order Within Thalassiosirales Compared to Other Three Sequenced Diatoms

Circular, k bp Compact and lack introns Four overlapping genes : psbC – psbD : 53bp sufC – sufB: 1bp atpD – atpF: 4bp ( 1bp in Rh. imbricata ) rpl4 – rpl23: 8bp (17 bp in Cy. sp.L04_2 and Cy. sp.WC03_2)

Results General Features of Plastid Genome of Thalassiosirales and Other Three Sequenced Diatoms Gene Loss/Gain/Pseudonization and Functional Gene Transfer from Plastid to Nucleus Expanded IR and Conserved IR boundary in Thalassiosirales Conserved Gene Order Within Thalassiosirales Compared to Other Three Sequenced Diatoms

Phylogenetic Distribution of acpP (acyl carrier protein) Maximum Likelihood Tree of Thalassiosirales and other diatom species based on 20 plastid genes acpP1 lost in Thalassiosirales

Schematic structure of nuclear-encoded plastid-targeted diatom protein precursor SPase: Signal peptidase SPP: Stromal processing peptidase Huesgen et al PLOS One

Over 84% identity Plastid Nucleus acpP

Phylogenetic distribution of syfB (Phe tRNA synthase) syfB lost in Thalassiosirales

Loss of tufA (translation elongation factor Tu) and psaE,I,M (photosystem I protein ) in Rhizosolenia Gene loss due to endosymbiont bacterium Richelia intracellularis?

More similar gene content within Thalassiosirales Phylogenetic Distribution of gene loss/gain across diatom plastid genomes

Results General Features of Plastid Genome of Thalassiosirales and Other Three Sequenced Diatoms Gene Loss/Gain/Pseudonization and Functional Gene Transfer from Plastid to Nucleus Expanded IR and Conserved IR boundary in Thalassiosirales Conserved Gene Order Within Thalassiosirales Compared to Other Three Sequenced Diatoms

Thalassiosirales Comparison of inverted repeat boundaries in Thalassiosirales and other three sequenced diatoms

Results General Features of Plastid Genome of Thalassiosirales and Other Three Sequenced Diatoms Gene Loss/Gain/Pseudonization and Functional Gene Transfer from Plastid to Nucleus Conserved IR boundary and Expanded IR in Thalassiosirales Conserved Gene Order Within Thalasisosirales Compared to Other Three Sequenced Diatoms

Gene order comparison of Thalassiosirales and other three sequenced diatoms using mauveAligner

Conclusion Gene content more conserved within the Thalassiosirales plastid genomes Gene order within Thalassiosirales highly conserved, except for extensive genome rearrangement in T.oceanica. Cy. nana, T. weissflogii and Ro. cardiophora share an identical gene order, which is inferred to be the ancestral order for the Thalassiosirales The larger size of the Thalassiosirales plastid genome is largely due to expansion of inverted repeat. Missing of psaE, psaI, psaM represents first documented instance of the loss of photosynthetic genes in diatom plastid genome

Land plant 3 : 1 Green algae 1 : 1 Red algae 1 : 3 Cp Mt Comparison of nucleotide substitution rates

Land plant 3 : 1 Green algae 1 : 1 Red algae 1 : 3 Diatoms ? Cp Mt Comparison of nucleotide substitution rates

Acknowledgement All Theriot and Jansen Lab members Society Student Travel Grant Sponsor rd International Diatom Symposium Student Grant 2013 International Phycological Society Paul C. Silva Travel Award th International Phycological Congress Financial Support

Work hard, play harder!

Be a hard worker, Also have a life!