Differential insertion of transposable elements in Anopheles gambiae M & S genomes Jenica L. Abrudan, Ryan C. Kennedy, Maria F. Unger, Michael R. Olson,

Slides:



Advertisements
Similar presentations
SHI Meng. Abstract The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants,
Advertisements

Combined analysis of ChIP- chip data and sequence data Harbison et al. CS 466 Saurabh Sinha.
Duplication, rearrangement, and mutation of DNA contribute to genome evolution Chapter 21, Section 5.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Microbial Genetics (Micr340)
Human Genetics Overview.
Transposons First discovered in plants. Transposition to a new site Transposons *Three classes of transposons Class I : Retrotransposons Class II: Transposons.
ABSTRACT We have conducted an extensive computational analysis of the Culex quinquefasciatus genome to find and annotate a specific subfamily of the TEs:
DNA and Chromosome Structure. Chromosomal Structure of the Genetic Material.
Comparison of Drosophila Genomes Li-Lun, Ho. D. melanogaster vs. D. yakuba D. yakuba genome is assembled in Apr, D. yakuba genome has 14 times higher.
Bacterial Genetics Xiao-Kui GUO PhD.
Scott Emrich Assistant Professor, Computer Science and Engineering Scientific Manager, VectorBase University of Notre Dame A flexible, scalable genomics.
VectorBase A Resource Centre for Invertebrate Hosts of Human Pathogens Bob MacCallum Imperial College London.
Abstract Although transposable elements (TEs) were discovered over 50 years ago, the robust discovery of them in newly sequenced genomes remains a difficult.
Homework #2 is due now Bonus #1 is due 10/24. deogr[Xpter:Xqter],genes[1.00: ]
Selfish DNA Honors Genetics.
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
Transposon and Mechanisms of Transposition
Translesion DNA Synthesis Cells bypass lesions encountered at the replication fork during DNA synthesis and correct them after replication is finished.
Development: differentiating cells to become an organism.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
BACTERIAL TRANSPOSONS
Viruses Gene Regulation results in differential Gene Expression, leading to cell Specialization.
DNA TECHNOLOGY AND GENOMICS CHAPTER 20 P
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Vectorbase and Galaxy Jarek Nabrzyski On behalf of VectorBase Center for Research Computing University of Notre Dame VectorBase Bioinformatics Resource.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Lettuce/Sunflower EST CGPDB project. Data analysis, assembly visualization and validation. Alexander Kozik, Brian Chan, Richard Michelmore. Department.
Mobile DNA  Transposons By Anna Purna
Transposable Elements DNA Sequences That Change Positions in the Genome.
The Human Genome The sequence of the nitrogenous bases found on the 23 pairs of chromosomes There are roughly 3 billion nitrogenous bases in one complete.
MULTIPLE POPULATIONS OF ARTEMISININ-RESISTANT PLASMODIUM FALCIPARUM IN CAMBODIA MIOTTO ET. AL Presented by Josie Benson.
Ontology Driven Data Collection for EuPathDB Jie Zheng, Omar Harb, Chris Stoeckert Center for Bioinformatics, University of Pennsylvania.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Shotgun sequencing reveals transkingdom alterations in immunodeficiency associated enteropathy Xiaoxi Dong (Oregon State University), Jialu Hu (Oregon.
Comparative Genome Analysis and Genome Evolution of Members of the Magnaporthaceae Family of Fungi.
Homework #2 is due 10/18 Bonus #1 is due 10/25. The order of Hox genes parallels the order of body parts in which they are expressed Fig
Objective: I can explain how genes jumping between chromosomes can lead to evolution. Chapter 21; Sections ; Pgs Genomes: Connecting.
The evolution of selfishness HW#5: FASTA sequence Function.
Genomes and their evolution
Genomes and Their Evolution
Transposable Elements
Chapter 13: transposable elements
Genomes and Their Evolution
SGN23 The Organization of the Human Genome
Genomes and Their Evolution
Genomes and their evolution
Genomes and their evolution
Transposons and Mechanisms of Transposition
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Bioinformatics II
Genomes and Their Evolution
Genomes and Their Evolution
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Transcriptional response to mating in whole A. gambiae females.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
BSC1010: Intro to Biology I K. Maltz Chapter 21.
Genomes and Their Evolution
Extra chromosomal Agents Transposable elements
Presented by, Jeremy Logue.
Genomes and Their Evolution
Clustered Organization of Reproductive Genes in the C. elegans Genome
Genomes and Their Evolution
Haplotypes When the presence of two or more polymorphisms on a single chromosome is statistically correlated in a population, this is a haplotype Example.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presented by, Jeremy Logue.
Genomes and Their Evolution
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Recently Mobilized Transposons in the Human and Chimpanzee Genomes
Presentation transcript:

Differential insertion of transposable elements in Anopheles gambiae M & S genomes Jenica L. Abrudan, Ryan C. Kennedy, Maria F. Unger, Michael R. Olson, Scott J. Emrich, Frank H. Collins, Nora J. Besansky Eck Institute of Global Health, University of Notre Dame Abstract : Mosquitoes in the Anopheles gambiae species complex are the major vectors of malaria in Africa. The original A. gambiae genome sequenced was the PEST strain, which was later discovered to be a composite of the A. gambiae M and S forms. These 2 sympatric forms demonstrate reproductive isolation and are believed to be incipient or different species. They have been individually sequenced recently, so we are performing computational analysis of the three genomes to identify sequence differences. We hypothesize that transposable elements may be influencing the speciation of A. gambiae. Insertions of transposable elements have been associated with alterations in chromosome structure, recombination, replication, and gene regulation. Recent studies have indicated the existence of “speciation islands” and numerous genes differentially expressed across multiple developmental stages between the M and S forms though many of those genes lie outside of the “speciation islands” implying there are more causal factors to be discovered. We have identified sequences differently inserted between the M & S genomes relative to PEST. We then identified the subset of those sequences that contain transposable elements using a discovery pipeline we have developed. We are currently using this subset of data to identify those sequences that are in close proximity (~1kb) to gene elements, and will perform experiments designed to measure the expression levels of those genes. We hope to find a correlation between the differentially inserted transposons and the observed gene expression differences. Input data: M genome assembly S genome assembly PEST genome assembly DNA sequences from RepBase and TEFam databases for the known transposable elements in Anopheles gambiae Bioperl Generating the start up data Indels between M/S and PEST were determined by mapping the reads of M/S to the PEST assembly and comparing the distance between the mate pairs. Sequences that are inserted into the S genome assembly, but not in the M or PEST assembly - 6,767 sequences Sequences that are inserted into the M genome assembly, but not in the S or PEST assembly - 6,792 sequences Sequences that are inserted into the M and PEST genome assemblies but not in S 1,301 sequences Sequences that are inserted into the S and PEST genome assemblies but not in M 2,128 sequences Steps in analyzing the potential differential insertions between M ans S Putative insertion in M relative to S Putative insertion in S relative to M Transposable elements compiled database 1,075 transposable element related putative insertions into M relative to S 1,146 transposable element related putative insertions into S relative to M Future plans: Verify that the insertions are fixed between M & S Look at the insertion site relative to genes Look at the possible influence of transposable elements related sequences on gene expression The distribution of the families of transposable elements derived indels between the two genomes seems to be highly similar References: D. Lawson, et al., VectorBase: a data resource for invertebrate vector genomics. Nucleic Acids Research, 37:D58307, Repbase. VectorBase. TEfam. Bioperl. The VectorBase project is funded by the US National Institute of Allergy and Infectious Diseases (NIAID), contract HHSN C. The differentially inserted sequences are computed from the two genome assemblies by mapping the M and S reads to the PEST genome and measuring the distance between the mate pairs and comparing it to where the mate pair would map to PEST. Suspicious sequences fell into two categories : Sequences present in only one assembly (either M or S) Sequences present in either M or S and PEST but not in all three assemblies Sequences that were computationally found to be different between the M and S assemblies were further analyzed for presence of transposable elements. For this purpose a database was computed out of the transposable element known to be present in the A. gambiae. The sequences were blasted (blastn) against this database and only those with an e-value of or less were further considered. While the numbers differ between the two genomes, Mariner seems to be the most abundant DNA transposon for both, with 212 sequence for S and 189 for M. The next most abundant element from the same class in M, Tc1 has a much lower number for S (151 vs 94) Class I Non-LTR retrotransposons and Class II DNA transposons seem to have the highest representation among the potentially different insertions between the two genomes both as diversity of sequence and as number of sequences present.