Recently Mobilized Transposons in the Human and Chimpanzee Genomes

Slides:



Advertisements
Similar presentations
A new method of finding similarity regions in DNA sequences Laurent Noé Gregory Kucherov LORIA/UHP Nancy, France LORIA/INRIA Nancy, France Corresponding.
Advertisements

Differential insertion of transposable elements in Anopheles gambiae M & S genomes Jenica L. Abrudan, Ryan C. Kennedy, Maria F. Unger, Michael R. Olson,
Biotechnology. Comparative genomics also has been used to identify recently mobilized transposons in genetically diverse humans. For example, over 600.
Homework #2 is due now Bonus #1 is due 10/24. deogr[Xpter:Xqter],genes[1.00: ]
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Pairwise Sequence Alignment Part 2. Outline Summary Local and Global alignments FASTA and BLAST algorithms Evaluating significance of alignments Alignment.
Genomewide Comparison of DNA Sequences between Humans and Chimpanzees Ingo Ebersberger, Dirk Metzler, Carsten Schwarz, Svante Pääbo The American Journal.
Welcome to the combined BLAST and Genome Browser Tutorial.
Homework #2 is due 10/18 Bonus #1 is due 10/25. The order of Hox genes parallels the order of body parts in which they are expressed Fig
Introduction to Genes and Genomes with Ensembl
Genomes and Their Evolution
Genomes and Their Evolution
SGN23 The Organization of the Human Genome
The Chimpanzee Genome Motivation for sequencing
Natural Mutagenesis of Human Genomes by Endogenous Retrotransposons
Multiplex Detection of Ehrlichia and Anaplasma Species Pathogens in Peripheral Blood by Real-Time Reverse Transcriptase-Polymerase Chain Reaction  Kamesh.
Novel PMS2 Pseudogenes Can Conceal Recessive Mutations Causing a Distinctive Childhood Cancer Syndrome  Michel De Vos, Bruce E. Hayward, Susan Picton,
Somatic Mutation Screening Using Archival Formalin-Fixed, Paraffin-Embedded Tissues by Fluidigm Multiplex PCR and Illumina Sequencing  Ming Wang, Leire.
Long-read sequence assembly of the gorilla genome
Jacek Majewski  The American Journal of Human Genetics 
Genetic Consequences of Programmed Genome Rearrangement
Cell Lineage Analysis in Human Brain Using Endogenous Retroelements
Michael Cullen, Stephen P
Volume 146, Issue 6, Pages (September 2011)
Basic Local Alignment Search Tool
Eric Samorodnitsky, Jharna Datta, Benjamin M
The Fine-Scale and Complex Architecture of Human Copy-Number Variation
Next-Generation Sequencing Strategies Enable Routine Detection of Balanced Chromosome Rearrangements for Clinical Diagnostics and Genetic Research  Michael E.
ATLAS: A System to Selectively Identify Human-Specific L1 Insertions
Thomas Willems, Melissa Gymrek, G
Volume 3, Issue 4, Pages (April 2013)
Evolutionary Rewiring of Human Regulatory Networks by Waves of Genome Expansion  Davide Marnetto, Federica Mantica, Ivan Molineris, Elena Grassi, Igor.
Decoding NF1 Intragenic Copy-Number Variations
Evidence for Widespread Reticulate Evolution within Human Duplicons
Volume 133, Issue 3, Pages (May 2008)
Basic Local Alignment Search Tool (BLAST)
The Dual Origin of the Malagasy in Island Southeast Asia and East Africa: Evidence from Maternal and Paternal Lineages  Matthew E. Hurles, Bryan C. Sykes,
Jessica Yingling, Kazuhito Toyo-oka, Anthony Wynshaw-Boris 
Figure 3 Stepwise strategy of bioinformatic analysis of sequencing results We obtained 7,292,715 and 303,698 DNA sequence reads from the affected brain.
Towfique Raj, Manik Kuchroo, Joseph M
Lauren M. Mathews, Susan Y
Volume 32, Issue 6, Pages (March 2015)
Rare Sequence Variation in the Genome Flanking a Short Tandem Repeat Locus Can Lead to a Question of “Nonmaternity”  Anne Deucher, Tsoyu Chiang, Iris.
Jeffrey Staples, Dandi Qiao, Michael H. Cho, Edwin K
Human Genomic Deletions Mediated by Recombination between Alu Elements
A Comprehensive Analysis of Recently Integrated Human Ta L1 Elements
High-Throughput Identification and Quantification of Candida Species Using High Resolution Derivative Melt Analysis of Panfungal Amplicons  Tasneem Mandviwala,
Beth Elliott, Christine Richardson, Maria Jasin  Molecular Cell 
Gene Conversion between the X Chromosome and the Male-Specific Region of the Y Chromosome at a Translocation Hotspot  Zoë H. Rosser, Patricia Balaresque,
Accurate Non-parametric Estimation of Recent Effective Population Size from Segments of Identity by Descent  Sharon R. Browning, Brian L. Browning  The.
Volume 8, Issue 1, Pages (July 2003)
Erratum The American Journal of Human Genetics
E.J. Hollox, J.A.L. Armour, J.C.K. Barber 
Complete Haplotype Sequence of the Human Immunoglobulin Heavy-Chain Variable, Diversity, and Joining Genes and Characterization of Allelic and Copy-Number.
Human Transposon Tectonics
Long Homozygous Chromosomal Segments in Reference Families from the Centre d'Étude du Polymorphisme Humain  Karl W. Broman, James L. Weber  The American.
Novel PMS2 Pseudogenes Can Conceal Recessive Mutations Causing a Distinctive Childhood Cancer Syndrome  Michel De Vos, Bruce E. Hayward, Susan Picton,
A Fast, Powerful Method for Detecting Identity by Descent
A HERV-K provirus in chimpanzees, bonobos and gorillas, but not humans
Characterization and Mutation Analysis of Human LEFTY A and LEFTY B, Homologues of Murine Genes Implicated in Left-Right Axis Development  K. Kosaki,
Volume 78, Issue 7, Pages (October 2010)
Efficient Computation of Significance Levels for Multiple Associations in Large Studies of Correlated Data, Including Genomewide Association Studies 
Enhanced Localization of Genetic Samples through Linkage-Disequilibrium Correction  Yael Baran, Inés Quintela, Ángel Carracedo, Bogdan Pasaniuc, Eran Halperin 
Haplotype Block Partition with Limited Resources and Applications to Human Chromosome 21 Haplotype Data  Kui Zhang, Fengzhu Sun, Michael S. Waterman,
Next-Generation Sequencing of Duplication CNVs Reveals that Most Are Tandem and Some Create Fusion Genes at Breakpoints  Scott Newman, Karen E. Hermetz,
Kung-Yee Liang, Fang-Chi Hsu, Terri H. Beaty, Kathleen C. Barnes 
Haplotypes in the Dystrophin DNA Segment Point to a Mosaic Origin of Modern Human Diversity  Ewa Ziętkiewicz, Vania Yotova, Dominik Gehl, Tina Wambach,
Genomewide Comparison of DNA Sequences between Humans and Chimpanzees
Bruce Rannala, Jeff P. Reeve  The American Journal of Human Genetics 
The Size Distribution of Homozygous Segments in the Human Genome
Presentation transcript:

Recently Mobilized Transposons in the Human and Chimpanzee Genomes Ryan E. Mills, E. Andrew Bennett, Rebecca C. Iskow, Christopher T. Luttig, Circe Tsui, W. Stephen Pittard, Scott E. Devine  The American Journal of Human Genetics  Volume 78, Issue 4, Pages 671-679 (April 2006) DOI: 10.1086/501028 Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 1 Overview of our transposon insertion–discovery pipeline. A, The time line for speciation of humans and chimpanzees is compared with the time line for the generation of transposon insertions. Common insertions occurred a very long time ago and are fixed in both species. “Species-specific” insertions are differentially present in the two species and occurred mostly during the past ∼6 million years. MYA = million years ago. B, Our strategy for identifying new transposon insertions in humans and chimpanzees. Recently mobilized transposons are flanked by TSDs and are precisely absent from one of the two genomes. One of the two copies of the TSD is actually found within the indel. Thus, the transposon plus one TSD copy equals the “fill.”C, Our computational pipeline. The five sequential steps of our computational pipeline for discovering species-specific transposon insertions in humans and chimpanzees are depicted. The draft chimpanzee-genome (build panTro1) and human-genome (build hg17) sequences were obtained from the University of California Santa Cruz browser (Kent et al. 2002). BAC clone sequences for the chimpanzee genome were obtained from GenBank (National Center for Biotechnology Information [NCBI]). BLAST programs also were obtained from NCBI. Repeatmasker was obtained from Arian Smit (Institute for Systems Biology). RepBase version 10.02 and the consensus sequence for the L1-Hs element were obtained from Jurzy Jurka (Jurka 2000). Full-length consensus sequences for L1-PA2, L1-PA3, L1-PA4, and L1-PA5 were obtained from GenBank (Boissinot et al. 2000). Custom MySQL databases and PERL scripts were generated as necessary. All analysis was performed locally on SUN SunFire v40z or Dell Power Edge 2500 servers running Linux operating systems. Our computational pipeline began with identification of all indels in humans versus chimpanzees using genomic alignments that were generated with BLASTz. Next, indels containing transposons were identified using Repeatmasker (A. Smit, unpublished material) and RepBase version 10.02 (Jurka 2000). RepBase libraries for humans and chimpanzees were modified to include full-length L1-PA2, L1-PA3, L1-PA4, L1-PA5 consensus sequences (Boissinot et al. 2000). TSDs were identified using a Smith-Waterman local alignment algorithm on the regions flanking each indel junction. The algorithm was restricted to require the optimum alignment to be located within 5 bp of the indel junction. Aligned sequences smaller than 4 bp or having an identity <90% were not scored as TSDs. A probability scoring system was developed to determine the likelihood that a given indel was caused by a single transposon insertion plus its TSD. This score was obtained by adding together the fraction of the indel that was accounted for by the transposon, its TSD, and a poly (A) tail (if present). A score of 1.0 indicated that the gap was fully accounted for by the transposon and associated sequences. We empirically determined that a lower cutoff of 0.85 provided accurate results while eliminating few, if any, true positives. SVA elements initially were annotated poorly by Repeatmasker. This program often split SVA elements into 2–3 segments (and thus counted most elements more than once). We developed a new method to reassemble these segments into a single element, where appropriate. The American Journal of Human Genetics 2006 78, 671-679DOI: (10.1086/501028) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 2 Classes of species-specific transposons in humans and chimpanzees. A, The overall composition of species-specific insertions in humans and chimpanzees. Note that 97.2% of all insertions in humans and 95.6% of all insertions in chimpanzees are Alu, L1, and SVA insertions. B, The distributions of Alu and L1 subfamilies for humans. C, The distributions of Alu and L1 subfamilies for chimpanzees. Note that different Alu and L1 subfamilies were amplified in humans (B) and chimpanzees (C). The American Journal of Human Genetics 2006 78, 671-679DOI: (10.1086/501028) Copyright © 2006 The American Society of Human Genetics Terms and Conditions

Figure 3 Genomic distributions of transposon insertions. A, Genomic distribution of Alu, L1, SVA, and other elements in the human genome. B, Genomic distribution of Alu, L1, SVA and other elements in the chimpanzee genome. For both genomes, the number of insertions in each chromosome is generally proportional to the amount of DNA present. Note that the Y-axis is the same for both charts. Thus, many more transposon insertions are present throughout the human genome than the chimpanzee genome (compare the number of insertions depicted in panels A and B). The American Journal of Human Genetics 2006 78, 671-679DOI: (10.1086/501028) Copyright © 2006 The American Society of Human Genetics Terms and Conditions