Third generation long-read sequencing of HIV-1 transcripts discloses cell type specific and temporal regulation of RNA splicing Frederic Bushman International.

Slides:



Advertisements
Similar presentations
Capturing the chicken transcriptome with PacBio long read RNA-seq data OR Chicken in awesome sauce: a recipe for new transcript identification Gladstone.
Advertisements

Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Tyson A. Clark, Ph.D. February 11, 2015
Additional Powerful Molecular Techniques Synthesis of cDNA (complimentary DNA) Polymerase Chain Reaction (PCR) Microarray analysis Link to Gene Therapy.
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
DNA Technology. Biotechnology The use or alteration of cells or biological molecules for specific applications Transgenics Transgenic “changed genes”
1 Library Screening, Characterization, and Amplification Screening of libraries Amplification of DNA (PCR) Analysis of DNA (Sequencing) Chemical Synthesis.
Characterization, Amplification, Expression
1 Characterization, Amplification, Expression Screening of libraries Amplification of DNA (PCR) Analysis of DNA (Sequencing) Chemical Synthesis of DNA.
Chris Chander, Luke Adea BioSci D145 Feb. 12, 2015
Genetic Technologies By: Brenda, Dale, John, and Brady.
Polymerase Chain Reaction WORKSHOP (3)
Special Topics in Genomics Lecture 1: Introduction Instructor: Hongkai Ji Department of Biostatistics
Chapter 6 Gene Prediction: Finding Genes in the Human Genome.
From Haystacks to Needles AP Biology Fall Isolating Genes  Gene library: a collection of bacteria that house different cloned DNA fragments, one.
Washington D.C., USA, July 2012www.aids2012.org Track A – BASIC SCIENCE Rapporteur Session Jacques Fellay EPFL, Lausanne, Switzerland.
DNA, RNA & Proteins Transcription Translation Chapter 3, 15 & 16.
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
Technological Solutions. In 1977 Sanger et al. were able to work out the complete nucleotide sequence in a virus – (Phage 0X174) This breakthrough allowed.
Library screening Heterologous and homologous gene probes Differential screening Expression library screening.
발표자 석사 2 년 김태형 Vol. 11, Issue 3, , March 2001 Comparative DNA Sequence Analysis of Mouse and Human Protocadherin Gene Clusters 인간과 마우스의 PCDH 유전자.
Aim: To understand how the olfactory transduction system is organized Are there several receptor protein “species” each of which detect a class of odorant.
Biotechnology.
Transcriptomics Sequencing. over view The transcriptome is the set of all RNA molecules, including mRNA, rRNA, tRNA, and other non coding RNA produced.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
Complexities of Gene Expression Cells have regulated, complex systems –Not all genes are expressed in every cell –Many genes are not expressed all of.
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
A Non-EST-Based Method for Exon-Skipping Prediction Rotem Sorek, Ronen Shemesh, Yuval Cohen, Ortal Basechess, Gil Ast and Ron Shamir Genome Research August.
Lecture 18 – Functional Genomics Based on chapter 8 Functional and Comparative Genomics Copyright © 2010 Pearson Education Inc.
High-Throughput Cloning and Expression Library Creation for Functional Proteomics The International Proteomics Tutorial Program.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Genetic technology. Biotechnology Alteration of cells or molecules for specific application. Genetic engineering refers to any biotechnology that manipulates.
Canadian Bioinformatics Workshops
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on
Answers to Homework Tasks
3rd Internal RECESS workshop Caroline C. Friedel
Amos Tanay Nir Yosef 1st HCA Jamboree, 8/2017
Recurrent inversion breaking intron 1 of the factor VIII gene is a frequent cause of severe hemophilia A by Richard D. Bagnall, Naushin Waseem, Peter M.
Chapter 4 “DNA Finger Printing”
Polymerase Chain Reaction
CHAPTER 12 DNA Technology and the Human Genome
Follicular lymphoma with a novel t(14;18) breakpoint involving the immunoglobulin heavy chain switch mu region indicates an origin from germinal center.
From: TopHat: discovering splice junctions with RNA-Seq
by Wen-feng Xu, Zhi-wei Xie, Dominic W. Chung, and Earl W. Davie
A novel mutation of HFE explains the classical phenotype of genetic hemochromatosis in a C282Y heterozygote  Daniel F. Wallace, James S. Dooley, Ann P.
Spliceosome-Mediated RNA Trans-splicing
RNA sequencing (RNA-Seq) and its application in ovarian cancer
Analysis of an exon 1 polymorphism of the B2 bradykinin receptor gene and its transcript in normal subjects and patients with C1 inhibitor deficiency 
Double Heterozygosity for a RET Substitution Interfering with Splicing and an EDNRB Missense Mutation in Hirschsprung Disease  Alberto Auricchio, Paola.
Widespread Inhibition of Posttranscriptional Splicing Shapes the Cellular Transcriptome following Heat Shock  Reut Shalgi, Jessica A. Hurt, Susan Lindquist,
Structure of the GM2A Gene: Identification of an Exon 2 Nonsense Mutation and a Naturally Occurring Transcript with an In-Frame Deletion of Exon 2  Biao.
The transcript profiles in the three human cell lines based on RNA sequencing (RNA‐seq). The transcript profiles in the three human cell lines based on.
Pseudoexon Activation as a Novel Mechanism for Disease Resulting in Atypical Growth- Hormone Insensitivity  Louise A. Metherell, Scott A. Akker, Patricia.
Figure 2. Technique overview
Volume 21, Issue 9, Pages (November 2017)
by Tim Wang, Kıvanç Birsoy, Nicholas W. Hughes, Kevin M
Determine CDS Coordinates
Gene Structure.
Volume 17, Issue 5, Pages (May 2009)
Schematic representation of a transcriptomic evaluation approach.
RT-PCR analysis of GFP splice variants in prp18a-1 mutants.
Barc knockdown causes intron retention of short, GC-rich introns with weak splice sites. barc knockdown causes intron retention of short, GC-rich introns.
Genomic structure of LTBP-4 around the 3rd 8-Cys repeat.
Expression of multiple forms of MEL1 gene products.
MicroRNA Binding Sites in Arabidopsis Class III HD-ZIP mRNAs Are Required for Methylation of the Template Chromosome  Ning Bao, Khar-Wai Lye, M.Kathryn.
by Honglin Chen, Paul Smith, Richard F. Ambinder, and S. Diane Hayward
Exon Skipping in IVD RNA Processing in Isovaleric Acidemia Caused by Point Mutations in the Coding Region of the IVD Gene  Jerry Vockley, Peter K. Rogan,
Figure Genetic characterization of the novel GYG1 gene mutation (A) GYG1_cDNA sequence and position of primers used. Genetic characterization of the novel.
Gene Structure.
Presentation transcript:

Third generation long-read sequencing of HIV-1 transcripts discloses cell type specific and temporal regulation of RNA splicing Frederic Bushman International AIDS Meeting Washington DC, 2012

Splicing factors prominent in genome-wide siRNA screens HIV RNAs spliced to yield at least 40 mRNAs Sensitivity suggests unexploited opportunity for intervention? Relevant ORFs remain to be discovered? Bushman et al PLoS Path Why Study HIV Splicing?

Approach Amplification: 18 primer pairs Canonical splicing Rare splicing New splicing

cDNA Template Mix Break Emulsion Sequence RainDance Technologies: Single Molecule Droplet PCR Tewhey et al., Nature Biotechnology, 2009 RainDance Technologies b cDNA prep from infected cells a Primer Library Overlapping primer pairs amplify cDNA maintaining ratios Primer Library PCR

Pacific Biosciences: Single molecule sequencing Fixed polymerase  Phosphate- labeled nucleotides High throughput single molecule real-time sequencing provides long reads, maintaining linkage between exons Error mitigated by 1.Alignment to 10kb HIV genome 2.SMRTbell approach…

930,294 HIV sequences of up to 2629 bp Pacific Biosciences: Sequence Output Cell Type Mappable Reads Median Raw Read-length Longest HIV Sequence HOS (18,24,48hpi) 88, bp2105 bp Primary CD4T (7 donors triplicate, 48hpi) 841, bp2629 bp

2 Novel Splice Donors Scott Sherrill-Mix

11 Novel Splice Acceptors Scott Sherrill-Mix

Novel Splice Sites Genetic Map Exons SD Splice Donor SA Splice Acceptor * site does not adhere to consensus

Complete message population of HIV in CD4 + T cells 77 complete message structures Evidence for 36 additional transcripts from partial reads Total: 113 mRNAs 19 novel transcripts including a new completely spliced class (~1kb) Scott Sherrill-Mix

Novel Acceptor A8c Novel splice acceptor A8c creates new ORFs in HIV

Dynamic Transcript Populations Mutually exclusive acceptors :

Temporal, cell-type and intra-human variability Dynamic Transcript Populations

Conclusions Long read single molecule sequencing works well to delineate HIV message populations At least 113 different HIV-1 transcripts 1 kb class of RNAs prominent in HIV 89.6 Differential splicing by cell type, time after infection, and among cells from human subjects

Credits Bushman laboratoryFormer Bushman LabCollaborators Troy Brady Gary WangCharles Berry Kyle Bittinger Brett BeitzelSumit Chanda Rohini SinhaMary LewinskiJohn Young Scott Sherrill-Mix Astrid SchroderRenate Koenig Frances MaleAngela Ciuffi Joe Ecker Christian Hoffmann Heather Marshall RoseCraig Hyde Nirav MalaniJeremy Leipzig Mark Yeager Brendan KellyMatt CulybaKushol Gupta Young HwangRick MitchellGreg Van Duyne Stephanie GrunbergTracy Diamond Masahiro Yamashita Serena DolliveEmily CharlsonMike Emerman Alexandra BrysonShannah RothFrancis Collins Sam MinotKaren OcwiejaPhilippe Leboulch Spencer BartonKeshet RonenAlain Fischer Aubrey BaileyGreg PeterfreundMarina Cavazzana-Calvo Rithun MukherjeeSalima Hacien-Bey-Abina Jennifer HwangRik Gijsbers Kristine YoderZeger Debyser Rebecca Custers-Allen

RNA in infected cells is 14% viral. Ratios among HIV message forms HIV infection associated with intron retention in cellular genes Solexa/Illumina Hi Seq 100 base paired end reads 2 uninfected samples 3 infected samples HIV 89.6 in human T-cells ~ 1 Billion sequence reads Both human and HIV