Towards Personalized Genomics-Guided Cancer Immunotherapy Ion Mandoiu Department of Computer Science & Engineering Joint work with Sahar Al Seesi (CSE)

Slides:



Advertisements
Similar presentations
RNA-Seq as a Discovery Tool
Advertisements

Imputation for GWAS 6 December 2012.
Functional Genomics with Next-Generation Sequencing
An Introduction to Studying Expression Data Through RNA-seq
Marius Nicolae Computer Science and Engineering Department
RNA-Seq based discovery and reconstruction of unannotated transcripts
Alex Zelikovsky Department of Computer Science Georgia State University Joint work with Serghei Mangul, Irina Astrovskaya, Bassam Tork, Ion Mandoiu Viral.
Vanderbilt Center for Quantitative Sciences Summer Institute Sequencing Analysis Yan Guo.
 Experimental Setup  Whole brain RNA-Seq Data from Sanger Institute Mouse Genomes Project [Keane et al. 2011]  Synthetic hybrids with different levels.
BIOINFORMATICS GENE DISCOVERY BIOINFORMATICS AND GENE DISCOVERY Iosif Vaisman 1998 UNIVERSITY OF NORTH CAROLINA AT CHAPEL HILL Bioinformatics Tutorials.
RNAseq.
“BIG DATA” from RNA-Seq Experiments. Significance of RNA-Seq Approaches  Reveals which genes are expressed and the levels at which they are expressed;
CSCE555 Bioinformatics Lecture 3 Gene Finding Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu Course page:
Transcriptome Sequencing with Reference
(A) Mutations within neoepitopes lead to structural alterations across the peptide backbone, as illustrated with structural snapshots from the simulations.
Next-generation sequencing and PBRC. Next Generation Sequencer Applications DeNovo Sequencing Resequencing, Comparative Genomics Global SNP Analysis Gene.
Bioinformatics pipeline for detection of immunogenic cancer mutations by high throughput mRNA sequencing Jorge Duitama 1, Ion Mandoiu 1, and Pramod Srivastava.
Estimation of alternative splicing isoform frequencies from RNA-Seq data Ion Mandoiu Computer Science and Engineering Department University of Connecticut.
Bioinformatics Methods for Diagnosis and Treatment of Human Diseases Jorge Duitama Dissertation Defense for the Degree of Doctorate in Philosophy Computer.
Bioinformatics Pipeline for Fosmid based Molecular Haplotype Sequencing Jorge Duitama1,2, Thomas Huebsch1, Gayle McEwen1, Sabrina Schulz1, Eun-Kyung Suk1,
Bioinformatics Tools for Personalized Cancer Immunotherapy
Estimation of alternative splicing isoform frequencies from RNA-Seq data Ion Mandoiu Computer Science and Engineering Department University of Connecticut.
Next-Generation Sequencing: Challenges and Opportunities Ion Mandoiu Computer Science and Engineering Department University of Connecticut.
Bioinformatics Methods for Diagnosis and Treatment of Human Diseases Jorge Duitama Dissertation Proposal for the Degree of Doctorate in Philosophy Computer.
Towards accurate detection and genotyping of expressed variants from whole transcriptome sequencing data Jorge Duitama 1, Pramod Srivastava 2, and Ion.
Reconstruction of Haplotype Spectra from NGS Data Ion Mandoiu UTC Associate Professor in Engineering Innovation Department of Computer Science & Engineering.
Whole Exome Sequencing for Variant Discovery and Prioritisation
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
Variables: – T(p) - set of candidate transcripts on which pe read p can be mapped within 1 std. dev. – y(t) -1 if a candidate transcript t is selected,
Next Generation DNA Sequencing
Computational methods for genomics-guided immunotherapy
CSCI 6900/4900 Special Topics in Computer Science Automata and Formal Grammars for Bioinformatics Bioinformatics problems sequence comparison pattern/structure.
The iPlant Collaborative
1 Transcript modeling Brent lab. 2 Overview Of Entertainment  Gene prediction Jeltje van Baren  Improving gene prediction with tiling arrays Aaron Tenney.
Serghei Mangul Department of Computer Science Georgia State University Joint work with Irina Astrovskaya, Marius Nicolae, Bassam Tork, Ion Mandoiu and.
Sahar Al Seesi and Ion Măndoiu Computer Science and Engineering
Introduction to RNAseq
Geuvadis Analysis Meeting 16/02/2012 Micha Sammeth CNAG – Barcelona.
Computational methods for genomics-guided immunotherapy Sahar Al Seesi Computer Science & Engineering Department, UCONN Immunology Department, UCONN Health.
Scalable Algorithms for Next-Generation Sequencing Data Analysis Ion Mandoiu UTC Associate Professor in Engineering Innovation Department of Computer Science.
TOX680 Unveiling the Transcriptome using RNA-seq Jinze Liu.
Lesson Four Structure of a Gene. Gene Structure What is a gene? Gene: a unit of DNA on a chromosome that codes for a protein(s) –Exons –Introns –Promoter.
Scalable Algorithms for Next-Generation Sequencing Data Analysis Ion Mandoiu UTC Associate Professor in Engineering Innovation Department of Computer Science.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Accessing and visualizing genomics data
Reliable Identification of Genomic Variants from RNA-seq Data Robert Piskol, Gokul Ramaswami, Jin Billy Li PRESENTED BY GAYATHRI RAJAN VINEELA GANGALAPUDI.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
From Reads to Results Exome-seq analysis at CCBR
Cancer Vaccine Design Ion Mandoiu
Lesson Four Structure of a Gene.
Gil McVean Department of Statistics
Lesson Four Structure of a Gene.
Statistical Applications in Biology and Genetics
Computational methods for genomics-guided immunotherapy
Overview of next-generation sequencing, neoantigen prediction, and functional T-cell analyses. Overview of next-generation sequencing, neoantigen prediction,
Gene expression estimation from RNA-Seq data
Sequencing Data Analysis
Sahar Al Seesi University of Connecticut CANGS 2017
Proteomics Informatics David Fenyő
Genomic alterations in breast cancer cell line MDA-MB-231.
Pairing T-cell Receptor Sequences using Pooling and Min-cost Flows
RNA sequencing (RNA-Seq) and its application in ovarian cancer
Diverse abnormalities manifest in RNA
Fig. 1 Cancer exome–based identification of neoantigens.
Dec. 22, 2011 live call UCONN: Ion Mandoiu, Sahar Al Seesi
Sequence Analysis - RNA-Seq 2
Schematic representation of a transcriptomic evaluation approach.
Sequencing Data Analysis
Fig. 1 Cancer exome–based identification of neoantigens.
Presentation transcript:

Towards Personalized Genomics-Guided Cancer Immunotherapy Ion Mandoiu Department of Computer Science & Engineering Joint work with Sahar Al Seesi (CSE) Jorge Duitama (CIAT) Fei Duan, Tatiana Blanchard, Pramod K. Srivastava (UCHC)

2 Mandoiu Lab Main Research Areas: Bioinformatics Algorithms Development of Computational Methods for Next-Gen Sequencing Data Analysis Ongoing Projects RNA-Seq Analysis (NSF, NIH, Life Technologies) -Novel transcript reconstruction -Allele-specific isoform expression Viral quasispecies reconstruction (USDA) -IBV evolution and vaccine optimization Genome assembly and scaffolding, LD-based genotype calling, local ancestry inference, metabolomics, … -More info & software at -Computational deconvolution of heterogeneous samples

Genomics-Guided Cancer Immunotherapy CTCAATTGATGAAATTGTTCTGAAACT GCAGAGATAGCTAAAGGATACCGGGTT CCGGTATCCTTTAGCTATCTCTGCCTC CTGACACCATCTGTGTGGGCTACCATG … AGGCAAGCTCATGGCCAAATCATGAGA mRNA Sequencing SYFPEITHI ISETDLSLL CALRRNESL … Tumor Specific Epitopes Peptide Synthesis Immune System Stimulation Mouse Image Source: Tumor Remission T-Cell Response

Bioinformatics Pipeline

Hybrid Read Alignment Approach mRNA reads Transcript Library Mapping Genome Mapping Read Merging Transcript mapped reads Genome mapped reads Mapped reads More efficient compared to spliced alignment onto genome Stringent filtering: reads with multiple alignments are discarded

Clipping Alignments

Removal of PCR Artifacts

Variant Detection and Genotyping AACGCGGCCAGCCGGCTTCTGTCGGCCAGCAGCCAGGAATCTGGAAACAATGGCTACAGCGTGC AACGCGGCCAGCCGGCTTCTGTCGGCCAGCCGGCAG CGCGGCCAGCCGGCTTCTGTCGGCCAGCAGCCCGGA GCGGCCAGCCGGCTTCTGTCGGCCAGCCGGCAGGGA GCCAGCCGGCTTCTGTCGGCCAGCAGCCAGGAATCT GCCGGCTTCTGTCGGCCAGCAGCCAGGAATCTGGAA CTTCTGTCGGCCAGCCGGCAGGAATCTGGAAACAAT CGGCCAGCAGCCAGGAATCTGGAAACAATGGCTACA CCAGCAGCCAGGAATCTGGAAACAATGGCTACAGCG CAAGCAGCCAGGAATCTGGAAACAATGGCTACAGCG GCAGCCAGGAATCTGGAAACAATGGCTACAGCGTGC Reference genome Locus i RiRi

Variant Detection and Genotyping Pick genotype with the largest posterior probability

Accuracy as Function of Coverage

Haplotyping Somatic cells are diploid, containing two nearly identical copies of each autosomal chromosome – Novel mutations are present on only one chromosome copy – For epitope prediction we need to know if nearby mutations appear in phase LocusMutationAlleles 1SNVC,T 2DeletionC,- 3SNVA,G 4Insertion-,GC LocusMutationHaplotype 1 Haplotype 2 1SNVTC 2DeletionC- 3SNVAG 4Insertion-GC

RefHap Algorithm Reduce the problem to Max-Cut Solve Max-Cut Build haplotypes according with the cut Locus12345 f1f1 *0110 f2f2 110*1 f3f3 1**0* f4f4 *00*1 3 f1f1 1 1 f4f4 f2f2 f3f3 h h

Epitope Prediction J.W. Yedell, E Reits and J Neefjes. Making sense of mass destruction: quantitating MHC class I antigen presentation. Nature Reviews Immunology, 3: , 2003 C. Lundegaard et al. MHC Class I Epitope Binding Prediction Trained on Small Data Sets. In Lecture Notes in Computer Science, 3239: , 2004 Profile weight matrix (PWM) model

Results on Tumor Data Tumor TypeMethACMS5 RNA-Seq Reads (Million) Genome Mapped75%54% Transcriptome Mapped83%59% HardMerge Mapped50%36% HardMerge Mapped Bases (Gb) High-Quality Heterozygous SNVs in CCDS Exons 1, Non-synonymous 1, Missense 1, Nonsense 63 4 No-stop 1 - NetMHC Predicted Epitopes Mean Tumor Diameter (mm) Days after tumor challenge AUC (mm 2 ) P < Tumor rejection potential of identified epitopes currently evaluated experimentally in the Srivastava lab

Ongoing Work Sequencing of spontaneous tumors (TRAMP mice) Detecting other forms of variation: indels, gene fusions, novel transcripts Incorporating predictions of TAP transport efficiency and proteasomal cleavage in epitope prediction Integration of mass-spectrometry data Monitoring immune response by TCR sequencing