The Challenges Of Sequencing FFPE DNA Using NGS

Slides:



Advertisements
Similar presentations
Next-Generation Sequencing: Methodology and Application
Advertisements

Sequence Capture and Targeted Re-sequencing
RNA-seq library prep introduction
Analysis of the Epidermal Growth Factor Receptor and K-Ras genes in patients with Non-small Cell Lung Cancer H. Mugalaasi1, J. Davies2, L Medley2, D Talbot2,
Development of a BRCA2 screening service – Introduction of high resolution MELT analysis A Grade Trainee Project Nick Camm Yorkshire Regional Genetics.
The Use of Circulating Tumour DNA as a Liquid Biopsy
Reference mapping and variant detection Peter Tsai Bioinformatics Institute, University of Auckland.
We processed six samples in triplicate using 11 different array platforms at one or two laboratories. we obtained measures of array signal variability.
Next–generation DNA sequencing technologies – theory & practice
From the homework: Distribution of DNA fragments generated by Micrococcal nuclease digestion mean(nucs) = bp median(nucs) = 110 bp sd(nucs+ = 17.3.
Next-generation sequencing
Targeted Data Introduction  Many mapping, alignment and variant calling algorithms  Most of these have been developed for whole genome sequencing and.
The 454 and Ion PGM at the Genomics Core Facility Dr. Deborah Grove, Director for Genetic Analysis Genomics Core Facility Huck Institutes of the Life Sciences.
Bioinformatics pipeline for detection of immunogenic cancer mutations by high throughput mRNA sequencing Jorge Duitama 1, Ion Mandoiu 1, and Pramod Srivastava.
I inherited What??? You and Your Genes: The Explosive New World of Genetics David Finegold, M.D.
NGS genetic analysis for National Lung Matrix Trial
Molecular Testing of lung cancer in routine practice
EGFR gene mutation testing in NSCLC
High Throughput Sequencing
Considerations for Analyzing Targeted NGS Data BRCA Tim Hague,CTO.
Considerations for Analyzing Targeted NGS Data BRCA Tim Hague,CTO.
Dr Katie Snape Specialist Registrar in Genetics St Georges Hospital
Considerations for Analyzing Targeted NGS Data Introduction Tim Hague, CTO.
Transmissibility of B19 by transfusion: evaluation of donor viremia prior to performing a linked donor-recipient study M Busch, representing REDS-II Parvo.
Genomics – Next-Gen sequencing and Microarrays
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
HaloPlexHS Get to Know Your DNA. Every Single Fragment.
Development of Molecular Methodologies for the Enhanced Detection of Tumour Biomarkers Michelle Wood, Hood Mugalassi, Justyna Tull, Linda Meredith, Rachel.
Copy Number Variation Eleanor Feingold University of Pittsburgh March 2012.
E XOME SEQUENCING AND COMPLEX DISEASE : practical aspects of rare variant association studies Alice Bouchoms Amaury Vanvinckenroye Maxime Legrand 1.
PCR assay of intragenic mutation lesions induced by monoenergetic fission neutrons and gamma rays in Drosophila Part I: Gamma rays Nanette Brand 1 Nonhlanhla.
High density array comparative genomic hybridisation (aCGH) for dosage analysis and rapid breakpoint mapping in Duchenne Muscular Dystrophy (DMD) Victoria.
Supplemental Figure 1. Bias-corrected NGS bioinformatics strategies. Paired-end DNA sequencing reveals the sequence of the genomic clone, the sample ID.
INTERPRETING GENETIC MUTATIONAL DATA FOR CLINICAL ONCOLOGY Ben Ho Park, M.D., Ph.D. Associate Professor of Oncology Johns Hopkins University May 2014.
CyVerse Workshop Transcriptome Assembly. Overview of work RNA-Seq without a reference genome Generate Sequence QC and Processing Transcriptome Assembly.
Reliable Identification of Genomic Variants from RNA-seq Data Robert Piskol, Gokul Ramaswami, Jin Billy Li PRESENTED BY GAYATHRI RAJAN VINEELA GANGALAPUDI.
Library QA & QC Day 1, Video 3
Canadian Bioinformatics Workshops
A comparison of somatic mutation callers in breast cancer samples and matched blood samples THOMAS BRETONNET BIOINFORMATICS AND COMPUTATIONAL BIOLOGY UNIT.
Next Generation Sequencing
Quality Control Metrics for DNA Sequencing
Interpreting exomes and genomes: a beginner’s guide
Ultra-Deep Sequencing of Multiplex-PCR Enriched Hotspot and
NextGenBUG 14 August 2015 Alison Meynert
Development of a next-generation sequencing gene panel for neurogenetic disorders J E Martindale, R Crookes, L Crooks, N J Beauchamp, A Dalton Sheffield.
Comparison of Clinical Targeted Next-Generation Sequence Data from Formalin-Fixed and Fresh-Frozen Tissue Specimens  David H. Spencer, Jennifer K. Sehn,
Implementation of a Reliable Next-Generation Sequencing Strategy for Molecular Diagnosis of Dystrophinopathies  Melissa Alame, Delphine Lacourt, Reda.
DNA Extraction of Lung Cancer Samples for Advanced Diagnostic Testing
Reverse Complement PCR: fast, low cost amplicon based NGS
Carol Beadling, Tanaya L. Neff, Michael C
Content and Labeling of Tests Marketed as Clinical “Whole-Exome Sequencing” Perspectives from a cancer genetics clinician and clinical lab director Allen.
A Targeted High-Throughput Next-Generation Sequencing Panel for Clinical Screening of Mutations, Gene Amplifications, and Fusions in Solid Tumors  Rajyalakshmi.
Reliable Next-Generation Sequencing of Formalin-Fixed, Paraffin-Embedded Tissue Using Single Molecule Tags  Astrid Eijkelenboom, Eveline J. Kamping, Annemiek.
Validation of a Next-Generation Sequencing Pipeline for the Molecular Diagnosis of Multiple Inherited Cancer Predisposing Syndromes  Paula Paulo, Pedro.
Screening for Mutations in Kidney-Related Genes Using SURVEYOR Nuclease for Cleavage at Heteroduplex Mismatches  Konstantinos Voskarides, Constantinos.
Somatic Mutation Screening Using Archival Formalin-Fixed, Paraffin-Embedded Tissues by Fluidigm Multiplex PCR and Illumina Sequencing  Ming Wang, Leire.
Targeted, High-Depth, Next-Generation Sequencing of Cancer Genes in Formalin- Fixed, Paraffin-Embedded and Fine-Needle Aspiration Tumor Specimens  Andrew.
Robustness of Amplicon Deep Sequencing Underlines Its Utility in Clinical Applications  Vera Grossmann, Andreas Roller, Hans-Ulrich Klein, Sandra Weissmann,
Hybrid Capture and Next-Generation Sequencing Identify Viral Integration Sites from Formalin-Fixed, Paraffin-Embedded Tissue  Eric J. Duncavage, Vincent.
Molecular Analysis of Circulating Cell-Free DNA from Lung Cancer Patients in Routine Laboratory Practice  Stephan Bartels, Sascha Persing, Britta Hasemeier,
Application of Single-Molecule Amplification and Resequencing Technology for Broad Surveillance of Plasma Mutations in Patients with Advanced Lung Adenocarcinoma 
Molecular Diagnosis of Autosomal Dominant Polycystic Kidney Disease Using Next- Generation Sequencing  Adrian Y. Tan, Alber Michaeel, Genyan Liu, Olivier.
Eric Samorodnitsky, Jharna Datta, Benjamin M
Catherine E. Cottrell, Hussam Al-Kateb, Andrew J. Bredemeyer, Eric J
Validation and Implementation of a Custom Next-Generation Sequencing Clinical Assay for Hematologic Malignancies  Michael J. Kluk, R. Coleman Lindsley,
Cancer WGS Analytical Pipeline Validation
Clinical Validation of a Next-Generation Sequencing Screen for Mutational Hotspots in 46 Cancer-Related Genes  Rajesh R. Singh, Keyur P. Patel, Mark J.
TS Tumor Panel (15 Genes) Overview
Genomic & RNA Profiling Core Facility
Development of a Novel Next-Generation Sequencing Assay for Carrier Screening in Old Order Amish and Mennonite Populations of Pennsylvania  Erin L. Crowgey,
Presentation transcript:

The Challenges Of Sequencing FFPE DNA Using NGS Hazel Ingram

Problems with FFPE Poor quality/fragmented DNA Fixation artefacts Insufficient tumour material Low tumour content Measuring Quality Nanodrop vs Qubit Fragmentation I’m sure anyone who has worked with FFPE samples before knows how difficult they are to work with. But they are the most widely practiced method for clinical sample preservation and archiving. It is often poor quality/fragmented DNA  due to the fixation process – fragmented and cross-linked DNA, with high conc of contaminants, the number of amplifiable copies of DNA are reduced by strand breaks and other DNA damage. A common problem is fixation artefacts, formalin causes C>U deamination which can appear like a genuine C>T mutation – false positives This isn’t always the case, but often we can have Insufficient tumour material – particularly if the sample is a biopsy In addition to that the sample could have a low tumour content  we need to think about the sensitivity of our tests. To try and help with this we macrodissect, we ask a histopathologist to mark on an H&E slide where the tumour cells are found, and what percentage of tumour cells are found in that area. Measuring quality is also important, we use the Qubit to measure concentration for NGS work, as it is much more accurate than the Nanodrop. It would also be useful to measure the fragmentation.

Projects CRUK SMP1 FOCUS 4 WCB Sample Duplicate Single Library Prep   CRUK SMP1 FOCUS 4 WCB Sample Duplicate Single Library Prep TSCA Haloplex Sequencing MiSeq Analysis MiSeq Reporter NextGENe HGMD – custom pipeline Thresholds for variant calling 5% sensitivity 150x coverage per replicate 150x coverage 50x coverage We currently run 3 research based studies using FFPE tissue for a variety of different tumour types CRUK SMP (KRAS, PIK3CA, BRAF, NRAS, TP53, PTEN, EGFR, DDR2) –I’ll be talking mainly about this project, as it’s what I’ve been working on. FOCUS4 (KRAS, PIK3CA, BRAF, NRAS, PIK3R1) WCB (KRAS, PIK3CA, BRAF, NRAS, TP53, PTEN) Each of these 3 research projects involves investigation of a similar panel of genes, including KRAS, NRAS and BRAF. Initially, pyro and Sanger sequencing was used and NGS was implemented to streamline the testing and analysis process

Validation Determine acceptable coverage for regions of interest To match or increase sensitivity compared to original methods (pyro and sanger) Low level of false positives Must have at least 90% concordance The validation process is the same for all new panels and involves testing the panel with samples that have known mutations. We need to detemine what is an acceptable level of coverage for the ROI To at least match the sensitivity of the test compared to the original methods We need to check we are maintaining a low level of false positives And we must have at least 90% concordance

CRUK SMP1 Validation Overview Illumina validation: 42 samples sent: 37/42 matched (88%) 3/42 non concordant low coverage 2/42 non concordant design issue In House validation: 45 samples 43 matched (95%) 2/45 Overall, between hubs 18 extra variants previously undetected in 124 samples Technological fail of pyro/sanger Higher sensitivity of NGS Large number of artefacts detected Failure rate: approx 10% Artefacts Low Coverage Technological Design Higher Sensitivity of NGS So just a little bit of background on CRUK Phase I SMP trial – there are 3 TH doing the lab work, and 9 CHs. Looked at a variety of tumour types and genes. The NGS panel was designed by Illumina, so we were obliged to use their TSCA library prep kit 1st part of validation was sending samples to Illumina to test their panel, a total of 42 samples were sent, each with at least one mutation– 37/42 (88%), of the 5 that were non concordant, 3 of those missed mutations were due to low coverage, 2 of these were due to design, so the panel was tweaked, and we moved onto the second part of validation 2nd part of the validation, was in house, and this is the sort of validation that was done for the WCB and FOCUS-4. So we tested 45 old samples, and had 95% concordance, again the 2 that didn’t match were due to low coverage. Between the 3 THs, in house 124 samples were tested and 18 additional variants were detected – either through failure of original tech or higher sensitivity of NGS It was also apparent that a large number of artefacts were being produced. TSCA gave a similar failure rate to the original methods (pyro and sanger seq) of approx 10%, but because the TSCA panel extended outside of the original regions, suggesting overall failure rate is lower. The important points here are: There were lots of artefacts low NGS coverage means missed mutations Technological design We proved that there NGS had higher sensitivity

Artefacts  Duplicate Testing True variants seen in both replicates Pipeline - only sequence variants observed in both replicates retained for analysis Solution = testing of each sample in duplicate Sequencing artefacts only seen in 1 replicate Strategy Average number of variants per sample Range in number of Singlicate testing 113 3-546 Duplicate testing 30 0-60 When we were doing the CRUK validation, we soon noticed that there were a very large number of artefacts per sample. This was a big problem, as we had limited resources and timescale for analysing and interpreting this volume of data. We moved towards duplicate testing, and discovered on most occasions, seq artefacts were only seen in 1 variant and true variants seen in replicates, we used our pipeline to filter so that only those seen in both replicates were retained for analysis * Duplication used for CRUK panel only

Artefacts - Sanger Seq False Positive (deamination) KIT c.1745G>A p.Trp582X – CRUK SMP Mutation not detected by NGS (single testing: >4000x) Mean coverage across panel: 4840x KIT re-tested by Sanger…no mutation detected C>T deamination artefact caused by formalin fixation …duplicate testing solves this issue UDG treatment prior to PCR may help  removes uracil lesions by hydrolysing N-glycosidic bond Just as another point to mention, one of the non-concordant results from CRUK validaton was found to be the result of a false positive in Sanger sequencing, rather than a false negative in NGS analysis. -In this case, a melanoma FFPE sample was identified as having a low level nonsense mutation in the KIT gene by Sanger, as shown in the trace. -But no KIT mutation was identified by NGS. Re-analysis by Sanger sequencing was performed and showed WT sequence. Therefore this ‘mutation’ was concluded as being a C>T deamination artefact caused by formalin fixation of the FFPE sample. Steps can be taken to help these sort of artefacts, and there have been papers on the effect of UDG treatment prior to PCR which may help. (UDG is a DNA repair enzyme that removes the uracil lesions by hydrolysing N-glycosidic bond)

Minimum Coverage: CRUK & FOCUS4 Known variants became undetectable when coverage for region of interest <150x 150x minimum coverage cut-off implemented For CRUK: <150x coverage in either replicate = failed exon Avoided reporting false negative results Failure rates were still comparable to original testing There needs to be a minimum coverage for QC purposes. Looking at our validation data known variants became undetectable when coverage for ROI dropped below 150x, so this is much higher than we would say for Germline – but we are working with much smaller percentage mutant. So for CRUK and FOCUS4 150x minimum coverage cut off was implemented, for CRUK, if there was less than 150x coverage in either replicate the exon was failed. This meant we avoided reporting false negative results, and maintained reasonable failure rates comparable to original testing.

Design CRUK (TSCA) FOCUS4 (TSCA) WCB (Haloplex) PTEN 27bp del missed Deletion removed probe binding site FOCUS4 (TSCA) NRAS c.182A>G p.Q61R Region had zero reads  poor amplification WCB (Haloplex) PIK3CA ex9 c.1634A>G, p.E545G missed I’ll just give an example of a design issue in each of the panels CRUK – looked into this, there was good coverage (>4000 reads), discovered that the deletion removed the probe binding site for the 2 probes in this region, so there was a redesign and a different probe set was included in the panel FOCUS 4 – missed this NRAS mutation, when we investigated we noticed that the region has no reads for this patients and half of the other validaiton patients had no coverage for the same region, so did an assay redesign to get better amplification. WCB – used a different library prep method, which involves an enzyme digest. again a redeisgn was necessary… these are just some of the reasons why design and validation is very important.

Library Prep Methods: TSCA vs Haloplex Both panels designed to cover same ROI Same samples run on both panels Run on MiSeq and analysed using NextGENe for direct comparison Number of samples Number of expected mutations Number of mutations found Percentage Haloplex – NextGENe 11 28 100% TSCA - NextGENe 26 93% For some projects we were restricted by what library prep to use, but for the WCB project we have a lovely lady Alex who researched into different panels. A comparison was done, by our researcher Alex, into TSCA and haloplex. Both panels were designed to cover the same regions of interest- which included KRAS, BRAF, NNRAS, PIK3CA, and samples were set up and then run on the MiSeq, both were analysed with NextGene software for a direct comparison So as you can see the haloplex performed slightly better to the TSCA, there were 2 missed mutations.

TSCA vs Haloplex As another comparison, some of you may be familiar with IGV- a viewing software, so this is the same region for the same patient, all of the greys indicate matching bases to the ref seq and the coloured lines represent mis-matches. The top picture shows Haloplex data, and the bottom TSCA, so much more artefacts in this run. So, haloplex kit was picked for use with the WCB project

Projects CRUK SMP FOCUS 4 WCB Sample Duplicate Single Library Prep   CRUK SMP FOCUS 4 WCB Sample Duplicate Single Library Prep TSCA Haloplex Sequencing MiSeq Analysis MiSeq Reporter NextGENe HGMD – custom pipeline Thresholds for variant calling 5% sensitivity 150x coverage per duplicate 150x coverage 50x coverage And it was partly down to this cleaner data that the minimum coverage level was put much lower than the other projects

Conclusions and Future Work DNA quality from FFPE tissue is a major challenge Help interpretation by: Running in duplicate being careful with assay design and minimum coverage Additional steps e.g. UDG treatment can help minimise deamination artefacts Need a more robust NGS technology for low quality DNA from FFPE Alternative NGS platforms / providers Alternative enrichment methods e.g. target capture as alternative to PCR based So the DNA quality from FFPE tissue is a major challenge, but it is unlikely that we will move from this method of preservation in the near future, because it maintains tissue morphology allowing for macrodissection, and is easy and cheap to store in comparison to fresh frozen tissue. we’ve discovered that running in duplicate and being careful with assay design and minimum coverage cut off helps with the interpretation of these samples. Adding steps, e.g. UDG treatment prior to PCR step can help minimise deamination artefacts For the future, we need a more robust NGS technology for low quality DNA from FFPE – that could be different NGS platforms or different enrichment methods.

Acknowledgements Alex Stretton James Eden Helen Roberts Rachel Butler Matt Mort (HGMD) The All Wales Medical Genetics Service Hazel.ingram@wales.nhs.uk