Performance of Common Analysis Methods for Detecting Low-Frequency Single Nucleotide Variants in Targeted Next-Generation Sequence Data  David H. Spencer,

Slides:



Advertisements
Similar presentations
Clinical Laboratory Analysis of Immunoglobulin Heavy Chain Variable Region Genes for Chronic Lymphocytic Leukemia Prognosis  Philippe Szankasi, David.
Advertisements

Detection of FLT3 Internal Tandem Duplication in Targeted, Short-Read-Length, Next- Generation Sequencing Data  David H. Spencer, Haley J. Abel, Christina.
Lawrence J. Jennings, Maria E
Ane Y. Schmidt, Thomas v. O. Hansen, Lise B
Comparison of Clinical Targeted Next-Generation Sequence Data from Formalin-Fixed and Fresh-Frozen Tissue Specimens  David H. Spencer, Jennifer K. Sehn,
Development of a Next-Generation Sequencing Method for BRCA Mutation Screening  Maurice Chan, Shen Mo Ji, Zhen Xuan Yeo, Linda Gan, Eric Yap, Yoon Sim.
Carol Beadling, Tanaya L. Neff, Michael C
A Targeted High-Throughput Next-Generation Sequencing Panel for Clinical Screening of Mutations, Gene Amplifications, and Fusions in Solid Tumors  Rajyalakshmi.
Emily M. Kudalkar, Naif A. M
Haley J. Abel, Hussam Al-Kateb, Catherine E. Cottrell, Andrew J
In Silico Proficiency Testing for Clinical Next-Generation Sequencing
Detection of FLT3 Internal Tandem Duplication in Targeted, Short-Read-Length, Next- Generation Sequencing Data  David H. Spencer, Haley J. Abel, Christina.
A Method to Evaluate the Quality of Clinical Gene-Panel Sequencing Data for Single- Nucleotide Variant Detection  Chung Lee, Joon S. Bae, Gyu H. Ryu, Nayoung.
Validation of a Next-Generation Sequencing Pipeline for the Molecular Diagnosis of Multiple Inherited Cancer Predisposing Syndromes  Paula Paulo, Pedro.
Robustness of Amplicon Deep Sequencing Underlines Its Utility in Clinical Applications  Vera Grossmann, Andreas Roller, Hans-Ulrich Klein, Sandra Weissmann,
Improving Mutation Screening in Patients with Colorectal Cancer Predisposition Using Next-Generation Sequencing  Jean-Marc Rey, Vincent Ducros, Pascal.
Annotation of Sequence Variants in Cancer Samples
Application of Single-Molecule Amplification and Resequencing Technology for Broad Surveillance of Plasma Mutations in Patients with Advanced Lung Adenocarcinoma 
Application of Single-Molecule Amplification and Resequencing Technology for Broad Surveillance of Plasma Mutations in Patients with Advanced Lung Adenocarcinoma 
Assessment of Capture and Amplicon-Based Approaches for the Development of a Targeted Next-Generation Sequencing Pipeline to Personalize Lymphoma Management 
A Method for Next-Generation Sequencing of Paired Diagnostic and Remission Samples to Detect Mitochondrial DNA Mutations Associated with Leukemia  Ilaria.
Maxim B. Freidin, Neesa Bhudia, Eric Lim, Andrew G
Single Nucleotide Polymorphism-Based System Improves the Applicability of Quantitative PCR for Chimerism Monitoring  Egle Gineikiene, Mindaugas Stoskus,
Comparison of High-Resolution Melting Analysis, TaqMan Allelic Discrimination Assay, and Sanger Sequencing for Clopidogrel Efficacy Genotyping in Routine.
Annotation of Sequence Variants in Cancer Samples
Comparison of Allelic Discrimination by dHPLC, HRM, and TaqMan in the Detection of BRAF Mutation V600E  Pablo Carbonell, María C. Turpin, Daniel Torres-Moreno,
Utilization of Whole-Exome Next-Generation Sequencing Variant Read Frequency for Detection of Lesion-Specific, Somatic Loss of Heterozygosity in a Neurofibromatosis.
Christopher R. Cabanski, Vincent Magrini, Malachi Griffith, Obi L
A Clinical Grade Sequencing-Based Assay for CEBPA Mutation Testing
Donavan T. Cheng, Talia N. Mitchell, Ahmet Zehir, Ronak H
Cornelis J. J. Huijsmans, Jeroen Poodt, Paul H. M. Savelkoul, Mirjam H
A Rapid and Sensitive Next-Generation Sequencing Method to Detect RB1 Mutations Improves Care for Retinoblastoma Patients and Their Families  Wenhui L.
Design and Validation of a Conformation Sensitive Capillary Electrophoresis-Based Mutation Scanning System and Automated Data Analysis of the More than.
Eric Samorodnitsky, Jharna Datta, Benjamin M
Clinical Laboratory Analysis of Immunoglobulin Heavy Chain Variable Region Genes for Chronic Lymphocytic Leukemia Prognosis  Philippe Szankasi, David.
Catherine E. Cottrell, Hussam Al-Kateb, Andrew J. Bredemeyer, Eric J
Detection of Dual IDH1 and IDH2 Mutations by Targeted Next-Generation Sequencing in Acute Myeloid Leukemia and Myelodysplastic Syndromes  Mia Y. Platt,
Validation and Implementation of a Custom Next-Generation Sequencing Clinical Assay for Hematologic Malignancies  Michael J. Kluk, R. Coleman Lindsley,
Development and Clinical Implementation of a Combination Deletion PCR and Multiplex Ligation-Dependent Probe Amplification Assay for Detecting Deletions.
Comprehensive Diagnostic Testing for Stereocilin
Development of a Next-Generation Sequencing Method for BRCA Mutation Screening  Maurice Chan, Shen Mo Ji, Zhen Xuan Yeo, Linda Gan, Eric Yap, Yoon Sim.
Jamal H. Carter, Samantha N. McNulty, Patrick J. Cimino, Catherine E
Next-Generation Sequencing for Simultaneous Determination of Human Papillomavirus Load, Subtype, and Associated Genomic Copy Number Changes in Tumors 
Analytical Validation of Clinical Whole-Genome and Transcriptome Sequencing of Patient-Derived Tumors for Reporting Targetable Variants in Cancer  Kazimierz.
Detection of Mutations in Myeloid Malignancies through Paired-Sample Analysis of Microdroplet-PCR Deep Sequencing Data  Donavan T. Cheng, Janice Cheng,
Utility of NIST Whole-Genome Reference Materials for the Technical Validation of a Multigene Next-Generation Sequencing Test  Bennett O.V. Shum, Ilya.
Development and Validation of a Template-Independent Next-Generation Sequencing Assay for Detecting Low-Level Resistance-Associated Variants of Hepatitis.
A Systematic Comparison of Traditional and Multigene Panel Testing for Hereditary Breast and Ovarian Cancer Genes in More Than 1000 Patients  Stephen.
Genotyping Single Nucleotide Polymorphisms in Human Genomic DNA with an Automated and Self-Contained PCR Cassette  Dammika P. Manage, Lucy Ma, Jana Lauzon,
Report of New Haplotype for ABCC2 Gene
Technical Validation of a Next-Generation Sequencing Assay for Detecting Clinically Relevant Levels of Breast Cancer–Related Single-Nucleotide Variants.
Deamination Effects in Formalin-Fixed, Paraffin-Embedded Tissue Samples in the Era of Precision Medicine  Seokhwi Kim, Charny Park, Yongick Ji, Deok G.
Sanger Confirmation Is Required to Achieve Optimal Sensitivity and Specificity in Next- Generation Sequencing Panel Testing  Wenbo Mu, Hsiao-Mei Lu, Jefferey.
Multiplexed High Resolution Melting Assay for Versatile Sample Tracking in a Diagnostic and Research Setting  Céline Helsmoortel, R. Frank Kooy, Geert.
Larissa V. Furtado, Helmut C. Weigelin, Kojo S. J
Analytical Validation and Application of a Targeted Next-Generation Sequencing Mutation-Detection Assay for Use in Treatment Assignment in the NCI-MPACT.
External Quality Assessment for Detection of Fetal Trisomy 21, 18, and 13 by Massively Parallel Sequencing in Clinical Laboratories  Rui Zhang, Hongyun.
A Variant Detection Pipeline for Inherited Cardiomyopathy–Associated Genes Using Next-Generation Sequencing  Théo G.M. Oliveira, Miguel Mitne-Neto, Louise.
Rondell P. Graham, Michelle A. Dina, Sarah C. Howe, Malinda L
Genomic Technologies and the New Era of Genomic Medicine
A Clinically Validated Diagnostic Second-Generation Sequencing Assay for Detection of Hereditary BRCA1 and BRCA2 Mutations  Ian E. Bosdet, T. Roderick.
Lawrence J. Jennings, Maria E
Rapid and Sensitive Real-Time Polymerase Chain Reaction Method for Detection and Quantification of 3243A>G Mitochondrial Point Mutation  Rinki Singh,
Katy Hanlon, Lorna W. Harries, Sian Ellard, Claudius E. Rudin 
A Real-Time PCR Assay for the Simultaneous Detection of Functional N284I and L412F Polymorphisms in the Human Toll-Like Receptor 3 Gene  Robert A. Brown,
Jared R. Kohler, David J. Cutler 
Nathan D. Montgomery, Sara R. Selitsky, Nirali M. Patel, D
The History and Impact of Molecular Coding Changes on Coverage and Reimbursement of Molecular Diagnostic Tests  Susan J. Hsiao, Mahesh M. Mansukhani,
Development of a Novel Next-Generation Sequencing Assay for Carrier Screening in Old Order Amish and Mennonite Populations of Pennsylvania  Erin L. Crowgey,
Lawrence J. Jennings, Maria E
Presentation transcript:

Performance of Common Analysis Methods for Detecting Low-Frequency Single Nucleotide Variants in Targeted Next-Generation Sequence Data  David H. Spencer, Manoj Tyagi, Francesco Vallania, Andrew J. Bredemeyer, John D. Pfeifer, Rob D. Mitra, Eric J. Duncavage  The Journal of Molecular Diagnostics  Volume 16, Issue 1, Pages 75-88 (January 2014) DOI: 10.1016/j.jmoldx.2013.09.003 Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology Terms and Conditions

Figure 1 A: Theoretical probability of sampling variant alleles of differing frequencies (25%, 10%, 5%, and 2.5%) two or more times versus coverage depth, based on binominal sampling statistics. B: Distribution of observed minor gold standard VAFs for all mixed samples. Each curve represents the distribution for a single sample with mixture proportions of 50%, 20%, 10%, and 5%, which were expected to have VAFs of 25%, 10%, 5%, and 2.5%, respectively. Gray bars show the middle 95% for each expected distribution. The Journal of Molecular Diagnostics 2014 16, 75-88DOI: (10.1016/j.jmoldx.2013.09.003) Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology Terms and Conditions

Figure 2 Performance of GATK, SAMtools, VarScan2, and SPLINTER for detecting low-frequency variants in mixed samples at positions with coverage ≥100×. A: Sensitivity for detecting all heterozygous minor gold standard variants in samples with mix proportions of 50%, 20%, 10%, and 5% (mean observed gold standard VAFs, 25.5%, 11.2%, 6.8%, and 4.2%, respectively). Sensitivities (True positive/True positive + False negative) are point estimates based on detection of all minor gold standard variants at positions with ≥100× coverage in each set of mixed samples (n = 409, 406, 409, and 411, respectively). Error bars show the 95% binomial CI for each point estimate. B: Sensitivity for detecting homozygous and heterozygous gold standard variants in the major sample, which have estimated VAFs of >25%. Error bars show the 95% binomial CI for each point estimate. C: Mean number of false positive SNV calls per sample made by each program at the indicated mix proportion across the entire target region, encompassing coding and noncoding sequence of 26 genes (306,336 bp). Indel calls were excluded, as were positions with low coverage or discordant calls in the gold standard variant analysis (see Materials and Methods). Error bars show the SD across all samples with the indicated mix proportion (n = 4 for each mix proportion). D: PPV (True positive/True positive + False positive) for SNV calls by each program across the mix proportions. Error bars show the 95% binomial CI for each point estimate. The Journal of Molecular Diagnostics 2014 16, 75-88DOI: (10.1016/j.jmoldx.2013.09.003) Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology Terms and Conditions

Figure 3 Sensitivity of GATK, SAMtools, VarScan2, and SPLINTER for low-frequency variants as a function of observed coverage at variant positions. Sequencing reads from mixed samples were randomly sampled (see Materials and Methods) to obtain datasets with estimated mean coverage depths of 1500, 1250, 1000, 750, 500, 400, 200, and 100 across the entire target region for each of the mixed samples. The observed coverage depths were determined for all minor gold standard variants, and variant detection was performed using each of the four programs. Panels show the overall sensitivity for all variants from each mixed sample in the observed coverage in bins of 100 (eg, the 100 bin contains all gold standard variants with coverage depths between 0 and 100) for GATK (A), SAMtools (B), VarScan2 (C), and SPLINTER (D). Error bars show the 95% binomial CI for each point estimate. The Journal of Molecular Diagnostics 2014 16, 75-88DOI: (10.1016/j.jmoldx.2013.09.003) Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology Terms and Conditions

Figure 4 False positives (mean per sample) called by GATK, SAMtools, VarScan2, and SPLINTER as a function of mean target coverage and mix proportion across the entire target region encompassing coding and noncoding sequences of 26 genes (306,336 bp). Variant SNV calls in down-sampled data that were not among the gold standard SNVs were called as false positive, after excluding indel calls and positions that were low coverage (<50) or discordant in the gold standard analysis. The number of false positive calls for GATK (A), SAMtools (B), VarScan2 (C), and SPLINTER (D) for each down-sampled coverage level and the mix proportion indicated by the legend. Error bars show the SD for each coverage level and mix proportion. The Journal of Molecular Diagnostics 2014 16, 75-88DOI: (10.1016/j.jmoldx.2013.09.003) Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology Terms and Conditions

Figure 5 Sensitivity and false positive calls made using only filtered reads compared with using all reads. Filtered reads included only those with a mapping quality >20, a minimum base quality of 20 for all bases, and no more than four discrepancies. A: Sensitivity for minor gold standard SNVs across all four programs after filtering of low-quality and multiple-discrepancy reads. Error bars show the 95% binomial CI for each point estimate. B: False positive SNV calls (mean per sample) using high-quality reads compared with all reads. Error bars show the SD across all samples at each mix proportion (n = 4). The Journal of Molecular Diagnostics 2014 16, 75-88DOI: (10.1016/j.jmoldx.2013.09.003) Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology Terms and Conditions