Download presentation
Presentation is loading. Please wait.
1
Software-Assisted Manual Review of Clinical Next-Generation Sequencing Data
Dale Muzzey, Shera Kash, Jillian I. Johnson, Laura M. Melroy, Piotr Kaleta, Kelly A. Pierce, Kaylene Ready, Hyunseok P. Kang, Kevin R. Haas The Journal of Molecular Diagnostics Volume 21, Issue 2, Pages (March 2019) DOI: /j.jmoldx Copyright © 2019 American Society for Investigative Pathology and the Association for Molecular Pathology Terms and Conditions
2
Figure 1 Overview of assay design, quality control measures, and variant calling pipeline that populate the manual call review optimization interface. A: Batch assembly: Each sequencing batch combined patient samples, cell-line controls, and no-template wells, all in randomized positions. B: Sequencing: Hybrid capture enabled targeted sequencing of genes of interest by multiplexed paired-end next-generation sequencing (NGS). C: Bioinformatics: In the bioinformatics pipeline, alignment preceded variant calling, which was followed by both the loading of interpretations and calculation of calling metrics. D: The Manual Call Review Optimization (MaCRO) interface for individual calls rendered information in tiers, with deeper tiers revealed to the user on clicking. Both basic and complex metrics were accessible by the interface, and the raw NGS data centered on the variant of interest were viewable directly in text or by a link to a genome browser. The Journal of Molecular Diagnostics , DOI: ( /j.jmoldx ) Copyright © 2019 American Society for Investigative Pathology and the Association for Molecular Pathology Terms and Conditions
3
Figure 2 Ambiguous heterozygous variants with low allele fraction could be resolved by Manual Call Review Optimization (MaCRO) confirmation. A: Top panel: A histogram of variant calls based on allele fraction alone. Bottom panel: The distribution of putative positive calls (N = 11,099) as a function of allele fraction and the base-10 logarithm of next-generation sequencing (NGS) read depth. A subset of calls populated an ambiguous zone where allele fraction was <30%. Each point in the scatter plot represents a single variant call. B: MaCRO confirmation applied to the variant calls in A revealed a diversity of true genotypes in the ambiguous zone. Sanger sequencing for samples selected from all unique, ambiguous variant sites upheld the MaCRO findings (large points in the scatter plot). Some MaCRO-confirmed negative calls have allele fraction >30%, illuminating how test accuracy could suffer if variant confirmation were constrained only to putative calls below a selected allele fraction threshold. SNV, single-nucleotide variant. The Journal of Molecular Diagnostics , DOI: ( /j.jmoldx ) Copyright © 2019 American Society for Investigative Pathology and the Association for Molecular Pathology Terms and Conditions
4
Figure 3 Examples of ambiguous positive variant calls resolved by Manual Call Review Optimization (MaCRO) confirmation. A: An excerpt of the next-generation sequencing (NGS) read pileup in a sample with a 40nt deletion in BRCA1 with 12% allele fraction revealed two reads spanning the deletion (reads with a purple gap). Of the 434× depth at this site, only 10× was depicted. The three pastel-shaded reads have mismatched bases at their 3′ ends, where the mismatched bases coincided with the deletion boundary, but the reads did not contribute to the allele fraction because the aligner did not recognize them as harboring the deletion. B: MaCRO inspection of the pileup indicated that reads with mismatches were alignment artifacts and actually have the 40nt deletion; properly viewing the reads as such effectively increased the allele fraction to 25% and raised the confidence of the heterozygous call. C: Retrospective Sanger sequencing confirmed the existence of a heterozygous 40nt deletion; the asterisk indicates the deletion boundary. D: A 10× excerpt from a 214× pileup of a putative SNV in STK11 with 8% allele fraction showed that reads with the alternate base occur on the same strand and have the same end point. Alternate bases were not observed at any other location on either strand, suggestive of a PCR error (not shown is the fact that different probes captured fragments with the alternate base, consistent with the PCR error occurring during library preparation, not during the capture or sequencing process). E: Suppressing the spurious reads during MaCRO evaluation revealed an otherwise clearly homozygous site. F: Retrospective Sanger sequencing confirmed the homozygous reference assertion from MaCRO (asterisk). SNP, single nucleotide polymorphism. The Journal of Molecular Diagnostics , DOI: ( /j.jmoldx ) Copyright © 2019 American Society for Investigative Pathology and the Association for Molecular Pathology Terms and Conditions
5
Figure 4 Sanger sequencing and Manual Call Review Optimization (MaCRO) confirmation were concordant at all tested sites. The number of samples tested by both Sanger and MaCRO is shown in bold, and in parentheses is the number of unique sites. The Journal of Molecular Diagnostics , DOI: ( /j.jmoldx ) Copyright © 2019 American Society for Investigative Pathology and the Association for Molecular Pathology Terms and Conditions
6
Figure 5 Time to perform Manual Call Review Optimization (MaCRO) confirmation. A: For 299 batches, the histogram shows the time between passing each batch's first control sample and passing the third-to-last patient sample. B: The cumulative density plot shows the fraction of batches for which MaCRO confirmation occurred within the number of hours indicated (x axis) after completion of next-generation sequencing (NGS) and bioinformatics processing. The Journal of Molecular Diagnostics , DOI: ( /j.jmoldx ) Copyright © 2019 American Society for Investigative Pathology and the Association for Molecular Pathology Terms and Conditions
7
Supplemental Figure S1 Sanger sequencing and next-generation sequencing (NGS) data for mosaic variants. For the six variants identified by Manual Call Review Optimization (MaCRO) as being mosaic, the gene, position, NGS depth, NGS allele fraction, and Sanger sequencing data are shown. The Sanger sequencing trace shows the mosaic position with arrows and also indicates which reaction (forward strand or reverse strand) is plotted. For reverse strand reactions, the Sanger sequencing data reflects the reverse complement of the NGS data. A is green, C is blue, G is brown, and T is red. Rows are ordered by chromosome. The Journal of Molecular Diagnostics , DOI: ( /j.jmoldx ) Copyright © 2019 American Society for Investigative Pathology and the Association for Molecular Pathology Terms and Conditions
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.