Genotype Information Criteria for Forensic DNA Databases

Slides:



Advertisements
Similar presentations
Forensic DNA Inference ICFIS 2008 Lausanne, Switzerland Mark W Perlin, PhD, MD, PhD Joseph B Kadane, PhD Robin W Cotton, PhD Cybergenetics ©
Advertisements

DNA Mixture Interpretations and Statistics – To Include or Exclude Cybergenetics © Prescription for Criminal Justice Forensics ABA Criminal Justice.
Finding Truth in DNA Mixture Evidence Innocence Network Conference April, 2013 Charlotte, NC Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA.
How Inclusion Interpretation of DNA Mixture Evidence Reduces Identification Information American Academy of Forensic Sciences February, 2013 Washington,
Creating informative DNA libraries using computer reinterpretation of existing data Northeastern Association of Forensic Scientists November, 2011 Newport,
Attaching statistical weight to DNA test results 1.Single source samples 2.Relatives 3.Substructure 4.Error rates 5.Mixtures/allelic drop out 6.Database.
STATE AND NATIONAL DNA DATABASES: CODIS Division Scientific Services CT Forensic Science Laboratory Dr. Michael Bourke
DNA Analysis Chapter 12. General DNA Information Double helix—two coiled DNA strands Composed of nucleotides—units containing a sugar molecule (deoxyribose),
DNA Forensics MUPGRET Workshop. “DNA evidence…offers prosecutors important new tools for the identification and apprehension of some of the most violent.
The Basics of CODIS Combined DNA Index System
Forensic Statistics From the ground up…. Basics Interpretation Hardy-Weinberg equations Random Match Probability Likelihood Ratio Substructure.
Kern Regional Crime Laboratory Laboratory Director: Dr. Kevin W. P. Miller TRUEALLELE® WORK AND WORKFLOW: KERN COUNTY’S FIRST CASES APRIL 23, 2014.
Expert Systems for Automated STR Analysis SWGDAM Quantico, VA Mark W. Perlin January, 2003.
Forensic Databases Integrated Automated Fingerprint Identification System (IAFIS) Fingerprint and criminal history system for 50 million subjects Operational.
Tues 9/24/13 Bellringer: ◦ Take out your notes and respond to the following:  Would sap taken from a tree be class evidence or individual evidence? Why?
Stephen Gresko Senior Criminalist Nevada State CODIS Administrator WASHOE COUNTY SHERIFF’S OFFICE FORENSIC SCIENCE DIVISION.
Scientific Validation of Mixture Interpretation Methods 17th International Symposium on Human Identification Sponsored by the Promega Corporation October,
Nederlands Forensisch Instituut Observed and expected numbers of (partially) randomly matching profiles in the Dutch DNA database,
Computer Interpretation of Uncertain DNA Evidence National Institute of Justice Computer v. Human June, 2011 Arlington, VA Mark W Perlin, PhD, MD, PhD.
Forensics and CS Philip Chan. CSI: Crime Scene Investigation high tech forensics tools DNA profiling Use.
Presented by: Jamie L. Belrose NYS DNA Sub-committee Meeting; New York, NY March 5, 2010 New York State Police Validation of TrueAllele: a Statistical.
Using Expert Systems (TrueAllele ® ) for Forensic STR DNA Analysis National Institute of Justice Grantees Meeting 2011 Using Expert Systems (TrueAllele.
TrueAllele ® Genetic Calculator: Implementation in the NYSP Crime Laboratory NYS DNA Subcommittee May 19, 2010 Barry Duceman, Ph.D New York State Police.
Cybergenetics Webinar January, 2015 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © How TrueAllele ® Works (Part 4)
An Expert System for Scoring DNA Database Profiles Dr. Mark W. Perlin Cybergenetics Pittsburgh, PA.
Unleashing Forensic DNA through Computer Intelligence Forensics Europe Expo Forensic Innovation Conference April, 2013 London, UK Mark W Perlin, PhD, MD,
Compute first, ask questions later: an efficient TrueAllele ® workflow Midwestern Association of Forensic Scientists October, 2014 St. Paul, MN Martin.
Implications of database searches for DNA profiling statistics Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton,
Chapter 22: The DNA Databases.  Developed to assist in solving violent crimes by creating a network for the exchange of information  Compare DNA profiles.
Open Access DNA Database Duquesne University March, 2013 Pittsburgh, PA Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
Jack Ballantyne and Mark Perlin International Conference on Inference and Statistics, July , Seattle, WA.
Exploring Forensic Scenarios with TrueAllele ® Mixture Automation 59th Annual Meeting American Academy of Forensic Sciences February, 2007 Mark W Perlin,
Forensics (DNA Technology). What is DNA? DNA is the carrier of genetic information and provides a structural plan for proteins. It consists of linear.
DNA Identification: Quantitative Data Modeling Cybergenetics © Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA TrueAllele ® Lectures.
Data summary – “alleles” Threshold Over threshold, peaks are labeled as allele events All-or-none allele peaks, each given equal status Allele Pair 8,
Marjan Sjerps Kees van der Beek Ate Kloosterman
Four person DNA mixture
All rights Reserved Cengage/NGL/South-Western © 2016.
DNA: TrueAllele® Statistical Analysis, Probabilistic Genotyping
A Match Likelihood Ratio for DNA Comparison
Statistical Weights of DNA Profiles
Validating TrueAllele® genotyping on ten contributor DNA mixtures
Error in the likelihood ratio: false match probability
All rights Reserved Cengage/NGL/South-Western © 2016.
Explaining the Likelihood Ratio in DNA Mixture Interpretation
DNA identification pathway
On the threshold of injustice: manipulating DNA evidence
“Using Computer Technology to Overcome Bottlenecks in the Forensic DNA Testing Process and Improve Data Recovery from Complex Samples”
Suffolk County TrueAllele® Validation
Solving Crimes using MCMC to Analyze Previously Unusable DNA Evidence
Investigative DNA Databases that Preserve Identification Information
DNA Identification: Inclusion Genotype and LR
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA, USA
DNA Identification: Biology and Information
Forensic match information: exact calculation and applications
severed carotid artery
Laser Microdissection
DNA identification pathway
2018 AAFS Annual Scientific Meeting February 22, 2018
Probabilistic Genotyping to the Rescue for Pinkins and Glenn
Laser Microdissection
DNA Identification: Mixture Interpretation
Exonerating the Innocent with Probabilistic Genotyping
Testifying about probabilistic genotyping results
David W. Bauer1, PhD Nasir Butt2, PhD Jeffrey Oblock2
Using probabilistic genotyping to distinguish family members
Unit 3: Physical Evidence
Reporting match error: casework, validation & language
Presentation transcript:

Genotype Information Criteria for Forensic DNA Databases 2019 AAFS Annual Scientific Meeting February 23, 2018

CODIS Searches Allele to Allele Comparison High certainty genotypes - good Low certainty genotypes?

Moderate Match Estimate NDIS Upload Criteria Specimen Category Max Alleles per Locus Moderate Match Estimate Forensic Mixture 4 1 in 10 million

Falling Through the Cracks What about mixtures that have high information but do not meet upload criteria? How do you identify mixtures with high match information?

Probabilistic Genotyping Kullback-Leibler Divergence (KL) Measure of information gain in Bayesian inference Posterior (after computer inference) vs. prior (before computer inference)

Advantages of KL for Databases Estimate of match information (LR) in absence of reference sample Calculated over entire profile, not a subset of loci Can be used to identify high information mixtures

Methods Process all samples in TrueAllele® Assess KL of inferred genotypes Determine viability for CODIS KL <10 = typically insufficient for CODIS KL >10 = sufficient but specimen category varies

Case Example Assault case – female victim, male suspect Three questioned swabs No victim sample submitted

Case Example Item Quantity Amplified? Result Weight #1 ~300 pg Yes Mixture (2) 57 / 43 #2 ~170 pg Minimal N/A #3 ~10 pg No

Case Example Examiner focused on Item 1 for CODIS entry Manual data review – 14 loci (8 core) entered Insufficient MME for NDIS; uploaded to SDIS

SDIS Search #1 June 2017 28 candidate matches Human review (>1 hour) 25 eliminated 3 candidates requested for additional testing (expanded loci)

SDIS Search #1 July 2017 Expanded loci testing complete Human review All three excluded Summary – 28 candidates, 0 hits, ~1.5 hours human time (entry & review)

Additional Testing October 2017 Victim standard submitted TrueAllele® victim match, log LR 14.8 (630 trillion) Examiner manually refines CODIS profile MME still insufficient for NDIS Uploads to SDIS

SDIS Search #2 October 2 new candidate matches One eliminated One identified and confirmed Summary – 2 candidates, 1 hits, ~15 minutes human time

SDIS Searches Summary 30 candidates 1 hit ~2 hours human analysis/review Four months searching Offender was not identified in initial search!

Use TrueAllele® at candidate match review? Could We Improve? Use TrueAllele® at candidate match review?

TrueAllele® Match Review

Use TrueAllele® to select alleles for CODIS entry? Could We Improve? Use TrueAllele® to select alleles for CODIS entry?

TrueAllele® for CODIS Entry Victim Candidate Sample Review Method Searches Matches Needed Time for Hit? Human 30 2 Yes ~2 hrs ~15 TrueAllele ® 3 1 No minutes

TrueAllele® for CODIS Entry SC has created Probabilistic Genotyping index No MME, max of 8 alleles per locus Minimum 7 original core loci

TrueAllele® for CODIS Entry Search against single source profiles only Mixtures that were previously ineligible for searching Several hits to date

Summary MME is good predictor for number of matches but not quality of data KL is good predictor of match strength High information mixture data is not being searched Probabilistic mechanism is needed

Thank you!