UNCLASSIFIED Defense Forensic Science Center Evaluating Probabilistic Genotyping Results Joel Sutton, Technical Leader USACIL DNA Casework Branch 7 th.

Slides:



Advertisements
Similar presentations
Overcoming DNA Stochastic Effects 2010 NEAFS & NEDIAI Meeting November, 2010 Manchester, VT Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics.
Advertisements

Forensic DNA Inference ICFIS 2008 Lausanne, Switzerland Mark W Perlin, PhD, MD, PhD Joseph B Kadane, PhD Robin W Cotton, PhD Cybergenetics ©
How Inclusion Interpretation of DNA Mixture Evidence Reduces Identification Information American Academy of Forensic Sciences February, 2013 Washington,
Creating informative DNA libraries using computer reinterpretation of existing data Northeastern Association of Forensic Scientists November, 2011 Newport,
The statistical weight of mixed samples with allelic drop out First serious attempt by Gill et al. 2006, Forensic Science International 160:90 An important.
Attaching statistical weight to DNA test results 1.Single source samples 2.Relatives 3.Substructure 4.Error rates 5.Mixtures/allelic drop out 6.Database.
Tests of Hypotheses Based on a Single Sample
3 Person Example #2 Suspect Boxer Shorts (The Ladies Man)
Copyright © 2011 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 12 Measures of Association.
USING AND PROMOTING REFLECTIVE JUDGMENT AS STUDENT LEADERS ON CAMPUS Patricia M. King, Professor Higher Education, University of Michigan.
Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Appendix 3 Probability and Statistics.
Data Structures Hash Tables
TrueAllele ® Casework Validation on PowerPlex ® 21 Mixture Data Australian and New Zealand Forensic Science Society September, 2014 Adelaide, South Australia.
Using TrueAllele ® Casework to Separate DNA Mixtures of Relatives California Association of Criminalists October, 2014 San Francisco, CA Jennifer Hornyak,
Marketing Research and Information Systems
EngageNY.org Common Core: What can Parents Do? Presented by: Mrs. A. Cassidy Ms. M. Florio.
Example 10.1 Experimenting with a New Pizza Style at the Pepperoni Pizza Restaurant Concepts in Hypothesis Testing.
Forensic Statistics From the ground up…. Basics Interpretation Hardy-Weinberg equations Random Match Probability Likelihood Ratio Substructure.
Kern Regional Crime Laboratory Laboratory Director: Dr. Kevin W. P. Miller TRUEALLELE® WORK AND WORKFLOW: KERN COUNTY’S FIRST CASES APRIL 23, 2014.
March  There is a maximum of one obtuse angle in a triangle, but can you prove it?  To prove something like this, we mathematicians must do a.
STT 315 This lecture is based on Chapter 6. Acknowledgement: Author is thankful to Dr. Ashok Sinha, Dr. Jennifer Kaplan and Dr. Parthanil Roy for allowing.
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
2 Person Mixture #2 Vaginal swab of Victim. Case Scenario Assault occurred in dorm room Suspect says it was consensual No other parties heard or saw anything.
3 Person Mixture #4 (Or is it 2?) The Hardest Mixture I Know.
The Method Skeptic Debate For and Against. Forensic Concepts The nature of expert testimony Admissibility is determined by legal statute and court precedent;
You don’t know what you don’t know But does it matter? Or is everything inconclusive?
UNCLASSIFIED Defense Forensic Science Center DNA Mixture Interpretation Study: Inter- and Intra-laboratory Variation Roman Aranda IV 1, Emily Rogers 1,
Evidence-Based Medicine Presentation [Insert your name here] [Insert your designation here] [Insert your institutional affiliation here] Department of.
National Guidelines for the Curriculum in kindergarten and the first cycle of' Education (2012 September) Mathematics contributes to the cultural formation.
Creating a Virtual Library in the Age of Closings Jane Killian Defense Forensic Science Center December 10, 2013 SLA Military Library Division Workshop.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
TrueAllele ® Genetic Calculator: Implementation in the NYSP Crime Laboratory NYS DNA Subcommittee May 19, 2010 Barry Duceman, Ph.D New York State Police.
Separating Familial Mixtures, One Genotype at a Time Northeastern Association of Forensic Scientists November, 2014 Hershey, PA Ria David, PhD, Martin.
Uncertainty Management in Rule-based Expert Systems
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
AJ 104 Crime Scene Evidence, Experiments, and Models.
Slide 1-1 Copyright © 2004 Pearson Education, Inc. Stats Starts Here Statistics gets a bad rap, and Statistics courses are not necessarily chosen as fun.
Cross examination Is the DNA a mixture of two or more people? How did you calculate the match statistic? What is the scientific basis of that calculation?
Rapid DNA Response: On the Wings of TrueAllele Mid-Atlantic Association of Forensic Scientists May, 2015 Cambridge, Maryland Martin Bowkley, Matthew Legler,
Getting Past First Bayes with DNA Mixtures American Academy of Forensic Sciences February, 2014 Seattle, WA Mark W Perlin, PhD, MD, PhD Cybergenetics,
Slide 1 UCL JDI Centre for the Forensic Sciences 21 March 2012 Norman Fenton Queen Mary University of London and Agena Ltd Bayes and.
Virginia TrueAllele ® Validation Study: Casework Comparison Presented at AAFS, February, 2013 Published in PLOS ONE, March, 2014 Mark W Perlin, PhD, MD,
Open Access DNA Database Duquesne University March, 2013 Pittsburgh, PA Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
Objective DNA Mixture Information in the Courtroom: Relevance, Reliability & Acceptance NIST International Symposium on Forensic Science Error Management:
CHAPTER 15: Tests of Significance The Basics ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.
Data summary – “alleles” Threshold Over threshold, peaks are labeled as allele events All-or-none allele peaks, each given equal status Allele Pair 8,
Seventh Annual Prescriptions for Criminal Justice Forensics Program Fordham University School of Law June 3, 2016 DNA Panel.
UNCLASSIFIED Defense Forensic Science Center Development and Evaluation of a Model to Quantify the Weight of Fingerprint Evidence *Henry Swofford; Koertner.
Chapter Nine Hypothesis Testing.
Discussion of the paper: The peer performance ratios of hedge funds
A Match Likelihood Ratio for DNA Comparison
Statistical Weights of DNA Profiles
Overcoming Bias in DNA Mixture Interpretation
Unit 5: Hypothesis Testing
Validating TrueAllele® genotyping on ten contributor DNA mixtures
How to Defend Yourself Against DNA Mixtures
Explaining the Likelihood Ratio in DNA Mixture Interpretation
Distorting DNA evidence: methods of math distraction
On the threshold of injustice: manipulating DNA evidence
“Using Computer Technology to Overcome Bottlenecks in the Forensic DNA Testing Process and Improve Data Recovery from Complex Samples”
Virginia TrueAllele® Validation Study: Casework Comparison
Solving Crimes using MCMC to Analyze Previously Unusable DNA Evidence
Summary and Recommendations
Gerald Dyer, Jr., MPH October 20, 2016
Significance Tests: The Basics
Summary and Recommendations
Exonerating the Innocent with Probabilistic Genotyping
David W. Bauer1, PhD Nasir Butt2, PhD Jeffrey Oblock2
Using probabilistic genotyping to distinguish family members
Presentation transcript:

UNCLASSIFIED Defense Forensic Science Center Evaluating Probabilistic Genotyping Results Joel Sutton, Technical Leader USACIL DNA Casework Branch 7 th Annual Prescriptions for Criminal Justice Forensics, 3 June 2016

UNCLASSIFIED Disclaimer The opinions or assertions contained herein are the private views of the author and are not to be construed as official or as reflecting the views of the Department of the Army or the Department of Defense. Names of commercial manufacturers or products included are incidental only, and inclusion does not imply endorsement by the authors, DFSC, OPMG, DA or DoD. Unless otherwise noted, all figures, diagrams, media, and other materials used in this presentation are created by the respective author(s) and contributor(s) of the presentation and research. 2

UNCLASSIFIED Background Various methods for interpreting/reporting results Testing different LR propositions Testing multiple persons of interest Verbal scale Uninformative – the “new inconclusive” Admissibility and testimony strategies 3 Outline

UNCLASSIFIED The Defense Forensic Science Center has been online using probabilistic genotyping (PG) exclusively in casework since Nov 2014 Our primary casework is sexual assault evidence At the current time, we have testified in 18 court- martials using PG results One thing we have learned is that the validation of the software turned out to be the “easy” part Interpreting and reporting the results has actually been the real challenge It’s not even the “PG” part that is the challenge, but more with the likelihood ratio we are reporting 4 Background

UNCLASSIFIED As laboratories continue to move towards interpreting and reporting DNA results with probabilistic genotyping, there are considerations the legal community needs to be aware of  Training practitioners  Training stakeholders The following slides give examples for some of these considerations 5 Practical Considerations

UNCLASSIFIED DNA results traditionally have one of three conclusions:  inclusion  exclusion  inconclusive PG is based on probabilities, most use a traditional LR framework Some have advocated no longer using terms like “inclusion” or “exclusion” for PG conclusions, but instead simply report out the LR and let the number speak for itself  An LR > 1 favors the prosecution hypothesis  An LR < 1 favors the defense hypothesis  An LR =1 is considered neutral 6 Methods For Interpreting/Reporting Results This is the basic LR formula w/o adding in the additional probabilities for drop

UNCLASSIFIED This has led to two general schools of thought with interpreting PG results based on an LR  Run all mixtures through the program  Evaluate all the data first to determine suitability and try to exclude the POI first 7 Methods For Interpreting/Reporting Results There are pros and cons to each approach

UNCLASSIFIED Methods For Interpreting/Reporting Results Run everything through Pro: considered less subjective Pro: with complex mixtures, can you really ever “include” or “exclude” Con: “junk in, junk out” Con: who’s the expert Interpret first Pro: if one can interpret first, you can “weed out” some of the more difficult mixtures as being uninterpretable Pro: The analyst still serves as the expert for determining inclusions/exclusions Con: There is some subjectivity with this in terms of interpreting complex mixtures and making decisions up front 8

UNCLASSIFIED Hypotheses are also referred to as propositions Assumptions considered for proposition  Number of contributors in the mixture  Any known contributors Depending on these propositions, you can get very different LRs When setting propositions, there are a few things to keep in mind:  We generally know the prosecution hypothesis  We never know nor should we predict what the defense hypothesis will be However, knowing the math behind the LR, we can predict:  What is most reasonable  What is most informative 9 Testing Different Hypotheses

UNCLASSIFIED Sexual Assault case – evidence is semen on a vaginal swab from Victim (V) DNA results - mixture of two individuals, with all the alleles consistent with the V and Suspect (S) Prosecution hypothesis – V + S Defense hypothesis options – unknown individual (U) contributed the semen  V+U  U+U 10 LR Example

UNCLASSIFIED (V+S/U+U) – this LR would be large  The more U on bottom compared to top LR goes “waaay” up  Has to do with the math (V+S/V+U) is smaller  The ratio of U in the LR has been minimized  Reasonable to assume the V on her own swab  Genotype possibilities have been constrained to the foreign DNA contributor which makes the LR lower but also more informative (less genotype possibilities promote better exclusionary power) 11 LR Example

UNCLASSIFIED It is conventional wisdom to try and minimize the ratio of unknowns with the LR where possible This reduces the LR (gives a more conservative estimate) Otherwise the LR can blow up based on the math which does not benefit the accused Making reasonable assumptions for known contributors and keeping the number of contributors the same between propositions is generally preferred If unsure, test and include multiple propositions to assess at trial 12 Which Proposition To Use

UNCLASSIFIED Two choices to make with your proposition:  Evaluate each individual separately in the mixture  Evaluate them together in the mixture If your software can, better to evaluate together Case circumstances and pre-trial conferences will dictate which proposition will be reasonable for the defense 13 Multiple Persons Of Interest

UNCLASSIFIED Often these numbers associated with the LRs being reported will be drastically different What does “50 times more likely” mean compared to “50 septillion times more likely” The differences in the numbers speak to the weight A verbal scale is mostly arbitrary, but may help in verbalizing what these differences mean 14 Verbal Scale

UNCLASSIFIED LR =Verbalized as… 10 billion or greater“Very strong support” 10 million to 10 billion“Strong support” 10,000 to 10 million“Moderate support” 1000 to 10,000“Weak support” >10 to 1000“Very weak support” 1 to 10“Uninformative” 0 to 1“Supports an exclusion” 15 USACIL DNA Verbal Scale Using STRmix™ This scale is data dependent, based on the deconvolution, and used when requested to put the numbers in some sort of general context

UNCLASSIFIED 16 What Does “Uninformative” Mean? This paper demonstrates that LR distributions for competing propositions are very well separated with good quality data, but converge to 1 with low quality data. At that point, the LR becomes uninformative for inclusions/exclusions, essentially inconclusive for either proposition

UNCLASSIFIED 17 Admissibility And Courtroom Testimony This paper drives home some important points on this subject and introduces the concept of a software-expert pair (SEP)

UNCLASSIFIED From the authors’ point of view PG is reliable if properly validated  ISFG recommendations  SWGDAM validation guidelines  OSAC is writing validation standards Peer-reviewed papers from the developers is crucial – the software may be new, but not the concepts The concepts of an LR, MCMC, modeling stutter and peak height variability, and calculating match probabilities are not novel and well-published 18 Admissibility

UNCLASSIFIED The courts expect the results and any software not only be properly validated, but also that it is appropriately applied “Appropriately applied” relates to the expert using it and testifying to it. The software and expert work together (SEP) “It is imperative to note that to have a successful SEP in use for forensic casework requires not only a validated software tool, but a fully trained and competent expert that uses the chosen software” 19 Courtroom Testimony

UNCLASSIFIED FRE 702 speaks to experts and the right for the defense to confront these witnesses It’s the expert after all, not the software that will have to testify “It is accepted that no analyst is required to understand the mathematics and computer program to the extent that they could recreate the system, except the developers themselves. However it is an expectation that analysts at least understand the workings of any system they use to be able to understand and explain the results.” 20 Admissibility And Courtroom Testimony

UNCLASSIFIED Summary Lots of PG systems out there, all slightly different Individual laboratory protocols may use the same system differently Read the literature to better understand the fundamental concepts – training staff is key Understand the LR framework These are not expert systems – do not treat it as a “black box” Work with your customers and stakeholders – they need to be trained as well 21

UNCLASSIFIED Joel Sutton USACIL DNA Technical Leader Questions? Thank you for your time!