Validating TrueAllele® genotyping on ten contributor DNA mixtures

Slides:



Advertisements
Similar presentations
Overcoming DNA Stochastic Effects 2010 NEAFS & NEDIAI Meeting November, 2010 Manchester, VT Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics.
Advertisements

Forensic DNA Inference ICFIS 2008 Lausanne, Switzerland Mark W Perlin, PhD, MD, PhD Joseph B Kadane, PhD Robin W Cotton, PhD Cybergenetics ©
DNA Mixture Interpretations and Statistics – To Include or Exclude Cybergenetics © Prescription for Criminal Justice Forensics ABA Criminal Justice.
Finding Truth in DNA Mixture Evidence Innocence Network Conference April, 2013 Charlotte, NC Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA.
How Inclusion Interpretation of DNA Mixture Evidence Reduces Identification Information American Academy of Forensic Sciences February, 2013 Washington,
Creating informative DNA libraries using computer reinterpretation of existing data Northeastern Association of Forensic Scientists November, 2011 Newport,
How TrueAllele ® Works (Part 3) Kinship, Paternity and Missing Persons Cybergenetics Webinar December, 2014 Mark W Perlin, PhD, MD, PhD Cybergenetics,
Preventing rape in the military through effective DNA computing Forensics Europe Expo Forensics Seminar Theatre April, 2014 London, UK Mark W Perlin, PhD,
TrueAllele ® Interpretation of DNA Mixture Evidence 9 th International Conference on Forensic Inference and Statistics August, 2014 Leiden University,
TrueAllele ® Casework Validation on PowerPlex ® 21 Mixture Data Australian and New Zealand Forensic Science Society September, 2014 Adelaide, South Australia.
Using TrueAllele ® Casework to Separate DNA Mixtures of Relatives California Association of Criminalists October, 2014 San Francisco, CA Jennifer Hornyak,
Kern Regional Crime Laboratory Laboratory Director: Dr. Kevin W. P. Miller TRUEALLELE® WORK AND WORKFLOW: KERN COUNTY’S FIRST CASES APRIL 23, 2014.
Revolutionising DNA analysis in major crime investigations The Investigator Conferences Green Park Conference Centre May, 2014 Aylesbury, Buckinghamshire.
Revolutionising DNA analysis in major crime investigations The Investigator Conferences Green Park Conference Centre May, 2014 Aylesbury, Buckinghamshire.
No DNA Left Behind: When "inconclusive" really means "informative" Schenectady County District Attorney’s Office January, 2014 Mark W Perlin, PhD, MD,
DNA Mixture Statistics Cybergenetics © Spring Institute Commonwealth's Attorney's Services Council Richmond, Virginia March, 2013 Mark W.
Scientific Validation of Mixture Interpretation Methods 17th International Symposium on Human Identification Sponsored by the Promega Corporation October,
Computer Interpretation of Uncertain DNA Evidence National Institute of Justice Computer v. Human June, 2011 Arlington, VA Mark W Perlin, PhD, MD, PhD.
Challenging DNA Evidence The Fifth Judicial District of Pennsylvania Criminal Division February, 2015 Pittsburgh, PA Mark W Perlin, PhD, MD, PhD Cybergenetics,
How TrueAllele ® Works (Part 2) Degraded DNA and Allele Dropout Cybergenetics Webinar November, 2014 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh,
TrueAllele ® Genetic Calculator: Implementation in the NYSP Crime Laboratory NYS DNA Subcommittee May 19, 2010 Barry Duceman, Ph.D New York State Police.
Separating Familial Mixtures, One Genotype at a Time Northeastern Association of Forensic Scientists November, 2014 Hershey, PA Ria David, PhD, Martin.
Cybergenetics Webinar January, 2015 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © How TrueAllele ® Works (Part 4)
Cracking the DNA mixture code – computer analysis of UK crime cases Forensics Europe Expo Forensic Innovation Conference April, 2014 London, UK Mark W.
Unleashing Forensic DNA through Computer Intelligence Forensics Europe Expo Forensic Innovation Conference April, 2013 London, UK Mark W Perlin, PhD, MD,
Rapid DNA Response: On the Wings of TrueAllele Mid-Atlantic Association of Forensic Scientists May, 2015 Cambridge, Maryland Martin Bowkley, Matthew Legler,
Getting Past First Bayes with DNA Mixtures American Academy of Forensic Sciences February, 2014 Seattle, WA Mark W Perlin, PhD, MD, PhD Cybergenetics,
Compute first, ask questions later: an efficient TrueAllele ® workflow Midwestern Association of Forensic Scientists October, 2014 St. Paul, MN Martin.
Virginia TrueAllele ® Validation Study: Casework Comparison Presented at AAFS, February, 2013 Published in PLOS ONE, March, 2014 Mark W Perlin, PhD, MD,
TrueAllele ® Computing: All the DNA, all the time Continuing Professional Development Sydney, Australia March, 2014 Mark W Perlin, PhD, MD, PhD Cybergenetics,
Murder in McKeesport October 25, 2008 Tamir Thomas.
When Good DNA Goes Bad International Conference on Forensic Research & Technology October, 2012 Chicago, Illinois Mark W Perlin, PhD, MD, PhD Cybergenetics,
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © Duquesne University October, 2015 Pittsburgh, PA What’s in a Match?
Open Access DNA Database Duquesne University March, 2013 Pittsburgh, PA Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
Objective DNA Mixture Information in the Courtroom: Relevance, Reliability & Acceptance NIST International Symposium on Forensic Science Error Management:
DNA Identification: Quantitative Data Modeling Cybergenetics © Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA TrueAllele ® Lectures.
Separating DNA Mixtures by Computer to Identify and Convict a Serial Rapist Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Garett Sugimoto,
Four person DNA mixture
A Match Likelihood Ratio for DNA Comparison
Forensic Stasis in a World of Flux
Overcoming Bias in DNA Mixture Interpretation
How to Defend Yourself Against DNA Mixtures
Error in the likelihood ratio: false match probability
Explaining the Likelihood Ratio in DNA Mixture Interpretation
Distorting DNA evidence: methods of math distraction
On the threshold of injustice: manipulating DNA evidence
Machines can work it out: Automated TrueAllele® workflow
Virginia TrueAllele® Validation Study: Casework Comparison
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA, USA
Suffolk County TrueAllele® Validation
American Academy of Forensic Sciences Criminalistics Section
Solving Crimes using MCMC to Analyze Previously Unusable DNA Evidence
Investigative DNA Databases that Preserve Identification Information
DNA Identification: Inclusion Genotype and LR
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA, USA
DNA Identification: Biology and Information
DNA Identification: Biology and Information
Forensic match information: exact calculation and applications
severed carotid artery
TrueAllele® computer technology
The Triumph and Tragedy of DNA Evidence
Probabilistic Genotyping to the Rescue for Pinkins and Glenn
Forensic validation, error and reporting: a unified approach
DNA Identification: Biology and Information
DNA Identification: Mixture Interpretation
Exonerating the Innocent with Probabilistic Genotyping
Testifying about probabilistic genotyping results
David W. Bauer1, PhD Nasir Butt2, PhD Jeffrey Oblock2
Using probabilistic genotyping to distinguish family members
Reporting match error: casework, validation & language
Presentation transcript:

Validating TrueAllele® genotyping on ten contributor DNA mixtures 1David W. Bauer, PhD 2Nasir Butt, PhD 1Mark W. Perlin, PhD, MD, PhD  1Cybergenetics Pittsburgh, PA  2Cuyahoga County Forensic Laboratory Cleveland, OH Cybergenetics © 2003-2017

DNA evidence DATA Cold cases Corroborate answers Create new questions

What is an STR? Cell locus DNA Mother allele Father allele ACGT 1 2 3 4 5 6 7 8 9 Father allele 1 2 3 4 5 6 7 8 9 10 11 12

STR data Task: to infer genotypes from peaks Data eliminates possibilities peak size before after 9 Single source … peak height 12 … 9, 9 9, 9 9, 10 9, 10 9, 11 9, 11 RFU 9, 12 9, 12 10, 10 10, 10 10, 11 10, 11 10, 12 11 10, 12 8 … … Size (bp)

STR mixture data more contributors more complexity more uncertainty 20 2 person mixture No single answer 18 19 18, 18 18, 19 18, 20 RFU Genotypes separated 19, 19 19, 20 17 20, 20 Size (bp)

Too complex? – Yes and No 3 person mixture RFU Size (bp) Reported conclusions: “Mixture of at least 3 individuals…” “Number of contributors cannot reasonably be assumed…” “…therefore, no further conclusions can be drawn.” 3 person mixture 11 13 9 12 RFU 10 8 Size (bp)

How TrueAllele works - Computer tries thousands of possibilities - Compares with data to calculate probability - Good explanations higher probability 3 person mixture 11 13 9 12 major minor RFU middle 10 8 Size (bp)

How TrueAllele works - Handles uncertainty with probability Prob(evidence match) Probability Genotype for the major contributor 9,11 10,11 9,12 10,12 11,12 9,13 10,13 11,13 12,13 Allele pair

Degree of match Match statistics expressed as the numbers of zeros Likelihood ratio (LR) = Prob(evidence match) Prob(coincidental match) Probability 9,11 10,11 9,12 10,12 11,12 9,13 10,13 11,13 12,13 Allele pair

Degree of match Match statistics expressed as the numbers of zeros Likelihood ratio (LR) = Prob(evidence match) Prob(coincidental match) Probability 7x 14% 2% 9,11 10,11 9,12 10,12 11,12 9,13 10,13 11,13 12,13 Allele pair log(LR) = log(7) = 0.85 log(trillion) = 12 15 loci

minors difficult to ‘see’ Validation samples Laboratory created mixtures (2 - 10 person) Don’t know real contributor number in casework 20 10 person mixture minors difficult to ‘see’ 19 18 23 RFU 21 17 15 22 16 25 Size (bp)

Sensitivity: simplicity from complexity Match statistics to actual contributors 2 person Average match strength decreases 3 person 5 person Max log(LR) nearly constant 6 person -6 6 12 18 24 30 log(LR)

Sensitivity: More or less log(LR) depends on contributor weight Match statistics for all samples 30 24 18 stronger match log(LR) 12 6 more DNA 20 40 60 80 100 -6 contributor weight (%)

Sensitivity: More or less less DNA more contributors 30 weaker match 24 2-3 person 18 log(LR) 12 4 person 6 6 person 20 40 60 80 100 -6 contributor weight (%)

Specificity Sensitivity is important, but not sufficient 2 person counts -42 -36 -30 -24 -18 -12 -6 log(LR) compare Each evidence genotype 10,000 random profiles Match statistic distribution for non-contributors

non-contributors excluded Specificity Sensitivity is important, but not sufficient 2 person counts non-contributors excluded 3 person counts 4 person More contributors counts lower specificity 5 - 6 person counts -42 -36 -30 -24 -18 -12 -6 log(LR)

What limits specificity Specificity not lost to complexity minors less specific counts 0 - 5 % counts 10 - 15% Information depends on DNA amount 30 - 50 % counts 5 observed Majors more specific counts ≥ 50% -42 -36 -30 -24 -18 -12 -6 log(LR)

Reproducibility - Repeatable results using a random search algorithm? - Compare independnet results for each individual contribs within group std dev 2 0.08 3 0.71 4 2.20 5 1.65 6 1.77 log(LR2) Software uses a random search algorithm log(LR1) more complexity more variation

Reproducibility - Software uses a random search algorithm - How repeatable are the results? contribs within group std dev 2 0.08 3 0.71 4 2.20 5 1.65 6 1.77 log(LR2) log(LR1) more complexity more variation less DNA

How many contributors? Don’t know actual contributor number in casework Does assumed contributor number matter? 20 10 person mixture Actual: 10 19 18 23 RFU compare 21 17 Observed: 5 15 22 16 25 Size (bp)

Never too many contributors Matches to true contributors counts -6 6 12 18 24 30 log(LR) Number of assumed contributors does not change results min mean std dev max N = False Exclusions Actual (2-10) -2.1 7.8 8.0 29.3 78 9 Observed (2-6) -9.1 7.9 8.9 29.4

Identification comes from information Observed Actual counts -42 -36 -30 -24 -18 -12 -6 log(LR) Counts   Observed Actual log(LR) 6,900 13,043 1 1,866 3,334 2 327 509 3 39 42 4 12 9 5 ‘Extra’ genotypes: - don’t affect chance of false inclusion 1 million comparisons - low inclusionary & exclusionary power Weak matches to low-information genotypes

Information grows Peeling matched references increases information log(LR) DATA 1 4 1 3 2 1 2 assume known 2 2 2 2 2 3 3 match 2 1 1 2 3 3 4 15 3 3 4 4 6 8 weight (%) 13 4 2 3 3 6 16 4 4 5 6 12 5 5 5 22 6 7 13 7

Information grows Peeling matched-references increases information log(LR) DATA 1 4 1 3 2 1 2 assume known 2 2 2 2 2 3 3 match 2 1 1 2 3 3 4 15 3 3 4 4 6 8 weight (%) more information 13 4 2 3 3 6 16 4 4 5 6 Higher match to minors 12 5 5 5 22 6 7 ‘Unseen’ minors not lost 13 7

Information includes AND excludes Peeling matched-references increases information -7 -3 minor genotypes Before peeling After peeling counts -30 -24 -18 -12 -6 log(LR) Genotypes for minors more exclusionary

Complexity in casework more probative weight more contributors more complexity Crime: Quintuple homicide in Sydney Australia Father, mother, 2 sons, aunt Suspect: Brother in-law Evidence: Blood stain in suspect’s garage

Complexity in casework Blood mixture of 5 victims (blood-related) victim weight LR Son 40% 50 quadrillion Son 30% 2.2 billion Father 20% 230 thousand Aunt 7% 29 thousand Mother 3% 290

{ Conclusions Tested TrueAllele on ten-contributor mixtures • Sensitive Included true contributors DNA amount • Specific not Excluded non-contributors contributor number • Reproducible Concordance between runs • Objective Data dictate results, not user assumptions

More information • Courses • Newsletters • Newsroom • Presentations http://www.cybgen.com/information • Courses • Newsletters • Newsroom • Presentations • Publications • Webinars TrueAllele YouTube channel: http://www.youtube.com/user/TrueAllele dave@cybgen.com