Solving Crimes using MCMC to Analyze Previously Unusable DNA Evidence

Slides:



Advertisements
Similar presentations
DNA Identification: Mixture Weight & Inference
Advertisements

Overcoming DNA Stochastic Effects 2010 NEAFS & NEDIAI Meeting November, 2010 Manchester, VT Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics.
DNA Mixture Interpretations and Statistics – To Include or Exclude Cybergenetics © Prescription for Criminal Justice Forensics ABA Criminal Justice.
Finding Truth in DNA Mixture Evidence Innocence Network Conference April, 2013 Charlotte, NC Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA.
How Inclusion Interpretation of DNA Mixture Evidence Reduces Identification Information American Academy of Forensic Sciences February, 2013 Washington,
Preventing rape in the military through effective DNA computing Forensics Europe Expo Forensics Seminar Theatre April, 2014 London, UK Mark W Perlin, PhD,
Coding a Safer Society through Computer Interpretation of DNA Evidence Cybergenetics © MATLAB Virtual Conference March, 2014 Mark W Perlin, PhD,
Probabilistic Software Workshop September 29, 2014 TrueAllele® Casework Mark W. Perlin, PhD, MD, PhD.
TrueAllele ® Interpretation of DNA Mixture Evidence 9 th International Conference on Forensic Inference and Statistics August, 2014 Leiden University,
How TrueAllele® Works (Part 1)
Probabilistic Software Workshop September 29, 2014
TrueAllele ® Casework Validation on PowerPlex ® 21 Mixture Data Australian and New Zealand Forensic Science Society September, 2014 Adelaide, South Australia.
Using TrueAllele ® Casework to Separate DNA Mixtures of Relatives California Association of Criminalists October, 2014 San Francisco, CA Jennifer Hornyak,
Revolutionising DNA analysis in major crime investigations The Investigator Conferences Green Park Conference Centre May, 2014 Aylesbury, Buckinghamshire.
Revolutionising DNA analysis in major crime investigations The Investigator Conferences Green Park Conference Centre May, 2014 Aylesbury, Buckinghamshire.
No DNA Left Behind: When "inconclusive" really means "informative" Schenectady County District Attorney’s Office January, 2014 Mark W Perlin, PhD, MD,
DNA Mixture Statistics Cybergenetics © Spring Institute Commonwealth's Attorney's Services Council Richmond, Virginia March, 2013 Mark W.
Scientific Validation of Mixture Interpretation Methods 17th International Symposium on Human Identification Sponsored by the Promega Corporation October,
Computer Interpretation of Uncertain DNA Evidence National Institute of Justice Computer v. Human June, 2011 Arlington, VA Mark W Perlin, PhD, MD, PhD.
TrueAllele ® Modeling of DNA Mixture Genotypes California Association of Crime Laboratory Directors October, 2014 San Francisco, CA Mark W Perlin, PhD,
How TrueAllele ® Works (Part 2) Degraded DNA and Allele Dropout Cybergenetics Webinar November, 2014 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh,
TrueAllele ® Genetic Calculator: Implementation in the NYSP Crime Laboratory NYS DNA Subcommittee May 19, 2010 Barry Duceman, Ph.D New York State Police.
Separating Familial Mixtures, One Genotype at a Time Northeastern Association of Forensic Scientists November, 2014 Hershey, PA Ria David, PhD, Martin.
Cybergenetics Webinar January, 2015 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © How TrueAllele ® Works (Part 4)
Cracking the DNA mixture code – computer analysis of UK crime cases Forensics Europe Expo Forensic Innovation Conference April, 2014 London, UK Mark W.
Unleashing Forensic DNA through Computer Intelligence Forensics Europe Expo Forensic Innovation Conference April, 2013 London, UK Mark W Perlin, PhD, MD,
Rapid DNA Response: On the Wings of TrueAllele Mid-Atlantic Association of Forensic Scientists May, 2015 Cambridge, Maryland Martin Bowkley, Matthew Legler,
Getting Past First Bayes with DNA Mixtures American Academy of Forensic Sciences February, 2014 Seattle, WA Mark W Perlin, PhD, MD, PhD Cybergenetics,
Compute first, ask questions later: an efficient TrueAllele ® workflow Midwestern Association of Forensic Scientists October, 2014 St. Paul, MN Martin.
TrueAllele ® interpretation of Allegheny County DNA mixtures Cybergenetics © Continuing Legal Education Allegheny County Courthouse February,
Virginia TrueAllele ® Validation Study: Casework Comparison Presented at AAFS, February, 2013 Published in PLOS ONE, March, 2014 Mark W Perlin, PhD, MD,
TrueAllele ® Computing: All the DNA, all the time Continuing Professional Development Sydney, Australia March, 2014 Mark W Perlin, PhD, MD, PhD Cybergenetics,
Murder in McKeesport October 25, 2008 Tamir Thomas.
When Good DNA Goes Bad International Conference on Forensic Research & Technology October, 2012 Chicago, Illinois Mark W Perlin, PhD, MD, PhD Cybergenetics,
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © Duquesne University October, 2015 Pittsburgh, PA What’s in a Match?
Open Access DNA Database Duquesne University March, 2013 Pittsburgh, PA Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
Objective DNA Mixture Information in the Courtroom: Relevance, Reliability & Acceptance NIST International Symposium on Forensic Science Error Management:
DNA Identification: Quantitative Data Modeling Cybergenetics © Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA TrueAllele ® Lectures.
DNA-led investigation through computer interpretation of evidence Pennsylvania State Police Training Seminar Hershey, PA April, 2014 Mark W Perlin, PhD,
Data summary – “alleles” Threshold Over threshold, peaks are labeled as allele events All-or-none allele peaks, each given equal status Allele Pair 8,
Separating DNA Mixtures by Computer to Identify and Convict a Serial Rapist Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Garett Sugimoto,
Four person DNA mixture
A Match Likelihood Ratio for DNA Comparison
Overcoming Bias in DNA Mixture Interpretation
Validating TrueAllele® genotyping on ten contributor DNA mixtures
Shedding Light on Inconclusive DNA: TrueAllele® Computer Analysis
Error in the likelihood ratio: false match probability
Explaining the Likelihood Ratio in DNA Mixture Interpretation
Distorting DNA evidence: methods of math distraction
On the threshold of injustice: manipulating DNA evidence
Virginia TrueAllele® Validation Study: Casework Comparison
Suffolk County TrueAllele® Validation
American Academy of Forensic Sciences Criminalistics Section
Investigative DNA Databases that Preserve Identification Information
DNA Identification: Inclusion Genotype and LR
DNA Identification: Stochastic Effects
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA, USA
DNA Identification: Biology and Information
Forensic match information: exact calculation and applications
severed carotid artery
Question What is a threshold? Cybergenetics ©
TrueAllele® computer technology
Probabilistic Genotyping to the Rescue for Pinkins and Glenn
Forensic validation, error and reporting: a unified approach
DNA Identification: Mixture Interpretation
Exonerating the Innocent with Probabilistic Genotyping
Testifying about probabilistic genotyping results
David W. Bauer1, PhD Nasir Butt2, PhD Jeffrey Oblock2
Using probabilistic genotyping to distinguish family members
Reporting match error: casework, validation & language
Presentation transcript:

Solving Crimes using MCMC to Analyze Previously Unusable DNA Evidence Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Joint Statistical Meetings August, 2014 Boston, MA Cybergenetics © 2003-2014 Cybergenetics © 2007-2012

One person, one genotype 8 12 locus

DNA reference data One or two allele peaks at a locus victim peak height peak size Cybergenetics © 2007-2010

Two people, two genotypes 8 12 locus 10 13

DNA mixture data Quantitative peak height pattern at 13 genetic loci One locus D7S820 Evidence from victim's fingernails victim other Cybergenetics © 2007-2010

Hierarchical Bayesian model Mixture weight Genotype • small DNA amounts • degraded contributions • K = 1, 2, 3, 4, 5, 6, ... unknown contributors • joint likelihood function

Markov chain Monte Carlo Sample from joint posterior probability distribution victim 93% 7% other

Separate mixture data into two contributor components Mixture weight Separate mixture data into two contributor components 7% 93% other victim

Genotype inference Thorough: consider every possible genotype solution Objective: does not know the comparison genotype Victim's allele pair Explain the peak pattern Another person's allele pair Better explanation has a higher likelihood

Evidence genotype Objective genotype determined solely from the DNA data. Never sees a comparison reference. 99.9%

How much more does the suspect match the evidence DNA match information How much more does the suspect match the evidence than a random person? Likelihood ratio 99.9% Prob(evidence match) Prob(coincidental match) LR = = 58 1.7%

DNA match statistic Product of 13 independent genetic loci A match between the victim's fingernails and the suspect is 189 billion times more probable than coincidence. Statistic Method CPI = 13 thousand inclusion (human) LR = 189 billion TrueAllele (computer) Cybergenetics © 2007-2010

TrueAllele Casework on Virginia DNA Mixture Evidence: Computer and Manual Interpretation in 72 Reported Criminal Cases. Perlin MW, Dormer K, Hornyak J, Schiermeier-Wood L, Greenspoon S PLoS ONE (2014) 9(3): e92837 Sensitive The extent to which interpretation identifies the correct person True DNA mixture inclusions 101 reported genotype matches 82 with DNA statistic over a million

TrueAllele sensitivity log(LR) match distribution 11.05 (5.42) 113 billion TrueAllele

Specific The extent to which interpretation does not misidentify the wrong person True exclusions, without false inclusions 101 matching genotypes x 10,000 random references x 3 ethnic populations, for over 1,000,000 nonmatching comparisons

TrueAllele specificity log(LR) mismatch distribution – 19.47

Reproducible The extent to which interpretation gives the same answer to the same question MCMC computing has sampling variation duplicate computer runs on 101 matching genotypes measure log(LR) variation

TrueAllele reproducibility Concordance in two independent computer runs standard deviation (within-group) 0.305

Manual inclusion method Over threshold, peaks become binary allele events Allele pairs 7, 7 7, 10 7, 12 7, 14 10, 10 10, 12 10, 14 12, 12 12, 14 14, 14 All-or-none allele peaks, disregard quantitative data Analytical threshold

Simplify data, easy procedure, CPI information 6.83 (2.22) 6.68 million CPI Combined probability of inclusion Simplify data, easy procedure, apply simple formula PI = (p1 + p2 + ... + pk)2

Modified inclusion method Higher threshold for human review Apply two thresholds, doubly disregard the data Stochastic threshold in 2010 Analytical threshold in 2000

Modified CPI information 6.83 (2.22) 6.68 million CPI 2.15 (1.68) 140 mCPI

Method comparison 6.83 (2.22) 6.68 million CPI 2.15 (1.68) mCPI 140 11.05 (5.42) 113 billion TrueAllele

Method accuracy Kolmogorov Smirnov test K-S p-value 0.106 0.215 0.106 0.215 0.561 1e-22 0.735 1e-25

Learn more about TrueAllele http://www.cybgen.com/information • Courses • Newsletters • Newsroom • Presentations • Publications http://www.youtube.com/user/TrueAllele TrueAllele YouTube channel perlin@cybgen.com