DNA Mixture Statistics Cybergenetics © 2003-2013 2013 Spring Institute Commonwealth's Attorney's Services Council Richmond, Virginia March, 2013 Mark W.

Slides:



Advertisements
Similar presentations
Overcoming DNA Stochastic Effects 2010 NEAFS & NEDIAI Meeting November, 2010 Manchester, VT Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics.
Advertisements

DNA Mixture Interpretations and Statistics – To Include or Exclude Cybergenetics © Prescription for Criminal Justice Forensics ABA Criminal Justice.
The Way Forward in the UK
Finding Truth in DNA Mixture Evidence Innocence Network Conference April, 2013 Charlotte, NC Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA.
TrueAllele ® Challenges in Court and Culture Cybergenetics © th Annual DNA Technology Educational Seminar Centre of Forensic Sciences and the.
How Inclusion Interpretation of DNA Mixture Evidence Reduces Identification Information American Academy of Forensic Sciences February, 2013 Washington,
Creating informative DNA libraries using computer reinterpretation of existing data Northeastern Association of Forensic Scientists November, 2011 Newport,
Preventing rape in the military through effective DNA computing Forensics Europe Expo Forensics Seminar Theatre April, 2014 London, UK Mark W Perlin, PhD,
Coding a Safer Society through Computer Interpretation of DNA Evidence Cybergenetics © MATLAB Virtual Conference March, 2014 Mark W Perlin, PhD,
Probabilistic Software Workshop September 29, 2014 TrueAllele® Casework Mark W. Perlin, PhD, MD, PhD.
TrueAllele ® Interpretation of DNA Mixture Evidence 9 th International Conference on Forensic Inference and Statistics August, 2014 Leiden University,
How TrueAllele® Works (Part 1)
TrueAllele ® Casework Validation on PowerPlex ® 21 Mixture Data Australian and New Zealand Forensic Science Society September, 2014 Adelaide, South Australia.
Using TrueAllele ® Casework to Separate DNA Mixtures of Relatives California Association of Criminalists October, 2014 San Francisco, CA Jennifer Hornyak,
Revolutionising DNA analysis in major crime investigations The Investigator Conferences Green Park Conference Centre May, 2014 Aylesbury, Buckinghamshire.
No DNA Left Behind: When "inconclusive" really means "informative" Schenectady County District Attorney’s Office January, 2014 Mark W Perlin, PhD, MD,
Challenging DNA Evidence The Fifth Judicial District of Pennsylvania Criminal Division February, 2015 Pittsburgh, PA Mark W Perlin, PhD, MD, PhD Cybergenetics,
Revolutionising DNA analysis in major crime investigations The Investigator Conferences Green Park Conference Centre May, 2014 Aylesbury, Buckinghamshire.
Solving Cold Cases by TrueAllele ® Analysis of DNA Evidence Finding Closure: The Science, Law and Politics of Cold Case Investigations October, 2014 Pittsburgh,
TrueAllele ® Mixture Interpretation Cybergenetics © th Annual DNA Technology Educational Seminar Centre of Forensic Sciences and the Promega.
More informative DNA identification: Computer reinterpretation of existing data Ria David, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
DNA Identification Science: The Search for Truth Cybergenetics © Summer Research Symposium Duquesne University July, 2010 Mark W Perlin, PhD,
Computer Interpretation of Uncertain DNA Evidence National Institute of Justice Computer v. Human June, 2011 Arlington, VA Mark W Perlin, PhD, MD, PhD.
TrueAllele ® Modeling of DNA Mixture Genotypes California Association of Crime Laboratory Directors October, 2014 San Francisco, CA Mark W Perlin, PhD,
How TrueAllele ® Works (Part 2) Degraded DNA and Allele Dropout Cybergenetics Webinar November, 2014 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh,
TrueAllele ® Genetic Calculator: Implementation in the NYSP Crime Laboratory NYS DNA Subcommittee May 19, 2010 Barry Duceman, Ph.D New York State Police.
Reanimating Zombie™ DNA Penn State Dickinson Law School September, 2012 State College, Pennsylvania Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh,
Computer interpretation of touch DNA mixtures Seminar for Chiefs of Police in Western Pennsylvania May, 2014 Allegheny County, PA Mark W Perlin, PhD, MD,
Separating Familial Mixtures, One Genotype at a Time Northeastern Association of Forensic Scientists November, 2014 Hershey, PA Ria David, PhD, Martin.
Cybergenetics Webinar January, 2015 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © How TrueAllele ® Works (Part 4)
Cracking the DNA mixture code – computer analysis of UK crime cases Forensics Europe Expo Forensic Innovation Conference April, 2014 London, UK Mark W.
Unleashing Forensic DNA through Computer Intelligence Forensics Europe Expo Forensic Innovation Conference April, 2013 London, UK Mark W Perlin, PhD, MD,
Rapid DNA Response: On the Wings of TrueAllele Mid-Atlantic Association of Forensic Scientists May, 2015 Cambridge, Maryland Martin Bowkley, Matthew Legler,
Getting Past First Bayes with DNA Mixtures American Academy of Forensic Sciences February, 2014 Seattle, WA Mark W Perlin, PhD, MD, PhD Cybergenetics,
Compute first, ask questions later: an efficient TrueAllele ® workflow Midwestern Association of Forensic Scientists October, 2014 St. Paul, MN Martin.
TrueAllele ® interpretation of Allegheny County DNA mixtures Cybergenetics © Continuing Legal Education Allegheny County Courthouse February,
Virginia TrueAllele ® Validation Study: Casework Comparison Presented at AAFS, February, 2013 Published in PLOS ONE, March, 2014 Mark W Perlin, PhD, MD,
TrueAllele ® Computing: All the DNA, all the time Continuing Professional Development Sydney, Australia March, 2014 Mark W Perlin, PhD, MD, PhD Cybergenetics,
Murder in McKeesport October 25, 2008 Tamir Thomas.
When Good DNA Goes Bad International Conference on Forensic Research & Technology October, 2012 Chicago, Illinois Mark W Perlin, PhD, MD, PhD Cybergenetics,
DNA Mapping the Crime Scene: Do Computers Dream of Electric Peaks? 23rd International Symposium on Human Identification October, 2012 Nashville, TN Mark.
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © Duquesne University October, 2015 Pittsburgh, PA What’s in a Match?
Open Access DNA Database Duquesne University March, 2013 Pittsburgh, PA Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
Objective DNA Mixture Information in the Courtroom: Relevance, Reliability & Acceptance NIST International Symposium on Forensic Science Error Management:
DNA-led investigation through computer interpretation of evidence Pennsylvania State Police Training Seminar Hershey, PA April, 2014 Mark W Perlin, PhD,
Data summary – “alleles” Threshold Over threshold, peaks are labeled as allele events All-or-none allele peaks, each given equal status Allele Pair 8,
Separating DNA Mixtures by Computer to Identify and Convict a Serial Rapist Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Garett Sugimoto,
Understanding DNA Evidence Beaver County Courthouse March, 2016 Beaver, PA Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
Four person DNA mixture
DNA: TrueAllele® Statistical Analysis, Probabilistic Genotyping
Validating TrueAllele® genotyping on ten contributor DNA mixtures
Shedding Light on Inconclusive DNA: TrueAllele® Computer Analysis
Explaining the Likelihood Ratio in DNA Mixture Interpretation
Machines can work it out: Automated TrueAllele® workflow
Virginia TrueAllele® Validation Study: Casework Comparison
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA, USA
Solving Crimes using MCMC to Analyze Previously Unusable DNA Evidence
Investigative DNA Databases that Preserve Identification Information
DNA Identification: Biology and Information
DNA Identification: Biology and Information
severed carotid artery
TrueAllele® computer technology
Probabilistic Genotyping to the Rescue for Pinkins and Glenn
DNA Identification: Biology and Information
DNA Identification: Mixture Interpretation
Exonerating the Innocent with Probabilistic Genotyping
Testifying about probabilistic genotyping results
David W. Bauer1, PhD Nasir Butt2, PhD Jeffrey Oblock2
Using probabilistic genotyping to distinguish family members
Presentation transcript:

DNA Mixture Statistics Cybergenetics © Spring Institute Commonwealth's Attorney's Services Council Richmond, Virginia March, 2013 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA

Child molestation case June, 2011: Northern Virginia daughter's birthday slumber party 10 year old girls sleeping in basement object sexual penetration aggravated sexual battery (2 counts) Prosecutor: CDCA Nicole Wittmann

CHILD & CHILD & CHILD VICTIM Television Cabinet Table L-Shaped Couch Bathroom Bedroom Laundry Bedroom Stairs Bookcase VICTIM Storage Door to Outside Closet underpants pajama pants

DNA mixture statistics Human review (using thresholds) underpants original = 10 million modified = 1 million pajama pants original = 2 million modified = 4 Computer interpretation requested

Prosecutor question What is the true match information of the evidence to the suspect?

Biology 1 trillion cells

Nucleus cell nucleus

DNA cell nucleus chromosomes

Locus cell nucleus chromosomes locus

Allele cell nucleus chromosomes locus Short Tandem Repeat (STR) alleles

cell nucleus chromosomes locus Short Tandem Repeat (STR) genotype 7, 8 alleles Genotype

Identification Evidence item

Identification Lab Evidence item Evidence data

Identification Evidence item Evidence data Evidence genotype , 12 LabInfer

Identification Evidence genotype Known genotype , 12 LabInfer Compare Evidence item Evidence data

Identification Known genotype , 12 LabInfer Compare Probability(identification) = Prob(suspect matches evidence) = 100% Evidence data Evidence item Evidence genotype

Coincidence Biological population

Coincidence Biological population Allele frequency data 10 Lab

Coincidence Biological population Allele frequency data Population genotype 10 10, 5% LabInfer Genotype product rule, combines alleles Prob(10, 12) = 2 x p 10 x p 12 Prob(10, 10) = p 10 x p 10

Coincidence Biological population Allele frequency data Population genotype Known genotype 10 10, 5% 10, 12 LabInfer Compare

Coincidence Biological population Allele frequency data Population genotype Known genotype 10 10, 5% 10, 12 LabInfer Compare Probability(coincidence) = Prob(coincidental match) = 5%

Identification information before data (population) after (evidence) Evidence changes our belief Prob(identification) Prob(coincidence) At the suspect's genotype, identification vs. coincidence?

Match statistic before data (population) after (evidence) At the suspect's genotype, identification vs. coincidence? Prob(evidence matches suspect) Prob(coincidental match) Perlin MW. Explaining the likelihood ratio in DNA mixture interpretation. Promega's Twenty First International Symposium on Human Identification, 2010; San Antonio, TX.

Match statistic Prob(evidence matches suspect) Prob(coincidental match) before data (population) after (evidence) 20 = 100% 5% = At the suspect's genotype, identification vs. coincidence?

Bayes theorem Calculate probability Belief in hypothesis after having seen data is proportional to how well hypothesis explains the data times our initial belief. All hypotheses must be considered. Need computers to do this properly. Hypothesis: Defendant contributed to DNA evidence Rev Bayes, 1763 Computers, 1985

Mixture interpretation varies National Institute of Standards and Technology Two Contributor Mixture Data, Known Victim 31 thousand (4) 213 trillion (14)

DNA mixture + Evidence item

Uncertainty Lab Evidence item Evidence data

Uncertainty , 50% 11, 30% 12, 20% + Evidence genotype LabInfer Evidence item Evidence data

Uncertainty Known genotype , 50% 11, 30% 12, 20% 10, 12 + Compare Evidence genotype LabInfer Evidence item Evidence data

Uncertainty , 50% 11, 30% 12, 20% 10, 12 + Compare Probability(identification) = Prob(suspect matches evidence) = 50% Evidence genotype LabInfer Evidence item Evidence data Known genotype

Identification information before data (population) after (evidence) Prob(identification) Prob(coincidence) At the suspect's genotype, identification vs. coincidence? Less weight of evidence, less change in our belief Numerator decreases Denominator unchanged

Match statistic before data (population) after (evidence) 10 = 50% 5% = At the suspect's genotype, identification vs. coincidence? Prob(evidence matches suspect) Prob(coincidental match)

TrueAllele operator Replicate computer runs for each item 2 or 3 unknown mixture contributors Victim genotype was considered STR evidence data.fsa genetic analyzer files Evidence genotypes probability distributions

DNA mixture data Quantitative peak heights at a locus peak size peak height

TrueAllele ® Casework ViewStation User Client Database Server Interpret/Match Expansion Visual User Interface VUIer™ Software Parallel Processing Computers

Mixture weight Separate mixture data into two contributor components 25%75%

Genotype inference Thorough: consider every possible genotype solution Objective: does not know the comparison genotype Explain the peak pattern Better explanation has a higher likelihood Victim's allele pair Another person's allele pair

Genotype inference Explain the peak pattern Worse explanation has a lower likelihood Victim's allele pair Another person's allele pair

Genotype separation major contributor

Genotype concordance

TrueAllele report Genotype probability distributions Evidence genotypeSuspect genotype Population genotype Likelihood ratio (LR) DNA match statistic

Probability(evidence match) Probability(coincidental match) 30x 3% 98% DNA match statistic

Match statistic at 15 loci

TrueAllele DNA match Black36.6 quintillion Caucasian20.7 quadrillion Hispanic212 quadrillion Black319 thousand Caucasian3.86 thousand Hispanic32.9 thousand LR match to Defendant UnderpantsPajama pants

Powers of Ten logarithmic scale thousand million billion trillionquadrillion quintillion … 000 number of zeros

Trial preparation discuss case report direct examination curriculum vitae PowerPoint slides background reading answer questions

Computer Interpretation of Quantitative DNA Evidence Commonwealth v Defendant April, 2012 Arlington, Virginia Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©

DNA genotype 8, ACGT A genetic locus has two DNA sentences, one from each parent. 9 locus Many alleles allow for many many allele pairs. A person's genotype is relatively unique. mother allele father allele repeated word An allele is the number of repeated words. A genotype at a locus is a pair of alleles.

DNA evidence interpretation Evidence item Evidence data LabInfer Evidence genotype Known genotype 10, 50% 11, 30% 12, 20% 10, 12 Compare

Computers can use all the data Quantitative peak heights at locus Penta E peak size peak height

How the computer thinks Consider every possible genotype solution Explain the peak pattern Better explanation has a higher likelihood Victim's allele pair Another person's allele pair

Evidence genotype Objective genotype determined solely from the DNA data. Never sees a suspect. 1% 98%

DNA match information Probability(evidence match) Probability(coincidental match) How much more does the suspect match the evidence than a random person? 30x 3% 98%

Match information at 15 loci

Is the suspect in the evidence? A match between the underpants and Defendant is: 36.6 quintillion times more probable than a coincidental match to an unrelated Black person 20.7 quadrillion times more probable than a coincidental match to an unrelated Caucasian person 212 quadrillion times more probable than a coincidental match to an unrelated Hispanic person

Is the suspect in the evidence? A match between the pajama pants and Defendant is: 319 thousand times more probable than a coincidental match to an unrelated Black person 3.86 thousand times more probable than a coincidental match to an unrelated Caucasian person 32.9 thousand times more probable than a coincidental match to an unrelated Hispanic person

Outcome Guilty object sexual penetration two counts of aggravated sexual battery Sentence 22 years imprisonment Court of Appeals DNA chain of custody appeal denied

TrueAllele mixture validation: Virginia case study Mark W Perlin, PhD, MD, PhD Kiersten Dormer, MS and Jennifer Hornyak, MS Cybergenetics, Pittsburgh, PA Lisa Schiermeier-Wood, MS and Susan Greenspoon, PhD Department of Forensic Science, Richmond, VA Establish the reliability of TrueAllele mixture interpretation

Case composition 72 criminal cases 92 evidence items 111 genotype comparisons Criminal offense 18 homicide 12 robbery 6 sexual assault 20 weapon

DNA mixture distribution

Data summary – “alleles” Threshold Over threshold, peaks are labeled as allele events All-or-none allele peaks, each given equal status Allele Pair 7, 7 7, 10 7, 12 7, 14 10, 10 10%10, 12 10, 14 12, 12 12, 14 14, 14

CPI information Nothing reported CPI Combined probability of inclusion

SWGDAM 2010 guidelines Threshold Under threshold, alleles less used Allele Pair 7, 7 7, 10 7, 12 7, 14 10, 10 0%10, 12 10, 14 12, 12 12, 14 14, 14 Higher threshold for human review

Modified CPI information Nothing reported CPI mCPI

SWGDAM 2010 guidelines If a stochastic threshold based on peak height is not used in the evaluation of DNA typing results, the laboratory must establish alternative criteria (e.g., quantitation values or use of a probabilistic genotype approach) for addressing potential stochastic amplification. The criteria must be supported by empirical data and internal validation and must be documented in the standard operating procedures. Use TrueAllele ® Casework for DNA mixture statistics

Validated genotyping method Perlin MW, Sinelnikov A. An information gap in DNA evidence interpretation. PLoS ONE. 2009;4(12):e8327. Perlin MW, Legler MM, Spencer CE, Smith JL, Allan WP, Belrose JL, Duceman BW. Validating TrueAllele ® DNA mixture interpretation. Journal of Forensic Sciences. 2011;56(6): Perlin MW, Belrose JL, Duceman BW. New York State TrueAllele ® Casework validation study. Journal of Forensic Sciences. 2013;58(6):in press.

TrueAllele reinterpretation Virginia reevaluates DNA evidence in 375 cases July 16, 2011 “Mixture cases are their own little nightmare,” says William Vosburgh, director of the D.C. police’s crime lab. “It gets really tricky in a hurry.” “If you show 10 colleagues a mixture, you will probably end up with 10 different answers” Dr. Peter Gill, Human Identification E-Symposium, 2005

Mixture weight Separate mixture data into two contributor components 25%75%

Genotype inference Thorough: consider every possible genotype solution Objective: does not know the comparison genotype Explain the peak pattern Better explanation has a higher likelihood Victim's allele pair Another person's allele pair Allele Pair 7, 7 7, 10 7, 12 7, 14 10, 10 98%10, 12 10, 14 12, 12 12, 14 14, 14

TrueAllele sensitivity Nothing reported CPI mCPI TrueAllele

TrueAllele specificity True exclusions, without false inclusions – 19.69

TrueAllele reproducibility log(LR 1 ) log(LR 2 ) Concordance in two independent computer runs standard deviation (within-group) 0.305

Validation results A reliable method sensitive specific reproducible TrueAllele ® Casework DNA mixture interpretation is: TrueAllele computer genotyping is more effective than human review

TrueAllele Virginia outcomes 144 cases analyzed 72 case reports – 10 trials CityCourtChargeSentence RichmondFederalWeapon50 years AlexandriaFederalBank robbery90 years QuanticoMilitaryRape3 years ChesapeakeStateRobbery26 years ArlingtonStateMolestation22 years RichmondStateHomicide35 years FairfaxStateAbduction33 years NorfolkStateHomicide8 years CharlottesvilleStateHomicide15 years HamptonStateHome invasion5 years

TrueAllele in Virginia Department of Forensic Science has their own TrueAllele system Training, validation, approvals Services centralized in Richmond DFS will provide DNA mixture statistics and court testimony

TrueAllele in the United States Casework system Interpretation services

More information Courses Newsletters Newsroom Presentations Publications