David W. Bauer1, PhD Nasir Butt2, PhD Jeffrey Oblock2

Slides:



Advertisements
Similar presentations
Overcoming DNA Stochastic Effects 2010 NEAFS & NEDIAI Meeting November, 2010 Manchester, VT Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics.
Advertisements

Forensic DNA Inference ICFIS 2008 Lausanne, Switzerland Mark W Perlin, PhD, MD, PhD Joseph B Kadane, PhD Robin W Cotton, PhD Cybergenetics ©
DNA Mixture Interpretations and Statistics – To Include or Exclude Cybergenetics © Prescription for Criminal Justice Forensics ABA Criminal Justice.
Finding Truth in DNA Mixture Evidence Innocence Network Conference April, 2013 Charlotte, NC Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA.
How Inclusion Interpretation of DNA Mixture Evidence Reduces Identification Information American Academy of Forensic Sciences February, 2013 Washington,
Creating informative DNA libraries using computer reinterpretation of existing data Northeastern Association of Forensic Scientists November, 2011 Newport,
How TrueAllele ® Works (Part 3) Kinship, Paternity and Missing Persons Cybergenetics Webinar December, 2014 Mark W Perlin, PhD, MD, PhD Cybergenetics,
Preventing rape in the military through effective DNA computing Forensics Europe Expo Forensics Seminar Theatre April, 2014 London, UK Mark W Perlin, PhD,
TrueAllele ® Casework Validation on PowerPlex ® 21 Mixture Data Australian and New Zealand Forensic Science Society September, 2014 Adelaide, South Australia.
Using TrueAllele ® Casework to Separate DNA Mixtures of Relatives California Association of Criminalists October, 2014 San Francisco, CA Jennifer Hornyak,
No DNA Left Behind: When "inconclusive" really means "informative" Schenectady County District Attorney’s Office January, 2014 Mark W Perlin, PhD, MD,
Revolutionising DNA analysis in major crime investigations The Investigator Conferences Green Park Conference Centre May, 2014 Aylesbury, Buckinghamshire.
Computer Interpretation of Uncertain DNA Evidence National Institute of Justice Computer v. Human June, 2011 Arlington, VA Mark W Perlin, PhD, MD, PhD.
How TrueAllele ® Works (Part 2) Degraded DNA and Allele Dropout Cybergenetics Webinar November, 2014 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh,
TrueAllele ® Genetic Calculator: Implementation in the NYSP Crime Laboratory NYS DNA Subcommittee May 19, 2010 Barry Duceman, Ph.D New York State Police.
Separating Familial Mixtures, One Genotype at a Time Northeastern Association of Forensic Scientists November, 2014 Hershey, PA Ria David, PhD, Martin.
Cybergenetics Webinar January, 2015 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © How TrueAllele ® Works (Part 4)
Cracking the DNA mixture code – computer analysis of UK crime cases Forensics Europe Expo Forensic Innovation Conference April, 2014 London, UK Mark W.
Unleashing Forensic DNA through Computer Intelligence Forensics Europe Expo Forensic Innovation Conference April, 2013 London, UK Mark W Perlin, PhD, MD,
Rapid DNA Response: On the Wings of TrueAllele Mid-Atlantic Association of Forensic Scientists May, 2015 Cambridge, Maryland Martin Bowkley, Matthew Legler,
Getting Past First Bayes with DNA Mixtures American Academy of Forensic Sciences February, 2014 Seattle, WA Mark W Perlin, PhD, MD, PhD Cybergenetics,
Compute first, ask questions later: an efficient TrueAllele ® workflow Midwestern Association of Forensic Scientists October, 2014 St. Paul, MN Martin.
TrueAllele ® Computing: All the DNA, all the time Continuing Professional Development Sydney, Australia March, 2014 Mark W Perlin, PhD, MD, PhD Cybergenetics,
Murder in McKeesport October 25, 2008 Tamir Thomas.
When Good DNA Goes Bad International Conference on Forensic Research & Technology October, 2012 Chicago, Illinois Mark W Perlin, PhD, MD, PhD Cybergenetics,
DNA Mapping the Crime Scene: Do Computers Dream of Electric Peaks? 23rd International Symposium on Human Identification October, 2012 Nashville, TN Mark.
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © Duquesne University October, 2015 Pittsburgh, PA What’s in a Match?
Open Access DNA Database Duquesne University March, 2013 Pittsburgh, PA Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
Exploring Forensic Scenarios with TrueAllele ® Mixture Automation 59th Annual Meeting American Academy of Forensic Sciences February, 2007 Mark W Perlin,
Objective DNA Mixture Information in the Courtroom: Relevance, Reliability & Acceptance NIST International Symposium on Forensic Science Error Management:
DNA Identification: Quantitative Data Modeling Cybergenetics © Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA TrueAllele ® Lectures.
DNA-led investigation through computer interpretation of evidence Pennsylvania State Police Training Seminar Hershey, PA April, 2014 Mark W Perlin, PhD,
Separating DNA Mixtures by Computer to Identify and Convict a Serial Rapist Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Garett Sugimoto,
Four person DNA mixture
A Match Likelihood Ratio for DNA Comparison
Forensic Stasis in a World of Flux
Overcoming Bias in DNA Mixture Interpretation
Validating TrueAllele® genotyping on ten contributor DNA mixtures
Error in the likelihood ratio: false match probability
Explaining the Likelihood Ratio in DNA Mixture Interpretation
DNA identification pathway
Distorting DNA evidence: methods of math distraction
On the threshold of injustice: manipulating DNA evidence
“Using Computer Technology to Overcome Bottlenecks in the Forensic DNA Testing Process and Improve Data Recovery from Complex Samples”
Machines can work it out: Automated TrueAllele® workflow
Virginia TrueAllele® Validation Study: Casework Comparison
Solving sexual assault cases using DNA mixture evidence
Suffolk County TrueAllele® Validation
American Academy of Forensic Sciences Criminalistics Section
Solving Crimes using MCMC to Analyze Previously Unusable DNA Evidence
Investigative DNA Databases that Preserve Identification Information
DNA Identification: Inclusion Genotype and LR
New York State Police TrueAllele® Validation
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA, USA
DNA Identification: Biology and Information
DNA Identification: Biology and Information
Forensic match information: exact calculation and applications
severed carotid artery
DNA identification pathway
2018 AAFS Annual Scientific Meeting February 22, 2018
Genotype Information Criteria for Forensic DNA Databases
Probabilistic Genotyping to the Rescue for Pinkins and Glenn
Forensic validation, error and reporting: a unified approach
DNA Identification: Biology and Information
DNA Identification: Mixture Interpretation
Exonerating the Innocent with Probabilistic Genotyping
Testifying about probabilistic genotyping results
Using probabilistic genotyping to distinguish family members
Reporting match error: casework, validation & language
Presentation transcript:

Developing investigative leads from a probabilistic genotyping database David W. Bauer1, PhD Nasir Butt2, PhD Jeffrey Oblock2 Mark W. Perlin1, PhD, MD, PhD  1Cybergenetics Pittsburgh, PA  2Cuyahoga County Forensic Laboratory Cleveland, OH Cybergenetics © 2003-2019

DNA evidence chain • DNA can provide identification information • DNA data requires interpretation Genotype • Link culprit to crime scene ATCGACTGATCGACGAGAACTACGATAGAGAC DATA

A broken link • Manual methods simplify the data • DNA identification information lost Allele List • Minimal connection back to culprit ATCGACTGATCGACGAGAACTACGATAGAGAC Human interpretation DATA

Probabilistic Genotyping • TrueAlleleTM probabilistic genotyping • Uses all of the data • Preserves ID information Genotype • Connect culprit to crime scene ATCGACTGATCGACGAGAACTACGATAGAGAC DATA

Investigative TrueAllele Database Create leads for criminal investigations evidence • Link suspect to evidence within cases suspect • Link suspects to other cases evidence suspect • Link evidence between crime scenes evidence

STR genotype Cell locus DNA Mother allele Father allele ACGT 1 2 3 4 5 6 7 8 9 Father allele 1 2 3 4 5 6 7 8 9 10 11 12

STR data Task: to infer genotypes from peaks Data focuses probability before after 9 Single source … 12 … 9, 9 3% 9, 9 9, 10 6% 9, 10 9, 11 3% 9, 11 RFU 9, 12 4% 9, 12 100% 10, 10 2% 10, 10 10, 11 5% 10, 11 10, 12 3% 11 10, 12 8 … … Size (bp)

STR mixture data more complexity more uncertainty less focused probability 9 No single answer 2 person mixture 7 8 … 7, 7 40% 7, 8 2% RFU 7, 9 3% 8, 8 35% 8, 9 5% 9, 9 15% Size (bp) …

Cleveland auto thefts • Spike in auto thefts during summer/fall 2015 • Gang involvement suspected 37 cases • 161 DNA evidence samples • 72 references samples

The challenges: Throughput 161 evidence samples 72 references ~12,000 comparisons X = 12,000 comparisons 10 minutes per comparison X = > 2,000 human-hours • A lot of human time performing repetitive task

The challenges: Data complexity Reported conclusions: “…complexity of the data…” “…mixture of at least 3 individuals…” “…therefore, no further conclusions can be drawn.” 11 13 9 12 RFU 10 8 Size (bp)

Interpreting complex data - TrueAllele computer proposes possibilities - Compares with data to calculate probability - Good explanations higher probability 11 13 9 12 minor major RFU middle 10 8 Size (bp)

Uncertainty ≠ Uninformative • TrueAllele separates out genotypes from data Evidence genotype for major contributor Probability 9,11 10,11 9,12 10,12 11,12 9,13 10,13 11,13 12,13 Allele pair

Uncertainty ≠ Uninformative • TrueAllele separates out genotypes from data Evidence genotype for major contributor Probability Allele pair frequency from population 9,11 10,11 9,12 10,12 11,12 9,13 10,13 11,13 12,13 Allele pair

Uncertainty ≠ Uninformative • Use genotypes to calculate match statistics Prob(evidence match) Prob(coincidental match) Likelihood ratio (LR) = Probability 7x 14% 2% 9,11 10,11 9,12 10,12 11,12 9,13 10,13 11,13 12,13 Allele pair

Uncertainty ≠ Uninformative • Allele lists loose identification information Allele list 9, 10, 11, 12, 13 1/CPI 1.14 Probability 7x 3.24, 5.92, 29.7, 34.4, 20.6 14% 2% 9,11 10,11 9,12 10,12 11,12 9,13 10,13 11,13 12,13 Allele pair

Linking suspects to crimes 161 items 72 references 127 ref-evi matches  1,200 comparisons  37 References 31 Cases

The missing links TrueAllele Database Manual & T.A. (59 case-ref) 37 References 31 Cases

Method comparison Manual review TrueAllele IPGD ~2 months from data to interpretations Few hours of analyst time 23 references 37 references At least one reference association for all 30 30 items ‘inconclusive’ 12 reported exclusions 12 positive associations

Match review Search Download 2015-4520-5.1_ncon3_100K 2 3 0.2975 0.0523 14.0146 2015-4515-5_ref 13.7692

Match review • Quick and easy access to data, genotypes, match statistics 2015-4520-5.1_ncon3_100K 2 3 0.2975 0.0523 14.0146 2015-4515-5_ref 13.7692

Match rarity • Quantify specificity • Calculated for each genotype Mean log(LR) = -7.5 • Quantify specificity Std dev = 3.13 • Calculated for each genotype For a match strength of 95.2 billion only one in 74.7 trillion people would match as strongly.

Connecting crime scenes • Use all the data • Investigative leads between cases • Database calculates match statistics between evidence items

Evidence in action • July & August 2013 in Bakersfield, CA • Three separate break-ins and sexual assault cases

Evidence in action • 37 swabs and cuttings Pants • Low level DNA mixtures • Evidence matches between cases • CODIS hit on zip tie Phone cord Zip tie in road Phone Bathtub handle

Evidence in action • 37 swabs and cuttings • Low level DNA mixtures • Evidence matches between cases • CODIS hit on zip tie • Arrest (2013) and conviction (2015) Zip tie in road

World Trade Center Powerful TrueAllele Database to identify victim remains Kinship profile TrueAllele database ~18,000 victim remains samples ~6,500 family references ~2,400 personal effects ~2,700 people

Conclusions • Automate DNA investigation • Reduce interpretation bottleneck • Extract more information from DNA data • Match suspects to crimes • Connect crime scenes together

More information www.cybgen.com • Courses • Newsletters • Newsroom • Presentations • Publications • Webinars TrueAllele YouTube channel: http://www.youtube.com/user/TrueAllele dave@cybgen.com