“Using Computer Technology to Overcome Bottlenecks in the Forensic DNA Testing Process and Improve Data Recovery from Complex Samples”

Slides:



Advertisements
Similar presentations
Overcoming DNA Stochastic Effects 2010 NEAFS & NEDIAI Meeting November, 2010 Manchester, VT Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics.
Advertisements

Finding Truth in DNA Mixture Evidence Innocence Network Conference April, 2013 Charlotte, NC Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA.
Creating informative DNA libraries using computer reinterpretation of existing data Northeastern Association of Forensic Scientists November, 2011 Newport,
#1 Task Determine if there is physical evidence that connects the suspect to the crime.Determine if there is physical evidence that connects the suspect.
Preventing rape in the military through effective DNA computing Forensics Europe Expo Forensics Seminar Theatre April, 2014 London, UK Mark W Perlin, PhD,
STATE AND NATIONAL DNA DATABASES: CODIS Division Scientific Services CT Forensic Science Laboratory Dr. Michael Bourke
How TrueAllele® Works (Part 1)
TrueAllele ® Casework Validation on PowerPlex ® 21 Mixture Data Australian and New Zealand Forensic Science Society September, 2014 Adelaide, South Australia.
The Basics of CODIS Combined DNA Index System
Using TrueAllele ® Casework to Separate DNA Mixtures of Relatives California Association of Criminalists October, 2014 San Francisco, CA Jennifer Hornyak,
Kern Regional Crime Laboratory Laboratory Director: Dr. Kevin W. P. Miller TRUEALLELE® WORK AND WORKFLOW: KERN COUNTY’S FIRST CASES APRIL 23, 2014.
Revolutionising DNA analysis in major crime investigations The Investigator Conferences Green Park Conference Centre May, 2014 Aylesbury, Buckinghamshire.
Expert Systems for Automated STR Analysis SWGDAM Quantico, VA Mark W. Perlin January, 2003.
Revolutionising DNA analysis in major crime investigations The Investigator Conferences Green Park Conference Centre May, 2014 Aylesbury, Buckinghamshire.
Stephen Gresko Senior Criminalist Nevada State CODIS Administrator WASHOE COUNTY SHERIFF’S OFFICE FORENSIC SCIENCE DIVISION.
More informative DNA identification: Computer reinterpretation of existing data Ria David, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
Scientific Validation of Mixture Interpretation Methods 17th International Symposium on Human Identification Sponsored by the Promega Corporation October,
Computer Interpretation of Uncertain DNA Evidence National Institute of Justice Computer v. Human June, 2011 Arlington, VA Mark W Perlin, PhD, MD, PhD.
Automated STR Data Analysis: Validation Studies Automated Analysis Databasing Validation Casework Studies Mark W. Perlin (Cybergenetics, Pittsburgh, PA)
TrueAllele ® Modeling of DNA Mixture Genotypes California Association of Crime Laboratory Directors October, 2014 San Francisco, CA Mark W Perlin, PhD,
Using Expert Systems (TrueAllele ® ) for Forensic STR DNA Analysis National Institute of Justice Grantees Meeting 2011 Using Expert Systems (TrueAllele.
TrueAllele ® Genetic Calculator: Implementation in the NYSP Crime Laboratory NYS DNA Subcommittee May 19, 2010 Barry Duceman, Ph.D New York State Police.
Separating Familial Mixtures, One Genotype at a Time Northeastern Association of Forensic Scientists November, 2014 Hershey, PA Ria David, PhD, Martin.
Cybergenetics Webinar January, 2015 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © How TrueAllele ® Works (Part 4)
An Expert System for Scoring DNA Database Profiles Dr. Mark W. Perlin Cybergenetics Pittsburgh, PA.
Unleashing Forensic DNA through Computer Intelligence Forensics Europe Expo Forensic Innovation Conference April, 2013 London, UK Mark W Perlin, PhD, MD,
Compute first, ask questions later: an efficient TrueAllele ® workflow Midwestern Association of Forensic Scientists October, 2014 St. Paul, MN Martin.
Virginia TrueAllele ® Validation Study: Casework Comparison Presented at AAFS, February, 2013 Published in PLOS ONE, March, 2014 Mark W Perlin, PhD, MD,
DNA Mapping the Crime Scene: Do Computers Dream of Electric Peaks? 23rd International Symposium on Human Identification October, 2012 Nashville, TN Mark.
Open Access DNA Database Duquesne University March, 2013 Pittsburgh, PA Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
Exploring Forensic Scenarios with TrueAllele ® Mixture Automation 59th Annual Meeting American Academy of Forensic Sciences February, 2007 Mark W Perlin,
PCR Y.Martinez, LSHS, 2014 DIRECTIONS: COPY NOTES IN ORANGE.
Forensics (DNA Technology). What is DNA? DNA is the carrier of genetic information and provides a structural plan for proteins. It consists of linear.
Simple Reporting of Complex DNA Evidence: Automated Computer Interpretation Promega 14th International Symposium on Human Identification Pointe Hilton.
How is DNA be used to solve crimes? 8 th Grade Forensic Science T. Trimpe
Separating DNA Mixtures by Computer to Identify and Convict a Serial Rapist Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Garett Sugimoto,
Explain how crime scene evidence is
Laboratory Information Management Systems (LIMS)
All rights Reserved Cengage/NGL/South-Western © 2016.
DNA: TrueAllele® Statistical Analysis, Probabilistic Genotyping
A Match Likelihood Ratio for DNA Comparison
Forensic Stasis in a World of Flux
Validating TrueAllele® genotyping on ten contributor DNA mixtures
How to Defend Yourself Against DNA Mixtures
All rights Reserved Cengage/NGL/South-Western © 2016.
DRILL: What are some things you have see on TV about the field of forensics?
Explaining the Likelihood Ratio in DNA Mixture Interpretation
DNA identification pathway
On the threshold of injustice: manipulating DNA evidence
Machines can work it out: Automated TrueAllele® workflow
Rules for DNA Comparison Analysis
Virginia TrueAllele® Validation Study: Casework Comparison
Suffolk County TrueAllele® Validation
History of Biological Evidence in Forensics
Solving Crimes using MCMC to Analyze Previously Unusable DNA Evidence
Investigative DNA Databases that Preserve Identification Information
New York State Police TrueAllele® Validation
Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA, USA
severed carotid artery
Aim: How is genetics used in law enforcement?
DNA identification pathway
TrueAllele® computer technology
2018 AAFS Annual Scientific Meeting February 22, 2018
Genotype Information Criteria for Forensic DNA Databases
The Triumph and Tragedy of DNA Evidence
Probabilistic Genotyping to the Rescue for Pinkins and Glenn
DNA Identification: Mixture Interpretation
Exonerating the Innocent with Probabilistic Genotyping
Testifying about probabilistic genotyping results
David W. Bauer1, PhD Nasir Butt2, PhD Jeffrey Oblock2
Presentation transcript:

“Using Computer Technology to Overcome Bottlenecks in the Forensic DNA Testing Process and Improve Data Recovery from Complex Samples”

Goals of DNA Testing Help identify perpetrators of crimes Eliminate the wrongfully accused Help prevent future crimes

Reducing DNA Testing Time Hire more staff Automate Humans for tasks requiring intelligence Machines for repetitive processes

Typical Laboratory Automation Setup Identify Material Human DNA Extraction Automated, ~15 years DNA Quantification DNA Amplification Automated, ~10-15 years Instrumental Analysis Automated, ~20 years DNA Profile Interpretation

How Many DNA Profiles? 96-well plate 4 to 8 allelic ladders At least two PCR controls Several DNA extraction blanks Typical plate could contain 80 to 90 DNA profiles

Interpretation Guidelines 2017 SWGDAM Guidelines for human interpretation To avoid confirmation bias, evidence samples should be interpreted before comparison to known samples

To Avoid Confirmation Bias Look at your evidence profile first Infer DNA types from the evidence without knowledge of known types Compare inferred DNA types to knowns Determine if match exists

5 minutes, 42 seconds per DNA profile Human Interpretation Eight hour workday 8 hours x 60 minutes = 480 minutes 480 minutes / 84 DNA profiles = 5 minutes, 42 seconds per DNA profile

Interpretation Bottleneck Volume of data Complexity Thresholds Data declared “Inconclusive” or “Too Complex for Interpretation”

So Throwing Out Data is the Only Way?

Alternate Approach - Automate Automate DNA interpretation using computers Use humans for tasks requiring intelligence, use machines for repetitive processes DNA interpretation can be automated with the TrueAllele® technology

TrueAllele® Processing ViewStation User Client Database Server Interpret/Match Expansion Visual User Interface VUIer™ Software Parallel Processing Computers Cybergenetics © 2007-2017

Automated Process Upload entire plate to server Computer interprets the data Check results and compare: To other data files on the same plate To known profiles To previous runs Perform detailed processing on potential matches

Benefits of Automated DNA Interpretation All data examined, nothing discarded Speed One plate in ~6 – 7 hours No confirmation bias Computer infers genotypes No prior knowledge

Benefits of Automated DNA Interpretation All data compared: Evidence, references, lab staff, crime scene investigators, controls Identify more case-to-case matches and potential contamination CODIS specimen and candidate match assessment

Database Matching TrueAllele implemented in January 2016 Server expansion modules installed May 2017 20 processors (casework and database screening) 8 (original) processors dedicated to database screening

TrueAllele Workflows Casework Database Traditional sample-by-sample analysis Database Process everything, look for matches Confirm matches in Casework process

Database Matching Uploaded all 7 years of data to BCSO TrueAllele database >7,500 DNA profiles >15,000 inferred genotypes Batched request run conditions: 5K/5K 3 unknown contributors

Database Matching ~30,000 potential matches returned Most were “within case” To date, ~80 previously unknown EVI-REF case-to-case matches

Match Evaluation Five samples from five different cases matched to each other EVI – EVI; no reference matches No matches to any other genotypes

Sample Contributor N Con KL 2016 Case (log LR) 3 19.9574 12.5962 2015 Case 1 19.675 14.7183 2017 Case 14.6881 6.2815 2015 Case 2 17.401 8.0723

Case Evaluation 2014 Home Invasion (unknown suspect) 2015 Attempted Murder (known suspect) 2015 Burglary (two known suspects) 2016 Murder (known suspect) 2017 B&E Auto (three known suspects)

Case Evaluation No suspects in common 2015 Attempted Murder & 2016 Murder: identity was not in dispute 2017 B&E Auto: suspects were witnessed and caught soon after

Possible Causes Wrongfully accused? An accomplice that we don’t know about? Contamination? Aliens?

Investigation Worked in lab at different times Worked in lab by three different examiners Genotypes did not match lab staff Genotypes did not match any case reference samples

Investigation All cases worked by same agency All items were touch DNA All evidence was collected or handled by same investigator Investigator was the only common link between all five cases

Sample Contributor N Con KL Reference (log LR) 2014 Home Invasion 2 3 19.9574 13.4844 2015 Burglary 1 19.675 15.4627 2016 Murder 23.3979 17.798 2017 B&E Auto 14.6881 11.0779 2015 Attempted Murder 17.401 4.0144

Batched request: 5K/5K 3 unknown contributors Run Conditions Batched request: 5K/5K 3 unknown contributors

2 Convergence 1.641 1.113 2.048 Automated Matching 5K/5K?

D12S391 D18S51 3 contributors?

Run Conditions Batched request: 5K/5K 1, 2, or 3 unknown contributors Identify potential matches for detailed processing

Quality Control Two of the profiles had been entered into CODIS LDIS match Other three were unsuitable for CODIS entry

CODIS Match Evaluation

CODIS Match Evaluation Example #1 Offender Human Review, 30 minutes TrueAllele,5 minutes LR CPI (1 in) #1 Uncertain Eliminated 1.5 39,000 #2 Not eliminated 2.7 #3 Match 73 billion

CODIS Match Evaluation Example #2 Offender MME CPI (1 in) LR #1 1.728 x 104 65 158 trillion

Previously Unidentified Matches 2014: Burglary case, uploaded to SDIS Offender hit Match confirmed in laboratory 2017: Process old data, upload to TrueAllele® 3 additional cases from 2012 - 2014 Never entered into CODIS

Can we use the automated process for CODIS screening? Minimum # 4x4 CPI Sample Major ? LR Contributors Rule? (1 in ) 2012 3 No No 28,000 6 quintillion 2013 3 No No 920 27 trillion 2014 3 No No 760 9 trillion Can we use the automated process for CODIS screening?

CODIS Screening KL computed by TrueAllele® MME calculated by CODIS Measures information value of inferred genotype MME calculated by CODIS Predicts matches at moderate stringency Compare MME, KL, and LR for CODIS profile assessment

What We Are Implementing Use KL to predict quality of match Use MME to filter adventitious matches High KL – build MME, search CODIS Low KL – do not upload

Summary DNA interpretation is automatable Reduce/eliminate interpretation bottlenecks Output searched internally and screened for suitable CODIS profiles Improved quality control More information recovered from same amount of data

Questions?