Familial searches and cold hit statistics Forensic Bioinformatics (www.bioforensics.com) Dan Krane Wright State University, Dayton, OH 45435.

Slides:



Advertisements
Similar presentations
Overcoming DNA Stochastic Effects 2010 NEAFS & NEDIAI Meeting November, 2010 Manchester, VT Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics.
Advertisements

Forensic DNA Inference ICFIS 2008 Lausanne, Switzerland Mark W Perlin, PhD, MD, PhD Joseph B Kadane, PhD Robin W Cotton, PhD Cybergenetics ©
DNA Mixture Interpretations and Statistics – To Include or Exclude Cybergenetics © Prescription for Criminal Justice Forensics ABA Criminal Justice.
New York State Police TrueAllele ® Casework Developmental Validation Cybergenetics © New York State DNA Subcommittee March, 2010.
Statistical Weights of DNA Profiles Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH.
Forensic DNA profiling workshop
Evaluating forensic DNA evidence Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH Steelman Visiting Scientist.
Juror Understanding of Random Match Probabilities Dale A. Nance Case Western Reserve University August, 2007.
What is necessary (and unnecessary) for analyses of offender databases Forensic Bioinformatics ( Jason R.
Forensic Bioinformatics (
Elementary Statistics for Lawyers References Evett and Weir, Interpreting DNA evidence. Balding, Weight-of-evidence for forensic DNA profiles.
Attaching statistical weight to DNA test results 1.Single source samples 2.Relatives 3.Substructure 4.Error rates 5.Mixtures/allelic drop out 6.Database.
Database Searches Non-random samples of N individuals Typically individuals convicted of some crime Maryland, people arrested but not convicted.
Database Searches Evidence profiles are compared to a list of up to 3,528,903 profiles (FBI CODIS) References Balding, Weight-of- evidence for forensic.
Welcome! 5 th Annual Expert Forum: The Science of DNA Profiling August 11-13, 2007 David H. Ponitz Conference Center Dayton, OH 5 th Annual Expert Forum:
Offender Databases and Familial Searches: Social, Ethical, and Legal Issues Mary Terrell White, PhD Department of Community Health Boonshoft School of.
Inferring the Number of Contributors to Mixed DNA Profiles David Paoletti.
Run-specific limits of quantitation and detection (an alternative to minimum peak height thresholds) Forensic Bioinformatics ( Dan.
Introductory Mathematics & Statistics for Business
Interim Analysis in Clinical Trials: A Bayesian Approach in the Regulatory Setting Telba Z. Irony, Ph.D. and Gene Pennello, Ph.D. Division of Biostatistics.
How strong is DNA evidence?
1 Probabilistic Linkage: Issues and Strategies Craig A. Mason, Ph.D. University of Maine
1 Matching EPIET introductory course Mahón, 2011.
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
CSPP European University Institute, Florence THE POLITICAL IMPLICATIONS OF INTERNET DIFFUSION Professor Richard Rose Director, Centre for the Study of.
Chapter 7 Sampling and Sampling Distributions
Type I & Type II errors Brian Yuen 18 June 2013.
Bayes’s Theorem and the Weighing of Evidence by Juries Philip Dawid University College London.
Confounding from Cryptic Relatedness in Association Studies Benjamin F. Voight (work jointly with JK Pritchard)
Putting Statistics to Work
How is DNA be used to solve crimes? 8 th Grade Forensic Science T. Trimpe
Chapter 11: The t Test for Two Related Samples
CSI Revisited The Science of Forensic DNA Analysis
Evaluating forensic DNA evidence: what software can and cannot do Forensic Bioinformatics ( Dan E. Krane, Wright State University,
Ethics and the DNA Database Presented by: Mick Carling, Police Science and Technology Unit Date: April 2014.
Chapter 1. Speed of Analysis (Technology) Power of Discrimination (Genetics) Low High SlowFast Markers Used (Biology) RFLP Single Locus Probes RFLP Multi-Locus.
DNA Forensics MUPGRET Workshop. “DNA evidence…offers prosecutors important new tools for the identification and apprehension of some of the most violent.
TrueAllele ® Casework Validation on PowerPlex ® 21 Mixture Data Australian and New Zealand Forensic Science Society September, 2014 Adelaide, South Australia.
Other DNA Issues Harry R Erwin, PhD CIS308 Faculty of Applied Sciences University of Sunderland.
DNA, crime and law enforcement Personal Genetics Education Project (pgEd) Harvard Medical School - Wu Laboratory
Inferential Statistics
Forensic Statistics From the ground up…. Basics Interpretation Hardy-Weinberg equations Random Match Probability Likelihood Ratio Substructure.
Doug Raiford Lesson 15.  Every cell has identical DNA  If know the sequence of a suspect can compare to evidence  But wait… Do we have to sequence.
Statistical weights of mixed DNA profiles Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH Forensic DNA.
How is DNA used to solve crimes? 8 th Grade Forensic Science T. Trimpe
Thinking About DNA Database Searches William C. Thompson Dept. of Criminology, Law & Society University of California, Irvine.
Cybergenetics Webinar January, 2015 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © How TrueAllele ® Works (Part 4)
DNA, crime and law enforcement Personal Genetics Education Project (pgEd) Harvard Medical School
Using Familias and FamLink Athens 29 May, Familias 10 w w.umb.no NORWEGIAN UNIVERSITY OF LIFE SCIENCES.
Implications of database searches for DNA profiling statistics Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton,
Statistical weights of single source DNA profiles Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH Forensic.
The Time Has Come to Analyze DNA Profile Databases Forensic Bioinformatics ( Dan E. Krane, Ph.D., Wright State University, Dayton,
Open Access DNA Database Duquesne University March, 2013 Pittsburgh, PA Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics ©
DNA, crime and law enforcement Personal Genetics Education Project (pgEd) Harvard Medical School
What can go wrong with DNA profiling Dan E. Krane, Wright State University, Dayton, OH Forensic DNA Profiling Video Series Forensic Bioinformatics (
All rights Reserved Cengage/NGL/South-Western © 2016.
DNA Evidence How can DNA be used to solve Crimes?.
DNA, crime and law enforcement Personal Genetics Education Project (pgEd) Harvard Medical School
Marjan Sjerps Kees van der Beek Ate Kloosterman
DNA technology in court
All rights Reserved Cengage/NGL/South-Western © 2016.
Statistical Weights of DNA Profiles
Validating TrueAllele® genotyping on ten contributor DNA mixtures
Error in the likelihood ratio: false match probability
All rights Reserved Cengage/NGL/South-Western © 2016.
JS 115 Validation Pre class activities Database issues- Continued
Solving Crimes using MCMC to Analyze Previously Unusable DNA Evidence
Aim: How is genetics used in law enforcement?
Variable Number Tandem Repeats (VNTR)
Exonerating the Innocent with Probabilistic Genotyping
Presentation transcript:

Familial searches and cold hit statistics Forensic Bioinformatics ( Dan Krane Wright State University, Dayton, OH 45435

Familial search Database search yields a close but imperfect DNA match Can suggest a relative is the true perpetrator Great Britain performs them routinely Reluctance to perform them in US since 1992 NRC report Neither the current or next generation of CODIS software performs effective searches

Relatedness does make a difference

Dr. Fred Bieber (leading proponent of searches) We ve been doing familial searches for years. The difference between investigating identical twins and other siblings is just a matter of degree.

Three strategies for familial searches Search for rare alleles (inefficient) Count matching alleles (arbitrary) Likelihood ratios with kinship analyses

Example 2003 North Carolina performed post- conviction DNA testing on evidence from a 1984 rape and murder Darryl Hunt (who had served 18 years of a life sentence) was exonerated Database search yielded best match to Anthony Brown with 16/26 alleles Brother Willard Brown tested and found to be a perfect match

Thresholds for similarity United Kingdom: being among those who match at the most alleles Virginia: be very, very close* California: appear useful* Florida: match at least 21 out of 26 alleles North Carolina: 16 out of 26 is enough * As quoted in a front page story in USA Today (by Richard Willing, Suspects get snared by a relatives DNA, 6/7/2005). Virginia has since stated that they do not and never have done familial searches. California has said that they did not do them but have just adopted a policy to allow them now.

Bieber et al.s Monte Carlo simulations 50% of the time, a sibling has the best match in a database of 50,000 80% of the time, a sibling is in the top 10 matches Investigating the relatives of people in the top 10 could increase cold hit rate from 10% to 14% 30,000 cold-hits in the U.S. as of 2006 could have been 33,000 Bieber, Brenner and Lazer Finding criminals through DNA of their relatives. Science. 312:

Is 16/26 close enough? How many pairs of randomly generated individuals match at 16+ alleles with unrelated databases of size… 1,000: 562 pairs of individuals 5,000: 13,872 pairs of individuals 10,000: 52,982 pairs of individuals Arizona DPS found 144 pairs of individuals matching at 9 or more loci in a database of 65,493 individuals

Approximate likelihood of finding a matching pair of DNA profiles in a database of unrelated individuals Database Size 1 in 10 billion 1 in 100 billion 1 in 1 trillion in 20,0001 in 200,0001 in 2 million 10,0001 in 2001 in in 20, ,0001 in 2.51 in 201 in 200 1,000,0001 in 1 1 in 2.5 Profile frequency

The birthday paradox The chance of a single, randomly chosen person having the same birthday as mine is approximately 1 in 365 But, in a group of 23 or more people there is at least a 50% chance that two will share the same birthday The number of pairwise comparisons is equal to N x (N-1)/2 Not an issue for an individual search, but how many searches are being performed?

Three strategies for familial searches Search for rare alleles (inefficient) Count matching alleles (arbitrary) Likelihood ratios with kinship analyses

Is the true DNA match a sibling or a random individual? Given a closely matching profile, who is more likely to match, a sibling or a randomly chosen, unrelated individual? Use a likelihood ratio

Probabilities of siblings matching at 0, 1 or 2 alleles Weir and NRC I only present probabilities that siblings match perfectly. HF = 1 for homozygous loci and 2 for heterozygous loci

Probabilities of parent/child matching at 0, 1 or 2 alleles Weir and NRC I only present probabilities that parent/child match perfectly.

Other familial relationships Cousins: Grandparent-grandchild; aunt/uncle-nephew- neice;half-sibings: HF = 1 for homozygous loci and 2 for heterozygous loci

Two types of errors False positives (Type I): an initial suspects family is investigated even though an unrelated individual is the actual source of the evidence sample. False negatives (Type II): an initial suspects family is not be investigated even though a relative really is the source of the evidence sample. A wide net (low LR threshold) catches more criminals but comes at the cost of more fruitless investigations. Paoletti, D., Doom, T., Raymer, M. and Krane, D Assessing the implications for close relatives in the event of similar but non-matching DNA profiles. Jurimetrics, 46:

Hypothesis testing using an LR threshold (and prior odds) of 1

Type I and II errors with prior odds of 1

Type I and II errors with prior odds of 1 and non-cognate allele frequencies

Problem(s) with familial searches Likelihood ratio Chance of error use of non- cognate database alternative suspect pool size false positive false negative

What statistical weight should be given to a familial hit? Probable Cause Case –Suspect is first identified by non- DNA evidence –DNA evidence is used to corroborate traditional police investigation Cold Hit Case –Suspect is first identified by search of DNA database –Traditional police work is no longer focus

Surveying the three (or four) proposed statistics for cold hits NRC I : 1992 National Research Council Report NRC II: 1996 National Research Council Report Bayesian (aka Balding and Donnelly): Widespread in UK and Western Europe DAB: 2000 DNA Advisory Board to FBI

The Problem: Ascertainment bias First three approaches differ in how they take into account ascertainment bias. –Ascertainment bias is a statistical effect of fact suspect first identified by search of a database –How must RMP be modified

Dr. Fred Bieber (leading proponent of searches) Familial searches create a new category of people... under lifetime genetic surveillance. It s composition would reflect existing demographic disparities in the criminal justice system. Familial searches potentially amplify these existing disparities. Bieber, Brenner and Lazer Finding criminals through DNA of their relatives. Science. 312:

Possible solutions to familial search problems Limit the size of the alternative suspect pool (e.g. by pre-screening with Y-STRs; investigator-initiated searches) Diminish the effect of incorrect allele frequency databases (e.g. with a ceiling principle approach) Use alleles not in common between the suspect and his relative to generate random match probability Limit demographic disparities (e.g. investigator-initiated searches)

Resources Internet –Forensic Bioinformatics Website: Scientists –Jason Gilder (Forensic Bioinformatics) –Fred Bieber (Harvard University) –Sandy Zabel (Northwestern University) –Larry Mueller (UC, Irvine) –Keith Inman (Forensic Analytical, Hayward, CA) Publications –Paoletti, D., Doom, T., Raymer, M. and Krane, D Assessing the implications for close relatives in the event of similar but non- matching DNA profiles. Jurimetrics, 46: –Bieber, F., Brenner, C. and Lazer, D Finding criminals through DNA of their relatives. Science 312: –Rudin, N. and Inman, K An introduction to forensic DNA analysis. New York, 2nd edition.

Recommendation 4.4: If the possible contributors of the evidence sample include relatives of the suspect, DNA profiles of those relatives should be obtained. If these profiles cannot be obtained, the probability of finding the evidence profile in those relatives should be calculated. NRC II, 1996