Thinking About DNA Database Searches William C. Thompson Dept. of Criminology, Law & Society University of California, Irvine.

Slides:



Advertisements
Similar presentations
Forensic DNA Inference ICFIS 2008 Lausanne, Switzerland Mark W Perlin, PhD, MD, PhD Joseph B Kadane, PhD Robin W Cotton, PhD Cybergenetics ©
Advertisements

Familial searches and cold hit statistics Forensic Bioinformatics ( Dan Krane Wright State University, Dayton, OH
Juror Understanding of Random Match Probabilities Dale A. Nance Case Western Reserve University August, 2007.
Elementary Statistics for Lawyers References Evett and Weir, Interpreting DNA evidence. Balding, Weight-of-evidence for forensic DNA profiles.
Attaching statistical weight to DNA test results 1.Single source samples 2.Relatives 3.Substructure 4.Error rates 5.Mixtures/allelic drop out 6.Database.
Database Searches Non-random samples of N individuals Typically individuals convicted of some crime Maryland, people arrested but not convicted.
How strong is DNA evidence?
Bayes’s Theorem and the Weighing of Evidence by Juries Philip Dawid University College London.
Ethics and the DNA Database Presented by: Mick Carling, Police Science and Technology Unit Date: April 2014.
Review of Probability.
PROBABILITY AND STATISTICS IN THE LAW Philip Dawid University College London.
Overview Fundamentals
Attacking DNA that Implicates Your Client William C. Thompson, J.D., Ph.D. Dept. of Criminology, Law & Society University of California, Irvine.
Trial By Probability Bayes’ Theorem in Court. Presented By... Dave Bucheger Jill Thompson Sally Danielson Justin Koplitz Eric Hartmann.
Evaluation and interpretation of crime forensic evidence Crime Trace recovery Potential sources of the traces scenarios producing the traces Evaluation.
Presented By: Syeda Saleha Raza. A young girl, Lulu, has been found murdered at her home with many knife wounds. The knife has not been found. Some bloodstains.
Processing physical evidence discovering, recognizing and examining it; collecting, recording and identifying it; packaging, conveying and storing it;
Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Appendix 3 Probability and Statistics.
Part 4: Counting High 1.
Probability and Statistics of DNA Fingerprinting.
Nicholas Cowerdy QC, DPP Was pioneered in Australia by state Forensic Science Laboratory and introduced into casework in July 1989 As a result the following.
Conditional Probability: What’s the Probability of A, Given B?
The Inexpert Witness Born 1933 Distinguished paediatrician Famous for “Munchausen Syndrome by Proxy” Expert witness in cases of suspected child abuse and.
Chapter 4 Probability See.
Sections 4-1 and 4-2 Review and Preview and Fundamentals.
Statistical weights of mixed DNA profiles Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH Forensic DNA.
Apr. 8 Stat 100. To do Read Chapter 21, try problems 1-6 Skim Chapter 22.
Chapter 5: Elementary Probability Theory
Introduction to Probability Theory March 24, 2015 Credits for slides: Allan, Arms, Mihalcea, Schutze.
Nederlands Forensisch Instituut Observed and expected numbers of (partially) randomly matching profiles in the Dutch DNA database,
 Basic Concepts in Probability  Basic Probability Rules  Connecting Probability to Sampling.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
1 The Scientist Game Chris Slaughter, DrPH (courtesy of Scott Emerson) Dept of Biostatistics Vanderbilt University © 2002, 2003, 2006, 2008 Scott S. Emerson,
Comp. Genomics Recitation 3 The statistics of database searching.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Week 71 Hypothesis Testing Suppose that we want to assess the evidence in the observed data, concerning the hypothesis. There are two approaches to assessing.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Cybergenetics Webinar January, 2015 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © How TrueAllele ® Works (Part 4)
Lab 8: Individual Identity and Population Assignment.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Implications of database searches for DNA profiling statistics Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton,
8-3: Probability and Probability Distributions English Casbarro Unit 8.
Statistical weights of single source DNA profiles Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH Forensic.
Forensic DNA Analysis Basic Review 46 chromosomes per cell, 23 pairs Humans have approximately 25,000 genes Each gene has multiple versions,
Murder in McKeesport October 25, 2008 Tamir Thomas.
Individual Identity and Population Assignment Lab. 8 Date: 10/17/2012.
Attacking Non-Bayesian Reasoning William Thompson UC Irvine February 15, 2014.
Chapter 5: Probability in Our Daily Lives Section 5.4: Applying the Probability Rules.
Chapter 5 Probability in our Daily Lives Section 5.1: How can Probability Quantify Randomness?
Seventh Annual Prescriptions for Criminal Justice Forensics Program Fordham University School of Law June 3, 2016 DNA Panel.
Introduction to probability (3) Definition: - The probability of an event A is the sum of the weights of all sample point in A therefore If A1,A2,…..,An.
Marjan Sjerps Kees van der Beek Ate Kloosterman
9.3 Hypothesis Tests for Population Proportions
Forensics and CS Philip Chan.
How small probabilities affect our life?
Statistical Weights of DNA Profiles
Validating TrueAllele® genotyping on ten contributor DNA mixtures
Is 1 in a Trillion Coincidence? Not Really, Experts Find
Error in the likelihood ratio: false match probability
The science: accuracy, reliability and interpretation. Andrew P. Read
Explaining the Likelihood Ratio in DNA Mixture Interpretation
Distorting DNA evidence: methods of math distraction
Rules for DNA Comparison Analysis
Suffolk County TrueAllele® Validation
Solving Crimes using MCMC to Analyze Previously Unusable DNA Evidence
Forensic match information: exact calculation and applications
Probability.
Forensic validation, error and reporting: a unified approach
Reporting match error: casework, validation & language
Presentation transcript:

Thinking About DNA Database Searches William C. Thompson Dept. of Criminology, Law & Society University of California, Irvine

Value of DNA Match for Proving Identity Prior Odds x Likelihood Ratio = Posterior Odds May be very low 1 x RMP + FPP* *Actually RMP + [FPP x (1-RMP)], see Thompson, Taroni & Aitken, :1 million x 1 billion:1 = 1000:1 1:1 million x 1 million:1 = 1:1 1:1000 x 10,000:1 = 10:1

Mysterious Clusters and the Law of Truly Large Numbers In a truly large sample space, seemingly unusual events are bound to occur –E.g., double lottery winners; cancer clusters –See, Diaconis & Mosteller (1989). Methods for studying coincidences, JASA,

Taking Account of Coincidence When Searching Truly Large DNA Databases Should the frequency of the matching profile be presented to the jury? Standard answers: –No NRC I – test additional loci; report only freq. of those NRC II—multiply freq. by N (for database) –Yes Friedman/Donnelley—present LR but keep in mind prior odds may be very low Prosecutors Everywhere—jury should hear most impressive number possible “because it’s relevant”

My Solution: Present Profile Frequency Only When It Equals the RMP* Multiple Tests of Different Hypotheses –Search unsolved crime evidence against offender database –For each offender, p(match|not source) = frequency Multiple Tests of Same Hypothesis –Search suspect against unsolved crime database to see if he matches any unsolved crime –For this suspect, p(match|not source) = Freq. x N *RMP = p(match|suspect not the source)

My Solution: Present Profile Frequency Only When It Equals the RMP* Testing relatives of people who almost match –For most suspects, p(match|not source) = frequency of matching profile –For relatives of people who almost match, p(match|not source) >>>> frequency –Therefore it is misleading to present the frequency of the matching profile in cases where the suspect is selected because a relative almost matches

Database Searches and the Birthday Problem The probability that a randomly chosen person will have my birthday is 1 in 365 The probability that any two people in a room share a birthday can be far higher –With 23 people in a room, the likelihood that two will share a birthday exceeds 1 in 2 –With 60 people in the room, the probability is nearly 1 in 1

Database Searches and the Birthday Problem Suppose the probability of a random match between any two DNA profiles is between 1 in 10 billion and 1 in 1 trillion What is the probability of finding a match between two such profiles in a database of: –1,000 –100,000 –1,000,000

Approximate likelihood that two profiles in a DNA database will match Database Size1 in 10 billion 1 in 100 billion 1 in 1 trillion in 20,0001 in 200,0001 in 2 million 10,0001 in 2001 in in 20, ,0001 in 2.51 in 201 in 200 1,000,0001 in 1 1 in 2.5 Profile Frequency

Why present a birthday statistic in database cases? Because it is relevant…