ICFIS, Leiden 21 August 2014 Norman Fenton Queen Mary University of London and Agena Ltd Limitations and opportunities of the likelihood.

Slides:



Advertisements
Similar presentations
Forensic DNA Inference ICFIS 2008 Lausanne, Switzerland Mark W Perlin, PhD, MD, PhD Joseph B Kadane, PhD Robin W Cotton, PhD Cybergenetics ©
Advertisements

Database Searches Non-random samples of N individuals Typically individuals convicted of some crime Maryland, people arrested but not convicted.
Introductory Mathematics & Statistics for Business
In 1999, Sally Clark was convicted of the murder of her two sons. The data: In 1996, her first son died apparently of cot death at a few weeks of age.
THE RISKS OF LIFE Video Conference 5 Media & Misinformation.
CS CS1512 Foundations of Computing Science 2 Lecture 23 Probability and statistics (4) © J.
CS1512 Foundations of Computing Science 2 Week 3 (CSD week 32) Probability © J R W Hunter, 2006, K van Deemter 2007.
Bayesian inference PHILOSOPHY OF SCIENCE: Thomas Bayes
Comparing Classical and Bayesian Approaches to Hypothesis Testing James O. Berger Institute of Statistics and Decision Sciences Duke University
Chapter 10: The t Test For Two Independent Samples
Chapter 7 Hypothesis Testing
Inferential Statistics and t - tests
Statistics Hypothesis Testing.
What Went Wrong in the Case of Sally Clark?
Bayes’s Theorem and the Weighing of Evidence by Juries Philip Dawid University College London.
FT228/4 Knowledge Based Decision Support Systems
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.
Testing Hypotheses About Proportions
One sample means Testing a sample mean against a population mean.
Naïve Bayes. Bayesian Reasoning Bayesian reasoning provides a probabilistic approach to inference. It is based on the assumption that the quantities of.
Slide 1 PGM 2012 The Sixth European Workshop on Probabilistic Graphical Models Granada, Spain 20 September 2012 Norman Fenton Queen Mary University of.
Bayesian Networks for Risk Assessment
PROBABILITY AND STATISTICS IN THE LAW Philip Dawid University College London.
Presented By: Syeda Saleha Raza. A young girl, Lulu, has been found murdered at her home with many knife wounds. The knife has not been found. Some bloodstains.
Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Appendix 3 Probability and Statistics.
1 Chapter 12 Probabilistic Reasoning and Bayesian Belief Networks.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
PMAI 2003/04 Targil 1: Basic Probability (see further details in handouts)
Hypothesis Testing After 2 hours of frustration trying to fill out an IRS form, you are skeptical about the IRS claim that the form takes 15 minutes on.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Inductive Reasoning Bayes Rule. Urn problem (1) A B A die throw determines from which urn to select balls. For outcomes 1,2, and 3, balls are picked from.
Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
Does Naïve Bayes always work?
The Inexpert Witness Born 1933 Distinguished paediatrician Famous for “Munchausen Syndrome by Proxy” Expert witness in cases of suspected child abuse and.
Testing Hypotheses I Lesson 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics n Inferential Statistics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Overview Definition Hypothesis
ANOVA Greg C Elvers.
Page 79 Exercise 5A Homework - using GCSE notes for review prior to starting this unit.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Scales and probability measures The states of a random variable can be given on different scales 1) Nominal scale A scale where the states have no numerical.
 Basic Concepts in Probability  Basic Probability Rules  Connecting Probability to Sampling.
Bayesian Networks for Data Mining David Heckerman Microsoft Research (Data Mining and Knowledge Discovery 1, (1997))
Exeter University 10 October 2013 Norman Fenton Director of Risk & Information Management Research (Queen Mary University of London) and CEO of Agena Ltd.
Chapter 20 Testing hypotheses about proportions
Copyright © 2010, 2007, 2004 Pearson Education, Inc Section 8-2 Basics of Hypothesis Testing.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Uncertainty Management in Rule-based Expert Systems
Uncertainty in Expert Systems
why smart data is better than big data Queen Mary University of London
1 Chapter 12 Probabilistic Reasoning and Bayesian Belief Networks.
"Classical" Inference. Two simple inference scenarios Question 1: Are we in world A or world B?
Slide 1 UCL JDI Centre for the Forensic Sciences 21 March 2012 Norman Fenton Queen Mary University of London and Agena Ltd Bayes and.
Sally Clark. Sally Clark (August 1964 – 15 March 2007) was a British lawyer who became the victim of a miscarriage of justice when she was wrongly convicted.
Inferential Statistics Inferential statistics allow us to infer the characteristic(s) of a population from sample data Slightly different terms and symbols.
STA Lecture 221 !! DRAFT !! STA 291 Lecture 22 Chapter 11 Testing Hypothesis – Concepts of Hypothesis Testing.
Textbook Basics of an Expert System: – “Expert systems: Design and Development,” by: John Durkin, 1994, Chapters 1-4. Uncertainty (Probability, Certainty.
1 Definitions In statistics, a hypothesis is a claim or statement about a property of a population. A hypothesis test is a standard procedure for testing.
One Sample Inf-1 In statistical testing, we use deductive reasoning to specify what should happen if the conjecture or null hypothesis is true. A study.
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
Chapter 20 Testing Hypotheses About Proportions. confidence intervals and hypothesis tests go hand in hand:  A confidence interval shows us the range.
AP Statistics From Randomness to Probability Chapter 14.
Does Naïve Bayes always work?
Bayesian networks: opportunities and challenges in the law
What’s Reasonable About a Range?
Session II: Reserve Ranges Who Does What
Data Mining Lecture 11.
The Language of Statistical Decision Making
Honors Statistics From Randomness to Probability
Testing Hypotheses I Lesson 9.
Presentation transcript:

ICFIS, Leiden 21 August 2014 Norman Fenton Queen Mary University of London and Agena Ltd Limitations and opportunities of the likelihood ratio approach for evidence evaluation Fenton, N. E., D. Berger, D. Lagnado, M. Neil and A. Hsu, "When ‘neutral’ evidence still has probative value (with implications from the Barry George Case)", Science and Justice, Volume 54, Issue 4, Pages 274–287, July 2014

Overview 1.Revisiting the Likelihood Ratio – first principles, theoretical benefits and limitations 2.Revisiting the LR in two well-known cases 3.Why Bayesian networks are needed 4.Conclusions and way forward

1. REVISITING THE LIKELIHOOD RATIO – FIRST PRINCIPLES

Was Mrs Peacock the murderer? H: “Mrs Peacock guilty” P(H) = 1/6 E: “The murderer was female P(E | H) = 1 P(E) = 1/2 By Bayes P(H | E) = 1/3

When does evidence E support a hypothesis H? When our belief in H increases as a result of observing E, i.e. when P(H | E) > P(H) So, suppose E supports H and that H’ is an alternative mutually exclusive hypothesis. Can we conclude that our belief in H’ must have decreased? NO!! (suppose H’ is “Miss Scarlett is murderer”) Except when H’ = not H P(H’ | E) = 1 – P(H | E) < 1 – P(H) = P(H’)

A simple formal definition of probative value of evidence

Because of obsessive and irrational fear of the explicit PRIOR P(H)

Instead the Likelihood Ratio is used

Benefits of the LR Simple formula for probative value of evidence No need to explicitly consider prior for H Forces forensic experts to consider the likelihood of both the prosecution hypothesis and the defence hypothesis

But note: It is meaningless to talk about the Likelihood Ratio being a measure of the probative value of evidence without explicit reference to Bayes Theorem

And especially note….. When H’ ≠ not H the notion that the LR is a measure of probative value of evidence is tenuous and potentially misleading

When H’ ≠ not H… Knowing that LR>1 just tells us that the ratio of posterior probabilities (of H and H’) is greater than the ratio of prior probabilities So all we can conclude is E supports H more than it supports H’. But E may not support H at all because we can still have P(H|E) < P(H) Similarly LR=1 only tells us E supports both H and H’ equally. That does NOT mean E is neutral; P(H|E) might be very different to P(H)

Was Mrs Peacock the murderer? H: “Mrs Peacock guilty” E: “The murderer was female P(E | H) = 1 P(E | not H) = 2/5 LR= 2.5 But if H’: “Miss Scarlet guilty” P(E | H’) = 1 LR=1

Issues and limitations of the LR ‘probative value’ is not what people think it means when H’ is different from not H But it is difficult to work with exhaustive pairs of hypotheses Priors can never be truly ignored Evidence E is rarely ‘simple’ – normally involves E1 and E2 that require separate likelihoods Can be difficult even to avoid non-mutually exclusive hypotheses in practice Even if we get it all right LR of source level hypotheses tells us NOTHING about LR of offense level hypotheses

Example Fred and Joe live at the same address. Gun X is registered to that address. Bob is found murdered from a gun shot. Evidence E: “there is a gun in Fred’s house with FDR that matched that from the crime scene.” Fred is charged with the murder of Bob. The offence level hypotheses are: H p : Fred fired the shot that killed Bob not H p : Fred did not fire the shot that killed Bob The source level hypotheses are: H p1 : Fred owned gun that killed Bob not H p1 : Fred did not own gun that killed Bob

Some reasonable assumptions LR=1 for source level hypotheses….. but E has real probative value on Hp Essentially irrelevant

Prior state of the BN

Calculating the probability of evidence E under the two values for H1p P(E | not H1 p ) = (unchanged from prior) P(E | H1 p ) = (unchanged from prior)

Evidence is observed Probability of Hp jumps from 1% to over 9%..the evidence is certainly not neutral

2. REVISITING THE LR IN TWO WELL-KNOWN CASES

R v Sally Clark Convicted and ultimately cleared of murdering her 2 children

Sally Clark Revisited: A new issue in the probability experts’ reasoning Hd : Sally Clark’s two babies died of SIDS Hp : Sally Clark murdered her two babies “(Prior) probability of Hd over 100 times greater than (prior) probability of Hp” “So assuming LR of 5 posterior of Hd still 20 greater Hd : Sally Clark’s two babies died of SIDS Hp : Sally Clark murdered at least one of her two babies. (Prior) probability of Hd only 2.5 times greater than the (prior) probability of Hp

R v Barry George, Jill Dando

R v Barry George (revisiting the Appeal Court judgment) Hp: Hypothesis “BG was man who shot JD” E: “Single particle of FDR matching that from the gun that killed JD found in BG coat pocket Defence likelihood P(E|not Hp) = 1/100 …But Prosecution likelihood P(E| Hp) = 1/100 So LR = 1 and evidence ‘has no probative value’ But the appeal transcript suggests a problem… Fenton, N. E., D. Berger, D. Lagnado, M. Neil and A. Hsu, "When ‘neutral’ evidence still has probative value (with implications from the Barry George Case)", Science and Justice, Volume 54, Issue 4, Pages 274–287, July 2014

Confusion from experts about the hypotheses Not clear that Hp stated was really the same prosecution hypothesis considered by the experts – H1 p : “The particle found in BG’s pocket came from a gun fired by BG”. – H2 p : “The particle found in BG’s pocket came from the gun that killed JD”. Transcript suggests the experts did not adhere to the assumption that defence hypothesis Hd was simply “not Hp”, i.e. “BG was not the man who shot JD”. – H1 d : “Integrity of BG coat was corrupted”

LR=1? P(E | H p ) = P(E | H1 d ) but the evidence E is not neutral as concluded by expert and accepted by the court. It favours H p.

3. WHY BAYESIAN NETWORKS ARE NEEDED

More comprehensive BN model needed in BG case

Target is type X Target is source Source is type X Target tested X Source tested X Even single piece of forensic match evidence is NOT a 2-node BN Source is type X

Bayesian nets: what we need to stress Separate out assumptions from calculations Can incorporate subjective, expert judgement Can address the standard resistance to using subjective probabilities by using ranges. Easily show results from different assumptions …but must be seen as the ‘calculator’

The potential of Bayesian Networks “I assert that we now have a technology that is ready for use, not just by the scholars of evidence, but by trial lawyers.” Edwards, W. (1991). "Influence Diagrams, Bayesian Imperialism, and the Collins case: an appeal to reason." Cardozo Law Review 13:

4. CONCLUSIONS AND WAY FORWARD

Summary LR and probative value of evidence may not be what people think it is In isolation the LR may be highly misleading Doing things correctly requires fuller models - BNs But Bayesian arguments cannot be presented from first principles

Blatant Plug for Book CRC Press, ISBN: , ISBN 10: