DNA Identification: Biology and Information Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Continuing Legal Education Allegheny County Courthouse March, 2011 Cybergenetics © 2003-2011 Cybergenetics © 2007-2010
DNA cell nucleus
DNA chromosomes cell nucleus
DNA chromosomes cell nucleus locus
DNA chromosomes cell nucleus alleles locus Short Tandem Repeat (STR)
DNA chromosomes cell nucleus alleles locus Short Tandem Repeat (STR) genotype 7, 8
Identification Biological evidence
Identification Lab Biological evidence Questioned data 10 12
Identification Lab Infer Biological evidence Questioned data Questioned genotype Q 10, 12 10 12
Identification Lab Infer Biological evidence Questioned data Questioned genotype Q 10, 12 10 12 Compare Suspect genotype S 10, 12
Identification Lab Infer Biological evidence Questioned data Questioned genotype Q 10, 12 10 12 Compare Suspect genotype S Prob(identification) = Prob(suspect matches evidence) = 100% 10, 12
Biological population Coincidence Biological population
Biological population Coincidence Lab Biological population Allele frequency data 10 11 12 13 14 15 16 17
Coincidence Lab Infer Biological population Allele frequency data Population genotype R 10, 12 @ 5% 10 11 12 13 14 15 16 17 Genotype product rule, combines alleles Prob(10, 12) = 2p10p12 Prob(10, 10) = p10p10
Biological population Coincidence Lab Infer Biological population Allele frequency data Population genotype R 10, 12 @ 5% Compare 10 11 12 13 14 15 16 17 Suspect genotype S 10, 12
Biological population Coincidence Lab Infer Biological population Allele frequency data Population genotype R 10, 12 @ 5% Compare 10 11 12 13 14 15 16 17 Suspect genotype S Prob(coincidence) = Prob(suspect matches population) = 5% 10, 12
Information Likelihood ratio At the suspect's genotype, identification vs. coincidence? after (evidence) Prob(identification) data Prob(coincidence) before (population) Likelihood ratio
Information At the suspect's genotype, identification vs. coincidence? after (evidence) Prob(suspect matches evidence) data Prob(suspect matches population) before (population)
Information At the suspect's genotype, identification vs. coincidence? after (evidence) Prob(suspect matches evidence) 100% data = Prob(suspect matches population) 5% before (population) = 20
Uncertainty Biological evidence +
Uncertainty Lab Biological evidence Questioned data + 10 11 12
Uncertainty Lab Infer Biological evidence Questioned data Questioned genotype Q 10, 12 @ 50% 11, 12 @ 30% 12, 12 @ 20% + 10 11 12
Uncertainty Lab Infer Biological evidence Questioned data Questioned genotype Q 10, 12 @ 50% 11, 12 @ 30% 12, 12 @ 20% + 10 11 12 Compare Suspect genotype S 10, 12
Uncertainty Lab Infer Biological evidence Questioned data Questioned genotype Q 10, 12 @ 50% 11, 12 @ 30% 12, 12 @ 20% + 10 11 12 Compare Prob(identification) = Prob(suspect matches evidence) = 50% Suspect genotype S 10, 12
Information Likelihood ratio At the suspect's genotype, identification vs. coincidence? after (evidence) Prob(identification) data Prob(coincidence) before (population) Likelihood ratio
Information At the suspect's genotype, identification vs. coincidence? after (evidence) Prob(suspect matches evidence) 50% data = Prob(suspect matches population) 5% before (population) = 10