Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Chapter 11 Statistical Interpretation.

Slides:



Advertisements
Similar presentations
Chapter 23 – Part 1 Part 2 After Break.
Advertisements

Forensic DNA profiling workshop
Biology Basics Cellular Biology for Lawyers Elementary Science in pictures.
Peak Height Ratios in Forensic STR Analyses: Pattern of Occurrence & Effects of Concentration Data from: Illinois State Police Laboratory Illinois State.
How they make a scrimillion copies of the DNA Amplification Capillary Electrophereses Interpretation.
Things that can muck up a DNA Profile
Forensic Bioinformatic Services, Inc.
Attaching statistical weight to DNA test results 1.Single source samples 2.Relatives 3.Substructure 4.Error rates 5.Mixtures/allelic drop out 6.Database.
Database Searches Non-random samples of N individuals Typically individuals convicted of some crime Maryland, people arrested but not convicted.
Cold Hit Statistics NRC I How does it work? Searches are done with a subset of available loci. A second set of different genetic markers are used to confirm.
Inferring the Number of Contributors to Mixed DNA Profiles David Paoletti.
Lecture Slides Elementary Statistics Eleventh Edition
Fig S1 Figure S1: 5T4 expression in B cell development. Adapted from microarray data published online by van Zelm et al
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Chapter 1 The Study of Body Function Image PowerPoint
Evaluating forensic DNA evidence
DNA Fingerprinting.
0 - 0.
Chapter 7 Sampling and Sampling Distributions
The basics for simulations
Business and Economics 6th Edition
© Forensic Science Service Ltd All rights reserved. New DNA developments better, faster and cheaper Martin Bill, R&D Forensic Science Service Ltd.
Introduction To 2 and 3 Person Mixtures How the RMP Can Help With Complex Mixtures.
Measurements and Their Uncertainty 3.1
© 2012 National Heart Foundation of Australia. Slide 2.
Addition 1’s to 20.
Week 1.
Statistical Inferences Based on Two Samples
Analyzing Genes and Genomes
2 Person Mixture #3 Questioned samples from bomb remains, no references.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
Essential Cell Biology
CSE3201/4500 Information Retrieval Systems
Chapter 8 Estimation Understandable Statistics Ninth Edition
Intracellular Compartments and Transport
PSSA Preparation.
Essential Cell Biology
Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Chapter 14 Forensic Challenges.
Energy Generation in Mitochondria and Chlorplasts
Brief History of Forensic DNA Typing
DNA: Review, Replication, & Analysis Two types of DNA Nucleic DNA –Found in the nucleus of a cell –Specific to an individual Mitochondrial DNA (mtDNA)
Forensics and DNA Statistics Harry R Erwin, PhD CIS308 Faculty of Applied Sciences University of Sunderland.
Lecture 12: Autosomal STR DNA Profiling
Three generations of DNA testing
Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Appendix 3 Probability and Statistics.
Chapter 1. Speed of Analysis (Technology) Power of Discrimination (Genetics) Low High SlowFast Markers Used (Biology) RFLP Single Locus Probes RFLP Multi-Locus.
 How does the graph represent a gel? Each group filled in a ‘band’ that represents where different – sized DNA fragments would have migrated on a gel,
Copyright © McGraw-Hill Education. Permission required for reproduction or display. Chapter 14 Constant Allele Frequencies.
Fingerprints: Nuclear DNA standard. =FBI CODIS (Combined DNA Index System) standard for nuclear DNA utilizes 13 highly-variable tetramer STR sites. CSF1PO.
JS 115- Population Genetics- Assessing the Strength of the Evidence I.Pre class activities a.Quiz b.Review Assignments and Schedules c.Return and Review.
Genetic Statistic Application in Forensic Science Arthur J. Eisenberg, PhD Professor and Chairman Department of Forensic and Investigative Genetics Co-Director.
PowerPoint Slides to Aid Learning with Forensic DNA Typing Book
Forensic Statistics From the ground up…. Basics Interpretation Hardy-Weinberg equations Random Match Probability Likelihood Ratio Substructure.
DNA Criminalist and Court Appearance
Human Identity Testing Purpose: Match a person to a DNA sample. Examples: Paternity Test Genetic History Historical (Thomas Jefferson, Sally Hemings) Genealogical.
Commonly Used Short Tandem Repeat Markers
Chapter 7 Forensic Issues: Degraded DNA, PCR Inhibition, Contamination, and Mixed Samples ©2002 Academic Press.
Allele Frequencies: Staying Constant Chapter 14. What is Allele Frequency? How frequent any allele is in a given population: –Within one race –Within.
Statistical Analysis of DNA Simple Repeats –Identical length and sequence agat agat agat agat agat Compound Repeats –Two or more adjacent simple repeats.
Lecture 14: Population Assignment and Individual Identity October 8, 2015.
Statistical weights of single source DNA profiles Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH Forensic.
PCR Y.Martinez, LSHS, 2014 DIRECTIONS: COPY NOTES IN ORANGE.
Individual Identity and Population Assignment Lab. 8 Date: 10/17/2012.
Disputed DNA Stats for a Low-level Sample: A Case Study By Dan Krane – Carrie Rowland –
Three generations of DNA testing
Lecture 15: Individual Identity and Forensics October 17, 2011.
Statistical Analysis of DNA
Lecture 15: Individual Identity and Paternity Analysis
Biotechnology Mader 19.4.
Presentation transcript:

Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Chapter 11 Statistical Interpretation

Chapter 11 – Statistical Data Interpretation Chapter Summary Matching DNA results must be provided with statistical interpretation to help determine their relevance. The frequency of alleles and genotypes are assessed by gathering a sampling of a particular population. Provided that the alleles and their loci are independent from one another, results can be combined using what is commonly referred to as “the product rule.” The random match probability for a particular DNA profile represents the chance of drawing this combination of alleles at random from a population of unrelated individuals and is not the probability of guilt—a philosophical mistake known as the “prosecutor’s fallacy.” Corrections for subpopulation structure and possible involvement of relatives reduce the match probability and typically provide a more conservative estimate for the defendant. Unresolved mixtures and partial profiles, which are forensic realities, reduce the match probability for a particular sample.

DNA Testing Requires a Reference Sample Crime Scene Evidence compared to Suspect(s) (Forensic Case) Child compared to Alleged Father (Paternity Case) Victim’s Remains compared to Biological Relative (Mass Disaster ID) Soldier’s Remains compared to Direct Reference Sample (Armed Forces ID) A DNA profile by itself is fairly useless because it has no context… DNA analysis for identity only works by comparison – you need a reference sample

DNA Profile (with specific alleles) Rarity estimate of DNA profile (e.g., RMP or LR) Genetic formulas Population allele frequencies John M. Butler (2009) Fundamentals of Forensic DNA Typing, Figure 11.1

Decide on Number of Samples and Ethnic/Racial Grouping Gather Samples Analyze Samples at Desired Genetic Loci Summarize DNA types Ethnic/ Racial Group 1 Ethnic/ Racial Group 2 Determine Allele Frequencies for Each Locus Perform Statistical Tests on Data Hardy-Weinberg equilibrium for allele independence Linkage equilibrium for locus independence Usually >100 per group Use Database(s) to Estimate an Observed DNA Profile Frequency Often anonymous samples from a blood bank See Table 11.1 Examination of genetic distance between populations John M. Butler (2009) Fundamentals of Forensic DNA Typing, Figure 11.2

Paternal Allele Maternal Allele Genotype Locus 1 DNA Profile Paternal Allele Maternal Allele Genotype Locus 2 Paternal Allele Maternal Allele Genotype Locus 3 HWE Linkage Equilibrium (product rule) John M. Butler (2009) Fundamentals of Forensic DNA Typing, Figure 11.3

How Statistical Calculations are Made Generate data with set(s) of samples from desired population group(s) – Generally only samples are needed to obtain reliable allele frequency estimates Determine allele frequencies at each locus –Count number of each allele seen Allele frequency information is used to estimate the rarity of a particular DNA profile –Homozygotes (p 2 ), Heterozygotes (2pq) –Product rule used (multiply locus frequency estimates)

How Are Such Large Numbers Generated with Random Match Probabilities? Each allele is sampled multiple times to produce a statistically stable allele frequency Using theoretical model from genetics, multiple loci are multiplied together to produce an estimate of the rarity of a particular DNA profile (combination of STR alleles based on individual allele frequencies) Remember that relatives will share genetic characteristics and thus have STR profiles that are more similar to one another than unrelated individuals We are not looking at every person on the planet nor are we looking at every nucleotide in the suspect’s genome

DNA Profile Frequency with all 13 CODIS STR loci Locus allele value allele value1 inCombined D3S VWA FGA D8S ,364 D21S ,073 D18S ,845,217 D5S ,818,259 D13S x 10 9 D7S x D16S x THO x TPOX x CSF1PO x The Random Match Probability for this profile in the U.S. Caucasian population is 1 in 837 trillion (10 12 ) AmpFlSTR ® Identifiler™ (Applied Biosystems) AMEL D3 TH01TPOX D2D19 FGA D21 D18 CSF D16 D7 D13 D5 VWA D8 What would be entered into a DNA database for searching: 16,17- 17,18- 21,22- 12,14- 28,30- 14,16- 12,13- 11,14- 9,9- 9,11- 6,6- 8,8- 10,10 PRODUCTRULEPRODUCTRULE

The Same 13 Locus STR Profile in Different Populations 1 in 0.84 quadrillion (10 15 ) in U.S. Caucasian population (NIST) 1 in 2.46 quadrillion (10 15 ) in U.S. Caucasian population (FBI)* 1 in 1.86 quadrillion (10 15 ) in Canadian Caucasian population* 1 in 16.6 quadrillion (10 15 ) in African American population (NIST) 1 in 17.6 quadrillion (10 15 ) in African American population (FBI)* 1 in 18.0 quadrillion (10 15 ) in U.S. Hispanic population (NIST) * 1 in 837 trillion These values are for unrelated individuals assuming no population substructure (using only p 2 and 2 pq) NIST study: Butler, J.M., et al. (2003) Allele frequencies for 15 autosomal STR loci on U.S. Caucasian, African American, and Hispanic populations. J. Forensic Sci. 48(4): (

The Three Possible Outcomes of Evidence Examination Exclusion (no match) Non-exclusion –“Match” or “inclusion” Inconclusive result “Suspect” Known (K) Sample “Evidence” Question (Q) Sample No result (or a complex mixture)

Profiler Plus  COfiler  SGM Plus  Green I Profiler  Blue TH01 Amel D16S539 D7S820 CSF1PO TPOX D3S1358 D16S539 D18S51 D21S11 Amel D3S1358 D18S51 D21S11 D8S1179 D7S820 D13S317 D5S818 D19S433 D2S1338 FGA vWA FGA TH01 D3S1358 vWA FGA D7S820 D5S818 D13S317 TH01 CSF1PO TPOX D8S1179 vWA TH01 CSF1PO TPOX Amel FGA D3S1358 Amel PCR Product Size (bp) Same DNA sample run with Applied Biosystems STR Kits Random Match Probability 1.0 x x x x x x

The Statistic (Determining the Weight of the Evidence) Should Be Calculated from the Evidence Evidence (partial profile): TypeStatistic Locus 1 16,171 in 9 Locus 2 17,181 in 9 Locus 3 21,221 in 12 Locus 4 12,141 in 16 Locus 5 28,301 in Product = 1 in 171,000 Reference (full profile): TypeStatistic Locus 1 16,171 in 9 Locus 2 17,181 in 9 Locus 3 21,221 in 12 Locus 4 12,141 in 16 Locus 5 28,301 in 11 Locus 6 14,161 in 26 Locus 7 12,131 in 9 Locus 8 11,141 in 31 Locus 9 9,91 in 32 Locus 10 9,111 in 14 Locus 11 6,61 in 19 Locus 12 8,81 in 3 Locus 13 10,101 in Product = 1 in 665 trillion Match Observed at All Loci that May Be Compared The reference sample is still a “match” – just not as much information is available from the evidence for comparison

Chapter 11 – Points for Discussion What is the purpose of providing a random match probability statistic when two DNA profiles match? What is the purpose of generating a population database? For a locus with n possible alleles, how many total genotypes are theoretically possible? Why utilize a minimum allele frequency? Why is it important to establish independence between alleles and between loci? What is wrong with simply saying that a suspect is included in a mixture without providing any statistics?