Individual Identity and Population Assignment Lab. 8 Date: 10/17/2012.

Slides:



Advertisements
Similar presentations
Forensic DNA Inference ICFIS 2008 Lausanne, Switzerland Mark W Perlin, PhD, MD, PhD Joseph B Kadane, PhD Robin W Cotton, PhD Cybergenetics ©
Advertisements

Juror Understanding of Random Match Probabilities Dale A. Nance Case Western Reserve University August, 2007.
Attaching statistical weight to DNA test results 1.Single source samples 2.Relatives 3.Substructure 4.Error rates 5.Mixtures/allelic drop out 6.Database.
How strong is DNA evidence?
Mapping genes with LOD score method
Statistics Review – Part II Topics: – Hypothesis Testing – Paired Tests – Tests of variability 1.
Lab 3 : Exact tests and Measuring of Genetic Variation.
Lab 3 : Exact tests and Measuring Genetic Variation.
Alleles = A, a Genotypes = AA, Aa, aa
Forensics and DNA Statistics Harry R Erwin, PhD CIS308 Faculty of Applied Sciences University of Sunderland.
Bayesian inference “Very much lies in the posterior distribution” Bayesian definition of sufficiency: A statistic T (x 1, …, x n ) is sufficient for 
Forensic DNA Analysis (Part II)
Lecture 16: Individual Identity and Paternity Analysis March 7, 2014.
Evaluation and interpretation of crime forensic evidence Crime Trace recovery Potential sources of the traces scenarios producing the traces Evaluation.
Joint Linkage and Linkage Disequilibrium Mapping
Processing physical evidence discovering, recognizing and examining it; collecting, recording and identifying it; packaging, conveying and storing it;
MALD Mapping by Admixture Linkage Disequilibrium.
. Learning – EM in ABO locus Tutorial #08 © Ydo Wexler & Dan Geiger.
Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Appendix 3 Probability and Statistics.
Chapter 1. Speed of Analysis (Technology) Power of Discrimination (Genetics) Low High SlowFast Markers Used (Biology) RFLP Single Locus Probes RFLP Multi-Locus.
 How does the graph represent a gel? Each group filled in a ‘band’ that represents where different – sized DNA fragments would have migrated on a gel,
IGES 2003 How many markers are necessary to infer correct familial relationships in follow-up studies? Silvano Presciuttini 1,3, Chiara Toni 2, Fabio Marroni.
Statistics Are Fun! Analysis of Variance
DNA Forensics MUPGRET Workshop. “DNA evidence…offers prosecutors important new tools for the identification and apprehension of some of the most violent.
Population Genetics What is population genetics?
Probability and Statistics of DNA Fingerprinting.
Assigning individuals to ethnic groups based on 13 STR loci X. Fosella 1, F. Marroni 1, S. Manzoni 2, A. Verzeletti 2, F. De Ferrari 2, N. Cerri 2, S.
Genetic Statistic Application in Forensic Science Arthur J. Eisenberg, PhD Professor and Chairman Department of Forensic and Investigative Genetics Co-Director.
Other DNA Issues Harry R Erwin, PhD CIS308 Faculty of Applied Sciences University of Sunderland.
Forensic Statistics From the ground up…. Basics Interpretation Hardy-Weinberg equations Random Match Probability Likelihood Ratio Substructure.
Crime Scene Investigation Science in Motion Cedar Crest College.
Statistical weights of mixed DNA profiles Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH Forensic DNA.
Scientific Validation of Mixture Interpretation Methods 17th International Symposium on Human Identification Sponsored by the Promega Corporation October,
Nederlands Forensisch Instituut Observed and expected numbers of (partially) randomly matching profiles in the Dutch DNA database,
DNA evidence The DNA Double Helix Consists of so-called nucleobases always in pairs A-T, C-G. One part of the pair is inherited from the mother, the other.
Population Genetics: Chapter 3 Epidemiology 217 January 16, 2011.
Thinking About DNA Database Searches William C. Thompson Dept. of Criminology, Law & Society University of California, Irvine.
Lecture 19: Association Studies II Date: 10/29/02  Finish case-control  TDT  Relative Risk.
Lecture 14: Population structure and Population Assignment October 12, 2012.
Lab 7. Estimating Population Structure. Goals 1.Estimate and interpret statistics (AMOVA + Bayesian) that characterize population structure. 2.Demonstrate.
Cybergenetics Webinar January, 2015 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © How TrueAllele ® Works (Part 4)
Statistical Analysis of DNA Simple Repeats –Identical length and sequence agat agat agat agat agat Compound Repeats –Two or more adjacent simple repeats.
Lab 8: Individual Identity and Population Assignment.
Getting Past First Bayes with DNA Mixtures American Academy of Forensic Sciences February, 2014 Seattle, WA Mark W Perlin, PhD, MD, PhD Cybergenetics,
Lecture 14: Population Assignment and Individual Identity October 8, 2015.
Implications of database searches for DNA profiling statistics Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton,
Statistical weights of single source DNA profiles Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH Forensic.
Murder in McKeesport October 25, 2008 Tamir Thomas.
Lecture 15: Individual Identity and Paternity Analysis
Lab 7. Estimating Population Structure
Lecture 22: Quantitative Traits II
Objective DNA Mixture Information in the Courtroom: Relevance, Reliability & Acceptance NIST International Symposium on Forensic Science Error Management:
Exercise 1 DNA identification. To which population an individual belongs? Two populations of lab-mice have been accidentally put in a same cage. Your.
Chi square and Hardy-Weinberg
Disputed DNA Stats for a Low-level Sample: A Case Study By Dan Krane – Carrie Rowland –
Seventh Annual Prescriptions for Criminal Justice Forensics Program Fordham University School of Law June 3, 2016 DNA Panel.
Robert Page Doctoral Student in Dr. Voss’ Lab Population Genetics.
Lecture 15: Individual Identity and Forensics October 17, 2011.
Statistical Analysis of DNA
Forensics and CS Philip Chan.
Statistical Weights of DNA Profiles
Lecture 15: Individual Identity and Paternity Analysis
Explaining the Likelihood Ratio in DNA Mixture Interpretation
Distorting DNA evidence: methods of math distraction
“Using Computer Technology to Overcome Bottlenecks in the Forensic DNA Testing Process and Improve Data Recovery from Complex Samples”
Solving Crimes using MCMC to Analyze Previously Unusable DNA Evidence
Aim: How is genetics used in law enforcement?
TrueAllele® computer technology
Biotechnology Mader 19.4.
DNA Identification: Mixture Interpretation
Presentation transcript:

Individual Identity and Population Assignment Lab. 8 Date: 10/17/2012

Goals 1.Use LR method to assess probability that a person left DNA at a crime scene 2.Assign individuals to a predefined population using LR method. 3.Understand the role of human population differentiation in reliable population assignment.

Individual Identity Scenario: A person is accused of a murder. CSU was able to recover the skin cells of the perpetrator where the murder was alleged to have taken place. Genotyping of the sample gives a single DNA profile that matches the suspect.

Individual Identity Ideally… For the courtroom… =1 Expected Freq. based on HWE

Example LocusAllele 1 (frequency) Allele 2 (frequency) AA1 (0.3) A2 (0.7) BB1 (0.4)

Interpretation “It is about 15 times more likely that the sample came from the suspect than from a random person unrelated to the suspect.” Prosecutor’s Fallacy: “It is 15 times more likely that the sample is from the suspect than from someone else” Defense Attorney’s Fallacy: “Because the odds that the sample came from the suspect rather than someone else are only 15:1, there are hundreds of thousands of people who are just as likely to be the sources of the sample found at the scene

Interpretation Prosecutor’s Fallacy (part 2): “Given the DNA evidence, the probability that the sample came from somebody other than the suspect is ”.

Individual identity

Problem 1 The profile of a crime suspect genotyped for three of the CODIS loci is: Suspect Profile CODIS Locus Allele 1 (frequency) Allele 2 (frequency) D8S (0.1119) 14 (0.2238) D21S11 28 (0.1049) 31 (0.0664) D7S (0.2797) 11 (0.2797) 1.Calculate the likelihood ratio and provide a strictly correct interpretation 2.Calculate the posterior probability for each of the following scenarios: ScenarioPrior Probability P(H 1 ) Rationale for P(H 1 ) 1 1/(6.6  10 9 ) All ~6.6 billion people on the planet are considered equally likely to be the perpetrators. 2 1/(3.03  10 8 ) All ~303 million U.S. citizens are considered equally likely to be the perpetrators. 31/56,000 All ~56,000 people (including students) currently living in Morgantown are considered equally likely to be the perpetrators. 41/18 All 18 students currently enrolled for BIOL 464 are considered equally likely to be the perpetrators.

Accounting for Pop. Structure F ST typically much less than 0.01, but a conservative value of 0.01 is used with all ethnic groups. A value of 0.03 is used for Native Americans. Typically, the ethnic group with the lowest LR (most conservative) is used for the courtroom calculation.

Problem 2 There are two hypotheses about the ethnicity of the crime perpetrator from Problem 1, and the frequencies of the alleles from the profile of the suspect in these groups are as follows: Based on these allele frequencies, what would be the LR in each group: a)In the absence of substructure b)If F ST = 0.01 in group 1, and F ST = 0.03 in group 2. c)Which estimates should be used in court for each case and how should the evidence be presented? LocusAlleleFrequency in group 1Frequency in group 2 D8S D8S D21S D21S D7S

Problem 3 Starting with the profile from problem 1, use OmniPop200.1 to calculate likelihood ratios for a large number of ethnic groups. a)Determine how many additional CODIS loci would be necessary to obtain a likelihood ratio greater than in any of the ethnic groups included in the database. b)Which 2 ethnic groups show the largest difference in likelihoods? How great is the difference? What do you conclude about the importance of population structure for likelihood calculations using the CODIS loci? Does this conclusion depend on the loci or the genotypes observed?

Assigning individuals to predefined populations based on LR Lets say, H 1 is the hypothesis that an individual is from population 1 H 2 is the hypothesis that the individual is from population 2 G is the multilocus genotype of the individual. Assumptions: Both populations are in HWE. Selected loci are in linkage equilibrium. Example: LR = 230 means “the individual is 230 times more likely to originate from population 1 than from population 2”

Assignment success depends on: 1.# and variability of molecular markers. 2.# of potential source populations. 3.Accuracy of allele frequency estimations. 4.F ST.

Problem 4: Glacier National Park (British Columbia). Black bear- Less dangerous Brown bear- Dangerous Bear feces: Black or Brown Bear ?? LocusAlleleFrequency in brown bears Frequency in black bears G1A G1A196Not detected0.074 G1D G10B Not detected G10B Bear Profile LocusAllele 1Allele 2 G1A G1D178 G10B156158

Problem 5: Use GenAlex to analyze human_struc.xls and human_by_region.xls data to see success of population assignment in these two datasets. PopSelf PopOther Pop AFRICA913 AMERICA EAST_ASIA EURASIA OCEANIA Total Percent PopSelf PopOther Pop BantuKenya75 BiakaPygmy Mandenka MbutiPygmy Total Percent