Lecture 14: Population Assignment and Individual Identity October 8, 2015.

Slides:



Advertisements
Similar presentations
Attaching statistical weight to DNA test results 1.Single source samples 2.Relatives 3.Substructure 4.Error rates 5.Mixtures/allelic drop out 6.Database.
Advertisements

Lab 3 : Exact tests and Measuring Genetic Variation.
Forensic DNA Analysis (Part II)
Lecture 16: Individual Identity and Paternity Analysis March 7, 2014.
DNA Fingerprinting and Forensic Analysis
Explain how crime scene evidence is
DNA fingerprinting Every human carries a unique set of genes (except twins!) The order of the base pairs in the sequence of every human varies In a single.
Copyright © McGraw-Hill Education. Permission required for reproduction or display. Chapter 14 Constant Allele Frequencies.
Human Migrations Saeed Hassanpour Spring Introduction Population Genetics Co-evolution of genes with language and cultural. Human evolution: genetics,
Assigning individuals to ethnic groups based on 13 STR loci X. Fosella 1, F. Marroni 1, S. Manzoni 2, A. Verzeletti 2, F. De Ferrari 2, N. Cerri 2, S.
Constant Allele Frequencies Hardy-Weinberg Equilibrium.
Forensic Statistics From the ground up…. Basics Interpretation Hardy-Weinberg equations Random Match Probability Likelihood Ratio Substructure.
DNA Profiling (DNA fingerprinting).
explain how crime scene evidence is
Explain how crime scene evidence is
1 Chapter 7 Chapter 7 DNA Fingerprinting Learning Goals: o Explain how crime scene evidence is collected and processed to obtain DNA o Describe how radioactive.
DNA Criminalist and Court Appearance
Human Genetics Concepts and Applications Tenth Edition RICKI LEWIS Copyright ©The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
DNA evidence The DNA Double Helix Consists of so-called nucleobases always in pairs A-T, C-G. One part of the pair is inherited from the mother, the other.
DNA Profiling in Forensic Science. Introduction DNA Profiling is the analysis of DNA samples to determine if they came from the same individual. Since.
A Primer for Future Jurors (or Criminals)
Lecture 13: Population Structure October 5, 2015.
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Lecture 14: Population structure and Population Assignment October 12, 2012.
Genes in human populations n Population genetics: focus on allele frequencies (the “gene pool” = all the gametes in a big pot!) n Hardy-Weinberg calculations.
Watson & Crick Discovered the basic shape of DNA
DNA Fingerprinting. Also known as DNA profiling Used in criminal and legal cases since the 1980’s to determine identity or parentage Also used to identify.
Forensic Science: Fundamentals & Investigations, Chapter 7 1 Introduction and History of Biological Evidence in Forensics DNA fingerprinting or DNA profiling,
1 DNA Polymorphisms: DNA markers a useful tool in biotechnology Any section of DNA that varies among individuals in a population, “many forms”. Examples.
Statistical Analysis of DNA Simple Repeats –Identical length and sequence agat agat agat agat agat Compound Repeats –Two or more adjacent simple repeats.
Lab 8: Individual Identity and Population Assignment.
Forensic DNA Analysis Basic Review 46 chromosomes per cell, 23 pairs Humans have approximately 25,000 genes Each gene has multiple versions,
Lecture 15: Individual Identity and Paternity Analysis
Simple-Sequence Length Polymorphisms SSLPs Short tandemly repeated DNA sequences that are present in variable copy numbers at a given locus. Scattered.
PCR Y.Martinez, LSHS, 2014 DIRECTIONS: COPY NOTES IN ORANGE.
Individual Identity and Population Assignment Lab. 8 Date: 10/17/2012.
Forensic Science DNA Analysis 1. History of Biological Evidence in Forensics  DNA fingerprinting  Also known as DNA profiling  Used with a high degree.
Chapter 1: Forensic Biology.  Common Disciplines:  Crime scene investigation  Latent print examination  Forensic Biology  Controlled substance analysis.
All rights Reserved Cengage/NGL/South-Western © 2016.
Bio II: Forensics.  DNA molecules are found in the nucleus of cells in the human body in chromosomes.  People have 23 pairs of chromosomes, with an.
History Evidence BIOLOGICAL EVIDENCE EXAMINED FOR INHERITED TRAITS TECHNIQUES EMERGED FROM HEALTHCARE DNA FINGERPRINTING DEVELOPED IN 1984.
Chapter 10 Advanced Concepts in DNA © 2012 Cengage Learning. All Rights Reserved.
 Types of STR markers- 5 types based on sequence  STR allele nomenclature  Allelic ladder  Serological methods of identity profiling  Identity profiling.
 ABO blood typing  Lacks power of discrimination  RFLP analysis using minisatellite probes  High power of discrimination  Laborious  STR analysis.
Robert Page Doctoral Student in Dr. Voss’ Lab Population Genetics.
Lecture 15: Individual Identity and Forensics October 17, 2011.
Explain how crime scene evidence is
Simple-Sequence Length Polymorphisms
Statistical Analysis of DNA
Explain how crime scene evidence is
All rights Reserved Cengage/NGL/South-Western © 2016.
Lecture 15: Individual Identity and Paternity Analysis
All rights Reserved Cengage/NGL/South-Western © 2016.
Forensic Science DNA Analysis
explain how crime scene evidence is
History of Biological Evidence in Forensics
Explain how crime scene evidence is
DNA Polymorphisms: DNA markers a useful tool in biotechnology
Explain how crime scene evidence is
Forensic DNA Analysis.
DNA Fingerprinting Ch 7 – Unit 5.
DNA Fingerprinting and Forensic Analysis
The Indispensable Forensic Tool
Explain how crime scene evidence is
explain how crime scene evidence is
Explain how crime scene evidence is
Biotechnology Mader 19.4.
Presentation transcript:

Lecture 14: Population Assignment and Individual Identity October 8, 2015

Last Time uSample calculation of F ST uDefining populations on genetic criteria: introduction to Structure

Structure Program  One of the most widely-used programs in population genetics (original paper cited >15,000 times since 2000)  Very flexible model can determine:  The most likely number of uniform groups (populations, K)  The genomic composition of each individual (admixture coefficients)  Possible population of origin

Structure is Hierarchical: Groups reveal more substructure when examined separately Rosenberg et al Science 298:

Today  Principal Components Analysis  Genotype likelihoods  Population assignment  Forensic identification

Alternative clustering method: Principal Components Analysis  Structure is very computationally intensive  Often no clear best-supported K-value  Alternative is to use traditional multivariate statistics to find uniform groups  Principal Components Analysis is most commonly used algorithm  EIGENSOFT (PCA, Patterson et al., 2006; PloS Genetics 2:e190). Eckert, Population Structure, 5-Aug

Principal Components Analysis  Efficient way to summarize multivariate data like genotypes  Each axis passes through maximum variation in data, explains a component of the variation  /pca/s1.htm

Once you have populations defined, can you assign a migrant individual to their population of origin?

Human Population Assignment with SNP  Assayed 500,000 SNP genotypes for 3,192 Europeans  Used Principal Components Analysis to ordinate samples in space  High correspondence betweeen sample ordination and geographic origin of samples  Individuals assigned to populations of origin with high accuracy  Novembre et al Nature 456:98

Population Assignment: Likelihood  Assume you find skin cells and blood under fingernails of a murder victim  Victim had major debts with the Sicilian mafia as well as the Chinese mafia  Can population assignment help to focus investigation?  What is H 1 and what is H 2 ?

Population Assignment: Likelihood  "Assignment Tests" based on allele frequencies in source populations and genetic composition of individuals  Likelihood-Based Approaches  Calculate likelihood that individual genotype originated in particular population  Assume Hardy-Weinberg and linkage equilibria  Genotype frequencies corrected for presence of sampled individual  Usually reported as log 10 likelihood for origin in given population relative to other population  Implemented in ‘GENECLASS’ program ( eneclass.html) for m loci for homozygote A i A i in population l at locus k for heterozygote A i A j in population l at locus k

Power of Population Assignment using Likelihood  Assignment success depends on:  Number of markers used  Polymorphism of markers  Number of possible source populations  Differentiation of populations  Accuracy of allele frequency estimations  Rules of Thumb (Cornuet et al. 1999) for 100% assignment success, for 10 reference populations need:  30 to 50 reference individuals per population  10 microsatellite loci  HE > 0.6  FST > 0.1

Population Assignment Example: A Fish Story  Fishing competition on Lake Saimaa in Southeast Finland  Contestant allegedly caught a 5.5 kg salmon, much larger than usual for the lake  Compared fish from the lake to fish from local markets (originating from Norway and Baltic sea)  7 microsatellites  Based on likelihood analysis, fish was purchased rather than caught in lake Lake Saimaa Market - -log 10 of likelihood that the observed genotype could occur in Lake Saimaa

Genetic Typing in Forensics  Highly polymorphic loci provide unique ‘fingerprint’ for each individual  Tie suspects to blood stains, semen, skin cells, hair  Revolutionized criminal justice in last 20 years  Also used in disasters and forensic anthropology  Principles of population genetics must be applied in calculating and interpreting probability of identity

Markers in Genetic Typing  Standard set of 13 core loci for forensics: CODIS (Combined DNA Index System)  Sets of highly polymorphic microsatellites (also called VNTR (Variable Number of Tandem Repeats), STR (Short Tandem Repeat) or SSR (Simple Sequence Repeat))  Most are amplified in a single multiplex reaction and analyzed in a single capillary  Very high “exclusion power” (ability to differentiate individuals)

Individual Identity: Likelihood  Assume you find skin cells and blood under fingernails of a murder victim  A hitman for the Sicilian mafia is seen exiting the apartment  You gather DNA evidence from the skin cells and from the suspect  They have identical genotypes  What is the likelihood that the evidence came from the suspect?  What is H 1 and what is H 2 ?

Match Probability  Probability of observing a genotype at locus k by chance in population is a function of allele frequencies: for m loci Homozygote Heterozygote  Assumes unlinked (independent loci) and Hardy- Weinberg equilibrium

Probability of Identity  Probability 2 randomly selected individuals have same profile at locus k: Homozygotes Heterozygotes for m loci  Exclusion Probability (E): E=1-P

Which allele frequency to use?  Human populations show some level of substructuring  F ST generally < 0.03  Challenge is to choose proper ethnic group and account for gene flow from other groups Illinois Caucasian Georgia Caucasian U.S. Black

Substructure in human populations  G ST is quite high among the 5 major groups of human populations for CODIS microsatellites  Relatively low within groups, but not 0!

NRC (1996) recommendations  Use population that provides highest probability of observing the genotype (unless other information is known)  Correct homozygous genotypes for substructure within selected population (e.g., Native Americans, hispanics, African Americans, caucasians, Asian Americans)  No correction for heterozygotes HomozygotesHeterozygotes

Why is it ‘conservative’ (from the standpoint of proving a match) to ignore substructure for heterozygotes?

What if the slimy mob defense attorney argues that the most likely perpetrator is the mob hitman’s brother, who has conveniently “disappeared”? Does the general match probability apply to near relatives?

Probability of identity for full sibs Heterozygotes 2 alleles IBD 1 allele IBD 0 alleles IBD General Probability of Identity for Full Sibs: Homozygotes 2 alleles IBD 0 alleles IBD

Probability of identity for full sibs For a locus with 5 alleles, each at a frequency of 0.2: P ID = P IDsib = Probability of identity unrelated individuals

What is minimum probability of identity for full sibs?

Example: World Trade Center Victims  Match victims using DNA collected from toothbrushes, hair brushes, or relatives  Exact matches not guaranteed  Why not?  Use likelihood to match samples to victims