Familias and Forensic genetics Thore.

Slides:



Advertisements
Similar presentations
Forensic DNA Inference ICFIS 2008 Lausanne, Switzerland Mark W Perlin, PhD, MD, PhD Joseph B Kadane, PhD Robin W Cotton, PhD Cybergenetics ©
Advertisements

Attaching statistical weight to DNA test results 1.Single source samples 2.Relatives 3.Substructure 4.Error rates 5.Mixtures/allelic drop out 6.Database.
Single Nucleotide Polymorphism And Association Studies Stat 115 Dec 12, 2006.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
1 Statistical genetics and genetical statistics Thore Egeland, Rikshospitalet and Section of Medical Statistics Joint work with P. Mostad, NR, B. Olaisen,
1 Familias Thore Egeland, Rikshospitalet and Section of Medical Statistics Joint work with P. Mostad, NR, B. Olaisen, B. Mevåg, M. Stenersen, Inst of Forensic.
Evaluation and interpretation of crime forensic evidence Crime Trace recovery Potential sources of the traces scenarios producing the traces Evaluation.
STAT 135 LAB 14 TA: Dongmei Li. Hypothesis Testing Are the results of experimental data due to just random chance? Significance tests try to discover.
Basics of Linkage Analysis
Workshop Biostatistics in relationship testing ESWG-meeting 2014 Daniel Kling Norwegian Institute of Public Health and Norwegian University of Life Sciences,
Joint Linkage and Linkage Disequilibrium Mapping
Fundamentals of Forensic DNA Typing Slides prepared by John M. Butler June 2009 Appendix 3 Probability and Statistics.
Sample size computations Petter Mostad
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Hypothesis Testing II The Two-Sample Case.
Hypothesis testing – mean differences between populations
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
RMTD 404 Lecture 8. 2 Power Recall what you learned about statistical errors in Chapter 4: Type I Error: Finding a difference when there is no true difference.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
PowerPoint Slides for Chapter 16: Variation and Population Genetics Section 16.2: How can population genetic information be used to predict evolution?
Inferential Statistics 2 Maarten Buis January 11, 2006.
Psyc 235: Introduction to Statistics DON’T FORGET TO SIGN IN FOR CREDIT!
No criminal on the run The concept of test of significance FETP India.
Confidence intervals and hypothesis testing Petter Mostad
Lecture 19: Association Studies II Date: 10/29/02  Finish case-control  TDT  Relative Risk.
Joint Linkage and Linkage Disequilibrium Mapping Key Reference Li, Q., and R. L. Wu, 2009 A multilocus model for constructing a linkage disequilibrium.
VI. Regression Analysis A. Simple Linear Regression 1. Scatter Plots Regression analysis is best taught via an example. Pencil lead is a ceramic material.
Lecture 13: Linkage Analysis VI Date: 10/08/02  Complex models  Pedigrees  Elston-Stewart Algorithm  Lander-Green Algorithm.
1 B-b B-B B-b b-b Lecture 2 - Segregation Analysis 1/15/04 Biomath 207B / Biostat 237 / HG 207B.
Copyright © 2004 Pearson Prentice Hall, Inc. Chapter 7 Multiple Loci & Sex=recombination.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Allele Frequencies: Staying Constant Chapter 14. What is Allele Frequency? How frequent any allele is in a given population: –Within one race –Within.
Statistics for Decision Making Basic Inference QM Fall 2003 Instructor: John Seydel, Ph.D.
Using Familias and FamLink Athens 29 May, Familias 10 w w.umb.no NORWEGIAN UNIVERSITY OF LIFE SCIENCES.
CHAPTER 27: One-Way Analysis of Variance: Comparing Several Means
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Lab 4: Inbreeding and Kinship. Inbreeding Reduces heterozygosity Does not change allele frequencies.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Association Mapping in Families Gonçalo Abecasis University of Oxford.
Lecture 15: Individual Identity and Forensics October 17, 2011.
Chapter 13 Simple Linear Regression
Mendelian genetics in Humans: Autosomal and Sex- linked patterns of inheritance Obviously examining inheritance patterns of specific traits in humans.
Gonçalo Abecasis and Janis Wigginton University of Michigan, Ann Arbor
ESWG paper challenges – A walkthrough Athen, May 29, 2014
Validating TrueAllele® genotyping on ten contributor DNA mixtures
Inference and Tests of Hypotheses
Statistical Data Analysis
Explaining the Likelihood Ratio in DNA Mixture Interpretation
More about Tests and Intervals
Hypothesis Testing Summer 2017 Summer Institutes.
Chapter 6 Hypothesis tests.
Chapter 9 Hypothesis Testing.
Sequence comparison: Multiple testing correction
Chapter 9 Hypothesis Testing.
The Y-haplotype geography problem
Discrete Event Simulation - 4
Friday’s Class 3 Extra Credit Questions
Forensic match information: exact calculation and applications
Pedigree analysis through genetic hypothesis testing
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE
Statistical Data Analysis
Pedigree Analysis.
Linkage Analysis Problems
Hypothesis Testing S.M.JOSHI COLLEGE ,HADAPSAR
X-chromosomal markers and FamLinkX
Chapter 4 Summary.
Hardy-Weinberg Lab Data
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

Familias and Forensic genetics Thore. Egeland@umb Familias and Forensic genetics Thore.Egeland@umb.no Norwegian University of Life Sciences San Andrés Island, June 4, 2012 Presentation Leonor Petter Mostad, Daniel Kling Title: Famlias 1995: Pater 2011, Familias 2.0 (1.97) 2012 Code also released Open Familias (R) Forensic genetics: Statistical aspects emphasized; most of you know more of the technical, or lab aspects Kinship problems, not crime (mixtures) Your background? My backgroun

Purpose "..practical workshop about the statistical evaluation of forensic cases (paternity and other family relationships) by using Familias" Quoted from the mandate Key-words: Practical. Exercises. Focus on Familias Software. Case work Little/no mathematics Statistical evaluation: Useful also if you choose to report only on a verbal scale

Contents Principles Statistical evaluation Familias. Exercises LR (Paternity index) Non-standard cases Theta-correction Mutations Complex pedigrees Silent alleles. Familias. Exercises Course homepage arken.umb.no/~theg/familias/ user: familias pass word: andres Principles: Just a little bit. There is an underlying philosphy, science: ”Look to DNA”, i.e., other forensic disciplines are not as well founded Non-standard cases: Software is needed Exercises: Practical hands on exercises Homepage 9:00 - 10:00 Review of basic forensic genetics.  Principles. LR for simple cases. Spanish/Portuguese summary. 10:00 - 11:15 Introduction to Familias. Demo. Spanish/Portuguese summary 11:15 - 11:45 Coffee break 11:45 - 12:30 Familias exercises. Participants will use computer with Familias installed. 12:30 - 14:00 Lunch 14:00 - 16:00 Non-standard cases. Spanish/Portuguese summary. Complex relationships. Mutation. Theta-correction 16:00 - 17:00 Familias exercises. 17:00- 18:00 Discussion of exercises. Spanish/Portuguese summary. More questions, contributions from participants.

Review of basic forensic genetics Spanish/Portuguese summary Part I: 09-10 Review of basic forensic genetics Spanish/Portuguese summary Practicalities ended: Leonor: Review so far?

Books: Evett&Weir, Balding, Buckleton et al. Principles To evaluate the uncertainty of any given proposition it is necessary to consider at least one alternative proposition. Scientific interpretation is based on questions of the kind: “What is the probability of the evidence given the proposition?” Scientific evidence is conditioned not only by the competing propositions, but also by the framework of circumstances within which they are to be evaluated Why not only ”Man is the father”? Evaluation of evidence depends on alternative Alternative: ”unrelated”: Typically strong evidence Alternative: brother of AF: Typically weaker evidence Reporting officer: statements about DNA given proposition Court, Decision maker: statements about proposition given DNA What is the alternative? Need to know something about the case. ”All probabilities are conditional” Books: Evett&Weir, Balding, Buckleton et al. RMI

Exercise S1 Standard paternity case H1: The alleged father (AF) is the real father H2: AF and the child are unrelated. Most basic ideas can be illustrated by this simple example. There are alternatives to notation and formulation Explain notation

Exercise S1 a) Calculate LR. First marker Probabilities 0-1 (or 0-100%, not used in calculation) Conditional probabilities Explain LR pA close to 0: Strong evidence pA>>0: Weaker evidence 0.05 : From where? Interpretation: Numbers of verbal scale? Observe that we have used principles 1,2, and 3. Before next slide: Q: What is special , unique with DNA? Compared to say fingerprints A: Ther is lot of independent data Interpretation: “The data is 20 times more likely assuming AF to be the father compared to the alternative that some unknown man is the father”.

Exercise S1 c) Calculate LR. Second marker Only slightly more difficult Table, excel can cover all cases of genotype combinations.

Exercise S1 c) Calculate LR. Two markers Assumptions HW. Within marker. This is a bit hidden without more mathematics, but we have used that what is passed down from mother and father is independent and this inependence assumption is secured by HW LE. Linkage equilibrium No artefacts (mutation, silent alleles, genotyping errors) Interpretation: “The data is 200 times more likely assuming AF to be the father compared to the alternative that some unknown man is the father”. Assumptions?

Non-standard cases Complications Theta-correction Mutations Complex pedigrees Silent alleles Later

Part II: 10-1115: Introduction to Familias. Demo. Exercises S1-S3, S5. Plenum Videos available Spanish/Portuguese summary Leonor: review!i As videos 1-3; deliberately present mutations before we discuss in more detail

Part III: 1145-1230: Hands on Familias exercises Exercises S1-S6

Part IV: 14-16 Repetition. Exercises S4, S6. Plenum Non-standard cases

Non-standard cases Complications Theta-correction Mutations Complex pedigrees Silent alleles Need software How to deal with four major complications in Familias

Theta - correction Corrects for deviations from Hardy-Weinberg-Equilibrium What is HWE? Dependence between the two alleles Theta models this dependence Theta 0.01-0.03 Theta>0 : if allele A is seen many times , the probability of seing A again increases Longer story Hardy-Weinberg may not be valid for a number of reasons including non-random mating. At the simplestl evel the theta correction can be considered to correct for deviation from HW. Theta is between 0 and 1 and for 0 the usual results obtained assuming HW would apply. For theta>0 we see that the there is an excess of homozygotes (actually simple estimates of theta can be based on the difference betweenobserved and expected number of heterozygotes).

Revisit Exercise S1h (plenum) Only kinship, theta, parameter used in this window Revisit Exercise S1h (plenum)

Modelling mutations Mutation rate varies with Sex of parent and locus. Alleles tend to mutate to close alleles: Several models Assume a laboratory handles 1000 cases a year. With 10 markes and two meioses for each case, we would have something like 1000*10*2*0.005=100 mutations with an average mutation rate of 0,005. It’s hard to deal analytically with mutations and this was our original motviation for developing software. There are some ad-hoc mutations models around (at least I consider to to be ad hoc, since they tend not be based on a proper model.)

Mutations Revisit Exercise S2, S7 Females versus males Stable versus non-stable First two require two parameters Second two require only one Revisit Exercise S2, S7

Complex pedigrees I. Exercise S6

Complex pedigrees II. Exercise S6

Silent alleles I. Exercise S11 Silent alleles, null alleles

Silent alleles II. Familias 0.05+0.1+0.1+0.75=1

Silent alleles III. Exercises S11 (S12 theoretical)

Part V: 16-17 Hands on exercises: S9, S10, S11 If time: 'Extra' exercises on homepage: arken.umb.no/~theg/familias/

Part VI: 17-18 Discussion of exercises. Spanish/Portuguese summary. Questions? Exercises S1h, S4c, S5d, S9d S1h ”Discuss the assumptions underlying the calculations of this exercise: HWE; if not theta corrrection LD Mutations, silent alleles Other? Forensic markers are non-coding S4c: GF and GS share the same haplotype, freq 0.0025 LR=1/0.0025=400. Multiply 3.4? Problem: Y-based LR refers to different hypothesis; Could replace GF by say a brother and GS by say a brother S5d: W easier to interpret? Problem: prior. Why 50-50? S9d: Is there a best mutation model? Balance practical demands and theoretical demands Should a mutation model be used routinely for all markers? Case for YES: Should not adjust model once specifics of a case materialise

References Familias reference: Egeland, Mostad et al., see also http://familias.name/