Forensic Bioinformatics (www.bioforensics.com)

Slides:



Advertisements
Similar presentations
Overcoming DNA Stochastic Effects 2010 NEAFS & NEDIAI Meeting November, 2010 Manchester, VT Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics.
Advertisements

How Inclusion Interpretation of DNA Mixture Evidence Reduces Identification Information American Academy of Forensic Sciences February, 2013 Washington,
Creating informative DNA libraries using computer reinterpretation of existing data Northeastern Association of Forensic Scientists November, 2011 Newport,
Objective Computer Assisted Review of DNA Evidence Dan E. Krane Wright State University, Dayton, OH
Familial searches and cold hit statistics Forensic Bioinformatics ( Dan Krane Wright State University, Dayton, OH
Peak Height Ratios in Forensic STR Analyses: Pattern of Occurrence & Effects of Concentration Data from: Illinois State Police Laboratory Illinois State.
Genophiler: A starting point for reviewing DNA testing results Michael L. Raymer, Ph.D. Travis Doom, Ph.D.
Attaching statistical weight to DNA test results 1.Single source samples 2.Relatives 3.Substructure 4.Error rates 5.Mixtures/allelic drop out 6.Database.
Inferring the Number of Contributors to Mixed DNA Profiles David Paoletti.
Run-specific limits of quantitation and detection (an alternative to minimum peak height thresholds) Forensic Bioinformatics ( Dan.
Detecting Degradation in DNA samples
3 Person Example #2 Suspect Boxer Shorts (The Ladies Man)
2 Person Mixture #3 Questioned samples from bomb remains, no references.
CHAPTER 15: Tests of Significance: The Basics Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
January Structure of the book Section 1 (Ch 1 – 10) Basic concepts and techniques Section 2 (Ch 11 – 15): Inference for quantitative outcomes Section.
Random Match Probability Statistics
CSI Revisited The Science of Forensic DNA Analysis
Standard Data Analysis - Integration
Regression Analysis Using Excel. Econometrics Econometrics is simply the statistical analysis of economic phenomena Here, we just summarize some of the.
Lecture 12: Autosomal STR DNA Profiling
Stat 112 – Notes 3 Homework 1 is due at the beginning of class next Thursday.
TrueAllele ® Casework Validation on PowerPlex ® 21 Mixture Data Australian and New Zealand Forensic Science Society September, 2014 Adelaide, South Australia.
Simple Linear Regression. Introduction In Chapters 17 to 19, we examine the relationship between interval variables via a mathematical equation. The motivation.
Forensic Statistics From the ground up…. Basics Interpretation Hardy-Weinberg equations Random Match Probability Likelihood Ratio Substructure.
Kern Regional Crime Laboratory Laboratory Director: Dr. Kevin W. P. Miller TRUEALLELE® WORK AND WORKFLOW: KERN COUNTY’S FIRST CASES APRIL 23, 2014.
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Inference for regression - Simple linear regression
Expert Systems for Automated STR Analysis SWGDAM Quantico, VA Mark W. Perlin January, 2003.
Handling Data and Figures of Merit Data comes in different formats time Histograms Lists But…. Can contain the same information about quality What is meant.
Chapter 19: Two-Sample Problems STAT Connecting Chapter 18 to our Current Knowledge of Statistics ▸ Remember that these formulas are only valid.
Statistical weights of mixed DNA profiles Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH Forensic DNA.
2 Person Mixture #2 Vaginal swab of Victim. Case Scenario Assault occurred in dorm room Suspect says it was consensual No other parties heard or saw anything.
Analyzing and Interpreting Quantitative Data
Multiple regression - Inference for multiple regression - A case study IPS chapters 11.1 and 11.2 © 2006 W.H. Freeman and Company.
Automated STR Data Analysis: Validation Studies Automated Analysis Databasing Validation Casework Studies Mark W. Perlin (Cybergenetics, Pittsburgh, PA)
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
A Process Control Screen for Multiple Stream Processes An Operator Friendly Approach Richard E. Clark Process & Product Analysis.
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
MATH 2400 Ch. 15 Notes.
TrueAllele ® Genetic Calculator: Implementation in the NYSP Crime Laboratory NYS DNA Subcommittee May 19, 2010 Barry Duceman, Ph.D New York State Police.
Artifacts and noise in DNA profiling Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH Forensic DNA Profiling.
Cybergenetics Webinar January, 2015 Mark W Perlin, PhD, MD, PhD Cybergenetics, Pittsburgh, PA Cybergenetics © How TrueAllele ® Works (Part 4)
An Expert System for Scoring DNA Database Profiles Dr. Mark W. Perlin Cybergenetics Pittsburgh, PA.
Statistical Analysis of DNA Simple Repeats –Identical length and sequence agat agat agat agat agat Compound Repeats –Two or more adjacent simple repeats.
Unleashing Forensic DNA through Computer Intelligence Forensics Europe Expo Forensic Innovation Conference April, 2013 London, UK Mark W Perlin, PhD, MD,
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
Compute first, ask questions later: an efficient TrueAllele ® workflow Midwestern Association of Forensic Scientists October, 2014 St. Paul, MN Martin.
Chapter 8: Simple Linear Regression Yang Zhenlin.
Statistical weights of single source DNA profiles Forensic Bioinformatics ( Dan E. Krane, Wright State University, Dayton, OH Forensic.
Murder in McKeesport October 25, 2008 Tamir Thomas.
What can go wrong with DNA profiling Dan E. Krane, Wright State University, Dayton, OH Forensic DNA Profiling Video Series Forensic Bioinformatics (
Data summary – “alleles” Threshold Over threshold, peaks are labeled as allele events All-or-none allele peaks, each given equal status Allele Pair 8,
Disputed DNA Stats for a Low-level Sample: A Case Study By Dan Krane – Carrie Rowland –
Seventh Annual Prescriptions for Criminal Justice Forensics Program Fordham University School of Law June 3, 2016 DNA Panel.
Chapter 13 Simple Linear Regression
26134 Business Statistics Week 5 Tutorial
Statistical Weights of DNA Profiles
Unit 5: Hypothesis Testing
Validating TrueAllele® genotyping on ten contributor DNA mixtures
Explaining the Likelihood Ratio in DNA Mixture Interpretation
“Using Computer Technology to Overcome Bottlenecks in the Forensic DNA Testing Process and Improve Data Recovery from Complex Samples”
Two-sided p-values (1.4) and Theory-based approaches (1.5)
Solving Crimes using MCMC to Analyze Previously Unusable DNA Evidence
Lecture 10/24/ Tests of Significance
2018 AAFS Annual Scientific Meeting February 22, 2018
Day 68 Agenda: 30 minute workday on Hypothesis Test --- you have 9 worksheets to use as practice Begin Ch 15 (last topic)
DNA Identification: Mixture Interpretation
David W. Bauer1, PhD Nasir Butt2, PhD Jeffrey Oblock2
Presentation transcript:

Forensic Bioinformatics (www.bioforensics.com) Genophiler® and GenoStat®: Tools for objective evaluation of DNA testing results Jason R. Gilder August 17, 2007 Forensic Bioinformatics (www.bioforensics.com)

Automation of GeneScan and Genotyper Consistent electronic analysis of all samples Organizes all output – can be read on any computer Summary table of all labeled peaks Draws your attention to potential issues

The * indicates that this peak may be involved in pullup… Summary Sheet The * indicates that this peak may be involved in pullup…

Clicking here will show us the GenoTyper annotated electropherogram… Pullup Clicking here will show us the GenoTyper annotated electropherogram…

Simplified table Overall view of samples Single line display Clicking sample displays electropherogram

Project Screen Shot

Analysis Parameters: Size Standards:

Standard electropherogram Similar to laboratory output Limited view Hard to evaluate low-level results

Genotyper zoom Close view of baseline and potential low-level alleles View from 0 to 150 RFU Helps expert make an informed decision

Another example of Genophiler zoom Genophiler also provides a zoomed graph to 150 RFUs Shows smaller peaks more clearly Here, the peaks that were not reported by the lab may be evidence of an unknown, minor contributor

Sample results window All detected peaks Size Peak height Peak area Data collection point

Sample information window Time and date of analysis Analysis instrument Machine conditions Analysis parameters

Raw data window Pre-separated spectral data

Poor raw data

Other Issues: EPT Data Shows current, voltage, laser power, and temperature For good analysis, should be constant (flat)

Problematic EPT data

Side-by-side interface

Consistency report We perform two analyses for every case GeneScan default RFU threshold (50) Reproduce the testing lab’s analysis Compares analysis results between testing laboratory and ABI defaults Highlight peaks that are labeled in one analysis and not the other

Genophiler® LOD/LOQ Table Sample Run Sample Name Sample Nickname Limit of Detection (LOD) Quantitation (LOQ) Average baseline signal Baseline std. dev. Run 2_17_06 C7P16 9947a 6K11_3ul 05-2 Pos Control (2-17)-2 19 49 5.86 4.34 6K11_3ul (2-17)-1 5.72 G7C16 9947a (2-17)-C 18 48 5.82 4.18 D7P16 RB Reagent Blank (2-17) 5.75 4.20    Notes:    - LOD/LOQ derived from the methodology validated in J. Gilder, T. Doom, K. Inman, and D. Krane. "Run-specific limits of detection and quantitation for STR-based DNA testing." Journal of Forensic Sciences. 2007;52(1):97-101.    - Limit of Detection (LOD) is the average baseline signal (in RFUs) plus three standard deviations (μb + 3σ).    - Limit of Quantitation (LOQ) is the average baseline signal (in RFUs) plus ten standard deviations (μb + 10σ).    - Positive controls not exhibiting the full 9947A profile are not included in the above table.

Removing labels in GenoTyper

Default view (as found in printout) Genotyper: Show peaks that were manually removed

Genophiler CD Full electronic record of the case All results in electronic format on CD-ROM All data files, analysis parameters, projects, macros, etc. Works on any computer with a web browser Can make printouts of everything

Potential issues that are flagged

Potential issues that are flagged Mixtures Peak height imbalance Missing genotype information High peaks Low average peak height Indications of degradation Pull-up Inconsistent PP/CO results

Mixture Locus or loci containing three or more labeled peaks

Peak height imbalance Any heterozygous loci differing by more than 30% 65% Defendant

Missing genotype information At least one locus with no labeled peaks

High peak Any peak above the point of saturation (4000 RFUs) Can lead to pull-up Defendant

Low average peak height An average peak height less than 400 RFUs

Degradation Potential breakdown of DNA sample Slope calculated through best-fit linear regression Slope = -17 Defendant

Pull-up Two peaks in two different dyes occurring at approximately the same data collection point Evidence Q5

Inconsistent PP/CO results Different results in Profiler Plus and COfiler Defendant

Potential issues that are flagged Mixtures Peak height imbalance Missing genotype information High peaks Low average peak height Indications of degradation Pull-up Inconsistent PP/CO results

Genophiler report “An expert will be able to explain the significance and implications of these findings in your particular case.” “All of the statements listed below about the data in your case can be verified by any competent expert who has access to GeneScan and Genotyper software and to the data you provided to us.”

Genophiler does not Perform mixture interpretation Make allele calls Assess the quality of a profile Make any judgment calls

GenoStat DNA statistics package Random match probabilities Related random match probabilities Mixture statistics (CPI) 167 population allele frequency databases Mixture resolution Separate mixtures into contributor components Free

Standard stats mode

Random match probability Standard stats mode Random match probability

Standard stats mode Unrelated Sibling Half-Sibling Parent/Child Uncle/Nephew Cousins Standard stats mode

Standard stats mode Mixture stats

Theta substructure factor Standard stats mode Theta substructure factor

FBI population databases Standard stats mode FBI population databases

Load any of 167 population databases Standard stats mode Load any of 167 population databases

Mixture resolution mode

Mixture resolution mode One hypothesis = fully resolved

Mixture resolution mode Set peak height ratio

Mixture resolution mode Minimum peak height threshold

Mixture resolution mode Inferred mixture ratio

Partially-resolved mixture

Partially-resolved mixture Manually exclude mixture hypotheses

Partially-resolved mixture Manually exclude loci from statistics

Partially-resolved mixture Constrained CPI uses only plausible mixture hypotheses

Questions? www.bioforensics.com