Tutorial #11 by Anna Tzemach. Background – Lander & Green’s HMM Recombinations across successive intervals are independent  sequential computation across.

Slides:



Advertisements
Similar presentations
Tutorial #8 by Ma’ayan Fishelson. Computational Difficulties Algorithms that perform multipoint likelihood computations sum over all the possible ordered.
Advertisements

Lecture 19: Parallel Algorithms
. Exact Inference in Bayesian Networks Lecture 9.
PHYLOGENETIC TREES Bulent Moller CSE March 2004.
Tutorial #5 by Ma’ayan Fishelson. Input Format of Superlink There are 2 input files: –The locus file describes the loci being analyzed and parameters.
Tutorial #2 by Ma’ayan Fishelson. Crossing Over Sometimes in meiosis, homologous chromosomes exchange parts in a process called crossing-over. New combinations.
Genetic linkage analysis Dotan Schreiber According to a series of presentations by M. Fishelson.
Basics of Linkage Analysis
Parallel Genehunter: Implementation of a linkage analysis package for distributed memory architectures Michael Moran CMSC 838T Presentation May 9, 2003.
. EM algorithm and applications Lecture #9 Background Readings: Chapters 11.2, 11.6 in the text book, Biological Sequence Analysis, Durbin et al., 2001.
Tutorial #6 by Ma’ayan Fishelson Based on notes by Terry Speed.
Why do we look similar to but different than our biological siblings?
Tutorial by Ma’ayan Fishelson Changes made by Anna Tzemach.
. Bayesian Networks For Genetic Linkage Analysis Lecture #7.
. Basic Model For Genetic Linkage Analysis Lecture #3 Prepared by Dan Geiger.
. Hidden Markov Models For Genetic Linkage Analysis Lecture #4 Prepared by Dan Geiger.
CASE STUDY: Genetic Linkage Analysis via Bayesian Networks
Reconstructing Genealogies: a Bayesian approach Dario Gasbarra Matti Pirinen Mikko Sillanpää Elja Arjas Department of Mathematics and Statistics
Chapter 11.2 (Pg ): Applying Mendel’s Principles
Tutorial #5 by Ma’ayan Fishelson Changes made by Anna Tzemach.
Tutorial #5 by Ma’ayan Fishelson
General Explanation There are 2 input files –The locus file describes the loci being analyzed and parameters for the different analyzing programs. –The.
. Basic Model For Genetic Linkage Analysis Lecture #5 Prepared by Dan Geiger.
Genetic Mapping Oregon Wolfe Barley Map (Szucs et al., The Plant Genome 2, )
Week 11 - Wednesday.  What did we talk about last time?  Graphs  Euler paths and tours.
Calculation of IBD State Probabilities Gonçalo Abecasis University of Michigan.
Copyright © Cengage Learning. All rights reserved. CHAPTER 10 GRAPHS AND TREES.
Bayesian MCMC QTL mapping in outbred mice Andrew Morris, Binnaz Yalcin, Jan Fullerton, Angela Meesaq, Rob Deacon, Nick Rawlins and Jonathan Flint Wellcome.
Markov Chain Monte Carlo Hadas Barkay Anat Hashavit.
Lecture 13: Linkage Analysis VI Date: 10/08/02  Complex models  Pedigrees  Elston-Stewart Algorithm  Lander-Green Algorithm.
Tutorial #10 by Ma’ayan Fishelson. Classical Method of Linkage Analysis The classical method was parametric linkage analysis  the Lod-score method. This.
Lecture 15: Linkage Analysis VII
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
Calculation of IBD probabilities David Evans University of Oxford Wellcome Trust Centre for Human Genetics.
Probability Probability is defined as the likelihood of an event occurring. The practice of genetics uses the concept of probability to predict the outcome.
1 HMM in crosses and small pedigrees Lecture 8, Statistics 246, February 17, 2004.
Practical With Merlin Gonçalo Abecasis. MERLIN Website Reference FAQ Source.
Problem 5 Problem Set 1 Winter 2011 I II III
. Basic Model For Genetic Linkage Analysis Prepared by Dan Geiger.
Guy Grebla1 Allegro, A new computer program for linkage analysis Guy Grebla.
Unit 3: Heredity Chromosomal Inheritance
Segregation of gametes Genes are on chromosomes.
I II III Normal Vision Normal Hearing Blindness Normal Hearing Normal Vision Deafness Blindness Deafness In humans, deafness (d) and blindness.
Lecture 17: Model-Free Linkage Analysis Date: 10/17/02  IBD and IBS  IBD and linkage  Fully Informative Sib Pair Analysis  Sib Pair Analysis with Missing.
Hidden Markov Models BMI/CS 576
Gonçalo Abecasis and Janis Wigginton University of Michigan, Ann Arbor
HMM in crosses and small pedigrees, cont.
6.5 Traits and Probability
Tutorial 2 Simple examples of Bayesian networks, Queries, And the stories behind them Tal Shor.
Can resemblance (e.g. correlations) between sib pairs, or DZ twins, be modeled as a function of DNA marker sharing at a particular chromosomal location?
Whose genes are you wearing?
Even/Odd Functions 5.5.
Lecture 22: Parallel Algorithms
Week 9 Vocab Definitions
Error Checking for Linkage Analyses
Calculation of IBD probabilities
Punnett squares illustrate genetic crosses.
Chromosomal Haplotypes by Genetic Phasing of Human Families
Segregation of gametes
Basic Model For Genetic Linkage Analysis Lecture #3
Two location Linkage Determining closeness of two markers
DIHYBRID CROSSES & GENE LINKAGE
Graphs and Vertex Coloring
IBD Estimation in Pedigrees
Linkage Analysis Problems
Tutorial #6 by Ma’ayan Fishelson
Closures of Relations Epp, section 10.1,10.2 CS 202.
11-2 Probability & Punnett Squares
7.1 Trees.
Gonçalo R. Abecasis, Janis E. Wigginton 
Presentation transcript:

Tutorial #11 by Anna Tzemach

Background – Lander & Green’s HMM Recombinations across successive intervals are independent  sequential computation across loci using the forward-backward algorithm is enabled. The algorithm computing the probability of the data given an inheritance vector is linear in the number of founders. We need to sum over all possible inheritance vectors (exponential in the number of non-founders). Complexity: Linear in the number of loci, and number of founders. Exponential in the number of non-founders.

Reminder – Compute marker data probability given inheritance data Assume that the descent graph vertices below represent the pedigree on the left v = ( 1,1; 0,0; 1,1; 1,1; 1,1; 0,0 ) v = (person 12; person 13; person 21; person 22; person 23; person 24) a/b a/b 2324 b/d a/c

Reminder (cont) (b,d) (a,b) (a,c) (a,b) Founder Graph (a,b) (a,c)(b,d)(a,b) Descent Graph

Reminder (cont) Alternatively we can rewrite function as

Simultaneous calculation of all vectors 1.J =0 2.For each meiosis j –Duplicate founder states –V[j] = 0 –Add corresponding edge to the first set of the founder alleles –V[j] = 1 –Add corresponding edge to the second set of the founder alleles –If j < 2 * number of nonfounder goto 2 3.Calculate all sets probability

Possible outcome Node with zero likelihood

Cutoff of zero nodes For each founder allele a i = Ǿ For each meiosis j (person p = j/2) –k j = alleles of person p; –If a i = Ǿ (allele assignment of corresponding founder) ai = k j –Else a i = a i ∩ k j If a i = Ǿ : return 0; Goto meiosis j+1 –if j == 2* number of people compute vector probability

Possible outcome Node with zero likelihood

Cutoff of symmetric nodes The node is symmetric node if parent pi is known to be homozygous or if offspring oi and all its descendants are not genotyped If we can not distinguish between v[i] = 0 and v[i] = 1 then P(data | v = ____1_____) = P(data | v = ____0_____) We do not want to calculate such nodes twice.

Possible outcome Node with zero likelihood Node identical to sibling

Founder Reduction No way to find ordered genotype for founders So, if we look on the child of the founder, we cannot distinguish between v[i] = 0 and v[i] =1 For each founder 1 bit in inheritance vector is constant. Total time/space save 2 f

Recombination speed-up The transition probability of vector i and vector j For small thetas only small number of recombinations make sense

Example Number of non founders = 3 Number of inheritance vectors = 2 2*3 = 64 Number of founders = 3 Founder reduction = 2 3 = 8 Reduced number of vectors = = 8

Example (cont) V[1] =0 A1 = {a,b} No need for v[1] = 1 -> founder reduction V[2] =0 A3 = {a,b} No need for v[2] = 1 -> founder reduction

Example (cont) V[3] = 0 => 000 a1 = {a,b} ∩ {a,c } = {a} V[3] = 1 => 001 a3 = {a,b} ∩ {a,c } = {a} V[4] = 0 => 0000 a5 = {a,c} V[4] = 0 = > 0010 a5 = {a,c} No need for v[4] = 1 -> founder reduction