Thomas Jellema & Wouter Van Gool 1 Question. 2Answer.

Slides:



Advertisements
Similar presentations
Numbers Treasure Hunt Following each question, click on the answer. If correct, the next page will load with a graphic first – these can be used to check.
Advertisements

1 A B C
Scenario: EOT/EOT-R/COT Resident admitted March 10th Admitted for PT and OT following knee replacement for patient with CHF, COPD, shortness of breath.
Variations of the Turing Machine
Angstrom Care 培苗社 Quadratic Equation II
AP STUDY SESSION 2.
1
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Author: Julia Richards and R. Scott Hawley
Properties Use, share, or modify this drill on mathematic properties. There is too much material for a single class, so you’ll have to select for your.
Objectives: Generate and describe sequences. Vocabulary:
UNITED NATIONS Shipment Details Report – January 2006.
David Burdett May 11, 2004 Package Binding for WS CDL.
We need a common denominator to add these fractions.
1 RA I Sub-Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Casablanca, Morocco, 20 – 22 December 2005 Status of observing programmes in RA I.
Properties of Real Numbers CommutativeAssociativeDistributive Identity + × Inverse + ×
Local Customization Chapter 2. Local Customization 2-2 Objectives Customization Considerations Types of Data Elements Location for Locally Defined Data.
Process a Customer Chapter 2. Process a Customer 2-2 Objectives Understand what defines a Customer Learn how to check for an existing Customer Learn how.
Custom Services and Training Provider Details Chapter 4.
CALENDAR.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt BlendsDigraphsShort.
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt RhymesMapsMathInsects.
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
1. PHOTO INDEX Bayside: Page 5-7 Other Colour Leon: Page 8-10 Cabrera Page Canaria Page Driftwood Page 16 Florence Florence and Corfu Page.
Break Time Remaining 10:00.
Turing Machines.
Table 12.1: Cash Flows to a Cash and Carry Trading Strategy.
PP Test Review Sections 6-1 to 6-6
1 The Blue Café by Chris Rea My world is miles of endless roads.
Bright Futures Guidelines Priorities and Screening Tables
EIS Bridge Tool and Staging Tables September 1, 2009 Instructor: Way Poteat Slide: 1.
Bellwork Do the following problem on a ½ sheet of paper and turn in.
CS 6143 COMPUTER ARCHITECTURE II SPRING 2014 ACM Principles and Practice of Parallel Programming, PPoPP, 2006 Panel Presentations Parallel Processing is.
Exarte Bezoek aan de Mediacampus Bachelor in de grafische en digitale media April 2014.
TESOL International Convention Presentation- ESL Instruction: Developing Your Skills to Become a Master Conductor by Beth Clifton Crumpler by.
Sample Service Screenshots Enterprise Cloud Service 11.3.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
Basel-ICU-Journal Challenge18/20/ Basel-ICU-Journal Challenge8/20/2014.
1..
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
Adding Up In Chunks.
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
Artificial Intelligence
1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.
Subtraction: Adding UP
: 3 00.
5 minutes.
1 hi at no doifpi me be go we of at be do go hi if me no of pi we Inorder Traversal Inorder traversal. n Visit the left subtree. n Visit the node. n Visit.
Analyzing Genes and Genomes
1 Let’s Recapitulate. 2 Regular Languages DFAs NFAs Regular Expressions Regular Grammars.
Speak Up for Safety Dr. Susan Strauss Harassment & Bullying Consultant November 9, 2012.
Essential Cell Biology
Converting a Fraction to %
Clock will move after 1 minute
PSSA Preparation.
Copyright © 2013 Pearson Education, Inc. All rights reserved Chapter 11 Simple Linear Regression.
Essential Cell Biology
Immunobiology: The Immune System in Health & Disease Sixth Edition
Physics for Scientists & Engineers, 3rd Edition
Energy Generation in Mitochondria and Chlorplasts
Select a time to count down from the clock above
Copyright Tim Morris/St Stephen's School
1.step PMIT start + initial project data input Concept Concept.
How to create Magic Squares
1 Decidability continued…. 2 Theorem: For a recursively enumerable language it is undecidable to determine whether is finite Proof: We will reduce the.
Copyright (c) 2002 by SNU CSE Biointelligence Lab 1 Chap. 4 Pairwise alignment using HMMs Biointelligence Laboratory School of Computer Sci. & Eng. Seoul.
Presentation transcript:

Thomas Jellema & Wouter Van Gool 1 Question

2Answer

3 Pairwise alignment using HMMs Wouter van Gool and Thomas Jellema

Thomas Jellema & Wouter Van Gool 4 Contents Most probable path Thomas Probability of an alignment Thomas Sub-optimal alignments Thomas Pause Posterior probability that xi is aligned to yi Wouter Pair HMMs versus FSAs for searchingWouter Conclusion and summaryWouter Questions Pairwise alignment using HMMs

Thomas Jellema & Wouter Van Gool Most probable path Model that emits a single sequene

Thomas Jellema & Wouter Van Gool Most probable path Begin and end state

Thomas Jellema & Wouter Van Gool Most probable path Model that emits a pairwise alignment

Thomas Jellema & Wouter Van Gool Most probable path Example of a sequence Seq1: A C T _ C Seq2: T _ G G C All : M X M Y M

Thomas Jellema & Wouter Van Gool Most probable path Begin and end state

Thomas Jellema & Wouter Van Gool Most probable path Finding the most probable path - The path you choose is the path that has the highest probability of being the correct alignment. - The state we choose to be part of the alignment has to be the state with the highest probability of being correct. - We calculate the probability of the state being a M, X or Y and choose the one with the highest probability - If the probability of ending the alignment is higher then the next state being a M, X or Y then we end the alignment

Thomas Jellema & Wouter Van Gool Most probable path The probability of emmiting an M is the highest probability of: 1 previous state X new state M 2 previous state Y new state M 3 previous state M new state M

Thomas Jellema & Wouter Van Gool Most probable path Probability of going to the M state

Thomas Jellema & Wouter Van Gool Most probable path Viterbi algorithm for pair HMMs

Thomas Jellema & Wouter Van Gool Most probable path Finding the most probable path using FSAs -The most probable path is also the optimal FSA alignment

Thomas Jellema & Wouter Van Gool Most probable path Finding the most probable path using FSAs

Thomas Jellema & Wouter Van Gool Most probable path Recurrence relations

Thomas Jellema & Wouter Van Gool Most probable path We wish to know if the alignment score is above or below the score of random alignment. The log-odds ratio s(a,b) = log (p ab / q a q b ). log (p ab / q a q b )>0 iff the probability that a and b are related by our model is larger than the probability that they are picked at random. The log odds scoring function

Thomas Jellema & Wouter Van Gool Most probable path Random model

Thomas Jellema & Wouter Van Gool 19 1 EN D η1- ηY η X ENDYX 1 τε 1-ε -τ Y τ ε X τδδ 1-2δ - τ M END YXM “Model” “Random” 4.1 Most probable path

Thomas Jellema & Wouter Van Gool Most probable path Transitions

Thomas Jellema & Wouter Van Gool Most probable path Transitions

Thomas Jellema & Wouter Van Gool Most probable path Optimal log-odds alignment

Thomas Jellema & Wouter Van Gool Most probable path A pair HMM for local alignment

Thomas Jellema & Wouter Van Gool 24 Contents Most probable path Thomas Probability of an alignment Thomas Sub-optimal alignments Thomas Pause Posterior probability that xi is aligned to yi Wouter Pair HMMs versus FSAs for searchingWouter Conclusion and summaryWouter Questions Pairwise alignment using HMMs

Thomas Jellema & Wouter Van Gool Probability of an allignment Probability that a given pair of sequences are related.

Thomas Jellema & Wouter Van Gool Probability of an allignment Summing the probabilities

Thomas Jellema & Wouter Van Gool Probability of an allignment

Thomas Jellema & Wouter Van Gool 28 Contents Most probable path Thomas Probability of an alignment Thomas Sub-optimal alignments Thomas Pause Posterior probability that xi is aligned to yi Posterior probability that xi is aligned to yi Wouter Pair HMMs versus FSAs for searchingWouter Conclusion and summaryWouter Questions Pairwise alignment using HMMs

Thomas Jellema & Wouter Van Gool Suboptimal alignment Finding suboptimal alignments How to make sample alignments?

Thomas Jellema & Wouter Van Gool Suboptimal alignment Finding distinct suboptimal alignments

Thomas Jellema & Wouter Van Gool 31 Contents Most probable path Thomas Probability of an alignment Thomas Sub-optimal alignments Thomas Pause Posterior probability that xi is aligned to yi Wouter ExampleWouter Pair HMMs versus FSAs for searchingWouter Conclusion or summaryWouter Questions Pairwise alignment using HMMs

Thomas Jellema & Wouter Van Gool 32 Contents Most probable path Thomas Probability of an alignment Thomas Sub-optimal alignments Thomas Pause Posterior probability that xi is aligned to yi Wouter Pair HMMs versus FSAs for searchingWouter Conclusion and summaryWouter Questions Pairwise alignment using HMMs

Thomas Jellema & Wouter Van Gool 33 Posterior probability that x i is aligned to y i Local accuracy of an alignment? Reliability measure for each part of an alignment HMM as a local alignment measure Idea: P(all alignments trough (x i,y i )) P(all alignments of (x,y))

Thomas Jellema & Wouter Van Gool 34 Posterior probability that x i is aligned to y i Notation: x i ◊ y i means x i is aligned to y i

Thomas Jellema & Wouter Van Gool 35 Posterior probability that x i is aligned to y i

Thomas Jellema & Wouter Van Gool 36 Posterior probability that x i is aligned to y i

Thomas Jellema & Wouter Van Gool 37 Probability alignment Miyazawa: it seems attractive to find alignment by maximising P(x i ◊ y i ) May lead to inconsistencies: e.g. pairs (i 1,i 1 ) & (i 2,j 2 ) i 2 > i 1 and j 1 < j 2 Restriction to pairs (i,j) for which P(x i ◊ y i )>0.5

Thomas Jellema & Wouter Van Gool 38 Posterior probability that x i is aligned to y i The expected accuracy of an alignment  Expected overlap between π and paths sampled from the posterior distribution  Dynamic programming

Thomas Jellema & Wouter Van Gool 39 Contents Most probable path Thomas Probability of an alignment Thomas Sub-optimal alignments Thomas Pause Posterior probability that xi is aligned to yi Wouter Pair HMMs versus FSAs for searchingWouter Conclusion and summaryWouter Questions Pairwise alignment using HMMs

Thomas Jellema & Wouter Van Gool 40 Contents Most probable path Thomas Probability of an alignment Thomas Sub-optimal alignments Thomas Pause Posterior probability that xi is aligned to yi Wouter Pair HMMs versus FSAs for searchingWouter Conclusion and summaryWouter Questions Pairwise alignment using HMMs

Thomas Jellema & Wouter Van Gool 41 Pair HMMs versus FSAs for searching P(D | M) > P(M | D) HMM: maximum data likelihood by giving the same parameters (i.e. transition and emission probabilities) Bayesian model comparison with random model R

Thomas Jellema & Wouter Van Gool 42 Pair HMMs versus FSAs for searching Problems: 1. Most algorithms do not compute full probability P(x,y | M) but only best match or Viterbi path 2. FSA parameters may not be readily translated into probabilities

Thomas Jellema & Wouter Van Gool 43 Pair HMMs vs FSAs for searching Example: a model whose parameters match the data need not be the best model abacqaqa S B α 1-α P S (abac) = α 4 q a q b q a q c P B (abac) = 1-α Model comparison using the best match rather than the total probability

Thomas Jellema & Wouter Van Gool 44 Pair HMMs vs FSAs for searching Problem: no fixed scaling procedure can make the scores of this model into the log probabilities of an HMM

Thomas Jellema & Wouter Van Gool 45 Pair HMMs vs FSAs for searching Bayesian model comparision: both HMMs have same log-odds ratio as previous FSA

Thomas Jellema & Wouter Van Gool 46 Pair HMMs vs FSAs for searching Conversion FSA into probabilistic model – Probabilistic models may underperform standard alignment methods if Viterbi is used for database searching. – Buf if forward algorithm is used, it would be better than standard methods.

Thomas Jellema & Wouter Van Gool 47 Contents Most probable path Thomas Probability of an alignment Thomas Sub-optimal alignments Thomas Pause Posterior probability that xi is aligned to yi Wouter ExampleWouter Pair HMMs versus FSAs for searchingWouter Conclusion and summaryWouter Questions Pairwise alignment using HMMs

Thomas Jellema & Wouter Van Gool 48 Why try to use HMMs? Many complicated alignment algorithms can be described as simple Finite State Machines. HMMs have many advantages: - Parameters can be trained to fit the data: no need for PAM/BLOSSUM matrices - HMMs can keep track of all alignments, not just the best one

Thomas Jellema & Wouter Van Gool 49 New things HMMs we can do with pair HMMs Compute probability over all alignments. Compute relative probability of Viterbi alignment (or any other alignment). Sample over all alignments in proportion to their probability. Find distinct sub-optimal alignments. Compute reliability of each part of the best alignment. Compute the maximally reliable alignment.

Thomas Jellema & Wouter Van Gool 50 Conclusion Pairs-HMM work better for sequence alignment and database search than penalty score based alignment algorithms. Unfortunately both approaches are O(mn) and hence too slow for large database searches!

Thomas Jellema & Wouter Van Gool 51 Contents Most probable path Thomas Probability of an alignment Thomas Sub-optimal alignments Thomas Pause Posterior probability that xi is aligned to yi Wouter Pair HMMs versus FSAs for searchingWouter Conclusion or summaryWouter Questions Pairwise alignment using HMMs