Algebraic Statistics for Computational Biology Lior Pachter and Bernd Sturmfels Ch.5: Parametric Inference R. Mihaescu Παρουσίαση: Aγγελίνα Βιδάλη Αλγεβρικοί.

Slides:



Advertisements
Similar presentations
Lecture 16 Hidden Markov Models. HMM Until now we only considered IID data. Some data are of sequential nature, i.e. have correlations have time. Example:
Advertisements

Hidden Markov Models (1)  Brief review of discrete time finite Markov Chain  Hidden Markov Model  Examples of HMM in Bioinformatics  Estimations Basic.
1 Hidden Markov Model Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520.
Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.
Lecture 8: Hidden Markov Models (HMMs) Michael Gutkin Shlomi Haba Prepared by Originally presented at Yaakov Stein’s DSPCSP Seminar, spring 2002 Modified.
Introduction to Hidden Markov Models
Tutorial on Hidden Markov Models.
Hidden Markov Models.
Hidden Markov Models Bonnie Dorr Christof Monz CMSC 723: Introduction to Computational Linguistics Lecture 5 October 6, 2004.
 CpG is a pair of nucleotides C and G, appearing successively, in this order, along one DNA strand.  CpG islands are particular short subsequences in.
Hidden Markov Models Modified from:
Page 1 Hidden Markov Models for Automatic Speech Recognition Dr. Mike Johnson Marquette University, EECE Dept.
Statistical NLP: Lecture 11
Hidden Markov Models Theory By Johan Walters (SR 2003)
Statistical NLP: Hidden Markov Models Updated 8/12/2005.
Hidden Markov Models Fundamentals and applications to bioinformatics.
Hidden Markov Models in NLP
GS 540 week 6. HMM basics Given a sequence, and state parameters: – Each possible path through the states has a certain probability of emitting the sequence.
… Hidden Markov Models Markov assumption: Transition model:
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
Hidden Markov Model 11/28/07. Bayes Rule The posterior distribution Select k with the largest posterior distribution. Minimizes the average misclassification.
Hidden Markov Models Pairwise Alignments. Hidden Markov Models Finite state automata with multiple states as a convenient description of complex dynamic.
Tropical Geometry for Biology Lior Pachter and Bernd Sturmfels Department of Mathematics U.C. Berkeley.
Hidden Markov Models I Biology 162 Computational Genetics Todd Vision 14 Sep 2004.
Part 4 b Forward-Backward Algorithm & Viterbi Algorithm CSE717, SPRING 2008 CUBS, Univ at Buffalo.
Hidden Markov Model Special case of Dynamic Bayesian network Single (hidden) state variable Single (observed) observation variable Transition probability.
Hidden Markov Models. Hidden Markov Model In some Markov processes, we may not be able to observe the states directly.
Face Recognition Using Embedded Hidden Markov Model.
Lior, Bernd & Seth Algebraic Statistics for Computational Biology.
Timothy and RahulE6886 Project1 Statistically Recognize Faces Based on Hidden Markov Models Presented by Timothy Hsiao-Yi Chin Rahul Mody.
Hidden Markov Models K 1 … 2. Outline Hidden Markov Models – Formalism The Three Basic Problems of HMMs Solutions Applications of HMMs for Automatic Speech.
Conditional Random Fields
Hidden Markov Models Usman Roshan BNFO 601. Hidden Markov Models Alphabet of symbols: Set of states that emit symbols from the alphabet: Set of probabilities.
Part 4 c Baum-Welch Algorithm CSE717, SPRING 2008 CUBS, Univ at Buffalo.
Hidden Markov Models David Meir Blei November 1, 1999.
Hidden Markov Models 戴玉書
Hidden Markov Models. Hidden Markov Model In some Markov processes, we may not be able to observe the states directly.
Dynamic Time Warping Applications and Derivation
Annotation and Alignment of the Drosophila Genomes.
Visual Recognition Tutorial1 Markov models Hidden Markov models Forward/Backward algorithm Viterbi algorithm Baum-Welch estimation algorithm Hidden.
. Class 5: Hidden Markov Models. Sequence Models u So far we examined several probabilistic model sequence models u These model, however, assumed that.
Parametric Inference for Biological Sequence Analysis Lior Pachter and Bernd Sturmfels Mathematics Dept., U.C. Berkeley.
1 Markov Chains. 2 Hidden Markov Models 3 Review Markov Chain can solve the CpG island finding problem Positive model, negative model Length? Solution:
Probabilistic Sequence Alignment BMI 877 Colin Dewey February 25, 2014.
Computer vision: models, learning and inference
CS344 : Introduction to Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 21- Forward Probabilities and Robotic Action Sequences.
Fundamentals of Hidden Markov Model Mehmet Yunus Dönmez.
Hidden Markov Models Yves Moreau Katholieke Universiteit Leuven.
Hidden Markov Models Usman Roshan CS 675 Machine Learning.
CS Statistical Machine learning Lecture 24
1 CONTEXT DEPENDENT CLASSIFICATION  Remember: Bayes rule  Here: The class to which a feature vector belongs depends on:  Its own value  The values.
Pattern Recognition and Machine Learning-Chapter 13: Sequential Data
1 Hidden Markov Model Observation : O1,O2,... States in time : q1, q2,... All states : s1, s2,... Si Sj.
CS Statistical Machine learning Lecture 25 Yuan (Alan) Qi Purdue CS Nov
1 Hidden Markov Model Observation : O1,O2,... States in time : q1, q2,... All states : s1, s2,..., sN Si Sj.
Hidden Markov Models. A Hidden Markov Model consists of 1.A sequence of states {X t |t  T } = {X 1, X 2,..., X T }, and 2.A sequence of observations.
Definition of the Hidden Markov Model A Seminar Speech Recognition presentation A Seminar Speech Recognition presentation October 24 th 2002 Pieter Bas.
Other Models for Time Series. The Hidden Markov Model (HMM)
4.2 - Algorithms Sébastien Lemieux Elitra Canada Ltd.
Visual Recognition Tutorial1 Markov models Hidden Markov models Forward/Backward algorithm Viterbi algorithm Baum-Welch estimation algorithm Hidden.
1 Hidden Markov Model Xiaole Shirley Liu STAT115, STAT215.
Hidden Markov Models BMI/CS 576
Hidden Markov Models.
Today.
Statistical Models for Automatic Speech Recognition
Hidden Markov Autoregressive Models
Hidden Markov Model LR Rabiner
CONTEXT DEPENDENT CLASSIFICATION
Algorithms of POS Tagging
Hidden Markov Models By Manish Shrivastava.
Presentation transcript:

Algebraic Statistics for Computational Biology Lior Pachter and Bernd Sturmfels Ch.5: Parametric Inference R. Mihaescu Παρουσίαση: Aγγελίνα Βιδάλη Αλγεβρικοί & Γεωμετρικοί Αλγόριθμοι στη Μοριακή Βιολογία Διδάσκων: Ι. Εμίρης

Convenient algebraic structure for stating dynamic programming algorithms: the tropical semiring Tropical arithmetic (Convex hull) (Minkowski sum) The polytope agebra ( P d natural higher-dimensional generalization:

Inference From Observed random variables Y 1 = σ 1,…,Y n = σ n we want to infer values for the Hidden random variables Χ 1,…,Χ m : Unknown biological data, i.e.: How do two sequences allign? MAP estimation: given an observation σ 1,…,σ n which is the most probable explanation X 1 =h 1,…, Χ m =h m ? Model parameters give transition probabilities p hσ : hidden state hσ observed state Observation: σ 1,…,σ n : Known biological data

Observation: σ 1,…,σ n We want to compute an explanation for the observation: the sequence h 1,…,h m which yields the maximum a prosteriori probability (MAP): We can efficiently compute the marginal probabilities: Hidden Markov Model (HMM)

Computation of the marginal probabilities: p σ has the decomposition which gives the “Forward algorithm”. Markov chain: Independent probabilities

Viterbi algorithm problem of computing p σ Tropicalization:u ij =-log(p’ ij )v ij =-log(p ij ) We can now efficiently find an explanation h 1,…,h m for the observation σ 1,…,σ n using the recursion: It is again the Forward algorithm.

Pair Hidden Markov Model (pHMM) The algebraic statistical model for sequence alignment, known as the pair hidden Markov model, is the image of the map where A n,m is the set of all alignments of the sequences σ 1, σ 2.

The Needleman-Wunsch algorithm for finding the shortest path in the alignment graph is the tropicalization of the pair hidden Markov model for sequence allignment. gttta- gt--gc g t g c gttta Example: n=5, m=4 **

The polytope propagation algorithm Tropical sum-product algorithm in general fashion. f is the density function for a statistical model. From the d monomials find the one that maximizes Solution: Tropicalization: w i =-logp i & Computation in the ploytope algebra

Density function for a statistical model: f(p 1,p 2 )=p 1 3 +p 1 2 p 2 2 +p 1 p 2 2 +p 1 +p 2 4 Find the index j of the monomial that minimizes the function e j. w. Find an explanation Find the index j of the monomial with maximal value Tropicalization: w i =-logp i

Explanations are vertices of the Newton Polytope of f p13p13 p11p11 f(p 1,p 2 )=p 1 3 +p 1 2 p 2 2 +p 1 p 2 2 +p 1 +p 2 4 we find a point for each exponent vector of a monomial

Normal fan The normal fan partitions the parameter space into regions such that: the explanation(s) for all sets of parameters in a given region is given by the polytope vertex(face) associated to that region.

Parametric MAP estimation problem Local: given a choice of parameters determine the set of all parameters with the same MAP estimate. Solution: Computation of the normal cone of the Newton Polytope. Global: asks for a partition of the space of parameters such that any two parameters lie in the same part iff they yield the same MAP estimate. Solution: Computation of the normal fan of the Newton Polytope.