CSE 5290: Algorithms for Bioinformatics Fall 2011

Slides:



Advertisements
Similar presentations
Hidden Markov Model in Biological Sequence Analysis – Part 2
Advertisements

Marjolijn Elsinga & Elze de Groot1 Markov Chains and Hidden Markov Models Marjolijn Elsinga & Elze de Groot.
HMM II: Parameter Estimation. Reminder: Hidden Markov Model Markov Chain transition probabilities: p(S i+1 = t|S i = s) = a st Emission probabilities:
Hidden Markov Model.
Hidden Markov Models Chapter 11. CG “islands” The dinucleotide “CG” is rare –C in a “CG” often gets “methylated” and the resulting C then mutates to T.
Hidden Markov Models.
Lecture 8: Hidden Markov Models (HMMs) Michael Gutkin Shlomi Haba Prepared by Originally presented at Yaakov Stein’s DSPCSP Seminar, spring 2002 Modified.
Bioinformatics Hidden Markov Models. Markov Random Processes n A random sequence has the Markov property if its distribution is determined solely by its.
Hidden Markov Models Eine Einführung.
Hidden Markov Models.
 CpG is a pair of nucleotides C and G, appearing successively, in this order, along one DNA strand.  CpG islands are particular short subsequences in.
Hidden Markov Models Modified from:
Hidden Markov Models Ellen Walker Bioinformatics Hiram College, 2008.
JM - 1 Introduction to Bioinformatics: Lecture XIII Profile and Other Hidden Markov Models Jarek Meller Jarek Meller Division.
Hidden Markov Models 1 2 K … 1 2 K … 1 2 K … … … … 1 2 K … x1x1 x2x2 x3x3 xKxK 2 1 K 2.
Lecture 6, Thursday April 17, 2003
Hidden Markov Model 11/28/07. Bayes Rule The posterior distribution Select k with the largest posterior distribution. Minimizes the average misclassification.
Hidden Markov Models. Two learning scenarios 1.Estimation when the “right answer” is known Examples: GIVEN:a genomic region x = x 1 …x 1,000,000 where.
Hidden Markov Models. Decoding GIVEN x = x 1 x 2 ……x N We want to find  =  1, ……,  N, such that P[ x,  ] is maximized  * = argmax  P[ x,  ] We.
Hidden Markov Models Lecture 6, Thursday April 17, 2003.
Hidden Markov Models I Biology 162 Computational Genetics Todd Vision 14 Sep 2004.
Hidden Markov Models Lecture 5, Tuesday April 15, 2003.
Hidden Markov Models 1 2 K … 1 2 K … 1 2 K … … … … 1 2 K … x1x1 x2x2 x3x3 xKxK 2 1 K 2.
. Sequence Alignment via HMM Background Readings: chapters 3.4, 3.5, 4, in the Durbin et al.
Hidden Markov Models 1 2 K … 1 2 K … 1 2 K … … … … 1 2 K … x1x1 x2x2 x3x3 xKxK 2 1 K 2.
CpG islands in DNA sequences
Hidden Markov Models Lecture 5, Tuesday April 15, 2003.
Lecture 9 Hidden Markov Models BioE 480 Sept 21, 2004.
Bioinformatics Hidden Markov Models. Markov Random Processes n A random sequence has the Markov property if its distribution is determined solely by its.
Hidden Markov Models.
Learning HMM parameters Sushmita Roy BMI/CS 576 Oct 21 st, 2014.
CS262 Lecture 5, Win07, Batzoglou Hidden Markov Models 1 2 K … 1 2 K … 1 2 K … … … … 1 2 K … x1x1 x2x2 x3x3 xKxK 2 1 K 2.
. Class 5: Hidden Markov Models. Sequence Models u So far we examined several probabilistic model sequence models u These model, however, assumed that.
Class 5 Hidden Markov models. Markov chains Read Durbin, chapters 1 and 3 Time is divided into discrete intervals, t i At time t, system is in one of.
1 Markov Chains. 2 Hidden Markov Models 3 Review Markov Chain can solve the CpG island finding problem Positive model, negative model Length? Solution:
HMM Hidden Markov Model Hidden Markov Model. CpG islands CpG islands In human genome, CG dinucleotides are relatively rare In human genome, CG dinucleotides.
Hidden Markov Models for Sequence Analysis 4
BINF6201/8201 Hidden Markov Models for Sequence Analysis
H IDDEN M ARKOV M ODELS. O VERVIEW Markov models Hidden Markov models(HMM) Issues Regarding HMM Algorithmic approach to Issues of HMM.
Hidden Markov Models Yves Moreau Katholieke Universiteit Leuven.
Hidden Markov Models Usman Roshan CS 675 Machine Learning.
HMMs for alignments & Sequence pattern discovery I519 Introduction to Bioinformatics.
PGM 2003/04 Tirgul 2 Hidden Markov Models. Introduction Hidden Markov Models (HMM) are one of the most common form of probabilistic graphical models,
Hidden Markov Models 1 2 K … 1 2 K … 1 2 K … … … … 1 2 K … x1x1 x2x2 x3x3 xKxK 2 1 K 2.
Algorithms in Computational Biology11Department of Mathematics & Computer Science Algorithms in Computational Biology Markov Chains and Hidden Markov Model.
Hidden Markov Models An Introduction to Bioinformatics Algorithms (Jones and Pevzner)
COMP3456 – adapted from textbook slides Hidden Markov Models.
Hidden Markov Model Parameter Estimation BMI/CS 576 Colin Dewey Fall 2015.
Definition of the Hidden Markov Model A Seminar Speech Recognition presentation A Seminar Speech Recognition presentation October 24 th 2002 Pieter Bas.
Hidden Markov Models – Concepts 1 2 K … 1 2 K … 1 2 K … … … … 1 2 K … x1x1 x2x2 x3x3 xKxK 2 1 K 2.
CSE182-L10 HMM applications.
Hidden Markov Models BMI/CS 576
Comp. Genomics Recitation 6 14/11/06 ML and EM.
Hidden Markov Models - Training
Hidden Markov Model I.
6.047/ Computational Biology: Genomes, Networks, Evolution
Hidden Markov Models Part 2: Algorithms
Hidden Markov Models.
Three classic HMM problems
Professor of Computer Science and Mathematics
- Viterbi training - Baum-Welch algorithm - Maximum Likelihood
CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (I)
Hidden Markov Models (HMMs)
Hidden Markov Model ..
Professor of Computer Science and Mathematics
Professor of Computer Science and Mathematics
CSE 5290: Algorithms for Bioinformatics Fall 2009
CSE 5290: Algorithms for Bioinformatics Fall 2009
Hidden Markov Model Lecture #6
CISC 667 Intro to Bioinformatics (Fall 2005) Hidden Markov Models (I)
Presentation transcript:

CSE 5290: Algorithms for Bioinformatics Fall 2011 Suprakash Datta datta@cse.yorku.ca Office: CSEB 3043 Phone: 416-736-2100 ext 77875 Course page: http://www.cse.yorku.ca/course/5290 4/8/2019 CSE 5290, Fall 2011

Next Markov chains and hidden Markov models Some of the following slides are based on slides by the authors of our text. 4/8/2019 CSE 5290, Fall 2011

Markov chains Simplest probabilistic model with states Finite state machine with probabilistic state transitions Useful for modeling many, many systems 4/8/2019 CSE 5290, Fall 2011

Example 0 Random walks 4/8/2019 CSE 5290, Fall 2011

Example 1 Modeling exons and introns What is the probability of getting a T after a A? Differentiates exons, introns 4/8/2019 CSE 5290, Fall 2011

Example 2 The casino problem Switch probabilistically between a fair coin F and an unfair (biased) coin B Thus, we define the probabilities: P(H|F) = P(T|F) = ½ P(H|B) = ¾, P(T|B) = ¼ The crooked dealer changes between F and B with probability 10% 4/8/2019 CSE 5290, Fall 2011

Example 3 CpG islands CG dinucleotide typically modified by methylation; then the C is more likely to mutate to a G Methylation suppressed in short stretches (clusters), typically around genes, and CpG occurs more frequently in these islands So, finding the CpG islands in a genome is an important problem 4/8/2019 CSE 5290, Fall 2011

The Markov property “Memoryless” state transitions Memoryless vs stateless Path to current node has no influence on current (or future) transitions 4/8/2019 CSE 5290, Fall 2011

Probabilities of paths Paths are sequences of states The Markovian property simplifies the equation P(x1 x2…xn) = P(xn| x1 x2…xn-1) P(x1 x2…xn-1) = P(xn|xn-1) P(x1 x2…xn-1) = a n-1,n P(x1 x2…xn-1) = …. = P(x1) a1,2 … a n-3,n-2 a n-2,n-1 a n-1,n 4/8/2019 CSE 5290, Fall 2011

Graph-theoretic formulation Directed graph Nodes are states Edge weights are transition probabilities Graph structure determines properties of Chain 4/8/2019 CSE 5290, Fall 2011

Linear Algebraic formulation Relationship between the state probability vectors in successive timesteps X t+1 = A X t 4/8/2019 CSE 5290, Fall 2011

Fundamental questions When is there a “steady state”? What is the state probability vector at steady state? How quickly does the Markov chain approach steady state? 4/8/2019 CSE 5290, Fall 2011

Answers Existence of steady state can be formulated in terms of graph properties (e.g. all pairs reachable from each other) Steady state probability easier in terms of linear algebraic formulation: solve π = A π Speed of convergence: second largest eigenvalue of A 4/8/2019 CSE 5290, Fall 2011

Hidden Markov Models (HMM) Determining when the casino switches from fair to unfair coins or vice versa 4/8/2019 CSE 5290, Fall 2011

Problem… Fair Bet Casino Problem Any observed outcome of coin tosses could have been generated by any sequence of states! Need to incorporate a way to grade different sequences differently. Decoding Problem 4/8/2019 CSE 5290, Fall 2011

P(x|fair coin) vs. P(x|biased coin) Suppose first that dealer never changes coins. Some definitions: P(x|fair coin): prob. of the dealer using the F coin and generating the outcome x. P(x|biased coin): prob. of the dealer using the B coin and generating outcome x. 4/8/2019 CSE 5290, Fall 2011

P(x|fair coin) vs. P(x|biased coin) P(x|fair coin)=P(x1…xn|fair coin) Πi=1,n p (xi|fair coin)= (1/2)n P(x|biased coin)= P(x1…xn|biased coin)= Πi=1,n p (xi|biased coin)=(3/4)k(1/4)n-k= 3k/4n k - the number of Heads in x. 4/8/2019 CSE 5290, Fall 2011

P(x|fair coin) vs. P(x|biased coin) P(x|fair coin) = P(x|biased coin) 1/2n = 3k/4n 2n = 3k n = k log23 when k = n / log23 (k ~ 0.67n) 4/8/2019 CSE 5290, Fall 2011

Log-odds Ratio We define log-odds ratio as follows: log2(P(x|fair coin) / P(x|biased coin)) = Σki=1 log2(p+(xi) / p-(xi)) = n – k log23 4/8/2019 CSE 5290, Fall 2011

Computing Log-odds Ratio in Sliding Windows x1x2x3x4x5x6x7x8…xn Consider a sliding window of the outcome sequence. Find the log-odds for this short window. Log-odds value Fair coin most likely used Biased coin most likely used Disadvantages: - the length of CG-island is not known in advance - different windows may classify the same position differently 4/8/2019 CSE 5290, Fall 2011

Hidden Markov Model (HMM) Can be viewed as an abstract machine with k hidden states that emits symbols from an alphabet Σ. Each state has its own probability distribution, and the machine switches between states according to this probability distribution. While in a certain state, the machine makes 2 decisions: What state should I move to next? What symbol - from the alphabet Σ - should I emit? 4/8/2019 CSE 5290, Fall 2011

Why “Hidden”? Observers can see the emitted symbols of an HMM but have no ability to know which state the HMM is currently in. Thus, the goal is to infer the most likely hidden states of an HMM based on the given sequence of emitted symbols. 4/8/2019 CSE 5290, Fall 2011

HMM Parameters Σ: set of emission characters. Ex.: Σ = {H, T} for coin tossing Σ = {1, 2, 3, 4, 5, 6} for dice tossing Q: set of hidden states, each emitting symbols from Σ. Q={F,B} for coin tossing 4/8/2019 CSE 5290, Fall 2011

HMM Parameters (cont’d) A = (akl): a |Q| x |Q| matrix of probability of changing from state k to state l. aFF = 0.9 aFB = 0.1 aBF = 0.1 aBB = 0.9 E = (ek(b)): a |Q| x |Σ| matrix of probability of emitting symbol b while being in state k. eF(0) = ½ eF(1) = ½ eB(0) = ¼ eB(1) = ¾ 4/8/2019 CSE 5290, Fall 2011

HMM for Fair Bet Casino The Fair Bet Casino in HMM terms: Σ = {0, 1} (0 for Tails and 1 Heads) Q = {F,B} – F for Fair & B for Biased coin. Transition Probabilities A *** Emission Probabilities E Fair Biased aFF = 0.9 aFB = 0.1 aBF = 0.1 aBB = 0.9 Tails(0) Heads(1) Fair eF(0) = ½ eF(1) = ½ Biased eB(0) = ¼ eB(1) = ¾ 4/8/2019 CSE 5290, Fall 2011

HMM for Fair Bet Casino (cont’d) HMM model for the Fair Bet Casino Problem 4/8/2019 CSE 5290, Fall 2011

Hidden Paths π = F F F B B B B B F F F A path π = π1… πn in the HMM is defined as a sequence of states. Consider path π = FFFBBBBBFFF and sequence x = 01011101001 Probability that xi was emitted from state πi x 0 1 0 1 1 1 0 1 0 0 1 π = F F F B B B B B F F F P(xi|πi) ½ ½ ½ ¾ ¾ ¾ ¼ ¾ ½ ½ ½ P(πi-1  πi) ½ 9/10 9/10 1/10 9/10 9/10 9/10 9/10 1/10 9/10 9/10 Transition probability from state πi-1 to state πi 4/8/2019 CSE 5290, Fall 2011

P(x|π) Calculation P(x|π): Probability that sequence x was generated by the path π: n P(x|π) = P(π0→ π1) · Π P(xi| πi) · P(πi → πi+1) i=1 = a π0, π1 · Π e πi (xi) · a πi, πi+1 4/8/2019 CSE 5290, Fall 2011

P(x|π) Calculation P(x|π): Probability that sequence x was generated by the path π: n P(x|π) = P(π0→ π1) · Π P(xi| πi) · P(πi → πi+1) i=1 = a π0, π1 · Π e πi (xi) · a πi, πi+1 = Π e πi+1 (xi+1) · a πi, πi+1 if we count from i=0 instead of i=1 πn+1 is a fictitious end state from which there is a null emission 4/8/2019 CSE 5290, Fall 2011

Decoding Problem Goal: Find an optimal hidden path of states given observations. Input: Sequence of observations x = x1…xn generated by an HMM M(Σ, Q, A, E) Output: A path that maximizes P(x|π) over all possible paths π. 4/8/2019 CSE 5290, Fall 2011

Building Manhattan paths for Decoding Problem Andrew Viterbi used the Manhattan grid model to solve the Decoding Problem. Every choice of π = π1… πn corresponds to a path in the graph. The only valid direction in the graph is eastward. This graph has |Q|2(n-1) edges. 4/8/2019 CSE 5290, Fall 2011

Edit Graph for Decoding Problem 4/8/2019 CSE 5290, Fall 2011

Decoding Problem vs. Alignment Problem Valid directions in the alignment problem. Valid directions in the decoding problem. 4/8/2019 CSE 5290, Fall 2011

Decoding Problem as Finding a Longest Path in a DAG The Decoding Problem is reduced to finding a longest path in the directed acyclic graph (DAG) above. Notes: the length of the path is defined as the product of its edges’ weights, not the sum. 4/8/2019 CSE 5290, Fall 2011

Decoding Problem (cont’d) Every path in the graph has the probability P(x|π). The Viterbi algorithm finds the path that maximizes P(x|π) among all possible paths. The Viterbi algorithm runs in O(n|Q|2) time. 4/8/2019 CSE 5290, Fall 2011

Decoding Problem: weights of edges (k, i) (l, i+1) The weight w is given by: ??? 4/8/2019 CSE 5290, Fall 2011

Decoding Problem: weights of edges P(x|π) = Π e πi+1 (xi+1) . a πi, πi+1 i=0 w (k, i) (l, i+1) The weight w is given by: ?? 4/8/2019 CSE 5290, Fall 2011

Decoding Problem: weights of edges i-th term = e πi+1 (xi+1) . a πi, πi+1 w (k, i) (l, i+1) The weight w is given by: ? 4/8/2019 CSE 5290, Fall 2011

Decoding Problem: weights of edges i-th term = e πi (xi) . a πi, πi+1 = el(xi+1). akl for πi =k, πi+1=l w (k, i) (l, i+1) The weight w=el(xi+1). akl 4/8/2019 CSE 5290, Fall 2011

Decoding Problem and Dynamic Programming sl,i+1 = maxk Є Q {sk,i · wt of edge between (k,i) and (l,i+1)} = maxk Є Q {sk,i · akl · el (xi+1) } = el (xi+1) · maxk Є Q {sk,i · akl} 4/8/2019 CSE 5290, Fall 2011

Decoding Problem (cont’d) Initialization: sbegin,0 = 1 sk,0 = 0 for k ≠ begin. Let π* be the optimal path. Then, P(x|π*) = maxk Є Q {sk,n . ak,end} 4/8/2019 CSE 5290, Fall 2011

Viterbi Algorithm The value of the product can become extremely small, which leads to overflowing. To avoid overflowing, use log value instead. sk,i+1= logel(xi+1) + max k Є Q {sk,i + log(akl)} 4/8/2019 CSE 5290, Fall 2011

A variant of the problem We want to know the most likely state at any point in time. 4/8/2019 CSE 5290, Fall 2011

Forward-Backward Problem Given: a sequence of coin tosses generated by an HMM. Goal: find the probability that the dealer was using a biased coin at a particular time. 4/8/2019 CSE 5290, Fall 2011

Forward Algorithm Define fk,i (forward probability) as the probability of emitting the prefix x1…xi and reaching the state π = k. The recurrence for the forward algorithm: fk,i = ek(xi) . Σ fl,i-1 . alk l Є Q 4/8/2019 CSE 5290, Fall 2011

Backward Algorithm Counterintuitive! However, forward probability is not the only factor affecting P(πi = k|x). The sequence of transitions and emissions that the HMM undergoes between πi+1 and πn also affect P(πi = k|x). forward xi backward Counterintuitive! 4/8/2019 CSE 5290, Fall 2011

Backward Algorithm (cont’d) Define backward probability bk,i as the probability of being in state πi = k and emitting the suffix xi+1…xn. The recurrence for the backward algorithm: bk,i = Σ el(xi+1) . bl,i+1 . akl l Є Q 4/8/2019 CSE 5290, Fall 2011

Backward-Forward Algorithm The probability that the dealer used a biased coin at any moment i: P(x, πi = k) fk(i) . bk(i) P(πi = k|x) = _______________ = ______________ P(x) P(x) P(x) is the sum of P(x, πi = k) over all k 4/8/2019 CSE 5290, Fall 2011

How are these computed? Iterative procedure, instance of Expectation Maximization (EM) 4/8/2019 CSE 5290, Fall 2011

Next… Profile HMMs 4/8/2019 CSE 5290, Fall 2011

Profile Representation of Protein Families Aligned DNA sequences can be represented by a 4 ·n profile matrix reflecting the frequencies of nucleotides in every aligned position. Protein family can be represented by a 20·n profile representing frequencies of amino acids. 4/8/2019 CSE 5290, Fall 2011

Profiles and HMMs HMMs can also be used for aligning a sequence against a profile representing a protein family. A 20·n profile P corresponds to n sequentially linked match states M1,…,Mn in the profile HMM of P. 4/8/2019 CSE 5290, Fall 2011

Profile HMM A profile HMM 4/8/2019 CSE 5290, Fall 2011

Building a profile HMM Multiple alignment is used to construct the HMM model. Assign each column to a Match state in HMM. Add Insertion and Deletion state. Estimate the emission probabilities according to amino acid counts in column. Different positions in the protein will have different emission probabilities. Estimate the transition probabilities between Match, Deletion and Insertion states The HMM model gets trained to derive the optimal parameters. 4/8/2019 CSE 5290, Fall 2011

States of Profile HMM Match states M1…Mn (plus begin/end states) Insertion states I0I1…In Deletion states D1…Dn 4/8/2019 CSE 5290, Fall 2011

Transition Probabilities in Profile HMM log(aMI)+log(aIM) = gap initiation penalty log(aII) = gap extension penalty 4/8/2019 CSE 5290, Fall 2011

Emission Probabilities in Profile HMM Probability of emitting a symbol a at an insertion state Ij: eIj(a) = p(a) where p(a) is the frequency of the occurrence of the symbol a in all the sequences. 4/8/2019 CSE 5290, Fall 2011

Profile HMM Alignment Define vMj (i) as the logarithmic likelihood score of the best path for matching x1..xi to profile HMM ending with xi emitted by the state Mj. vIj (i) and vDj (i) are defined similarly. 4/8/2019 CSE 5290, Fall 2011

Profile HMM Alignment: Dynamic Programming vMj-1(i-1) + log(aMj-1,Mj ) vMj(i) = log (eMj(xi)/p(xi)) + max vIj-1(i-1) + log(aIj-1,Mj ) vDj-1(i-1) + log(aDj-1,Mj ) vMj(i-1) + log(aMj, Ij) vIj(i) = log (eIj(xi)/p(xi)) + max vIj(i-1) + log(aIj, Ij) vDj(i-1) + log(aDj, Ij) 4/8/2019 CSE 5290, Fall 2011

Paths in Edit Graph and Profile HMM A path through an edit graph and the corresponding path through a profile HMM 4/8/2019 CSE 5290, Fall 2011

HMM Parameter Estimation So far, we have assumed that the transition and emission probabilities are known. However, in most HMM applications, the probabilities are not known. It’s very hard to estimate the probabilities. 4/8/2019 CSE 5290, Fall 2011

HMM Parameter Estimation Problem Given HMM with states and alphabet (emission characters) Independent training sequences x1, … xm Find HMM parameters Θ (that is, akl, ek(b)) that maximize P(x1, …, xm | Θ) the joint probability of the training sequences. 4/8/2019 CSE 5290, Fall 2011

Maximize the likelihood P(x1, …, xm | Θ) as a function of Θ is called the likelihood of the model. The training sequences are assumed independent, therefore P(x1, …, xm | Θ) = Πi P(xi | Θ) The parameter estimation problem seeks Θ that realizes In practice the log likelihood is computed to avoid underflow errors 4/8/2019 CSE 5290, Fall 2011

Two situations Known paths for training sequences CpG islands marked on training sequences One evening the casino dealer allows us to see when he changes dice Unknown paths CpG islands are not marked Do not see when the casino dealer changes dice 4/8/2019 CSE 5290, Fall 2011

Known paths Akl = # of times each k  l is taken in the training sequences Ek(b) = # of times b is emitted from state k in the training sequences Compute akl and ek(b) as maximum likelihood estimators: 4/8/2019 CSE 5290, Fall 2011

Pseudocounts Some state k may not appear in any of the training sequences. This means Akl = 0 for every state l and akl cannot be computed with the given equation. To avoid this overfitting use predetermined pseudocounts rkl and rk(b). Akl = # of transitions kl + rkl Ek(b) = # of emissions of b from k + rk(b) The pseudocounts reflect our prior biases about the probability values. 4/8/2019 CSE 5290, Fall 2011

Unknown paths: Baum-Welch Idea: Guess initial values for parameters. art and experience, not science Estimate new (better) values for parameters. how ? Repeat until stopping criteria is met. what criteria ? 4/8/2019 CSE 5290, Fall 2011

Better values for parameters Would need the Akl and Ek(b) values but cannot count (the path is unknown) and do not want to use a most probable path. For all states k,l, symbol b and training sequence x Compute Akl and Ek(b) as expected values, given the current parameters 4/8/2019 CSE 5290, Fall 2011

Notation For any sequence of characters x emitted along some unknown path π, denote by πi = k the assumption that the state at position i (in which xi is emitted) is k. 4/8/2019 CSE 5290, Fall 2011

The Baum-Welch algorithm Initialization: Pick the best-guess for model parameters (or arbitrary) Iteration: Forward for each x Backward for each x Calculate Akl, Ek(b) Calculate new akl, ek(b) Calculate new log-likelihood Until log-likelihood does not change much 4/8/2019 CSE 5290, Fall 2011

Baum-Welch analysis Log-likelihood is increased by iterations Baum-Welch is a particular case of the EM (expectation maximization) algorithm Convergence to local maximum. Choice of initial parameters determines local maximum to which the algorithm converges 4/8/2019 CSE 5290, Fall 2011