Understanding LSTM Networks

Understanding LSTM Networks
with Colah’s figures Colah’s blog:

Recurrent Neural Network

Long-Term Dependencies
The clouds are in the sky

Longer-Term Dependencies

LSTM comes in! Long Short Term Memory This is just a standard RNN.

LSTM comes in! Long Short Term Memory This is the LSTM!
This is just a standard RNN.

Overall Architecture Output (Cell) state Next (Cell) State
Forget Gate (Cell) state Next (Cell) State Input Gate Output Gate Hidden State Next Hidden State Input Output = Hidden state

The Core Idea

Step-by-Step Forget Gate Input Gate
Decide what information we’re going to throw away from the cell state. Input Gate Decide what new information we’re going to store in the cell state.

Step-by-Step Update (cell state) Output Gate (hidden state)
Update, scaled by how much we decide to update : input_gate*curr_state + forget_gate*prev_state Output Gate (hidden state) Output based on the updated state : output_gate*updated_state

Again Output (Cell) state Next (Cell) State Hidden State
Input Gate (Cell) state Next (Cell) State Forget Gate Output Gate Hidden State Next Hidden State Input

Gated Recurrent Unit Cho, Kyunghyun, et al. "Learning phrase representations using RNN encoder-decoder for statistical machine translation." arXiv preprint arXiv: (2014).

Understanding LSTM Networks

Similar presentations

Presentation on theme: "Understanding LSTM Networks"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Understanding LSTM Networks

Similar presentations

Presentation on theme: "Understanding LSTM Networks"— Presentation transcript:

Similar presentations

About project

Feedback