Download presentation
Presentation is loading. Please wait.
1
Textual Video Prediction
REU Student: Emily Cosgrove Graduate Student: Amir Mazaheri Professor: Dr. Shah
2
Preliminary Overview Deep Learning, CNNs, and RNNs
Computer Vision and Natural Language Processing General Adversarial Networks (GANs) Video Prediction Missing Idea?
3
Problem description Goal: Use NLP and textual information for video prediction Possible Contribution: Enhanced/different video prediction
4
Problem Description Current Video Prediction Systems: Our System:
Input Frames GAN Predicted Frames Our System: Input Frames GAN Predicted Frames Input Sentence
5
Tasks Step 3: Prepare our measurements Step 4: Formulate our solution
Step 1: Study current methods to predict videos Learn how to run and setup current methodβs codes Step 2: Study datasets which have been used for video prediction so far Possibly provide textual annotations for some of them. Step 3: Prepare our measurements How do we evaluate our results? Which other methods can we compare with? Step 4: Formulate our solution Discuss ideas to solve the problem Step 5: Implementation We will use Keras or Tensorflow to implement our ideas. Step 6: Baseline experiments
6
Weekly Progress Introductory Meetings with Mentor
Read papers related to topic General Adversarial Networks (Goodfellow) Decomposing Motion and Content for Natural Video Sequence Prediction (Ruben Villegas, et.) Began Step 1 Model we are currently working with
7
Research Paper: General Adversarial Networks
Author: Goodfellow Generator v. Discriminator Input: Random Noise Loss Functions Discriminator Generator π» ΞΈ g 1 m π=1 π log (1 βπ· πΊ π§ π ) π» ΞΈ d 1 m π=1 π log π· π₯ π + log (1 βπ· πΊ π§ π )
8
Next week Continue Step 1 Preprocess movie dataset
Study codes for current methods Read and study paper related to code Preprocess movie dataset
9
References Goodfellow, Ian, et al. "Generative adversarial nets." Advances in neural information processing systems Mathieu, Michael, Camille Couprie, and Yann LeCun. "Deep multi-scale video prediction beyond mean square error." arXiv preprint arXiv: (2015). Villegas, Ruben, Jimei Yang, Seunghoon Hong, Xunyun Lin, and Honglak Lee. βDecomposing Motion and Content for Natural Video Sequence Prediction.β ICLR (2017).
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.