Textual Video Prediction

Slides:



Advertisements
Similar presentations
Problem Solving.  Similar to Solving Math Word Problem  Read the Problem  Decide how to go about Solving the Problem  Solve the Problem  Test the.
Advertisements

Automatic Lung Nodule Detection Using Deep Learning
Yann LeCun Other Methods and Applications of Deep Learning Yann Le Cun The Courant Institute of Mathematical Sciences New York University
Conditional Generative Adversarial Networks
Generative Adversarial Nets
Automatic Advertisement Rating
Convolutional Neural Network
Video Generation with GAN
Week 3 (June 6 – June10 , 2016) Summary :
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Textual Video Prediction Week 2
Summary of Week 1 (May 23 – May 27, 2016)
Query-Focused Video Summarization – Week 1
Automatic Lung Cancer Diagnosis from CT Scans (Week 4)
CSCI 5922 Neural Networks and Deep Learning Generative Adversarial Networks Mike Mozer Department of Computer Science and Institute of Cognitive Science.
Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:
Intelligent Information System Lab
Synthesis of X-ray Projections via Deep Learning
CNN Demo LIU Pengpeng.
Authors: Jun-Yan Zhu*, Taesun Park*, Phillip Isola, Alexei A. Efros
Textual Video Prediction
Adri`a Recasens, Aditya Khosla, Carl Vondrick, Antonio Torralba
Low Dose CT Image Denoising Using WGAN and Perceptual Loss
Master’s Thesis defense Ming Du Advisor: Dr. Yi Shang
Distributed Representation of Words, Sentences and Paragraphs
Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:
Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:
Project 5: Generating Privacy and Security Threat Summary for Internet of Things REU Student: Nicole Fella Graduate Mentor: Kexin Liao Faculty Mentor:
Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:
Project # 5 Generating Privacy and Security Threat Summary for Internet of Things REU student: Domonique Cox Graduate mentors: Kaiqiang Song Faculty mentor(s):
ECE 599/692 – Deep Learning Lecture 1 - Introduction
Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:
Image recognition: Defense adversarial attacks
Word embeddings based mapping
Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:
Image to Image Translation using GANs
REU student: Domonique Cox Graduate mentors: Kaiqiang Song
REU student: Domonique Cox Graduate mentors: Kaiqiang Song
Project # 12, Smart Walker REU student: Jonathan Guilbe Graduate mentors: Sharare Zehtabian, Siavash Khodadadeh Faculty mentor(s): Dr. Turgut, Dr. Boloni.
Lip movement Synthesis from Text
View Inter-Prediction GAN: Unsupervised Representation Learning for 3D Shapes by Learning Global Shape Memories to Support Local View Predictions 1,2 1.
Attack and defense on learning-based security system
Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:
Presentation By: Eryk Helenowski PURE Mentor: Vincent Bindschaedler
Adversarial Learning for Security System
Compressive Image Recovery using Recurrent Generative Model
Abnormally Detection
Deep Learning Authors: Yann LeCun, Yoshua Bengio, Geoffrey Hinton
Attention for translation
Learn to Comment Mentor: Mahdi M. Kalayeh
Recurrent Neural Networks (RNNs)
Textual Video Prediction
Query-based video summarization
Angel A. Cantu, Nami Akazawa Department of Computer Science
Weeks 1 and 2 Aaron Ott.
UCF-REU in Computer Vision
Weak-supervision based Multi-Object Tracking
CRCV REU 2019 Kara Schatz.
Cengizhan Can Phoebe de Nooijer
Text-to-speech (TTS) Traditional approaches (before 2016) Neural TTS
CRCV REU 2019 Week 8 Aaron Honculada.
Bidirectional LSTM-CRF Models for Sequence Tagging
REU 2019 Week 2 Volodymyr Bobyr.
End-to-End Speech-Driven Facial Animation with Temporal GANs
Week 1 Overview - Cecilia La Place
Week 5 Cecilia La Place.
Anirban Laha and Vikas C. Raykar, IBM Research – India.
CS249: Neural Language Model
REU Program 2019 Week 6 Alex Ruiz Jyoti Kini.
SDSEN: Self-Refining Deep Symmetry Enhanced Network
Presentation transcript:

Textual Video Prediction REU Student: Emily Cosgrove Graduate Student: Amir Mazaheri Professor: Dr. Shah

Preliminary Overview Deep Learning, CNNs, and RNNs Computer Vision and Natural Language Processing General Adversarial Networks (GANs) Video Prediction Missing Idea?

Problem description Goal: Use NLP and textual information for video prediction Possible Contribution: Enhanced/different video prediction

Problem Description Current Video Prediction Systems: Our System: Input Frames   GAN Predicted Frames Our System: Input Frames   GAN Predicted Frames Input Sentence

Tasks Step 3: Prepare our measurements Step 4: Formulate our solution Step 1: Study current methods to predict videos Learn how to run and setup current method’s codes Step 2: Study datasets which have been used for video prediction so far Possibly provide textual annotations for some of them. Step 3: Prepare our measurements How do we evaluate our results? Which other methods can we compare with? Step 4: Formulate our solution Discuss ideas to solve the problem Step 5: Implementation We will use Keras or Tensorflow to implement our ideas. Step 6: Baseline experiments

Weekly Progress Introductory Meetings with Mentor Read papers related to topic General Adversarial Networks (Goodfellow) Decomposing Motion and Content for Natural Video Sequence Prediction (Ruben Villegas, et.) Began Step 1 https://github.com/tensorflow/models/tree/master/video_prediction Model we are currently working with

Research Paper: General Adversarial Networks Author: Goodfellow Generator v. Discriminator Input: Random Noise Loss Functions Discriminator Generator 𝛻 θ g 1 m 𝑖=1 𝑚 log (1 −𝐷 𝐺 𝑧 𝑖 ) 𝛻 θ d 1 m 𝑖=1 𝑚 log 𝐷 𝑥 𝑖 + log (1 −𝐷 𝐺 𝑧 𝑖 )

Next week Continue Step 1 Preprocess movie dataset Study codes for current methods Read and study paper related to code Preprocess movie dataset

References Goodfellow, Ian, et al. "Generative adversarial nets." Advances in neural information processing systems. 2014. Mathieu, Michael, Camille Couprie, and Yann LeCun. "Deep multi-scale video prediction beyond mean square error." arXiv preprint arXiv:1511.05440 (2015). Villegas, Ruben, Jimei Yang, Seunghoon Hong, Xunyun Lin, and Honglak Lee. “Decomposing Motion and Content for Natural Video Sequence Prediction.” ICLR (2017).