Textual Video Prediction

Slides:

Advertisements

Similar presentations

Problem Solving.  Similar to Solving Math Word Problem  Read the Problem  Decide how to go about Solving the Problem  Solve the Problem  Test the.

Advertisements

Automatic Lung Nodule Detection Using Deep Learning

Yann LeCun Other Methods and Applications of Deep Learning Yann Le Cun The Courant Institute of Mathematical Sciences New York University

Conditional Generative Adversarial Networks

Generative Adversarial Nets

Automatic Advertisement Rating

Convolutional Neural Network

Video Generation with GAN

Week 3 (June 6 – June10 , 2016) Summary :

Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek

Textual Video Prediction Week 2

Summary of Week 1 (May 23 – May 27, 2016)

Query-Focused Video Summarization – Week 1

Automatic Lung Cancer Diagnosis from CT Scans (Week 4)

CSCI 5922 Neural Networks and Deep Learning Generative Adversarial Networks Mike Mozer Department of Computer Science and Institute of Cognitive Science.

Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:

Intelligent Information System Lab

Synthesis of X-ray Projections via Deep Learning

CNN Demo LIU Pengpeng.

Authors: Jun-Yan Zhu*, Taesun Park*, Phillip Isola, Alexei A. Efros

Textual Video Prediction

Adri`a Recasens, Aditya Khosla, Carl Vondrick, Antonio Torralba

Low Dose CT Image Denoising Using WGAN and Perceptual Loss

Master’s Thesis defense Ming Du Advisor: Dr. Yi Shang

Distributed Representation of Words, Sentences and Paragraphs

Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:

Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:

Project 5: Generating Privacy and Security Threat Summary for Internet of Things REU Student: Nicole Fella Graduate Mentor: Kexin Liao Faculty Mentor:

Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:

Project # 5 Generating Privacy and Security Threat Summary for Internet of Things REU student: Domonique Cox Graduate mentors: Kaiqiang Song Faculty mentor(s):

ECE 599/692 – Deep Learning Lecture 1 - Introduction

Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:

Image recognition: Defense adversarial attacks

Word embeddings based mapping

Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:

Image to Image Translation using GANs

REU student: Domonique Cox Graduate mentors: Kaiqiang Song

REU student: Domonique Cox Graduate mentors: Kaiqiang Song

Project # 12, Smart Walker REU student: Jonathan Guilbe Graduate mentors: Sharare Zehtabian, Siavash Khodadadeh Faculty mentor(s): Dr. Turgut, Dr. Boloni.

Lip movement Synthesis from Text

View Inter-Prediction GAN: Unsupervised Representation Learning for 3D Shapes by Learning Global Shape Memories to Support Local View Predictions 1,2 1.

Attack and defense on learning-based security system

Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:

Presentation By: Eryk Helenowski PURE Mentor: Vincent Bindschaedler

Adversarial Learning for Security System

Compressive Image Recovery using Recurrent Generative Model

Abnormally Detection

Deep Learning Authors: Yann LeCun, Yoshua Bengio, Geoffrey Hinton

Attention for translation

Learn to Comment Mentor: Mahdi M. Kalayeh

Recurrent Neural Networks (RNNs)

Textual Video Prediction

Query-based video summarization

Angel A. Cantu, Nami Akazawa Department of Computer Science

Weeks 1 and 2 Aaron Ott.

UCF-REU in Computer Vision

Weak-supervision based Multi-Object Tracking

CRCV REU 2019 Kara Schatz.

Cengizhan Can Phoebe de Nooijer

Text-to-speech (TTS) Traditional approaches (before 2016) Neural TTS

CRCV REU 2019 Week 8 Aaron Honculada.

Bidirectional LSTM-CRF Models for Sequence Tagging

REU 2019 Week 2 Volodymyr Bobyr.

End-to-End Speech-Driven Facial Animation with Temporal GANs

Week 1 Overview - Cecilia La Place

Week 5 Cecilia La Place.

Anirban Laha and Vikas C. Raykar, IBM Research – India.

CS249: Neural Language Model

REU Program 2019 Week 6 Alex Ruiz Jyoti Kini.

SDSEN: Self-Refining Deep Symmetry Enhanced Network

Presentation transcript:

Textual Video Prediction REU Student: Emily Cosgrove Graduate Student: Amir Mazaheri Professor: Dr. Shah

Preliminary Overview Deep Learning, CNNs, and RNNs Computer Vision and Natural Language Processing General Adversarial Networks (GANs) Video Prediction Missing Idea?

Problem description Goal: Use NLP and textual information for video prediction Possible Contribution: Enhanced/different video prediction

Problem Description Current Video Prediction Systems: Our System: Input Frames GAN Predicted Frames Our System: Input Frames GAN Predicted Frames Input Sentence

Tasks Step 3: Prepare our measurements Step 4: Formulate our solution Step 1: Study current methods to predict videos Learn how to run and setup current method’s codes Step 2: Study datasets which have been used for video prediction so far Possibly provide textual annotations for some of them. Step 3: Prepare our measurements How do we evaluate our results? Which other methods can we compare with? Step 4: Formulate our solution Discuss ideas to solve the problem Step 5: Implementation We will use Keras or Tensorflow to implement our ideas. Step 6: Baseline experiments

Weekly Progress Introductory Meetings with Mentor Read papers related to topic General Adversarial Networks (Goodfellow) Decomposing Motion and Content for Natural Video Sequence Prediction (Ruben Villegas, et.) Began Step 1 https://github.com/tensorflow/models/tree/master/video_prediction Model we are currently working with

Research Paper: General Adversarial Networks Author: Goodfellow Generator v. Discriminator Input: Random Noise Loss Functions Discriminator Generator 𝛻 θ g 1 m 𝑖=1 𝑚 log (1 −𝐷 𝐺 𝑧 𝑖 ) 𝛻 θ d 1 m 𝑖=1 𝑚 log 𝐷 𝑥 𝑖 + log (1 −𝐷 𝐺 𝑧 𝑖 )

Next week Continue Step 1 Preprocess movie dataset Study codes for current methods Read and study paper related to code Preprocess movie dataset

References Goodfellow, Ian, et al. "Generative adversarial nets." Advances in neural information processing systems. 2014. Mathieu, Michael, Camille Couprie, and Yann LeCun. "Deep multi-scale video prediction beyond mean square error." arXiv preprint arXiv:1511.05440 (2015). Villegas, Ruben, Jimei Yang, Seunghoon Hong, Xunyun Lin, and Honglak Lee. “Decomposing Motion and Content for Natural Video Sequence Prediction.” ICLR (2017).