Textual Video Prediction Week 2

Slides:

Advertisements

Similar presentations

Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting 卷积LSTM网络:利用机器学习预测短期降雨施行健香港科技大学 VALSE 2016/03/23.

Advertisements

A Hierarchical Deep Temporal Model for Group Activity Recognition

Automatic Advertisement Ratings Discussion Methods Problem and Motivation The goal is to automatically generate an objective score or ranking for an advertisement.

Gist of Achieving Human Parity in Conversational Speech Recognition

Deep Learning for Dual-Energy X-Ray

Automatic Advertisement Rating

Project #5: Generating Privacy and Security Threat Summary for Internet of Things REU Student: Ray Yan Graduate mentors: Logan Lebanoff Faculty Mentor(s):

Week 3 (June 6 – June10 , 2016) Summary :

Summary of Week 1 (May 23 – May 27, 2016)

Query-Focused Video Summarization – Week 1

Automatic Lung Cancer Diagnosis from CT Scans (Week 4)

Week 8 REU Nolan Warner.

Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:

CNN Demo LIU Pengpeng.

Week 6 Cecilia La Place.

Summary Presentation.

Single Image Super-Resolution

Efficient Deep Model for Monocular Road Segmentation

Textual Video Prediction

Multiple Organ Detection in CT Volumes using CNN Week 4

Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:

Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:

Attention-based Caption Description Mun Jonghwan.

Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:

Recurrent Neural Networks

Project # 5 Generating Privacy and Security Threat Summary for Internet of Things REU student: Domonique Cox Graduate mentors: Kaiqiang Song Faculty mentor(s):

Week 6 Fatemeh Yazdiananari.

Machine Learning in Laparoscopy

Lip movement Synthesis from Text

Project 1: Smart Home REU student: Jason Ling Graduate mentors: Safa Bacanli Faculty mentor(s): Damla Turgut Week 8 (July 2 – July ) Accomplishments:

Week 1 (May 23 – May 27, 2016) Accomplishments:

Zhedong Zheng, Liang Zheng and Yi Yang

Textual Video Prediction

Airport Parking Space Navigation

Video Imagination from a Single Image with Transformation Generation

TPGAN overview.

Abnormally Detection

Learn to Comment Mentor: Mahdi M. Kalayeh

Rachit Saluja 13th Feb SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference Rowan Zeller, Yonatan.

Chuan Wang1, Haibin Huang1, Xiaoguang Han2, Jue Wang1

Automatic Handwriting Generation

Textual Video Prediction

Query-based video summarization

REU - End to End Self Driving Car

Weeks 1 and 2 Aaron Ott.

Week 3 Presentation Ngoc Ta Aidean Sharghi.

Week 3: Moving Target Detection Using Infrared Sensors

Deep screen image crop and enhance

Multi-UAV to UAV Tracking

Deep screen image crop and enhance

Weak-supervision based Multi-Object Tracking

CRCV REU 2019 Kara Schatz.

Cengizhan Can Phoebe de Nooijer

Week 3 Volodymyr Bobyr.

Report 2 Brandon Silva.

Deep screen image crop and enhance

REU 2019 Week 2 Volodymyr Bobyr.

End-to-End Speech-Driven Facial Animation with Temporal GANs

Week 7 Presentation Ngoc Ta Aidean Sharghi

Additional text exploring the video clip.

Deep screen image crop and enhance

Multi-Target Detection and Tracking of UAVs from a UAV

Week 1 Overview - Cecilia La Place

Self-Supervised Cross-View Action Synthesis

Week 6: Moving Target Detection Using Infrared Sensors

CRCV REU 2019 Aaron Honculada.

Deep screen image crop and enhance

Deep screen image crop and enhance

Week 4: Moving Target Detection Using Infrared Sensors

Presentation transcript:

Textual Video Prediction Week 2 REU Student: Emily Cosgrove Graduate Student: Amir Mazaheri Professor: Dr. Shah

Project Overview Video Prediction Generative Adversarial Networks (GANs) Textual information for video prediction Goal: Enhanced video prediction

Progress Large Scale Movie Description Challenge (LSMDC) challenge dataset 128,000 videos Each 2-20 seconds Comes with text Clipped each video into 1 second clips (about 30 frames) Now have 359,000 video clips Text coming with each We use standard training-validation-test split (used in LSMDC- 2016) 90% data as training, 5% validation, and 5% testing

Annotation: trying to lighten the mood. Someone smiles sheepishly

1 2 3 4 5 6 LSTM LSTM LSTM LSTM LSTM LSTM CNN CNN CNN CNN CNN CNN CNN DE-CONV DE-CONV DE-CONV LSTM LSTM LSTM LSTM LSTM LSTM CNN CNN CNN CNN CNN CNN CNN CNN CNN CNN CNN 1 2 3 4 5 6

Progress Two Research Papers Generative Adversarial Text to Image Synthesis (Scott Reed, et.) Uses GANs Developed a model to generate images based on textual description Convolutional LSTM Network: A Machine Learning Approach to Precipitation Nowcasting (Xingjian Shi, et.) Precipitation Nowcasting: challenging weather forecasting problem Proposed convolutional LSTM (ConvLSTM)

Next Week Make model more complete by adding Adversarial loss and text