Textual Video Prediction Week 2

Slides:



Advertisements
Similar presentations
Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting 卷积LSTM网络:利用机器学习预测短期降雨 施行健 香港科技大学 VALSE 2016/03/23.
Advertisements

A Hierarchical Deep Temporal Model for Group Activity Recognition
Automatic Advertisement Ratings Discussion Methods Problem and Motivation The goal is to automatically generate an objective score or ranking for an advertisement.
Gist of Achieving Human Parity in Conversational Speech Recognition
Deep Learning for Dual-Energy X-Ray
Automatic Advertisement Rating
Project #5: Generating Privacy and Security Threat Summary for Internet of Things REU Student: Ray Yan Graduate mentors: Logan Lebanoff Faculty Mentor(s):
Week 3 (June 6 – June10 , 2016) Summary :
Summary of Week 1 (May 23 – May 27, 2016)
Query-Focused Video Summarization – Week 1
Automatic Lung Cancer Diagnosis from CT Scans (Week 4)
Week 8 REU Nolan Warner.
Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:
CNN Demo LIU Pengpeng.
Week 6 Cecilia La Place.
Summary Presentation.
Single Image Super-Resolution
Efficient Deep Model for Monocular Road Segmentation
Textual Video Prediction
Multiple Organ Detection in CT Volumes using CNN Week 4
Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:
Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:
Attention-based Caption Description Mun Jonghwan.
Project 7: Modeling Social Network Structures and their Dynamic Evolutions with User- Generated Data from IoT REU Student: Emma Ambrosini Graduate mentors:
Recurrent Neural Networks
Project # 5 Generating Privacy and Security Threat Summary for Internet of Things REU student: Domonique Cox Graduate mentors: Kaiqiang Song Faculty mentor(s):
Week 6 Fatemeh Yazdiananari.
Papers 15/08.
Machine Learning in Laparoscopy
Lip movement Synthesis from Text
Project 1: Smart Home REU student: Jason Ling Graduate mentors: Safa Bacanli Faculty mentor(s): Damla Turgut Week 8 (July 2 – July ) Accomplishments:
Week 1 (May 23 – May 27, 2016) Accomplishments:
Zhedong Zheng, Liang Zheng and Yi Yang
Textual Video Prediction
Airport Parking Space Navigation
Video Imagination from a Single Image with Transformation Generation
TPGAN overview.
Abnormally Detection
Learn to Comment Mentor: Mahdi M. Kalayeh
Rachit Saluja 13th Feb SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference Rowan Zeller, Yonatan.
Chuan Wang1, Haibin Huang1, Xiaoguang Han2, Jue Wang1
Automatic Handwriting Generation
Textual Video Prediction
Query-based video summarization
REU - End to End Self Driving Car
Weeks 1 and 2 Aaron Ott.
Week 3 Presentation Ngoc Ta Aidean Sharghi.
Week 3: Moving Target Detection Using Infrared Sensors
Deep screen image crop and enhance
Multi-UAV to UAV Tracking
Deep screen image crop and enhance
Weak-supervision based Multi-Object Tracking
CRCV REU 2019 Kara Schatz.
Cengizhan Can Phoebe de Nooijer
Week 3 Volodymyr Bobyr.
Report 2 Brandon Silva.
Deep screen image crop and enhance
REU 2019 Week 2 Volodymyr Bobyr.
End-to-End Speech-Driven Facial Animation with Temporal GANs
Week 7 Presentation Ngoc Ta Aidean Sharghi
Additional text exploring the video clip.
Deep screen image crop and enhance
Multi-Target Detection and Tracking of UAVs from a UAV
Week 1 Overview - Cecilia La Place
Self-Supervised Cross-View Action Synthesis
Week 6: Moving Target Detection Using Infrared Sensors
CRCV REU 2019 Aaron Honculada.
Deep screen image crop and enhance
Deep screen image crop and enhance
Week 4: Moving Target Detection Using Infrared Sensors
Presentation transcript:

Textual Video Prediction Week 2 REU Student: Emily Cosgrove Graduate Student: Amir Mazaheri Professor: Dr. Shah

Project Overview Video Prediction Generative Adversarial Networks (GANs) Textual information for video prediction Goal: Enhanced video prediction

Progress Large Scale Movie Description Challenge (LSMDC) challenge dataset 128,000 videos Each 2-20 seconds Comes with text Clipped each video into 1 second clips (about 30 frames) Now have 359,000 video clips Text coming with each We use standard training-validation-test split (used in LSMDC- 2016) 90% data as training, 5% validation, and 5% testing

Annotation: trying to lighten the mood. Someone smiles sheepishly

1 2 3 4 5 6 LSTM LSTM LSTM LSTM LSTM LSTM CNN CNN CNN CNN CNN CNN CNN DE-CONV DE-CONV DE-CONV LSTM LSTM LSTM LSTM LSTM LSTM CNN CNN CNN CNN CNN CNN CNN CNN CNN CNN CNN 1 2 3 4 5 6

Progress Two Research Papers Generative Adversarial Text to Image Synthesis (Scott Reed, et.) Uses GANs Developed a model to generate images based on textual description Convolutional LSTM Network: A Machine Learning Approach to Precipitation Nowcasting (Xingjian Shi, et.) Precipitation Nowcasting: challenging weather forecasting problem Proposed convolutional LSTM (ConvLSTM)

Next Week Make model more complete by adding Adversarial loss and text