Textual Video Prediction

Slides:



Advertisements
Similar presentations
CSE 589 Applied Algorithms Spring 1999 Image Compression Vector Quantization Nearest Neighbor Search.
Advertisements

Tools. Center for TeleInFrastructure 2 Tools  NetMeter  AudioMeter  VideoMeter.
Losslessy Compression of Multimedia Data Hao Jiang Computer Science Department Sept. 25, 2007.
©Brooks/Cole, 2003 Chapter 2 Data Representation.
Sadaf Ahamed G/4G Cellular Telephony Figure 1.Typical situation on 3G/4G cellular telephony [8]
Agenda Last class: Memory, Digitizing Numbers Today: Digitizing: Text
Compression video overview 演講者:林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.
Post-processing of JPEG image using MLP Fall 2003 ECE539 Final Project Report Data Fok.
1 資訊隱藏技術之研究 The Study of Information Hiding Mechanisms 指導教授: Chang, Chin-Chen ( 張真誠 ) 研究生: Lu, Tzu-Chuen ( 呂慈純 ) Department of Computer Science and Information.
Advanced Science and Technology Letters Vol.35(Security 2013), pp Image Steganograpy via Video Using Lifting.
Week 4: 6/6 – 6/10 Jeffrey Loppert. This week.. Coded a Histogram of Oriented Gradients (HOG) Feature Extractor Extracted features from positive and negative.
Using Tables to Find Equivalent Ratios. Example 1: Use a table to find three equivalent ratios of. Strategy: Create a table and multiply or divide both.
Presenting: Shlomo Ben-Shoshan, Nir Straze Supervisors: Dr. Ofer Hadar, Dr. Evgeny Kaminsky.
Automatic Advertisement Rating
Performance Analysis of 1D and 2D Statistical Measures on Standard Facial Image Databases International Conference on Emerging Trends in Engineering &
Quality Evaluation and Comparison of SVC Encoders
WAVELET VIDEO PROCESSING TECHNOLOGY
Week 3 (June 6 – June10 , 2016) Summary :
Textual Video Prediction Week 2
Multiplication table. x
Summary of Week 1 (May 23 – May 27, 2016)
Injong Rhee ICMCS’98 Presented by Wenyu Ren
Security Issues for Visual Data: Copyright and Access Control
Visual Saliency Detection
Textual Video Prediction
Digital Multimedia Coding
Research Topic Error Concealment Techniques in H.264/AVC for Wireless Video Transmission Vineeth Shetty Kolkeri EE Graduate,UTA.
Historic Document Image De-Noising using Principal Component Analysis (PCA) and Local Pixel Grouping (LPG) Han-Yang Tang1, Azah Kamilah Muda1, Yun-Huoy.
Dots 5 × TABLES MULTIPLICATION.
Dots 5 × TABLES MULTIPLICATION.
Dots 2 × TABLES MULTIPLICATION.
5 × 7 = × 7 = 70 9 × 7 = CONNECTIONS IN 7 × TABLE
5 × 8 = 40 4 × 8 = 32 9 × 8 = CONNECTIONS IN 8 × TABLE
Dots 3 × TABLES MULTIPLICATION.
How to succeed in CSCI 4333 Design of Database Systems
Dots 6 × TABLES MULTIPLICATION.
4 × 6 = 24 8 × 6 = 48 7 × 6 = CONNECTIONS IN 6 × TABLE
5 × 6 = 30 2 × 6 = 12 7 × 6 = CONNECTIONS IN 6 × TABLE
Reversible Data Hiding Scheme Using Two Steganographic Images
Dots 2 × TABLES MULTIPLICATION.
Kyoungwoo Lee, Minyoung Kim, Nikil Dutt, and Nalini Venkatasubramanian
Dots 4 × TABLES MULTIPLICATION.
Textual Video Prediction
A Self-Reference Watermarking Scheme Based on Wet Paper Coding
10 × 8 = 80 5 × 8 = 40 6 × 8 = CONNECTIONS IN 8 × TABLE MULTIPLICATION.
Information Hiding and Its Applications
3 × 12 = 36 6 × 12 = 72 7 × 12 = CONNECTIONS IN 12 × TABLE
Chair Professor Chin-Chen Chang (張真誠) National Tsing Hua University
5 × 12 = × 12 = × 12 = CONNECTIONS IN 12 × TABLE MULTIPLICATION.
{Name}, School of Criminal Justice - Texas State University
Query-based video summarization
REU - End to End Self Driving Car
5 × 9 = 45 6 × 9 = 54 7 × 9 = CONNECTIONS IN 9 × TABLE
Bill Lotter, Harvard Biophysics PhD Candidate
Week 3 Presentation Ngoc Ta Aidean Sharghi.
Deep screen image crop and enhance
Visual Question Answering Aaron Honculada, Aisha Urooj Khan, Dr
Report 7 Brandon Silva.
REU Week 3: Real-Time Video Anomaly Detection
3 × 7 = 21 6 × 7 = 42 7 × 7 = CONNECTIONS IN 7 × TABLE
Deep screen image crop and enhance
Dots 3 × TABLES MULTIPLICATION.
Week 7 Presentation Ngoc Ta Aidean Sharghi
Deep screen image crop and enhance
Week 6: Moving Target Detection Using Infrared Sensors
Visual Question Answering
CRCV REU 2019 Week 4.
Deep screen image crop and enhance
The experiment based on hier-attention
Week 4: Moving Target Detection Using Infrared Sensors
Presentation transcript:

Textual Video Prediction REU Student: Emily Cosgrove Graduate Student: Amir Mazaheri Professor: Dr. Shah

PSNR PSNR (peak signal to noise ratio) Most common measurement for video prediction PSNR = 10 log 10 𝑀𝐴𝑋 𝑖 2 𝑀𝑆𝐸

Prednet We trained and tested PREDNET on our data Movie dataset It was originally trained on the KITTI datasets It predicts one frame We are working on to change the code to predict multiple frames

PSNR Table Method Name PSNR Details PREDNET 17.58 Predicts Just one Frame N/A Predicts Multiple Frames (Working on the code) ConvLSTM 21.3 Predicts Multiple Frames STN (Prediction of tx and ty) 31.235 ConvLSTM + Text

Next steps Compute the spatial attention Possible Usage of Text Copy pixels out of attention area Predict pixels inside the attention area

Video Spatial Attention Generated Text LSTM Background Video Video Spatial Attention