Textual Video Prediction

Slides:

Advertisements

Similar presentations

CSE 589 Applied Algorithms Spring 1999 Image Compression Vector Quantization Nearest Neighbor Search.

Advertisements

Tools. Center for TeleInFrastructure 2 Tools  NetMeter  AudioMeter  VideoMeter.

Losslessy Compression of Multimedia Data Hao Jiang Computer Science Department Sept. 25, 2007.

©Brooks/Cole, 2003 Chapter 2 Data Representation.

Sadaf Ahamed G/4G Cellular Telephony Figure 1.Typical situation on 3G/4G cellular telephony [8]

Agenda Last class: Memory, Digitizing Numbers Today: Digitizing: Text

Compression video overview 演講者：林崇元. Outline Introduction Fundamentals of video compression Picture type Signal quality measure Video encoder and decoder.

Post-processing of JPEG image using MLP Fall 2003 ECE539 Final Project Report Data Fok.

1 資訊隱藏技術之研究 The Study of Information Hiding Mechanisms 指導教授： Chang, Chin-Chen ( 張真誠 ) 研究生： Lu, Tzu-Chuen ( 呂慈純 ) Department of Computer Science and Information.

Advanced Science and Technology Letters Vol.35(Security 2013), pp Image Steganograpy via Video Using Lifting.

Week 4: 6/6 – 6/10 Jeffrey Loppert. This week.. Coded a Histogram of Oriented Gradients (HOG) Feature Extractor Extracted features from positive and negative.

Using Tables to Find Equivalent Ratios. Example 1: Use a table to find three equivalent ratios of. Strategy: Create a table and multiply or divide both.

Presenting: Shlomo Ben-Shoshan, Nir Straze Supervisors: Dr. Ofer Hadar, Dr. Evgeny Kaminsky.

Automatic Advertisement Rating

Performance Analysis of 1D and 2D Statistical Measures on Standard Facial Image Databases International Conference on Emerging Trends in Engineering &

Quality Evaluation and Comparison of SVC Encoders

WAVELET VIDEO PROCESSING TECHNOLOGY

Week 3 (June 6 – June10 , 2016) Summary :

Textual Video Prediction Week 2

Multiplication table. x

Summary of Week 1 (May 23 – May 27, 2016)

Injong Rhee ICMCS’98 Presented by Wenyu Ren

Security Issues for Visual Data: Copyright and Access Control

Visual Saliency Detection

Textual Video Prediction

Digital Multimedia Coding

Research Topic Error Concealment Techniques in H.264/AVC for Wireless Video Transmission Vineeth Shetty Kolkeri EE Graduate,UTA.

Historic Document Image De-Noising using Principal Component Analysis (PCA) and Local Pixel Grouping (LPG) Han-Yang Tang1, Azah Kamilah Muda1, Yun-Huoy.

Dots 5 × TABLES MULTIPLICATION.

Dots 5 × TABLES MULTIPLICATION.

Dots 2 × TABLES MULTIPLICATION.

5 × 7 = × 7 = 70 9 × 7 = CONNECTIONS IN 7 × TABLE

5 × 8 = 40 4 × 8 = 32 9 × 8 = CONNECTIONS IN 8 × TABLE

Dots 3 × TABLES MULTIPLICATION.

How to succeed in CSCI 4333 Design of Database Systems

Dots 6 × TABLES MULTIPLICATION.

4 × 6 = 24 8 × 6 = 48 7 × 6 = CONNECTIONS IN 6 × TABLE

5 × 6 = 30 2 × 6 = 12 7 × 6 = CONNECTIONS IN 6 × TABLE

Reversible Data Hiding Scheme Using Two Steganographic Images

Dots 2 × TABLES MULTIPLICATION.

Kyoungwoo Lee, Minyoung Kim, Nikil Dutt, and Nalini Venkatasubramanian

Dots 4 × TABLES MULTIPLICATION.

Textual Video Prediction

A Self-Reference Watermarking Scheme Based on Wet Paper Coding

10 × 8 = 80 5 × 8 = 40 6 × 8 = CONNECTIONS IN 8 × TABLE MULTIPLICATION.

Information Hiding and Its Applications

3 × 12 = 36 6 × 12 = 72 7 × 12 = CONNECTIONS IN 12 × TABLE

Chair Professor Chin-Chen Chang (張真誠) National Tsing Hua University

5 × 12 = × 12 = × 12 = CONNECTIONS IN 12 × TABLE MULTIPLICATION.

{Name}, School of Criminal Justice - Texas State University

Query-based video summarization

REU - End to End Self Driving Car

5 × 9 = 45 6 × 9 = 54 7 × 9 = CONNECTIONS IN 9 × TABLE

Bill Lotter, Harvard Biophysics PhD Candidate

Week 3 Presentation Ngoc Ta Aidean Sharghi.

Deep screen image crop and enhance

Visual Question Answering Aaron Honculada, Aisha Urooj Khan, Dr

Report 7 Brandon Silva.

REU Week 3: Real-Time Video Anomaly Detection

3 × 7 = 21 6 × 7 = 42 7 × 7 = CONNECTIONS IN 7 × TABLE

Deep screen image crop and enhance

Dots 3 × TABLES MULTIPLICATION.

Week 7 Presentation Ngoc Ta Aidean Sharghi

Deep screen image crop and enhance

Week 6: Moving Target Detection Using Infrared Sensors

Visual Question Answering

CRCV REU 2019 Week 4.

Deep screen image crop and enhance

The experiment based on hier-attention

Week 4: Moving Target Detection Using Infrared Sensors

Presentation transcript:

Textual Video Prediction REU Student: Emily Cosgrove Graduate Student: Amir Mazaheri Professor: Dr. Shah

PSNR PSNR (peak signal to noise ratio) Most common measurement for video prediction PSNR = 10 log 10 𝑀𝐴𝑋 𝑖 2 𝑀𝑆𝐸

Prednet We trained and tested PREDNET on our data Movie dataset It was originally trained on the KITTI datasets It predicts one frame We are working on to change the code to predict multiple frames

PSNR Table Method Name PSNR Details PREDNET 17.58 Predicts Just one Frame N/A Predicts Multiple Frames (Working on the code) ConvLSTM 21.3 Predicts Multiple Frames STN (Prediction of tx and ty) 31.235 ConvLSTM + Text

Next steps Compute the spatial attention Possible Usage of Text Copy pixels out of attention area Predict pixels inside the attention area

Video Spatial Attention Generated Text LSTM Background Video Video Spatial Attention