Query-Focused Video Summarization – Week 1

Slides:

Advertisements

Similar presentations

VisualRank: Applying PageRank to Large-Scale Image Search Yushi Jing, Member, IEEE, and Shumeet Baluja, Member, IEEE.

Advertisements

Ch. 17 Basic Statistical Models CIS 2033: Computational Probability and Statistics Prof. Longin Jan Latecki Prepared by: Nouf Albarakati.

Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,

Discriminative Segment Annotation in Weakly Labeled Video Kevin Tang, Rahul Sukthankar Appeared in CVPR 2013 (Oral)

IJCAI Wei Zhang, 1 Xiangyang Xue, 2 Jianping Fan, 1 Xiaojing Huang, 1 Bin Wu, 1 Mingjie Liu 1 Fudan University, China; 2 UNCC, USA {weizh,

Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.

Announcements  Project proposal is due on 03/11  Three seminars this Friday (EB 3105) Dealing with Indefinite Representations in Pattern Recognition.

Complete and Present Prototype Project 6 Status report: Tuesday, October 5 th Due: Saturday, October 9 th Presentation: Tuesday, October 12 th.

Distributed Representations of Sentences and Documents

MLP Exercise (2006) Become familiar with the Neural Network Toolbox in Matlab Construct a single hidden layer, feed forward network with sigmoidal units.

Information Retrieval in Practice

Project 4 Image Search based on BoW model with Inverted File System

A Genetic Algorithms Approach to Feature Subset Selection Problem by Hasan Doğu TAŞKIRAN CS 550 – Machine Learning Workshop Department of Computer Engineering.

Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.

PageRank for Product Image Search Kevin Jing (Googlc IncGVU, College of Computing, Georgia Institute of Technology) Shumeet Baluja (Google Inc.) WWW 2008.

An Example of Course Project Face Identification.

NEURAL NETWORKS FOR DATA MINING

Beauty is Here! Evaluating Aesthetics in Videos Using Multimodal Features and Free Training Data Yanran Wang, Qi Dai, Rui Feng, Yu-Gang Jiang School of.

Project 1: Machine Learning Using Neural Networks Ver 1.1.

Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.

1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.

WEEK4 RESEARCH Amari Lewis Aidean Sharghi. PREPARING THE DATASET  Cars – 83 samples  3 images for each sample when x=0  7 images for each sample when.

OBJECT TRACKING USING PARTICLE FILTERS. Table of Contents Tracking Tracking Tracking as a probabilistic inference problem Tracking as a probabilistic.

MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.

NN k Networks for browsing and clustering image collections Daniel Heesch Communications and Signal Processing Group Electrical and Electronic Engineering.

CS791 - Technologies of Google Spring A Webbased Kernel Function for Measuring the Similarity of Short Text Snippets By Mehran Sahami, Timothy.

Deep Learning Overview Sources: workshop-tutorial-final.pdf

Height Estimation from Egocentric Video- Week 1 Dr. Ali Borji Aisha Urooj Khan Jessie Finocchiaro UCF CRCV REU 2016.

Big data classification using neural network

Learning to Compare Image Patches via Convolutional Neural Networks

Advanced Image Processing

Week 3 (June 6 – June10 , 2016) Summary :

Why it is Called Tensor Flow Parallelism in ANNs Project Ideas and Discussion Glenn Fung Presents Batch Renormalizating Paper.

Summary of Week 1 (May 23 – May 27, 2016)

Query Based Video Summarization

Week III: Deep Tracking

Personalized Social Image Recommendation

Multimodal Learning with Deep Boltzmann Machines

COMP61011 : Machine Learning Ensemble Models

Presenter: Hajar Emami

Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT

Video Summarization via Determinantal Point Processes (DPP)

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

Introduction to Deep Learning for neuronal data analyses

Protection of AI Inventions in Japan

Project 1: Text Classification by Neural Networks

Chap. 7 Regularization for Deep Learning (7.8~7.12 )

Pose Estimation for non-cooperative Spacecraft Rendevous using CNN

On Convolutional Neural Network

Ying Dai Faculty of software and information science,

Ying Dai Faculty of software and information science,

Ying Dai Faculty of software and information science,

Textual Video Prediction

Autoencoders Supervised learning uses explicit labels/correct output in order to train a network. E.g., classification of images. Unsupervised learning.

Attention for translation

Automatic Handwriting Generation

Unsupervised learning of visual sense models for Polysemous words

Learning and Memorization

Query-based video summarization

Presented By: Harshul Gupta

Weekly Learning Alex Omar Ruiz Irene.

Week 3 Presentation Ngoc Ta Aidean Sharghi.

UCF-REU in Computer Vision

CRCV REU 2019 Kara Schatz.

Cengizhan Can Phoebe de Nooijer

Week 3 Volodymyr Bobyr.

Week 7 Presentation Ngoc Ta Aidean Sharghi

Self-Supervised Cross-View Action Synthesis

Iterative Projection and Matching: Finding Structure-preserving Representatives and Its Application to Computer Vision.

Presentation transcript:

Query-Focused Video Summarization – Week 1 Jacob Laurel Mentors: Aidean Sharghi and Dr. Boqing Gong UCF CRCV REU 2016

Project Proposal Goal: Summarize a video with a diverse subset of frames (meaning no redundancy). Also frames will be annotated so that a semantic summary will also be produced by the algorithm. Annotate a video data set to use for testing and training Possible applications of this project: Egocentric video summarization (i.e. Google Glass) , surveillance footage summarization, video search engine recommendations, pose estimation

Determinantal Point Process Probabilistic process Motivation: We wish to randomly take a diverse subset A, of some set S We want to formulate our subset such that elements with diverse/different features are most likely (Intuitively, elements with similar features “repel” each other) Such probabilities can be computed via determinants of a matrix (hence name)

Fig. 3. A diverse subset of frames from a video Illustration of DPPs Fig 1. MATLAB demo of a SDPP Fig 2. Graphical illustration of points uniformly sampled (left) and distributed according to a DPP (right) Fig. 3. A diverse subset of frames from a video

Step 1) Data Preparation Annotate and prepare videos taken from UT Egocentric dataset This will be done in conjunction with a GUI that can specify semantic information about the frames (used when querying)

Step 2) Design of the DPP Our Design will follow a sequential structure Different Neural Network configurations can be used to learn the best kernel matrix Existing structure incorporates a single hidden layer Different image features will be experimented with to determine the best model. Potential directions (as of 5/26) SIFT features GIST global image descriptor Visual Bag of words model We also propose to incorporate semantic features into the feature vector

Step 3) Experimental Procedure The DPP will be tested using various feature vectors in OpenCV and Matlab Neural Network to learn the kernel matrix will be constructed in Keras or Caffe Model will need to be trained from scratch since no existing weights for this application are readily available Scope will focus on feed-forward Neural Networks, as opposed to CNN’s to reduce computational requirements and allow for more degrees of freedom for other areas

Checklist for this week: Downloaded all necessary software Familiarized with Keras and reproduced simple NN’s Ran and edited MATLAB DPP code Familiarized with existing literature and downloaded the Data set from UT’s website Reproduced simple version of Dr. Gong’s paper

Results from implementing a simple DPP Fig. 5. Original video Fig. 4. Subset of images generated by the DPP kernel

Next Week To-do Annotate Data set, with short descriptions Generate Ground truth summaries Compare method with other state-of-the-art approaches