Predicting Visual Search Targets via Eye Tracking Data

Slides:



Advertisements
Similar presentations
Face Recognition and Biometric Systems Eigenfaces (2)
Advertisements

On the Relationship between Visual Attributes and Convolutional Networks Paper ID - 52.
October 14, 2010Neural Networks Lecture 12: Backpropagation Examples 1 Example I: Predicting the Weather We decide (or experimentally determine) to use.
Oral Defense by Sunny Tang 15 Aug 2003
Convolutional Neural Networks for Image Processing with Applications in Mobile Robotics By, Sruthi Moola.
Richard Socher Cliff Chiung-Yu Lin Andrew Y. Ng Christopher D. Manning
Combining multiple learners Usman Roshan. Bagging Randomly sample training data Determine classifier C i on sampled data Goto step 1 and repeat m times.
PRESENTATION REU IN COMPUTER VISION 2014 AMARI LEWIS CRCV UNIVERSITY OF CENTRAL FLORIDA.
Feedforward semantic segmentation with zoom-out features
Dense Color Moment: A New Discriminative Color Descriptor Kylie Gorman, Mentor: Yang Zhang University of Central Florida I.Problem:  Create Robust Discriminative.
Convolutional LSTM Networks for Subcellular Localization of Proteins
Neural Network Recognition of Frequency Disturbance Recorder Signals Stephen Tang REU Final Presentation July 22, 2014.
A Hierarchical Deep Temporal Model for Group Activity Recognition
Manisha Gupta PHD Student: Fawad Casian Mentor: Dr. Ali Borji.
Learning to Compare Image Patches via Convolutional Neural Networks SERGEY ZAGORUYKO & NIKOS KOMODAKIS.
Automatic Advertisement Ratings Discussion Methods Problem and Motivation The goal is to automatically generate an objective score or ranking for an advertisement.
Journal of Vision. 2013;13(14):10. doi: / Figure Legend:
Recent developments in object detection
Big data classification using neural network
Learning to Compare Image Patches via Convolutional Neural Networks
Demo.
The Relationship between Deep Learning and Brain Function
an introduction to: Deep Learning
Sentence Modeling Representation of sentences is the heart of Natural Language Processing A sentence model is a representation and analysis of semantic.
Query Based Video Summarization
Week III: Deep Tracking
Understanding and Predicting Image Memorability at a Large Scale
Tulane University University of Central Florida Problem Overview
EMOTIONAL INTELLIGENCE
Predicting Search Targets -Week 6
Lecture 5 Smaller Network: CNN
PixelLaser: Range from texture
Training Techniques for Deep Neural Networks
CS6890 Deep Learning Weizhen Cai
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Dynamic Routing Using Inter Capsule Routing Protocol Between Capsules
PixelLaser: Range from texture
Introduction to Convolutional Neural Network (CNN/ConvNET)-insights from amateur George (Tian Zhou)
Bird-species Recognition Using Convolutional Neural Network
Introduction to Neural Networks
Image Classification.
Deep neural networks (DNNs) can successfully identify, count, and describe animals in camera-trap images. Deep neural networks (DNNs) can successfully.
Vessel Extraction in X-Ray Angiograms Using Deep Learning
Deep learning Introduction Classes of Deep Learning Networks
Eco 6380 Predictive Analytics For Economists Spring 2016
Neuro-Computing Lecture 4 Radial Basis Function Network
Creating Data Representations
Driver Verification Using Eye Movements and Blinking
Lecture: Deep Convolutional Neural Networks
Visualizing and Understanding Convolutional Networks
RCNN, Fast-RCNN, Faster-RCNN
Introduction to Artificial Intelligence Lecture 24: Computer Vision IV
ImageNet Classification with Deep Convolutional Neural Networks
Heterogeneous convolutional neural networks for visual recognition
An introduction to: Deep Learning aka or related to Deep Neural Networks Deep Structural Learning Deep Belief Networks etc,
Department of Computer Science Ben-Gurion University of the Negev
Automatic Handwriting Generation
Predicting Search Targets -Week 2
Deep Object Co-Segmentation
Using Clustering to Make Prediction Intervals For Neural Networks
Predicting Search Targets -Week 2
VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION
Modeling IDS using hybrid intelligent systems
Multi-UAV to UAV Tracking
Weak-supervision based Multi-Object Tracking
Visual Question Answering Aaron Honculada, Aisha Urooj Khan, Dr
Week 3 Volodymyr Bobyr.
THE ASSISTIVE SYSTEM SHIFALI KUMAR BISHWO GURUNG JAMES CHOU
Tschandl P1,2, Argenziano G3, Razmara M4, Yap J4
Iterative Projection and Matching: Finding Structure-preserving Representatives and Its Application to Computer Vision.
Presentation transcript:

Predicting Visual Search Targets via Eye Tracking Data Dr. Ali Borji University of Central Florida Fawad Ahmed University of Central Florida Manisha Gupta manishagupta@utexas.edu University of Texas at Austin I. Purpose & Applications III. Gaze Information V. Euclidean Distances Participants’ gaze information was plotted against the collages Size of the fixation points corresponds to the duration Predict visual search targets in closed and open world settings Develop a learning mechanism over an unknown set of targets Applications: Implemented in cameras to scan the environment and pinpoint where a lost object (ex: set of keys) is located II. Dataset Extracted book covers Search target (top left) 6 participants searched for a target within a collage 20 variations of each collage 5 search targets (shown next to each collage) Yields 600 search tasks *Generated for all 600 search tasks *Repeated for the 20 variations of each collage per search target *Generated for 5 search tasks for each of the three types of collages Lower distances demonstrates higher similarities between the search target and the fixation point IV. Convolutional Neural Network Pre-trained Siamese model Extracts 2x512 vector features from linear layer Compares and outputs image similarities Pipeline: Architecture: Search Target ConvNet Decision Network Image Similarity Extracted Book Cover 1 Amazon book covers: Distinct structure/Distinct color Extracted Book Cover 2 ---- Extracted Book Cover N *Repeated for all six users The value of average Euclidean distances and lingering percentages imply that the set of Amazon collages was the most difficult search task. O’Reilly book covers: Low structure/Distinct color Decision Network Decision Network Branch Network 1 Image 1 Conv + ReLu Max pooling Max Pooling VI. Future Work VII. Acknowledgements Send feature vector and search targets to a binary SVM Add weights to fixation points to account for gaze duration Thank you to Dr. Mubarak Shah and Dr. Niels Lobo for their guidance during the program. The research project presented is funded by NSF REU Summer Program 2016. Un-shared (pseudo-siamese) Shared (siamese) Image 2 Conv + ReLu Max pooling Max Pooling Branch Network 2 Mugshots: Low structure/Low color