Volodymyr Bobyr Supervised by Aayushjungbahadur Rana

Slides:



Advertisements
Similar presentations
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
Advertisements

INRETS, Villeneuve d’Ascq, December 15 th -16 th 2005 ETISEO Annotation rules Data structure Annotation tool and format Ground truth creation rules Reference.
Robust Object Segmentation Using Adaptive Thresholding Xiaxi Huang and Nikolaos V. Boulgouris International Conference on Image Processing 2007.
Programme 2pm Introduction –Andrew Zisserman, Chris Williams 2.10pm Overview of the challenge and results –Mark Everingham (Oxford) 2.40pm Session 1: The.
Handwritten Character Recognition using Hidden Markov Models Quantifying the marginal benefit of exploiting correlations between adjacent characters and.
WEEK VI Malcolm Collins-Sibley Mentor: Shervin Ardeshir.
Object Detection with Discriminatively Trained Part Based Models
Extending context models for privacy in pervasive computing environments Jadwiga Indulska The School of Information Technology and Electrical Engineering,
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
Human pose recognition from depth image MS Research Cambridge.
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
CSSE463: Image Recognition Day 29 This week This week Today: Surveillance and finding motion vectors Today: Surveillance and finding motion vectors Tomorrow:
Nottingham Image Analysis School, 23 – 25 June NITS Image Segmentation Guoping Qiu School of Computer Science, University of Nottingham
Using decision trees to build an a framework for multivariate time- series classification 1 Present By Xiayi Kuang.
Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.
Multi-view Traffic Sign Detection, Recognition and 3D Localisation Radu Timofte, Karel Zimmermann, and Luc Van Gool.
Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,
CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.
CSSE463: Image Recognition Day 29
Visual Attributes in Video
Action-Grounded Push Affordance Bootstrapping of Unknown Objects
Object Detection based on Segment Masks
Sequences.
Introduction – Process View on Management
Improving Chinese handwriting Recognition by Fusing speech recognition
Krishna Kumar Singh, Yong Jae Lee University of California, Davis
DM-Group Meeting Liangzhe Chen, Nov
2nd Level Analysis Methods for Dummies 2010/11 - 2nd Feb 2011
Week 9 Emily Hand UNR.
Epileptic Seizure Prediction
Algorithm Analysis CSE 2011 Winter September 2018.
Tingdan Luo 05/02/2016 Interactively Optimizing Information Retrieval Systems as a Dueling Bandits Problem Tingdan Luo
Week 6 Cecilia La Place.
Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.
Reinforcement Learning
Microsoft Visual Basic 2005 BASICS
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Other Algorithms Follow Up
LinkedIn Training.
הפקולטה להנדסת חשמל - המעבדה לבקרה ורובוטיקה גילוי תנועה ועקיבה אחר מספר מטרות מתמרנות הטכניון - מכון טכנולוגי לישראל TECHNION.
Administrivia Course Web:
Globally Optimal Generalized Maximum Multi Clique Problem (GMMCP) using Python code for Pedestrian Object Tracking By Beni Mulyana.
CSSE463: Image Recognition Day 29
A Bayesian Estimation of Building Shape using MCMC
Introduction to Data Mining, 2nd Edition
On-going research on Object Detection *Some modification after seminar
Oral presentation for ACM International Conference on Multimedia, 2014
Introduction Task: extracting relational facts from text
Ensembles.
Object Detection Creation from Scratch Samsung R&D Institute Ukraine
Section 3.3 Graphing Linear Functions
Deep Neural Networks: A Hands on Challenge Deep Neural Networks: A Hands on Challenge Deep Neural Networks: A Hands on Challenge Deep Neural Networks:
CSSE463: Image Recognition Day 29
Grace W. Tang, Russ B. Altman  Structure 
CSSE463: Image Recognition Day 29
Evaluation of UMD Object Tracking in Video
Mark Elliot National Centre for Research Methods
CSSE463: Image Recognition Day 29
Deep neural networks for spike sorting: exploring options
Object Detection Implementations
Week 3: Moving Target Detection Using Infrared Sensors
Multi-UAV to UAV Tracking
SafeDrive: Online Driving Anomaly Detection From Large-Scale Vehicle Data ECE 693 Big Data Security.
Volodymyr Bobyr Supervised by Aayushjungbahadur Rana
Report 2 Brandon Silva.
Actor-Object Relation in Videos
Week 7 Presentation Ngoc Ta Aidean Sharghi
Volodymyr Bobyr Supervised by Aayushjungbahadur Rana
Using simple machine learning for image segmentation
Initial Progress Report
Presentation transcript:

Volodymyr Bobyr Supervised by Aayushjungbahadur Rana Week 8 Volodymyr Bobyr Supervised by Aayushjungbahadur Rana

Goals Goal Completed ✓ X Optimize the Data Loader Object & Action Segmentations and Centroid Results Incorporate Mean Average Precision metrics Rough Comparison to Challenge Results Full (temporal tube) Comparison to Challenge Results X

Quick Info Managed to bring training time back to 20 minutes Trained actions, objects, and centroids separately Planning to train in a sequence: Objects -> Actions -> Relations -> Centroids Order subject to change Got good checkpoints for objects & actions Actions Objects

Segmentation to B-Boxes Process: Segmentation Threshold: (0, 1) labels Segmentation to Blobs: connected segments are extracted as ‘blobs’ Blob Filtering: dispose of blobs with area < blob threshold (currently: 25) Bounding Box Generation: corners of the blobs Likely Problems: A single blob may represent multiple objects Centroids will be used to separate instances

Mean Average Precision Used as main metric in: ACM Multimedia 2019 Grand Challenge Process: Generate bounding boxes from segmentations Compare each generated bounding box to ground truth bounding box If IoU > threshold (0.5), the bounding box is a true positive If a bounding box wasn’t marked as true positive, it’s a false positive Mean-AP is the average over all classes, frames, and samples

Mean Average Precision If true positives == false positives == 0 for a class: Two different ways: If there weren’t any labels for that class, mark it as 1 Bumps up mean-ap by a lot because most channels will return a 1 If there weren’t any labels for that class, ignore this box If there were labels, mark it as 0 Results: 1st Way 2nd Way Objects 0.9363 0.2407 Actions 0.9576 0.3148

Approach Shift Combine actions and relations as they share most context Split the actions/relations into unidirectional and bidirectional Bidirectional: Actions/Relations that have no need for subject/object separation Ex: ‘next to’ Predicted between 0 and 1 Unidirectional: Actions/Relations where subjects & objects have to be clearly defined Ex: ‘holding’, ‘behind’, etc. Predicted between -1 and 1 (-1: object, 1: subject) When predicting some actions, the counterpart can often be assumed (post-processing) IE: (A, ‘behind’, B) implies (B, ‘in front’, A)