Volodymyr Bobyr Supervised by Aayushjungbahadur Rana

Slides:

Advertisements

Similar presentations

Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.

Advertisements

INRETS, Villeneuve d’Ascq, December 15 th -16 th 2005 ETISEO Annotation rules Data structure Annotation tool and format Ground truth creation rules Reference.

Robust Object Segmentation Using Adaptive Thresholding Xiaxi Huang and Nikolaos V. Boulgouris International Conference on Image Processing 2007.

Programme 2pm Introduction –Andrew Zisserman, Chris Williams 2.10pm Overview of the challenge and results –Mark Everingham (Oxford) 2.40pm Session 1: The.

Handwritten Character Recognition using Hidden Markov Models Quantifying the marginal benefit of exploiting correlations between adjacent characters and.

WEEK VI Malcolm Collins-Sibley Mentor: Shervin Ardeshir.

Object Detection with Discriminatively Trained Part Based Models

Extending context models for privacy in pervasive computing environments Jadwiga Indulska The School of Information Technology and Electrical Engineering,

PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.

Human pose recognition from depth image MS Research Cambridge.

Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.

CSSE463: Image Recognition Day 29 This week This week Today: Surveillance and finding motion vectors Today: Surveillance and finding motion vectors Tomorrow:

Nottingham Image Analysis School, 23 – 25 June NITS Image Segmentation Guoping Qiu School of Computer Science, University of Nottingham

Using decision trees to build an a framework for multivariate time- series classification 1 Present By Xiayi Kuang.

Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.

Multi-view Traffic Sign Detection, Recognition and 3D Localisation Radu Timofte, Karel Zimmermann, and Luc Van Gool.

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.

CSSE463: Image Recognition Day 29

Visual Attributes in Video

Action-Grounded Push Affordance Bootstrapping of Unknown Objects

Object Detection based on Segment Masks

Introduction – Process View on Management

Improving Chinese handwriting Recognition by Fusing speech recognition

Krishna Kumar Singh, Yong Jae Lee University of California, Davis

DM-Group Meeting Liangzhe Chen, Nov

2nd Level Analysis Methods for Dummies 2010/11 - 2nd Feb 2011

Week 9 Emily Hand UNR.

Epileptic Seizure Prediction

Algorithm Analysis CSE 2011 Winter September 2018.

Tingdan Luo 05/02/2016 Interactively Optimizing Information Retrieval Systems as a Dueling Bandits Problem Tingdan Luo

Week 6 Cecilia La Place.

Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.

Reinforcement Learning

Microsoft Visual Basic 2005 BASICS

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Other Algorithms Follow Up

LinkedIn Training.

הפקולטה להנדסת חשמל - המעבדה לבקרה ורובוטיקה גילוי תנועה ועקיבה אחר מספר מטרות מתמרנות הטכניון - מכון טכנולוגי לישראל TECHNION.

Administrivia Course Web:

Globally Optimal Generalized Maximum Multi Clique Problem (GMMCP) using Python code for Pedestrian Object Tracking By Beni Mulyana.

CSSE463: Image Recognition Day 29

A Bayesian Estimation of Building Shape using MCMC

Introduction to Data Mining, 2nd Edition

On-going research on Object Detection *Some modification after seminar

Oral presentation for ACM International Conference on Multimedia, 2014

Introduction Task: extracting relational facts from text

Object Detection Creation from Scratch Samsung R&D Institute Ukraine

Section 3.3 Graphing Linear Functions

Deep Neural Networks: A Hands on Challenge Deep Neural Networks: A Hands on Challenge Deep Neural Networks: A Hands on Challenge Deep Neural Networks:

CSSE463: Image Recognition Day 29

Grace W. Tang, Russ B. Altman Structure

CSSE463: Image Recognition Day 29

Evaluation of UMD Object Tracking in Video

Mark Elliot National Centre for Research Methods

CSSE463: Image Recognition Day 29

Deep neural networks for spike sorting: exploring options

Object Detection Implementations

Week 3: Moving Target Detection Using Infrared Sensors

Multi-UAV to UAV Tracking

SafeDrive: Online Driving Anomaly Detection From Large-Scale Vehicle Data ECE 693 Big Data Security.

Volodymyr Bobyr Supervised by Aayushjungbahadur Rana

Report 2 Brandon Silva.

Actor-Object Relation in Videos

Week 7 Presentation Ngoc Ta Aidean Sharghi

Volodymyr Bobyr Supervised by Aayushjungbahadur Rana

Using simple machine learning for image segmentation

Initial Progress Report

Presentation transcript:

Volodymyr Bobyr Supervised by Aayushjungbahadur Rana Week 8 Volodymyr Bobyr Supervised by Aayushjungbahadur Rana

Goals Goal Completed ✓ X Optimize the Data Loader Object & Action Segmentations and Centroid Results Incorporate Mean Average Precision metrics Rough Comparison to Challenge Results Full (temporal tube) Comparison to Challenge Results X

Quick Info Managed to bring training time back to 20 minutes Trained actions, objects, and centroids separately Planning to train in a sequence: Objects -> Actions -> Relations -> Centroids Order subject to change Got good checkpoints for objects & actions Actions Objects

Segmentation to B-Boxes Process: Segmentation Threshold: (0, 1) labels Segmentation to Blobs: connected segments are extracted as ‘blobs’ Blob Filtering: dispose of blobs with area < blob threshold (currently: 25) Bounding Box Generation: corners of the blobs Likely Problems: A single blob may represent multiple objects Centroids will be used to separate instances

Mean Average Precision Used as main metric in: ACM Multimedia 2019 Grand Challenge Process: Generate bounding boxes from segmentations Compare each generated bounding box to ground truth bounding box If IoU > threshold (0.5), the bounding box is a true positive If a bounding box wasn’t marked as true positive, it’s a false positive Mean-AP is the average over all classes, frames, and samples

Mean Average Precision If true positives == false positives == 0 for a class: Two different ways: If there weren’t any labels for that class, mark it as 1 Bumps up mean-ap by a lot because most channels will return a 1 If there weren’t any labels for that class, ignore this box If there were labels, mark it as 0 Results: 1st Way 2nd Way Objects 0.9363 0.2407 Actions 0.9576 0.3148

Approach Shift Combine actions and relations as they share most context Split the actions/relations into unidirectional and bidirectional Bidirectional: Actions/Relations that have no need for subject/object separation Ex: ‘next to’ Predicted between 0 and 1 Unidirectional: Actions/Relations where subjects & objects have to be clearly defined Ex: ‘holding’, ‘behind’, etc. Predicted between -1 and 1 (-1: object, 1: subject) When predicting some actions, the counterpart can often be assumed (post-processing) IE: (A, ‘behind’, B) implies (B, ‘in front’, A)