Visual Manipulation Relationship Network for Autonomous Robotics

Slides:

Advertisements

Similar presentations

Mapping between Scene and Screen. Screen (0, W-1) (W-1, H-1) (W-1, 0)(0, 0) Projection Plane (Xmin, Ymin)(Xmax, Ymin) (Xmax, Ymax) (Xmin, Ymax) Screen.

Advertisements

K-means Based Unsupervised Feature Learning for Image Recognition Ling Zheng.

Spatial Pyramid Pooling in Deep Convolutional

Feature extraction Feature extraction involves finding features of the segmented image. Usually performed on a binary image produced from.

IIIT Hyderabad Learning Semantic Interaction among Graspable Objects Swagatika Panda, A.H. Abdul Hafez, C.V. Jawahar Center for Visual Information Technology,

Mentor Prof. Amitabha Mukerjee Deepak Pathak Kaustubh Tapi 10346

Learning Hierarchical Features for Scene Labeling

Clip versus Overlay Module 4:. Module 3: Project Programs ArcGI S ArcCatalo g ArcMap ArcToolbox.

Spatial Localization and Detection

Deep Residual Learning for Image Recognition

Gaussian Conditional Random Field Network for Semantic Segmentation

Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.

Facial Smile Detection Based on Deep Learning Features Authors: Kaihao Zhang, Yongzhen Huang, Hong Wu and Liang Wang Center for Research on Intelligent.

City Forensics: Using Visual Elements to Predict Non-Visual City Attributes Sean M. Arietta, Alexei A. Efros, Ravi Ramamoorthi, Maneesh Agrawala Presented.

A Hierarchical Deep Temporal Model for Group Activity Recognition

Face Recognition based on 2D-PCA and CNN

When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.

Wenchi MA CV Group EECS,KU 03/20/2017

Recent developments in object detection

Faster R-CNN – Concepts

Convolutional Neural Network

Object Detection based on Segment Masks

Compact Bilinear Pooling

From Vision to Grasping: Adapting Visual Networks

Computer Science and Engineering, Seoul National University

Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek

The Problem: Classification

Saliency-guided Video Classification via Adaptively weighted learning

RESEARCH APPROACH.

Regularizing Face Verification Nets To Discrete-Valued Pain Regression

Combining CNN with RNN for scene labeling (segmentation)

YOLO9000:Better, Faster, Stronger

Part-Based Room Categorization for Household Service Robots

Week 6 Cecilia La Place.

Presentation by Ryan Brand

Deep Residual Learning for Image Recognition

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Adversarially Tuned Scene Generation

Solve Linear and Quadratic Systems Algebraically

Introduction to Neural Networks

Neural network systems

Face Recognition with Deep Learning Method

SBNet: Sparse Blocks Network for Fast Inference

A Comparative Study of Convolutional Neural Network Models with Rosenblatt’s Brain Model Abu Kamruzzaman, Atik Khatri , Milind Ikke, Damiano Mastrandrea,

Convolutional Neural Networks for Visual Tracking

RGB-D Image for Scene Recognition by Jiaqi Guo

Road Traffic Sign Recognition

Object Detection + Deep Learning

Progress Report Meng-Ting Zhong 2015/5/6.

Object Detection Creation from Scratch Samsung R&D Institute Ukraine

Faster R-CNN By Anthony Martinez.

Outline Background Motivation Proposed Model Experimental Results

Object Tracking: Comparison of

RCNN, Fast-RCNN, Faster-RCNN

Comparison of EET and Rank Pooling on UCF101 (split 1)

Solving a System of Linear and Quadratic Equations Algebraically

Heterogeneous convolutional neural networks for visual recognition

Course Recap and What’s Next?

b) Create a graph of your table.

Human-object interaction

Deep Object Co-Segmentation

Natalie Lang Tomer Malach

Object Detection Implementations

End-to-End Facial Alignment and Recognition

CRCV REU 2019 Kara Schatz.

Week 3 Volodymyr Bobyr.

Report 2 Brandon Silva.

Deep Video Quality Assessor: From Spatio-temporal Visual Sensitivity to A convolutional Neural Aggregation Network Woojae Kim1, Jongyoo Kim2, Sewoong Ahn1,Jinwoo.

CRCV REU 2019 Aaron Honculada.

SDSEN: Self-Refining Deep Symmetry Enhanced Network

Presentation transcript:

Visual Manipulation Relationship Network for Autonomous Robotics Hanbo Zhang, Xuguang Lan, Xinwen Zhou, Zhiqiang Tian, Yang Zhang and Nanning Zheng Khawla AlHamdan 2018 IEEE-RAS 18th International Conference on Humanoid Robots (Humanoids) Beijing, China, November 6-9, 2018

IEEE International Conference on Robotics and Automation (ICRA) Object discovery and grasp detection with a shared convolutional neural network IEEE International Conference on Robotics and Automation (ICRA)

What this paper work on. Manipulation relationship recognition Visual Manipulation Relationship Network (VMRN) VMRD (Visual Manipulation relationship Dataset)

Background: Object Detection Sliding window Deep feature Object Pairing Pooling Layer (OP2L)

Background: Visual Relationship Detection Spatial relationship Image and Video Deep learning

Background: Spatial Relation Reasoning Cloud Points

Visual Manipulation Relationship Network (VMRN) 2 stages : object detection result recognition result of the manipulation relationship

Visual Manipulation Relationship Network (VMRN)

(cls; xmin; ymin; xmax; ymax) Convolution Feature Maps Object Detector SSD (Single Shot Detector ) (cls; xmin; ymin; xmax; ymax) Convolution Feature Maps (VGG16 or ResNet50)

Object Pairing Pooling Layer (OP2L) Object Location (cls; xmin; ymin; xmax; ymax) mini-batch predicting manipulation relationships Shared Feature map CNN(I; Ф)

Object Pairing Pooling Layer (OP2L)

Manipulation Relationship Predictor

Manipulation Relationship Tree

Manipulation Relationship Tree Relation between objects: Object1 is the parent of Object2 Object2 is the parent of Object1 Object1 and Object2 has no relation. Labeling Criterion

Visual Manipulation Relationship Network (VMRN) 2 stages : object detection result recognition result of the manipulation relationship

Visual Manipulation Relationship Dataset (VMRD) 5185 images 17000 object 51530 the manipulation relationship

ACCURACY OF OBJECT DETECTION AND VISUAL MANIPULATION RELATIONSHIP RECOGNITION

ROI-based Robotic Grasp Detection for Object Overlapping Scenes Region of Interest (ROI)

Thank You