Visual Manipulation Relationship Network for Autonomous Robotics

Slides:



Advertisements
Similar presentations
Mapping between Scene and Screen. Screen (0, W-1) (W-1, H-1) (W-1, 0)(0, 0) Projection Plane (Xmin, Ymin)(Xmax, Ymin) (Xmax, Ymax) (Xmin, Ymax) Screen.
Advertisements

K-means Based Unsupervised Feature Learning for Image Recognition Ling Zheng.
Spatial Pyramid Pooling in Deep Convolutional
Feature extraction Feature extraction involves finding features of the segmented image. Usually performed on a binary image produced from.
IIIT Hyderabad Learning Semantic Interaction among Graspable Objects Swagatika Panda, A.H. Abdul Hafez, C.V. Jawahar Center for Visual Information Technology,
Mentor Prof. Amitabha Mukerjee Deepak Pathak Kaustubh Tapi 10346
Learning Hierarchical Features for Scene Labeling
Clip versus Overlay Module 4:. Module 3: Project Programs ArcGI S ArcCatalo g ArcMap ArcToolbox.
Spatial Localization and Detection
Deep Residual Learning for Image Recognition
Gaussian Conditional Random Field Network for Semantic Segmentation
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
Facial Smile Detection Based on Deep Learning Features Authors: Kaihao Zhang, Yongzhen Huang, Hong Wu and Liang Wang Center for Research on Intelligent.
City Forensics: Using Visual Elements to Predict Non-Visual City Attributes Sean M. Arietta, Alexei A. Efros, Ravi Ramamoorthi, Maneesh Agrawala Presented.
A Hierarchical Deep Temporal Model for Group Activity Recognition
Face Recognition based on 2D-PCA and CNN
When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.
Wenchi MA CV Group EECS,KU 03/20/2017
Recent developments in object detection
Faster R-CNN – Concepts
Convolutional Neural Network
Object Detection based on Segment Masks
Compact Bilinear Pooling
From Vision to Grasping: Adapting Visual Networks
Computer Science and Engineering, Seoul National University
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
The Problem: Classification
Saliency-guided Video Classification via Adaptively weighted learning
RESEARCH APPROACH.
Regularizing Face Verification Nets To Discrete-Valued Pain Regression
Combining CNN with RNN for scene labeling (segmentation)
YOLO9000:Better, Faster, Stronger
Part-Based Room Categorization for Household Service Robots
Week 6 Cecilia La Place.
Presentation by Ryan Brand
Deep Residual Learning for Image Recognition
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Adversarially Tuned Scene Generation
Solve Linear and Quadratic Systems Algebraically
Introduction to Neural Networks
Neural network systems
Face Recognition with Deep Learning Method
SBNet: Sparse Blocks Network for Fast Inference
A Comparative Study of Convolutional Neural Network Models with Rosenblatt’s Brain Model Abu Kamruzzaman, Atik Khatri , Milind Ikke, Damiano Mastrandrea,
Convolutional Neural Networks for Visual Tracking
RGB-D Image for Scene Recognition by Jiaqi Guo
Road Traffic Sign Recognition
Object Detection + Deep Learning
Progress Report Meng-Ting Zhong 2015/5/6.
Object Detection Creation from Scratch Samsung R&D Institute Ukraine
Faster R-CNN By Anthony Martinez.
Outline Background Motivation Proposed Model Experimental Results
Object Tracking: Comparison of
RCNN, Fast-RCNN, Faster-RCNN
Comparison of EET and Rank Pooling on UCF101 (split 1)
Solving a System of Linear and Quadratic Equations Algebraically
Heterogeneous convolutional neural networks for visual recognition
Course Recap and What’s Next?
b) Create a graph of your table.
Human-object interaction
Deep Object Co-Segmentation
Natalie Lang Tomer Malach
Object Detection Implementations
End-to-End Facial Alignment and Recognition
CRCV REU 2019 Kara Schatz.
Week 3 Volodymyr Bobyr.
Report 2 Brandon Silva.
Deep Video Quality Assessor: From Spatio-temporal Visual Sensitivity to A convolutional Neural Aggregation Network Woojae Kim1, Jongyoo Kim2, Sewoong Ahn1,Jinwoo.
CRCV REU 2019 Aaron Honculada.
SDSEN: Self-Refining Deep Symmetry Enhanced Network
Presentation transcript:

Visual Manipulation Relationship Network for Autonomous Robotics Hanbo Zhang, Xuguang Lan, Xinwen Zhou, Zhiqiang Tian, Yang Zhang and Nanning Zheng Khawla AlHamdan 2018 IEEE-RAS 18th International Conference on Humanoid Robots (Humanoids) Beijing, China, November 6-9, 2018

IEEE International Conference on Robotics and Automation (ICRA) Object discovery and grasp detection with a shared convolutional neural network IEEE International Conference on Robotics and Automation (ICRA)

What this paper work on. Manipulation relationship recognition Visual Manipulation Relationship Network (VMRN) VMRD (Visual Manipulation relationship Dataset)

Background: Object Detection Sliding window Deep feature Object Pairing Pooling Layer (OP2L)

Background: Visual Relationship Detection Spatial relationship Image and Video Deep learning

Background: Spatial Relation Reasoning Cloud Points

Visual Manipulation Relationship Network (VMRN) 2 stages : object detection result recognition result of the manipulation relationship

Visual Manipulation Relationship Network (VMRN)

(cls; xmin; ymin; xmax; ymax) Convolution Feature Maps Object Detector SSD (Single Shot Detector ) (cls; xmin; ymin; xmax; ymax) Convolution Feature Maps (VGG16 or ResNet50)

Object Pairing Pooling Layer (OP2L) Object Location (cls; xmin; ymin; xmax; ymax) mini-batch predicting manipulation relationships Shared Feature map CNN(I; Ф)

Object Pairing Pooling Layer (OP2L)

Manipulation Relationship Predictor

Manipulation Relationship Tree

Manipulation Relationship Tree Relation between objects: Object1 is the parent of Object2 Object2 is the parent of Object1 Object1 and Object2 has no relation. Labeling Criterion

Visual Manipulation Relationship Network (VMRN) 2 stages : object detection result recognition result of the manipulation relationship

Visual Manipulation Relationship Dataset (VMRD) 5185 images 17000 object 51530 the manipulation relationship

ACCURACY OF OBJECT DETECTION AND VISUAL MANIPULATION RELATIONSHIP RECOGNITION

ROI-based Robotic Grasp Detection for Object Overlapping Scenes Region of Interest (ROI)

Thank You