Depth estimation and Plane detection

Slides:



Advertisements
Similar presentations
Efficient High-Resolution Stereo Matching using Local Plane Sweeps Sudipta N. Sinha, Daniel Scharstein, Richard CVPR 2014 Yongho Shin.
Advertisements

 Share assignments and files  Student and parent home access  Announcements  Classroom information.
Face Alignment with Part-Based Modeling
Cambridge, Massachusetts Pose Estimation in Heavy Clutter using a Multi-Flash Camera Ming-Yu Liu, Oncel Tuzel, Ashok Veeraraghavan, Rama Chellappa, Amit.
Segmentation into Planar Patches for Recovery of Unmodeled Objects Kok-Lim Low COMP Computer Vision 4/26/2000.
Contents Description of the big picture Theoretical background on this work The Algorithm Examples.
A Novel 2D To 3D Image Technique Based On Object- Oriented Conversion.
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
What is the Best Multi-Stage Architecture for Object Recognition Kevin Jarrett, Koray Kavukcuoglu, Marc’ Aurelio Ranzato and Yann LeCun Presented by Lingbo.
Face Alignment Using Cascaded Boosted Regression Active Shape Models
Convolutional Neural Networks for Image Processing with Applications in Mobile Robotics By, Sruthi Moola.
Presented by: Kamakhaya Argulewar Guided by: Prof. Shweta V. Jain
Avoiding Segmentation in Multi-digit Numeral String Recognition by Combining Single and Two-digit Classifiers Trained without Negative Examples Dan Ciresan.
Acquiring 3D models of objects via a robotic stereo head David Virasinghe Department of Computer Science University of Adelaide Supervisors: Mike Brooks.
Wenqi Zhu 3D Reconstruction From Multiple Views Based on Scale-Invariant Feature Transform.
Face Detection Final Presentation Mark Lee Nic Phillips Paul Sowden Andy Tait 9 th May 2006.
Essential components of the implementation are:  Formation of the network and weight initialization routine  Pixel analysis of images for symbol detection.
Grauman Today: Image Filters Smooth/Sharpen Images... Find edges... Find waldo…
Camera Calibration Course web page: vision.cis.udel.edu/cv March 24, 2003  Lecture 17.
May 2003 SUT Color image segmentation – an innovative approach Amin Fazel May 2003 Sharif University of Technology Course Presentation base on a paper.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching.
Deep Learning Overview Sources: workshop-tutorial-final.pdf
Optical Character Recognition
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching Link: singhashwini.mesinghashwini.me.
Yann LeCun Other Methods and Applications of Deep Learning Yann Le Cun The Courant Institute of Mathematical Sciences New York University
When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.
A Plane-Based Approach to Mondrian Stereo Matching
CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.
Demo.
What Convnets Make for Image Captioning?
Summary of “Efficient Deep Learning for Stereo Matching”
Object Detection based on Segment Masks
DeepCount Mark Lenson.
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Rotational Rectification Network for Robust Pedestrian Detection
Fast Preprocessing for Robust Face Sketch Synthesis
Lecture 5 Smaller Network: CNN
Convolutional Networks
CS6890 Deep Learning Weizhen Cai
Deeply Supervised Salient Object Detection with Short Connections Qibin Hou, Ming-Ming Cheng, Xiaowei Hu, Ali Borji, Zhuowen Tu, Philip H. S. Torr Observations:
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Adversarially Tuned Scene Generation
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Bird-species Recognition Using Convolutional Neural Network
RGB-D Image for Scene Recognition by Jiaqi Guo
Convolutional Neural Networks
Smart Robots, Drones, IoT
Oral presentation for ACM International Conference on Multimedia, 2014
CSC 578 Neural Networks and Deep Learning
CornerNet: Detecting Objects as Paired Keypoints
A Proposal Defense On Deep Residual Network For Face Recognition Presented By SAGAR MISHRA MECE
On Convolutional Neural Network
Papers 15/08.
Use 3D Convolutional Neural Network to Inspect Solder Ball Defects
Outline Background Motivation Proposed Model Experimental Results
Visualizing and Understanding Convolutional Networks
RCNN, Fast-RCNN, Faster-RCNN
Course 6 Stereo.
边缘检测年度进展概述 Ming-Ming Cheng Media Computing Lab, Nankai University
Heterogeneous convolutional neural networks for visual recognition
Department of Computer Science Ben-Gurion University of the Negev
Human-object interaction
Object Detection Implementations
Debasis Bhattacharya, JD, DBA University of Hawaii Maui College
SFNet: Learning Object-aware Semantic Correspondence
CSC 578 Neural Networks and Deep Learning
Sign Language Recognition With Unsupervised Feature Learning
Point Set Representation for Object Detection and Beyond
Introduction Face detection and alignment are essential to many applications such as face recognition, facial expression recognition, age identification,
Presented By: Firas Gerges (fg92)
Presentation transcript:

Depth estimation and Plane detection for Image Analysis and Understanding Cheng Hanni Huang Jingjing Song Mengting Xiao Ying

Project Description Depth estimation is the set of techniques and algorithms aiming to obtain the distance of each 3D point to the image plane. The goal of plane detection is to identify pixels in a 2D image which locates on a 2D plane. Depth estimation and Plane detection are for 3D reconstruction, scene understanding and robotics etc.

Overview Literature investigation Website establishment Depth estimation Plane Detection Conclusion

Literature investigation Learning Depth from Single Monocular Images In Proc. of NIPS.2005 Depth Map Prediction from Single Image using Multi Scale Deep Network. in Proc. of NIPS, 2014. Accurate 3D Ground Plane Estimation from a Single Image, In Proc. of ICRA 2009 Detecting planes and estimating their orientation from a single image, In Proc. of BMVC 2011.

Website Establishment Django is a free and open-source web framework, written in Python, which follows the model-view-template (MVT) architectural pattern. Our templates include demo.html and show_pic.html The core code of depth-estimation and plane-detection are under the folder ‘homework (app)’,we call the two functions in views.py. If I run the django project on my computer, then the url “http://localhost:8000/demo/” can leads to our demo.

Depth estimation Methods:Multi-Scale Deep Network Global Coarse-Scale Network: contains five feature extraction layers of convolution and max-pooling,followed by two fully connected layers.to predict the overall depth map structure using a global view of the scene.

Depth estimation Methods:Multi-Scale Deep Network Local Fine-Scale Network:The fine-scale network stack consists of convolutional layers only, along with one pooling stage for the first layer edge features. To edit the coarse prediction it receives to align with local detais such as object ande wall edges.

Depth estimation Scale-Invariant Error

Plane Detection Methods SLIC Algorithm to over-segment a picture (TPAMI 2012) For each 2D point in the picture, add the information of depth map to reconstruct a 3D point. Get the adjacent-matrix according to the segmented clusters. Go through the adjacent matrix using BFS, judge if two adjacent matrix are coplanar.(dimensionality reduction using SVD) Compute cosine of each plane

Plane Detection Experiment Results a) Source image b) Depth map c) Plane with avg color(ICRA 2009) d) Plane with box (demo video)

Plane Detection Analysis Solution The method in ICRA 2009 divide a picture into several irregular parts, draw a bounding box according to the top-left and bottom-right point of a part can’t get the precise result. The method depends on many parameters,(sample number, cosine threshold), one certain parameter cannot fit all picture. Solution Train a number of pictures to choose the right parameters which can make the training error smallest. Extract other features (texture/gradient) from one picture to get better result. But time is limited, I… don’t try above methods.()

Conclusion We use Django Framework written in python to establish our website, so that our front-end page can connect with our picture-processing codes. CNN can be applied to depth estimation. We write a python script to get the output depth map from the pre-trained caffe model. Plane detection includes three steps. Reconstruct 3D points using depth information Over-segment the picture using SLIC algorithm Treat the coplanar parts of the segmented clusters as a plane.

Links http://10.13.49.144:8000/demo/ (Not always available) Demo: Project website: http://itec.hust.edu.cn/~chenghn/project/Depth_Estimation/index.html Source code: https://github.com/honey0920/depth-estimation-plane-detection Course website: http://lbmedia.ece.ucsb.edu/member/uweb/Teaching/website/index.htm

Thanks.