Self-Supervised Segmentation of River Scenes Supreeth Achar *, Bharath Sankaran ‡, Stephen Nuske *, Sebastian Scherer *, Sanjiv Singh * * ‡

Slides:



Advertisements
Similar presentations
Automatic Photo Pop-up Derek Hoiem Alexei A.Efros Martial Hebert Carnegie Mellon University.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
SHREYAS PARNERKAR. Motivation Texture analysis is important in many applications of computer image analysis for classification or segmentation of images.
Weakly supervised learning of MRF models for image region labeling Jakob Verbeek LEAR team, INRIA Rhône-Alpes.
Zhimin CaoThe Chinese University of Hong Kong Qi YinITCS, Tsinghua University Xiaoou TangShenzhen Institutes of Advanced Technology Chinese Academy of.
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
Learning Techniques for Video Shot Detection Under the guidance of Prof. Sharat Chandran by M. Nithya.
INTRODUCTION Heesoo Myeong, Ju Yong Chang, and Kyoung Mu Lee Department of EECS, ASRI, Seoul National University, Seoul, Korea Learning.
Real-Time Human Pose Recognition in Parts from Single Depth Images Presented by: Mohammad A. Gowayyed.
Fast and Extensible Building Modeling from Airborne LiDAR Data Qian-Yi Zhou Ulrich Neumann University of Southern California.
Mid-level Visual Element Discovery as Discriminative Mode Seeking Harley Montgomery 11/15/13.
Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei Li,
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
High Speed Obstacle Avoidance using Monocular Vision and Reinforcement Learning Jeff Michels Ashutosh Saxena Andrew Y. Ng Stanford University ICML 2005.
R-CNN By Zhang Liliang.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Handwritten Character Recognition using Hidden Markov Models Quantifying the marginal benefit of exploiting correlations between adjacent characters and.
Multiclass object recognition
Bag-of-Words based Image Classification Joost van de Weijer.
Computer vision.
Image Recognition using Hierarchical Temporal Memory Radoslav Škoviera Ústav merania SAV Fakulta matematiky, fyziky a informatiky UK.
Why Categorize in Computer Vision ?. Why Use Categories? People love categories!
Describing Images using Inferred Visual Dependency Representations Authors : Desmond Elliot & Arjen P. de Vries Presentation of Paper by : Jantre Sanket.
Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman University of Texas at Austin Jingnan.
Beauty is Here! Evaluating Aesthetics in Videos Using Multimodal Features and Free Training Data Yanran Wang, Qi Dai, Rui Feng, Yu-Gang Jiang School of.
Supervised Learning of Edges and Object Boundaries Piotr Dollár Zhuowen Tu Serge Belongie.
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
INTRODUCTION Heesoo Myeong and Kyoung Mu Lee Department of ECE, ASRI, Seoul National University, Seoul, Korea Tensor-based High-order.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
A Novel Local Patch Framework for Fixing Supervised Learning Models Yilei Wang 1, Bingzheng Wei 2, Jun Yan 2, Yang Hu 2, Zhi-Hong Deng 1, Zheng Chen 2.
Pattern Recognition April 19, 2007 Suggested Reading: Horn Chapter 14.
CS654: Digital Image Analysis
Modern Boundary Detection II Computer Vision CS 143, Brown James Hays Many slides Michael Maire, Jitendra Malek Szeliski 4.2.
Tell Me What You See and I will Show You Where It Is Jia Xu 1 Alexander G. Schwing 2 Raquel Urtasun 2,3 1 University of Wisconsin-Madison 2 University.
NA-MIC National Alliance for Medical Image Computing Evaluating Brain Tissue Classifiers S. Bouix, M. Martin-Fernandez, L. Ungar, M.
Context Neelima Chavali ECE /21/2013. Roadmap Introduction Paper1 – Motivation – Problem statement – Approach – Experiments & Results Paper 2 Experiments.
Learning video saliency from human gaze using candidate selection CVPR2013 Poster.
Convolutional Restricted Boltzmann Machines for Feature Learning Mohammad Norouzi Advisor: Dr. Greg Mori Simon Fraser University 27 Nov
1.Learn appearance based models for concepts 2.Compute posterior probabilities or Semantic Multinomial (SMN) under appearance models. -But, suffers from.
6.S093 Visual Recognition through Machine Learning Competition Image by kirkh.deviantart.com Joseph Lim and Aditya Khosla Acknowledgment: Many slides from.
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
Cell Segmentation in Microscopy Imagery Using a Bag of Local Bayesian Classifiers Zhaozheng Yin RI/CMU, Fall 2009.
Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,
Deep Learning Overview Sources: workshop-tutorial-final.pdf
Unveiling Zeus Automated Classification of Malware Samples Abedelaziz Mohaisen Omar Alrawi Verisign Inc, VA, USA Verisign Labs, VA, USA
1 2D TO 3D IMAGE AND VIDEO CONVERSION. INTRODUCTION The goal is to take already existing 2D content, and artificially produce the left and right views.
Week 3 Emily Hand UNR. Online Multiple Instance Learning The goal of MIL is to classify unseen bags, instances, by using the labeled bags as training.
Date of download: 7/8/2016 Copyright © 2016 SPIE. All rights reserved. A scalable platform for learning and evaluating a real-time vehicle detection system.
A Hierarchical Deep Temporal Model for Group Activity Recognition
Segmentation of Building Facades using Procedural Shape Priors
CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.
Signal and Image Processing Lab
A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology
Supervised Time Series Pattern Discovery through Local Importance
Nonparametric Semantic Segmentation
Fast Preprocessing for Robust Face Sketch Synthesis
Basic machine learning background with Python scikit-learn
Using Transductive SVMs for Object Classification in Images
MLP Based Feedback System for Gas Valve Control in a Madison Symmetric Torus Andrew Seltzman Dec 14, 2010.
Adversarially Tuned Scene Generation
Fast and Robust Object Tracking with Adaptive Detection
K-means and Hierarchical Clustering
A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.
Object-Graphs for Context-Aware Category Discovery
Eric Grimson, Chris Stauffer,
CellNetQL Image Segmentation without Feature Definition
RGB-D Image for Scene Recognition by Jiaqi Guo
Automatic Handwriting Generation
THE ASSISTIVE SYSTEM SHIFALI KUMAR BISHWO GURUNG JAMES CHOU
Presentation transcript:

Self-Supervised Segmentation of River Scenes Supreeth Achar *, Bharath Sankaran ‡, Stephen Nuske *, Sebastian Scherer *, Sanjiv Singh * * ‡

Task Detect extent of river in monocular imagery Input Image Extent of River 1

Motivation: River Mapping Explore and map river, canopy with a low flying micro air vehicle Water segmentation used for guidance and mapping 2

Challenges 3

Appearance varies between environments Appearance varies within the same environment due to changing weather, wind, seasons Appearance of rivers changes so much that a single model or classifier isn't going to work well under different conditions and at different places 4

Intuition: Exploit Structure Horizon line is known and river can't appear above horizon Regions below horizon have a higher probability of being part of shore if they are visually similar to above horizon region river if they are visually dissimilar to above horizon region 5

Self Supervised Algorithm Overview Step 1: Features computed for each patch in input image Step 2: Extract and discard the sky region, find horizon line Step 3: Calculate for each patch in the image Step 4: Automatically select training patches, build classifier Step 5: Classify all patches in the image, use MRF to enforce spatial coherence Output: detected extent of river 6

1. Compute Features Image broken into square patches with a grid Trade off : speed v/s fineness of segmentation Feature descriptor is computed for each patch 7

2. Estimate Sky and Horizon Sky can be confused with shiny parts of water surface Segmented with pretrained classifier, ignored in future steps Horizon line can be fitted to image using IMU measurements 8

9 3. Calculating

9

9

9

Chow-Liu trees built to model and Trees built with from data in current image These are used to compute for each patch 10

4. Train the Classifier Patches with are considered part of river All above horizon patches are used as non-river examples Linear SVM trained on these automatically selected patches 11

5. Detect River SVM is used to classify the entire image into river / non river SVM output is smoothened spatially with a Markov Random Field 12

5. Detect River SVM is used to classify the entire image into river / non river SVM output is smoothened spatially with a Markov Random Field 13

Datasets 4 datasets of images each with manually labeled ground truth Datasets were collected from boats moving along river NameLocationTimeSeasonWeather Allegheny DayHerrs IslandAfternoonSummerOvercast Allegheny DuskHerrs IslandEveningSummerLight rain Allegheny FallHerrs IslandAfternoonFallSunny YoughioghenyPerryopolisNoonSummerSunny 14

Sample Results River detection on an image from the “Allegheny - Dusk” dataset. Top left is input, top right is probability of patch being non-river based on appearance (warmer colors map to higher probabilities). The bottom left image shows the automatically extracted training examples, the bottom right image shows the final river detection result. 15

Sample Results River detection on an image from the “Allegheny - Day” dataset. Top left is input, top right is probability of patch being non-river based on appearance (warmer colors map to higher probabilities). The bottom left image shows the automatically extracted training examples, the bottom right image shows the final river detection result. 16

Results Error Rate (Percentage of Image Area Missclassified) Supervised Self Supervised Allghy. Day Allghy. Dusk Allghy. Fall Yough.All but Self All Allghy. Day Allghy. Dusk Allghy. Fall Yough Test Set Supervised approach performs poorly on previously unseen data Our self supervised method performs well on new data 17

Results Error Rate (Percentage of Image Area Missclassified) Supervised Self Supervised Allghy. Day Allghy. Dusk Allghy. Fall Yough.All but Self All Allghy. Day Allghy. Dusk Allghy. Fall Yough Test Set Supervised approach performs poorly on previously unseen data Our self supervised method performs well on new data 17

Results Error Rate (Percentage of Image Area Missclassified) Supervised Self Supervised Allghy. Day Allghy. Dusk Allghy. Fall Yough.All but Self All Allghy. Day Allghy. Dusk Allghy. Fall Yough Test Set Supervised approach performs poorly on previously unseen data Our self supervised method performs well on new data 17

Results Error Rate (Percentage of Image Area Missclassified) Supervised Self Supervised Allghy. Day Allghy. Dusk Allghy. Fall Yough.All but Self All Allghy. Day Allghy. Dusk Allghy. Fall Yough Test Set Supervised approach performs poorly on previously unseen data Our self supervised method performs well on new data 17

Video #1 Self Supervised River Segmentation on the Allegheny Fall sequence 18

Ongoing Work Detect Reflections Memory Adaptive models Find novel objects 19

Video #2 Self Supervised River Segmentation on the Allegheny Day sequence 20

Conclusions River appearance changes, segmenting is hard Difficult to build a single classifier that works everywhere By exploiting scene structure, we can make a system adapt and train itself 21

Thank You 22

Backup Slides

Related Work Geometry from Appearance Road Following Water Hazard Detection 5

Execution Time Single threaded C++ implementation takes 350ms to segment a 640x360 image This rate is fast enough for vehicle guidance C++ Running Time Profile StepTime (s) Feature Extraction0.25s Appearance Modeling0.02s SVM update0.03s Classification0.05s Total0.35s