Training Regimes Motivation  Allow state-of-the-art subcomponents  With “Black-box” functionality  This idea also occurs in other application areas.

Slides:



Advertisements
Similar presentations
Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)
Advertisements

Automatic Photo Pop-up Derek Hoiem Alexei A.Efros Martial Hebert Carnegie Mellon University.
The Layout Consistent Random Field for detecting and segmenting occluded objects CVPR, June 2006 John Winn Jamie Shotton.
Learning Specific-Class Segmentation from Diverse Data M. Pawan Kumar, Haitherm Turki, Dan Preston and Daphne Koller at ICCV 2011 VGG reading group, 29.
Combining Detectors for Human Hand Detection Antonio Hernández, Petia Radeva and Sergio Escalera Computer Vision Center, Universitat Autònoma de Barcelona,
EE462 MLCV Lecture 5-6 Object Detection – Boosting Tae-Kyun Kim.
From Interactive to Semantic Image Segmentation Varun Gulshan Supervisors: Prof. Andrew Blake Prof. Andrew Zisserman 20 Jan 2012.
A generic model to compose vision modules for holistic scene understanding Adarsh Kowdle *, Congcong Li *, Ashutosh Saxena, and Tsuhan Chen Cornell University,
Ľubor Ladický1 Phil Torr2 Andrew Zisserman1
Joint Optimisation for Object Class Segmentation and Dense Stereo Reconstruction Ľubor Ladický, Paul Sturgess, Christopher Russell, Sunando Sengupta, Yalin.
Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Wrap Up. We talked about Filters Edges Corners Interest Points Descriptors Image Stitching Stereo SFM.
Analyzing Semantic Segmentation Using Hybrid Human-Machine CRFs Roozbeh Mottaghi 1, Sanja Fidler 2, Jian Yao 2, Raquel Urtasun 2, Devi Parikh 3 1 UCLA.
Detecting Pedestrians by Learning Shapelet Features
Retrieving Actions in Group Contexts Tian Lan, Yang Wang, Greg Mori, Stephen Robinovitch Simon Fraser University Sept. 11, 2010.
Recognition: A machine learning approach
Quantifying and Transferring Contextual Information in Object Detection Professor: S. J. Wang Student : Y. S. Wang 1.
1 Integrating Vision Models for Holistic Scene Understanding Geremy Heitz CS223B March 4 th, 2009.
LARGE-SCALE NONPARAMETRIC IMAGE PARSING Joseph Tighe and Svetlana Lazebnik University of North Carolina at Chapel Hill CVPR 2011Workshop on Large-Scale.
Learning Spatial Context: Using stuff to find things Geremy Heitz Daphne Koller Stanford University October 13, 2008 ECCV 2008.
Self-Supervised Segmentation of River Scenes Supreeth Achar *, Bharath Sankaran ‡, Stephen Nuske *, Sebastian Scherer *, Sanjiv Singh * * ‡
Scene Understanding through Transfer Learning Stephen Gould Ben Packer Geremy Heitz Daphne Koller DARPA Update September 11, 2008.
An opposition to Window- Scanning Approaches in Computer Vision Presented by Tomasz Malisiewicz March 6, 2006 Advanced The Robotics Institute.
Graph Cut based Inference with Co-occurrence Statistics Ľubor Ladický, Chris Russell, Pushmeet Kohli, Philip Torr.
Object Class Recognition using Images of Abstract Regions Yi Li, Jeff A. Bilmes, and Linda G. Shapiro Department of Computer Science and Engineering Department.
1 How do ideas from perceptual organization relate to natural scenes?
1 Accurate Object Detection with Joint Classification- Regression Random Forests Presenter ByungIn Yoo CS688/WST665.
CONTEXT. Learned Satellite Clusters Results - Satellite Prior: Detector Only Posterior: TAS Model Region Labels.
1 Outline Overview Integrating Vision Models CCM: Cascaded Classification Models Learning Spatial Context TAS: Things and Stuff Descriptive Querying of.
Learning Spatial Context: Can stuff help us find things? Geremy Heitz Daphne Koller April 14, 2008 DAGS Stuff (n): Material defined by a homogeneous or.
What, Where & How Many? Combining Object Detectors and CRFs
Multi-modal robotic perception Stephen Gould, Paul Baumstarck, Morgan Quigley, Andrew Ng, Daphne Koller PAIL, January 2008.
3D Scene Models Object recognition and scene understanding Krista Ehinger.
Sung Ju Hwang and Kristen Grauman University of Texas at Austin.
Crash Course on Machine Learning
Object Detection Sliding Window Based Approach Context Helps
Detecting Pedestrians Using Patterns of Motion and Appearance Paul Viola Microsoft Research Irfan Ullah Dept. of Info. and Comm. Engr. Myongji University.
Face detection Slides adapted Grauman & Liebe’s tutorial
Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign.
Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman University of Texas at Austin Jingnan.
INTRODUCTION Heesoo Myeong and Kyoung Mu Lee Department of ECE, ASRI, Seoul National University, Seoul, Korea Tensor-based High-order.
Recognition II Ali Farhadi. We have talked about Nearest Neighbor Naïve Bayes Logistic Regression Boosting.
Putting Context into Vision Derek Hoiem September 15, 2004.
Object detection, deep learning, and R-CNNs
Learning from Big Data Lecture 5
Extracting Simple Verb Frames from Images Toward Holistic Scene Understanding Prof. Daphne Koller Research Group Stanford University Geremy Heitz DARPA.
Object-Graphs for Context-Aware Category Discovery Yong Jae Lee and Kristen Grauman University of Texas at Austin 1.
Context Neelima Chavali ECE /21/2013. Roadmap Introduction Paper1 – Motivation – Problem statement – Approach – Experiments & Results Paper 2 Experiments.
Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.
1.Learn appearance based models for concepts 2.Compute posterior probabilities or Semantic Multinomial (SMN) under appearance models. -But, suffers from.
M ACHINE L EARNING : F OUNDATIONS C OURSE TAU – 2012A P ROF. Y ISHAY M ANSOUR o TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class.
Joint Object-Material Category Segmentation from Audio-Visual Cues
Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,
Learning Hierarchical Features for Scene Labeling Cle’ment Farabet, Camille Couprie, Laurent Najman, and Yann LeCun by Dong Nie.
Machine Vision ENT 273 Lecture 4 Hema C.R.
Outline Scene Understanding Motivation for Context
2. Skin - color filtering.
Learning a Region-based Scene Segmentation Model
Nonparametric Semantic Segmentation
Object detection as supervised classification
Efficiently Selecting Regions for Scene Understanding
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
HOGgles Visualizing Object Detection Features
Identifying Human-Object Interaction in Range and Video Data
Object-Graphs for Context-Aware Category Discovery
Brief Review of Recognition + Context
Cascaded Classification Models
Adarsh Kowdle*, Congcong Li*, Ashutosh Saxena, and Tsuhan Chen
Semantic Segmentation
Presentation transcript:

Training Regimes Motivation  Allow state-of-the-art subcomponents  With “Black-box” functionality  This idea also occurs in other application areas Audio Signal ProcessingText Understanding STANFORD STANFORD Cascaded Classification Models: Combining Models for Holistic Scene Understanding Geremy Heitz, Stephen Gould, Ashutosh Saxena, and Daphne Koller Cascaded Classification Models  Independent models classify using only independent features  Further levels of “context-aware” classifiers include the original features, and features produced from the previous level’s outputs Object Detection Results Multi-class Segmentation 3D Reconstruction ROAD BUILDING CAR BUS PEOPLE BUILDING ROAD STREET SCENE Currently, the community tends to solve these problems in isolation We want a simple, flexible method for solving them jointly, allowing for the sharing of contextual cues What’s happening in this picture? To solve the scene understanding problem… …we need to know: What are the objects? What are the regions? What are the surfaces? What is the layout? What is the category? HOG DETECTOR Dalal & Triggs Base Detector: D(W) = HOG Detector Score Goal: Put a box around all the cars, people, motorcycles, boats, sheep, and cows Context Aware Detector: Goal: Label each pixel as belonging to one of 7 classes. Base Model: CRF with boosted singleton classifiers + pairwise smoothing Context Aware Model: Relative location map features Grass more likely Independent Models use only their own features Ground Models use features from the other tasks’ groundtruth labels CCM Models use features from the other tasks’ MAP outputs INDEPENDENT CCM CARPEDESTRIANMOTORBIKEBOAT CATEGORIES REGION LABELS ? Scene Categorization  From Scene Category  MAP category, marginals  From Region Labels  How much of each label is in a window adjacent to W  From Depths  Mean, variance of depths, estimate of “true” object size  Final Classifier P(Y) = Logistic(Φ(W)) Scene Type: Urban scene % of “building” above WVariance of depths in W Goal: Label each pixel as belonging to one of 7 classes. Base Model: CRF with boosted singleton classifiers + pairwise smoothing BLACK BOX GRASS SKY Find d* Reoptimize depths with new constraints: d CCM = argmin α||d - d*|| + β||d - d CONTEXT || Context Aware Model: Relative location map features Goal: Label each pixel as belonging to one of 7 classes. Base Model: Logistic over simple features RURAL Context Aware Model: Additional Features  From Detections  Number of each object type present  From Segmentation  Fraction of image of each region type DetectionCarPersonBikeBoatSheepCowDepth INDEP m 2-CCM m RegionsTreeRoadGrassWaterSkyBuildingFG INDEP CCM Boats & Water INDEPPred. RoadPred. Water True Road True Water CCMPred. RoadPred. Water True Road True Water DS1: 422 Images, 5 fold testing Categorization, Detection, Region Labeling All images have full groundtruth for all tasks DS1: 1745 Images Detection, Region Labeling, Depth Reconstruction Disjoint groundtruth = Water = Sky = Building = Tree = Road = Grass = Foreground = Car = Person = Motorcycle = Boat = Sheep = Cow INDEPENDENT CCM Independent Objects Independent Regions CCM Objects Independent Objects Independent Regions CCM Objects Independent Objects Independent Regions CCM Regions Independent Objects Independent Regions CCM Regions