Lecture 18: Context and segmentation CS6670: Computer Vision Noah Snavely.

Slides:



Advertisements
Similar presentations
Fitting a transformation: feature-based alignment Wed, Feb 23 Prof. Kristen Grauman UT-Austin.
Advertisements

Lecture 24: Segmentation CS6670: Computer Vision Noah Snavely From Sandlot ScienceSandlot Science.
I Images as graphs Fully-connected graph – node for every pixel – link between every pair of pixels, p,q – similarity w ij for each link j w ij c Source:
Internet Vision - Lecture 3 Tamara Berg Sept 10. New Lecture Time Mondays 10:00am-12:30pm in 2311 Monday (9/15) we will have a general Computer Vision.
Graph-Based Image Segmentation
Stephen J. Guy 1. Photomontage Photomontage GrabCut – Interactive Foreground Extraction 1.
Fitting a transformation: feature-based alignment
Active Contours (SNAKES) Back to boundary detection –This time using perceptual grouping. This is non-parametric –We’re not looking for a contour of a.
Lecture 6 Image Segmentation
Event prediction CS 590v. Applications Video search Surveillance – Detecting suspicious activities – Illegally parked cars – Abandoned bags Intelligent.
Segmentation. Terminology Segmentation, grouping, perceptual organization: gathering features that belong together Fitting: associating a model with observed.
A Study of Approaches for Object Recognition
Segmentation CSE P 576 Larry Zitnick Many slides courtesy of Steve Seitz.
Problem Sets Problem Set 3 –Distributed Tuesday, 3/18. –Due Thursday, 4/3 Problem Set 4 –Distributed Tuesday, 4/1 –Due Tuesday, 4/15. Probably a total.
Announcements Project 2 more signup slots questions Picture taking at end of class.
Today: Image Segmentation Image Segmentation Techniques Snakes Scissors Graph Cuts Mean Shift Wednesday (2/28) Texture analysis and synthesis Multiple.
From Pixels to “Blobs” : Rendering and Image Processing Alexei Efros.
Snakes Goes from edges to boundaries. Edge is strong change in intensity. Boundary is boundary of an object. –Smooth (more or less) –Closed. –…
Image Segmentation Today’s Readings Intelligent Scissors, Mortensen et. al, SIGGRAPH 1995Intelligent Scissors From Sandlot ScienceSandlot Science.
Announcements Kevin Matzen office hours – Tuesday 4-5pm, Thursday 2-3pm, Upson 317 TA: Yin Lou Course lab: Upson 317 – Card access will be setup soon Course.
Lecture 17: Parts-based models and context CS6670: Computer Vision Noah Snavely.
Visual motion Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys.
Image Segmentation A Graph Theoretic Approach. Factors for Visual Grouping Similarity (gray level difference) Similarity (gray level difference) Proximity.
Image Segmentation Today’s Readings Forsyth & Ponce, Chapter 14
Announcements Mailing list: –you should have received messages Office hours onlineonline –start next week.
Perceptual Organization: Segmentation and Optical Flow.
Segmentation and Clustering Today’s Readings Forsyth & Ponce, Chapter 7 (plus lots of optional references in the slides) From Sandlot ScienceSandlot Science.
Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.
Announcements vote for Project 3 artifacts Project 4 (due next Wed night) Questions? Late day policy: everything must be turned in by next Friday.
Announcements Project 3 questions Photos after class.
Computer Vision - A Modern Approach Set: Segmentation Slides by D.A. Forsyth Segmentation and Grouping Motivation: not information is evidence Obtain a.
Graph-based Segmentation
Performance Evaluation of Grouping Algorithms Vida Movahedi Elder Lab - Centre for Vision Research York University Spring 2009.
MRFs and Segmentation with Graph Cuts Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 02/24/10.
Recovering Surface Layout from a Single Image D. Hoiem, A.A. Efros, M. Hebert Robotics Institute, CMU Presenter: Derek Hoiem CS 598, Spring 2009 Jan 29,
Lecture 2: Edge detection CS4670: Computer Vision Noah Snavely From Sandlot ScienceSandlot Science.
Lecture 29: Face Detection Revisited CS4670 / 5670: Computer Vision Noah Snavely.
Lecture 4: Image Resampling and Reconstruction CS4670: Computer Vision Kavita Bala.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Visual motion Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys.
Why is computer vision difficult?
Putting Context into Vision Derek Hoiem September 15, 2004.
Brent M. Dingle, Ph.D Game Design and Development Program Mathematics, Statistics and Computer Science University of Wisconsin - Stout Edge Detection:
CS654: Digital Image Analysis Lecture 28: Advanced topics in Image Segmentation Image courtesy: IEEE, IJCV.
A global approach Finding correspondence between a pair of epipolar lines for all pixels simultaneously Local method: no guarantee we will have one to.
Lecture 30: Segmentation CS4670 / 5670: Computer Vision Noah Snavely From Sandlot ScienceSandlot Science.
Cutting Images: Graphs and Boundary Finding Computational Photography Derek Hoiem, University of Illinois 09/20/12 “The Double Secret”, Magritte.
Image segmentation.
Course Introduction to Medical Imaging Segmentation 1 – Mean Shift and Graph-Cuts Guy Gilboa.
SHAHAB iCV Research Group.
Image Segmentation Today’s Readings Szesliski Chapter 5
Announcements CS accounts Project 1 is out today
Lecture: Clustering and Segmentation
CS4670/5670: Image Scissors Noah Snavely Today’s Readings
Fitting a transformation: Feature-based alignment May 2nd, 2017
Lecture 1: Images and image filtering
Announcements Reader is in the BOOK STORE (not Comm Bldg.)
Brief Review of Recognition + Context
Lecture 31: Graph-Based Image Segmentation
Announcements Photos right now Project 3 questions
Digital Image Processing
Seam Carving Project 1a due at midnight tonight.
Segmentation (continued)
Announcements Project 4 questions Evaluations.
Announcements Project 1 is out today
Announcements Project 4 out today (due Wed March 10)
Lecture 1: Images and image filtering
Announcements Project 1 is out today help session at the end of class.
“Traditional” image segmentation
Presentation transcript:

Lecture 18: Context and segmentation CS6670: Computer Vision Noah Snavely

Announcements Project 3 out soon Final project

When is context helpful? Distance Information Local features Contextual features Slide by Antonio Torralba

Is it just for small / blurry things? Slide by Antonio Torralba

Is it just for small / blurry things? Slide by Antonio Torralba

Is it just for small / blurry things? Slide by Antonio Torralba

Context is hard to fight! Thanks to Paul Viola for showing me these

more “Look-alikes”

1 Don’t even need to see the object Slide by Antonio Torralba

Don’t even need to see the object Chance ~ 1/30000 Slide by Antonio Torralba

The influence of an object extends beyond its physical boundaries But object can say a lot about the scene

Object priming Torralba, Sinha, Oliva, VSS 2001 Increasing contextual information

Object priming Torralba, Sinha, Oliva, VSS 2001

Why context is important? Typical answer: to “guess” small / blurry objects based on a prior most current vision systems

So, you think it’s so simple? What is this? slide by Takeo Kanade

Why is this a car?

…because it’s on the road! Why is this road?

Why is this a road?

Same problem in real scenes

Why context is important? Typical answer: to “guess” small / blurry objects based on a prior Most current vision systems Deeper answer: to make sense of the visual world much work yet to be done!

Why context is important? To resolve ambiguity Even high-res objects can be ambiguous –e.g. there are more people than faces in the world! There are 30,000+ objects, but only a few dozen can happen within a given scene To notice “unusual” things Prevents mental overload To infer function of unknown object

Sources of Context from Divvala et al, CVPR 2009

Semantic Context Berg et al Fei-Fei and Perona 2005 Gupta & Davis 2008 Rabinovich et al. 2007

Local Pixel Context Wolf & Bileschi 2006Dalal & Triggs 2005

Scene Gist Context Global (low-dimensional) scene statistics

Geometric Context Geometric Context [Hoiem et al. ’2005] Ground Sky FrontalLeftRight

Photogrammetric Context Hoiem et al 2006 Lalonde et al. 2008

Illumination and Weather Context Illumination context (Lalonde et al)

Geographic Context Hays & Efros 2008

Temporal Context Gallagher et al 2008 Liu et al. 2008

Cultural Context Photographer Bias Society Bias

Flickr Paris

“uniformly sampled” Paris

…or Notre Dame

Why Photographers are Biased? People want their pictures to be recognizable and/or interesting vs.

Why Photographers are Biased? People follow photographic conventions vs. Simon & Seitz 2008

“100 Special Moments” by Jason Salavon

Empirical Evaluation of Context Divvala, Hoiem, Hays, Efros, Hebert, CVPR 2009

Change in Accuracies TypeChange Small97% Large15% Occluded79% Non-Occluded-5% Difficult57% Total47% Percentage change in average precisions before and after including context information Context helps in case of impoverished appearance

Detecting difficult objects Office Maybe there is a mouse Start recognizing the scene Torralba, Murphy, Freeman. NIPS 2004.

Detecting difficult objects Detect first simple objects (reliable detectors) that provide strong contextual constraints to the target (screen -> keyboard -> mouse) Torralba, Murphy, Freeman. NIPS 2004.

Detecting difficult objects (object parts) Detect first simple objects (reliable detectors) that provide strong contextual constraints to the target (screen -> keyboard -> mouse) Torralba, Murphy, Freeman. NIPS 2004.

…on the other hand

Who needs context anyway? We can recognize objects even out of context Banksy Slide by Antonio Torralba

Lecture 18a: Segmentation CS6670: Computer Vision Noah Snavely From Sandlot ScienceSandlot Science

From images to objects What defines an object? Subjective problem, but has been well-studied Gestalt Laws seek to formalize this –proximity, similarity, continuation, closure, common fate –see notes by Steve Joordens, U. Torontonotes

Extracting objects How could we do this automatically (or at least semi-automatically)?

The Gestalt school Grouping is key to visual perception Elements in a collection can have properties that result from relationships “The whole is greater than the sum of its parts” subjective contours occlusion familiar configuration Slide from S.Lazebnik

The ultimate Gestalt? Slide from S.Lazebnik

Gestalt factors These factors make intuitive sense, but are very difficult to translate into algorithms Slide from S.Lazebnik

Semi-automatic binary segmentation

Intelligent Scissors (demo)

Intelligent Scissors [Mortensen 95] Approach answers a basic question – Q: how to find a path from seed to mouse that follows object boundary as closely as possible?

GrabCut Grabcut [Rother et al., SIGGRAPH 2004]Rother et al., SIGGRAPH 2004

Is user-input required? Our visual system is proof that automatic methods are possible classical image segmentation methods are automatic Argument for user-directed methods? only user knows desired scale/object of interest

q Automatic graph cut [Shi & Malik] Fully-connected graph node for every pixel link between every pair of pixels, p,q cost c pq for each link –c pq measures similarity »similarity is inversely proportional to difference in color and position p C pq c

Segmentation by Graph Cuts Break Graph into Segments Delete links that cross between segments Easiest to break links that have low cost (similarity) –similar pixels should be in the same segments –dissimilar pixels should be in different segments w ABC

Cuts in a graph Link Cut set of links whose removal makes a graph disconnected cost of a cut: A B Find minimum cut gives you a segmentation

But min cut is not always the best cut...

Cuts in a graph A B Normalized Cut a cut penalizes large segments fix by normalizing for size of segments volume(A) = sum of costs of all edges that touch A

Interpretation as a Dynamical System Treat the links as springs and shake the system elasticity proportional to cost vibration “modes” correspond to segments –can compute these by solving an eigenvector problem –

Interpretation as a Dynamical System Treat the links as springs and shake the system elasticity proportional to cost vibration “modes” correspond to segments –can compute these by solving an eigenvector problem –

Color Image Segmentation