Segmentation by Clustering Reading: Chapter 14 (skip 14.5) Data reduction - obtain a compact representation for interesting image data in terms of a set.

Slides:



Advertisements
Similar presentations
Unsupervised learning
Advertisements

Lecture 07 Segmentation Lecture 07 Segmentation Mata kuliah: T Computer Vision Tahun: 2010.
Computer Vision Lecture 16: Region Representation
Foreground Modeling The Shape of Things that Came Nathan Jacobs Advisor: Robert Pless Computer Science Washington University in St. Louis.
Segmentation Course web page: vision.cis.udel.edu/~cv May 2, 2003  Lecture 29.
Human-Computer Interaction Human-Computer Interaction Segmentation Hanyang University Jong-Il Park.
Introduction to Bioinformatics
December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.
Lecture 6 Image Segmentation
Clustering II.
EE 7730 Image Segmentation.
Clustering… in General In vector space, clusters are vectors found within  of a cluster vector, with different techniques for determining the cluster.
Region Segmentation. Find sets of pixels, such that All pixels in region i satisfy some constraint of similarity.
Image Segmentation Chapter 14, David A. Forsyth and Jean Ponce, “Computer Vision: A Modern Approach”.
Segmentation and Clustering. Segmentation: Divide image into regions of similar contentsSegmentation: Divide image into regions of similar contents Clustering:
Segmentation CSE P 576 Larry Zitnick Many slides courtesy of Steve Seitz.
Segmentation Divide the image into segments. Each segment:
Segmentation Kyongil Yoon. Segmentation Obtain a compact representation of what is helpful (in the image) No comprehensive theory of segmentation Human.
Region-Level Motion- Based Background Modeling and Subtraction Using MRFs Shih-Shinh Huang Li-Chen Fu Pei-Yung Hsiao 2007 IEEE.
Clustering Color/Intensity
Computer Vision Segmentation Marc Pollefeys COMP 256 Some slides and illustrations from D. Forsyth, T. Darrel,...
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Segmentation and Grouping
Image Segmentation Some slides: courtesy of O. Capms, Penn State, J.Ponce and D. Fortsyth, Computer Vision Book.
Segmentation and Clustering. Déjà Vu? Q: Haven’t we already seen this with snakes?Q: Haven’t we already seen this with snakes? A: no, that was about boundaries,
Texture Reading: Chapter 9 (skip 9.4) Key issue: How do we represent texture? Topics: –Texture segmentation –Texture-based matching –Texture synthesis.
A Self-Organizing Approach to Background Subtraction for Visual Surveillance Applications Lucia Maddalena and Alfredo Petrosino, Senior Member, IEEE.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Computer Vision Segmentation Marc Pollefeys COMP 256 Some slides and illustrations from D. Forsyth, T. Darrel,...
Segmentation Slides Credit: Jim Rehg, G.Tech. Christopher Rasmussen, UD John Spletzer, Lehigh Also, Slides adopted from material provided by David Forsyth.
1 Video Surveillance systems for Traffic Monitoring Simeon Indupalli.
Computer Vision - A Modern Approach Set: Segmentation Slides by D.A. Forsyth Segmentation and Grouping Motivation: not information is evidence Obtain a.
1 Activity and Motion Detection in Videos Longin Jan Latecki and Roland Miezianko, Temple University Dragoljub Pokrajac, Delaware State University Dover,
Image Segmentation by Clustering using Moments by, Dhiraj Sakumalla.
Computer Vision James Hays, Brown
CSE 185 Introduction to Computer Vision Pattern Recognition.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
1 Mean shift and feature selection ECE 738 course project Zhaozheng Yin Spring 2005 Note: Figures and ideas are copyrighted by original authors.
Segmentation by Clustering
Computer Vision Lecture 5. Clustering: Why and How.
Chapter 14: SEGMENTATION BY CLUSTERING 1. 2 Outline Introduction Human Vision & Gestalt Properties Applications – Background Subtraction – Shot Boundary.
CSE 185 Introduction to Computer Vision Pattern Recognition 2.
Clustering Supervised vs. Unsupervised Learning Examples of clustering in Web IR Characteristics of clustering Clustering algorithms Cluster Labeling 1.
Basic Machine Learning: Clustering CS 315 – Web Search and Data Mining 1.
Chapter 10 Image Segmentation.
December 9, 2014Computer Vision Lecture 23: Motion Analysis 1 Now we will talk about… Motion Analysis.
CS654: Digital Image Analysis
Expectation-Maximization (EM) Case Studies
Segmentation & Grouping Tuesday, Sept 23 Kristen Grauman UT-Austin.
Image Segmentation Shengnan Wang
Basic Machine Learning: Clustering CS 315 – Web Search and Data Mining 1.
Visual Tracking by Cluster Analysis Arthur Pece Department of Computer Science University of Copenhagen
1 Overview representing region in 2 ways in terms of its external characteristics (its boundary)  focus on shape characteristics in terms of its internal.
Mestrado em Ciência de Computadores Mestrado Integrado em Engenharia de Redes e Sistemas Informáticos VC 15/16 – TP10 Advanced Segmentation Miguel Tavares.
Thresholding Foundation:. Thresholding In A: light objects in dark background To extract the objects: –Select a T that separates the objects from the.
Representing Moving Images with Layers J. Y. Wang and E. H. Adelson MIT Media Lab.
Lecture 30: Segmentation CS4670 / 5670: Computer Vision Noah Snavely From Sandlot ScienceSandlot Science.
Rodney Nielsen Many of these slides were adapted from: I. H. Witten, E. Frank and M. A. Hall Data Science Algorithms: The Basic Methods Clustering WFH:
K-Means Segmentation.
Miguel Tavares Coimbra
Lecture: Clustering and Segmentation
Range Imaging Through Triangulation
Segmentation and Grouping
Perceiving and Recognizing Objects
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
Segmentation Lecture 04 Computer Vision
Emel Doğrusöz Esra Ataer Muhammet Baştan Tolga Can
EM Algorithm and its Applications
Learning complex visual concepts
Presentation transcript:

Segmentation by Clustering Reading: Chapter 14 (skip 14.5) Data reduction - obtain a compact representation for interesting image data in terms of a set of components Find components that belong together (form clusters) Frame differencing - Background Subtraction and Shot Detection Slide credits for this chapter: David Forsyth, Christopher Rasmussen

Segmentation by Clustering

From: Object Recognition as Machine Translation, Duygulu, Barnard, de Freitas, Forsyth, ECCV02

General ideas Tokens –whatever we need to group (pixels, points, surface elements, etc., etc.) Top down segmentation –tokens belong together because they lie on the same object Bottom up segmentation –tokens belong together because they are locally coherent These two are not mutually exclusive

Why do these tokens belong together?

Top-down segmentation

Basic ideas of grouping in human vision Figure-ground discrimination –grouping can be seen in terms of allocating some elements to a figure, some to ground –Can be based on local bottom-up cues or high level recognition Gestalt properties –Psychologists have studies a series of factors that affect whether elements should be grouped together Gestalt properties

Elevator buttons in Berkeley Computer Science Building

“Illusory Contours”

Segmentation as clustering Cluster together (pixels, tokens, etc.) that belong together Agglomerative clustering –merge closest clusters –repeat Divisive clustering –split cluster along best boundary –repeat Point-Cluster distance –single-link clustering –complete-link clustering –group-average clustering Dendrograms –yield a picture of output as clustering process continues

Dendrogram from Agglomerative Clustering Instead of a fixed number of clusters, the dendrogram represents a hierarchy of clusters

Feature Space Every token is identified by a set of salient visual characteristics called features. For example: –Position –Color –Texture –Motion vector –Size, orientation (if token is larger than a pixel) The choice of features and how they are quantified implies a feature space in which each token is represented by a point Token similarity is thus measured by distance between points (“feature vectors”) in feature space Slide credit: Christopher Rasmussen

K -Means Clustering Initialization: Given K categories, N points in feature space. Pick K points randomly; these are initial cluster centers (means) m 1, …, m K. Repeat the following: 1.Assign each of the N points, x j, to clusters by nearest m i (make sure no cluster is empty) 2.Recompute mean m i of each cluster from its member points 3.If no mean has changed, stop Effectively carries out gradient descent to minimize: Slide credit: Christopher Rasmussen

K-Means Minimizing squared distances to the center implies that the center is at the mean: Derivative of error is zero at the minimum

Example: 3-means Clustering from Duda et al. Convergence in 3 steps

K-means clustering using intensity alone and color alone Image Clusters on intensityClusters on color

Technique: Background Subtraction If we know what the background looks like, it is easy to segment out new regions Applications –Person in an office –Tracking cars on a road –Surveillance –Video game interfaces Approach: –use a moving average to estimate background image –subtract from current frame –large absolute values are interesting pixels

Background Subtraction The problem: Segment moving foreground objects from static background Current image from C. Stauffer and W. Grimson Background imageForeground pixels courtesy of C. Wren Pfinder Slide credit: Christopher Rasmussen

Algorithm video sequencebackground frame differencethresholded frame diff for t = 1:N Update background model Compute frame difference Threshold frame difference Noise removal end Objects are detected where is non-zero

Background Modeling Offline average –Pixel-wise mean values are computed during training phase (also called Mean and Threshold) Adjacent Frame Difference –Each image is subtracted from previous image in sequence Moving average –Background model is linear weighted sum of previous frames

Results & Problems for Simple Approaches

Background Subtraction: Issues Noise models –Unimodal: Pixel values vary over time even for static scenes –Multimodal: Features in background can “oscillate”, requiring models which can represent disjoint sets of pixel values (e.g., waving trees against sky) Gross illumination changes –Continuous: Gradual illumination changes alter the appearance of the background (e.g., time of day) –Discontinuous: Sudden changes in illumination and other scene parameters alter the appearance of the background (e.g., flipping a light switch Bootstrapping –Is a training phase with “no foreground” necessary, or can the system learn what’s static vs. dynamic online? Slide credit: Christopher Rasmussen

Application: Sony Eyetoy For most games, this apparently uses simple frame differencing to detect regions of motion However, some applications use background subtraction to cut out an image of the user to insert in video Over 4 million units sold

Technique: Shot Boundary Detection Find the shots in a sequence of video –shot boundaries usually result in big differences between succeeding frames Strategy –compute interframe distances –declare a boundary where these are big Distance measures –frame differences –histogram differences –block comparisons –edge differences Applications –representation for movies, or video sequences obtain “most representative” frame –supports search