Learning to Perceive Coherent Objects Nimrod Dorfman, Daniel Harari, Shimon Ullman Weizmann Institute of Science WEIZMANN INSTITUTE OF SCIENCE COGSCI 2013.

Slides:



Advertisements
Similar presentations
Chapter 2: Marr’s theory of vision. Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Overview Introduce Marr’s distinction between.
Advertisements

Top-Down & Bottom-Up Segmentation
Dynamic Occlusion Analysis in Optical Flow Fields
Perception and Pattern Recognition  What types of information do we use to perceive the world correctly?  What are the major theories about how we recognize.
1 Computational Vision CSCI 363, Fall 2012 Lecture 35 Perceptual Organization II.
Low-Level Vision. Low Level Vision--outline Problem to be solved Example of one computation—lines How simple computations yield more complex information.
3D Human Body Pose Estimation from Monocular Video Moin Nabi Computer Vision Group Institute for Research in Fundamental Sciences (IPM)
Contour-Based Joint Clustering of Multiple Segmentations Daniel Glasner * 1 Shiv N. Vitaladevuni * 2 Ronen Basri 1 * equal contribution authors 1 2.
Facial feature localization Presented by: Harvest Jang Spring 2002.
GrabCut Interactive Image (and Stereo) Segmentation Carsten Rother Vladimir Kolmogorov Andrew Blake Antonio Criminisi Geoffrey Cross [based on Siggraph.
Foreground Modeling The Shape of Things that Came Nathan Jacobs Advisor: Robert Pless Computer Science Washington University in St. Louis.
Adviser : Ming-Yuan Shieh Student ID : M Student : Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.
Segmentation Course web page: vision.cis.udel.edu/~cv May 2, 2003  Lecture 29.
GrabCut Interactive Image (and Stereo) Segmentation Joon Jae Lee Keimyung University Welcome. I will present Grabcut – an Interactive tool for foreground.
Gestalt Principles Gurpreet. What are the Gestalt Principles? Gestalt is a psychological term which means ‘unified whole’. It was discovered by the German.
What happens when no correspondence is possible? Highly mismatched stereo-pairs lead to ‘binocular rivalry’ TANGENT ALERT! Open question: Can rivalry and.
MESA LAB Depth ordering Guimei Zhang MESA LAB MESA (Mechatronics, Embedded Systems and Automation) LAB School of Engineering, University of California,
Robust Moving Object Detection & Categorization using self- improving classifiers Omar Javed, Saad Ali & Mubarak Shah.
A Novel 2D-to-3D Conversion System Using Edge Information IEEE Transactions on Consumer Electronics 2010 Chao-Chung Cheng Chung-Te li Liang-Gee Chen.
Object Perception. Perceptual Grouping and Gestalt Laws Law of Good continuation. This is perceived as a square and triangle, not as a combination of.
1 Learning to Detect Natural Image Boundaries David Martin, Charless Fowlkes, Jitendra Malik Computer Science Division University of California at Berkeley.
CVR05 University of California Berkeley 1 Familiar Configuration Enables Figure/Ground Assignment in Natural Scenes Xiaofeng Ren, Charless Fowlkes, Jitendra.
Types of Perceptual Processes Bottom-up - work up from sensory info. Top-down - apply knowledge and experience.
Real-time Hand Pose Recognition Using Low- Resolution Depth Images
Segmentation by Clustering Reading: Chapter 14 (skip 14.5) Data reduction - obtain a compact representation for interesting image data in terms of a set.
1 Ecological Statistics and Perceptual Organization Charless Fowlkes work with David Martin and Jitendra Malik at University of California at Berkeley.
Using Image Priors in Maximum Margin Classifiers Tali Brayer Margarita Osadchy Daniel Keren.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
Chapter 14: SEGMENTATION BY CLUSTERING 1. 2 Outline Introduction Human Vision & Gestalt Properties Applications – Background Subtraction – Shot Boundary.
Lecture 29: Face Detection Revisited CS4670 / 5670: Computer Vision Noah Snavely.
Traffic Sign Pattern Recognition Pilho Kim (ECE), Zhaohua Wang, Yichang (James) Tsai (CE)
Visual motion Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys.
“When” rather than “Whether”: Developmental Variable Selection Melissa Dominguez Robert Jacobs Department of Computer Science University of Rochester.
CS 4487/6587 Algorithms for Image Analysis
Recognizing Action at a Distance Alexei A. Efros, Alexander C. Berg, Greg Mori, Jitendra Malik Computer Science Division, UC Berkeley Presented by Pundik.
 Detecting system  Training system Human Emotions Estimation by Adaboost based on Jinhui Chen, Tetsuya Takiguchi, Yasuo Ariki ( Kobe University ) User's.
CS654: Digital Image Analysis Lecture 25: Hough Transform Slide credits: Guillermo Sapiro, Mubarak Shah, Derek Hoiem.
Efficient Visual Object Tracking with Online Nearest Neighbor Classifier Many slides adapt from Steve Gu.
Figure ground segregation in video via averaging and color distribution Introduction to Computational and Biological Vision 2013 Dror Zenati.
DETECTING AND TRACKING TRACTOR-TRAILERS USING VIEW-BASED TEMPLATES Masters Thesis Defense by Vinay Gidla Apr 19,2010.
Inference in generative models of images and video John Winn MSR Cambridge May 2004.
1 Computational Vision CSCI 363, Fall 2012 Lecture 1 Introduction to Vision Science Course webpage:
IEEE International Conference on Multimedia and Expo.
Suspicious Behavior in Outdoor Video Analysis - Challenges & Complexities Air Force Institute of Technology/ROME Air Force Research Lab Unclassified IED.
Occlusion Tracking Using Logical Models Summary. A Variational Partial Differential Equations based model is used for tracking objects under occlusions.
How we actively interpret our environment..  Perception: The process in which we understand sensory information.  Illusions are powerful examples of.
Meeting 8: Features for Object Classification Ullman et al.
Motion Segmentation at Any Speed Shrinivas J. Pundlik Department of Electrical and Computer Engineering, Clemson University, Clemson, SC.
Fast Human Detection in Crowded Scenes by Contour Integration and Local Shape Estimation Csaba Beleznai, Horst Bischof Computer Vision and Pattern Recognition,
Learning Image Statistics for Bayesian Tracking Hedvig Sidenbladh KTH, Sweden Michael Black Brown University, RI, USA
Modeling Perspective Effects in Photographic Composition Zihan Zhou, Siqiong He, Jia Li, and James Z. Wang The Pennsylvania State University.
9/30/ Cognitive Robotics1 Gestalt Perception Cognitive Robotics David S. Touretzky & Ethan Tira-Thompson Carnegie Mellon Spring 2006.
9.012 Presentation by Alex Rakhlin March 16, 2001
A Novel 2D-to-3D Conversion System Using Edge Information
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
Dynamical Statistical Shape Priors for Level Set Based Tracking
Week 8 Nicholas Baker.
“The Truth About Cats And Dogs”
Background Perception Animation - principles Animation - history
Institute of Neural Information Processing (Prof. Heiko Neumann •
Vincent DeVito Computer Systems Lab
Pattern recognition (…and object perception).
Learning complex visual concepts
Perception.
CSSE463: Image Recognition Day 30
Digital Image Processing Week III
CSSE463: Image Recognition Day 30
CSSE463: Image Recognition Day 30
Learning complex visual concepts
Presentation transcript:

Learning to Perceive Coherent Objects Nimrod Dorfman, Daniel Harari, Shimon Ullman Weizmann Institute of Science WEIZMANN INSTITUTE OF SCIENCE COGSCI 2013

Object segregation is learned 2 5 months Even basic Gestalt cues are initially missing [Schmidt et al. 1986]

Object segregation is learned 3 Adults

4 How do we learn to segregate objects?

5 We propose a computational model: -Explain the first steps of learning -Based on psychophysical findings -Computationally tested on videos

It all begins with motion 6

Grouping by common motion precedes figural goodness [Spelke review] Motion discontinuities provide an early cue for occlusion boundaries [Granrud et al. 1984] 7

Our model 8 Static segregation Local occlusion boundaries Object form Motion discontinuities Common motion Boundary General Accurate Noisy Incomplete Global Object-specific Complete Inaccurate Motion-based segregation

Intensity edges? 9 Boundary

Occlusion cues 10 Extremal edges Convexity T-junctions [Ghose & Palmer 2010] Boundary

Familiar object 11 Global

12 How does it actually work?

Moving object 13 Motion

Moving object 14 Figure Ground Unknown Motion

15 Motion BoundaryGlobal

16 Need many examples for good results (1000+) Boundary Good examples

Prediction 17 Figure or Ground? Figure or Ground? Novel object, novel background 78% success Using 100,000 training examples 78% success Using 100,000 training examples Boundary

Entire image 18 Boundary FigureBackground

Learning an object 19 Standard object recognition algorithm Learns local features and their relative locations Global

Detection 20 Global

Combining information sources 21 Combined Boundary Accurate Noisy & Incomplete Global Complete Inaccurate

More complex algorithms Default GrabCutWith segregation cue [Rother et al. 2004] 22

Summary Static segregation is learned from motion Two simple mechanisms: Boundary Motion discontinuities  Occlusion boundaries (Need a rich library, including extremal edges) Global Common motion  Object form These mechanisms work in synergy This is enough to get started, adult segregation is much more complex 23

24