Learning object affordances based on structural object representation Kadir F. Uyanik Asil Kaan Bozcuoglu EE 583 Pattern Recognition Jan 4, 2011.

Slides:



Advertisements
Similar presentations
Chapter 2: Marr’s theory of vision. Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Overview Introduce Marr’s distinction between.
Advertisements

November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
DREAM PLAN IDEA IMPLEMENTATION Introduction to Image Processing Dr. Kourosh Kiani
Lecture 18 – Recognition 1. Visual Recognition 1)Contours 2)Objects 3)Faces 4)Scenes.
Lecture 07 Segmentation Lecture 07 Segmentation Mata kuliah: T Computer Vision Tahun: 2010.
Computer Vision Lecture 16: Region Representation
Instructor: Mircea Nicolescu Lecture 15 CS 485 / 685 Computer Vision.
September 10, 2013Computer Vision Lecture 3: Binary Image Processing 1Thresholding Here, the right image is created from the left image by thresholding,
Train DEPOT PROBLEM USING PERMUTATION GRAPHS
Last week... why object recognition is difficult, the template model the feature recognition model, word recognition as a case study Today... Recognition.
Abstraction + Geometry = Realization (Views of Illuminated Surfaces) A (potential) application of singularity theory to vision BIRS, Banff, 30 August 2011.
Localization of Piled Boxes by Means of the Hough Transform Dimitrios Katsoulas Institute for Pattern Recognition and Image Processing University of Freiburg.
1 Visual Recognition Lecturers: Ehud Rivlin, Michael Rudzsky, Ilan Shimshoni Tutor: Michael Rudzsky.
Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.
Recognition by Parts Visual Recognition Lecture 12 “The whole is equal to the sum of its parts” Euclid.
A Study of Approaches for Object Recognition
Processing Digital Images. Filtering Analysis –Recognition Transmission.
Local Symmetry - 2D Ribbons, SATs and Smoothed Local Symmetries Asaf Yaffe Image Processing Seminar, Haifa University, March 2005.
Object Perception. Perceptual Grouping and Gestalt Laws Law of Good continuation. This is perceived as a square and triangle, not as a combination of.
Visual Cognition II Object Perception. Theories of Object Recognition Template matching models Feature matching Models Recognition-by-components Configural.
Fuzzy Medical Image Segmentation
E.G.M. PetrakisTexture1 Repeative patterns of local variations of intensity on a surface –texture pattern: texel Texels: similar shape, intensity distribution.
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Lecture 4: Edge Based Vision Dr Carole Twining Thursday 18th March 2:00pm – 2:50pm.
Michael Arbib & Laurent Itti: CS664 – USC, spring Lecture 6: Object Recognition 1 CS664, USC, Spring 2002 Lecture 6. Object Recognition Reading Assignments:
00/4/103DVIP-011 Part Three: Descriptions of 3-D Objects and Scenes.
November 29, 2004AI: Chapter 24: Perception1 Artificial Intelligence Chapter 24: Perception Michael Scherger Department of Computer Science Kent State.
Hubert CARDOTJY- RAMELRashid-Jalal QURESHI Université François Rabelais de Tours, Laboratoire d'Informatique 64, Avenue Jean Portalis, TOURS – France.
Circular Augmented Rotational Trajectory (CART) Shape Recognition & Curvature Estimation Presentation for 3IA 2007 Russel Ahmed Apu & Dr. Marina Gavrilova.
Xiaojiang Ling CSE 668, Animate Vision Principles for 3D Image Sequences CSE Department, SUNY Buffalo
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
嵌入式視覺 Pattern Recognition for Embedded Vision Template matching Statistical / Structural Pattern Recognition Neural networks.
Shape Spaces Kathryn Leonard 22 January 2005 MSRI Intro to Image Analysis.
Gwangju Institute of Science and Technology Intelligent Design and Graphics Laboratory Multi-scale tensor voting for feature extraction from unstructured.
Perception Introduction Pattern Recognition Image Formation
Digital Image Processing In The Name Of God Digital Image Processing Lecture8: Image Segmentation M. Ghelich Oghli By: M. Ghelich Oghli
HOUGH TRANSFORM Presentation by Sumit Tandon
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
Shape Analysis and Retrieval Structural Shape Descriptors Notes courtesy of Funk et al., SIGGRAPH 2004.
Learning Decompositional Shape Models from Examples Alex Levinshtein Cristian Sminchisescu Sven Dickinson Sven Dickinson University of Toronto.
Chapter 10 Image Segmentation.
November 13, 2014Computer Vision Lecture 17: Object Recognition I 1 Today we will move on to… Object Recognition.
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
Texture Detection & Texture related clustering C601 Project Jing Qin Fall 2003.
Korea University Dept.of Industrial System & Information Engineering User Interface Lab Chapter 3 _ Object Recognition + 이병용.
CS654: Digital Image Analysis Lecture 25: Hough Transform Slide credits: Guillermo Sapiro, Mubarak Shah, Derek Hoiem.
1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Fundamentals of Sensation and Perception RECOGNIZING VISUAL OBJECTS ERIK CHEVRIER NOVEMBER 23, 2015.
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
Low level Computer Vision 1. Thresholding 2. Convolution 3. Morphological Operations 4. Connected Component Extraction 5. Feature Extraction 1.
By Sridhar Godavarthy. Co-Author: Joshua Candamo Ph.D Advisors: Dr. Kasturi Rangachar Dr. Dmitry Goldgof.
Image features and properties. Image content representation The simplest representation of an image pattern is to list image pixels, one after the other.
Instructor: Mircea Nicolescu Lecture 10 CS 485 / 685 Computer Vision.
Real-Time Hierarchical Scene Segmentation and Classification Andre Uckermann, Christof Elbrechter, Robert Haschke and Helge Ritter John Grossmann.
Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
Dr Nazir A. Zafar Advanced Algorithms Analysis and Design Advanced Algorithms Analysis and Design By Dr. Nazir Ahmad Zafar.
Visual Recognition Lecture 12
Course : T Computer Vision
DIGITAL SIGNAL PROCESSING
Recognizing Deformable Shapes
Perceiving and Recognizing Objects
3D Models and Matching particular matching techniques
CSc4730/6730 Scientific Visualization
Brief Review of Recognition + Context
Object Recognition Today we will move on to… April 12, 2018
Grouping/Segmentation
Descriptions of 3-D Objects and Scenes
Presentation transcript:

Learning object affordances based on structural object representation Kadir F. Uyanik Asil Kaan Bozcuoglu EE 583 Pattern Recognition Jan 4, 2011

Content Goal Inspirations Potential Difficulties Problem Definition Proposed Method References Appendix

Goal

Inspirations Ecological Psychologist James Jerome Gibson Cognitive Psychologist Irving Biederman

Inspirations: Affordances [1] [1] J. J. Gibson (1977), The Theory of Affordances. In Perceiving, Acting, and Knowing, Eds. Robert Shaw and John Bransford, ISBN [2] E. Sahin, M. Cakmak, M.R.Dogar, E. Ugur, G. Ucoluk, To Afford or Not to Afford: A New Formalization of Affordances Toward Affordance-Based Robot Control, Adaptive Behavior, 2007 pp: “… an affordance is neither an objective property nor a subjective property; or both if you like. An affordance cuts across the dichotomy of subjective- objective and helps us to understand its inadequacy. It is equally a fact of the environment and a fact of behavior. It is both physical and psychical, yet neither. An affordance points both ways, to the environment and to the observer.”

Inspirations: Affordances [1] [1] J. J. Gibson (1977), The Theory of Affordances. In Perceiving, Acting, and Knowing, Eds. Robert Shaw and John Bransford, ISBN [2] E. Sahin, M. Cakmak, M.R.Dogar, E. Ugur, G. Ucoluk, To Afford or Not to Afford: A New Formalization of Affordances Toward Affordance-Based Robot Control, Adaptive Behavior, 2007 pp: “… an affordance is neither an objective property nor a subjective property; or both if you like. An affordance cuts across the dichotomy of subjective- objective and helps us to understand its inadequacy. It is equally a fact of the environment and a fact of behavior. It is both physical and psychical, yet neither. An affordance points both ways, to the environment and to the observer.”

Inspirations: Affordances [1] [1] J. J. Gibson (1977), The Theory of Affordances. In Perceiving, Acting, and Knowing, Eds. Robert Shaw and John Bransford, ISBN environment agent (, ) Revised Definition: An affordance is an acquired relation between a tuple of an agent such that the application of the on the generates a certain [2]. [2] E. Sahin, M. Cakmak, M.R.Dogar, E. Ugur, G. Ucoluk, To Afford or Not to Afford: A New Formalization of Affordances Toward Affordance-Based Robot Control, Adaptive Behavior, 2007 pp: “… an affordance is neither an objective property nor a subjective property; or both if you like. An affordance cuts across the dichotomy of subjective- objective and helps us to understand its inadequacy. It is equally a fact of the environment and a fact of behavior. It is both physical and psychical, yet neither. An affordance points both ways, to the environment and to the observer.”

[3] Recognition-by-components: A theory of Human Image Understanding, Psychological Review, Vol. 94 (1987), pp “There are small number of geometric components that constitute the primitive elements of the object recognition system (like letters to form words)” Inspirations: Human Image Understanding [3]

[3] Recognition-by-components: A theory of Human Image Understanding, Psychological Review, Vol. 94 (1987), pp “There are small number of geometric components that constitute the primitive elements of the object recognition system (like letters to form words)” Inspirations: Human Image Understanding [3]

Potential Difficulties [4] Structural description not enough, also need metric info [4] M. A. Arbib CS564 – Brain Theory and Artificial Intelligence, USC, Fall 2001, Lecture 7: Object Recognition

Potential Difficulties [4] Structural description not enough, also need metric info Difficult to extract geons from real images [4] M. A. Arbib CS564 – Brain Theory and Artificial Intelligence, USC, Fall 2001, Lecture 7: Object Recognition

Potential Difficulties [4] Structural description not enough, also need metric info Difficult to extract geons from real images Ambiguity in the structural description: most often we have several candidates [4] M. A. Arbib CS564 – Brain Theory and Artificial Intelligence, USC, Fall 2001, Lecture 7: Object Recognition

Potential Difficulties [4] Structural description not enough, also need metric info Difficult to extract geons from real images Ambiguity in the structural description: most often we have several candidates For some objects, deriving a structural representation can be difficult [4] M. A. Arbib CS564 – Brain Theory and Artificial Intelligence, USC, Fall 2001, Lecture 7: Object Recognition

Problem Definition HOW TO decompose objects into parts/components ? find relations between components ? find a generic graph representation of an three tuple ?

Object Decomposition Proposed Algorithm

Object Decomposition What is missing? Use/try different clustering algorithms Triangulate 3D surfaces, Delaunay Compute gaussian curvature on each vertex Detect region boundaries, curvature thresholding Perform iterative region growing, flood fill

Graphical Representation We represent each objects in non-directed graphs as follows: –Each node has the info of geometric shape of the part –Each edge has the information of direction of edge for three axises, i.e from node1 to node2, x axis increases.

Graphical Representation Similarity Checking [isIsomorphic, label_list]= check_Isomorphism(G1, G2) If isIsomorphic Check geometric shapes of same labeled nodes in two graphs Check direction of equivalent edges in both graphs If both are matched, return true Else return false

Isomorphism check: Two candidates: - n1 = n6, n2 = n4, n3 = n5 (Attributes matched!) - n1 = n4, n2 = n6, n3 = n5 (Attributes isn’t matched) Graphical Representation Similarity Checking

Current System 80% is successful Assumes no occlusion. –For the cup case, handles should always be visible Needs metric info to distinguish bigger objects from small ones

One way to go… Learning a generic graph for each affordance type. Checking the maximal- cliques of the match graph while comparing graph of an object and a generic graph. Mahalanobis distance metric for generic graphs and use MLE

Tools

References [1] J. J. Gibson (1977), The Theory of Affordances. In Perceiving, Acting, and Knowing, Eds. Robert Shaw and John Bransford, ISBN [2] E. Sahin, M. Cakmak, M.R.Dogar, E. Ugur, G. Ucoluk, To Afford or Not to Afford: A New Formalization of Affordances Toward Affordance-Based Robot Control, Adaptive Behavior, 2007 pp: [3] Recognition-by-components: A theory of Human Image Understanding, Psychological Review, Vol. 94 (1987), pp [4] M. A. Arbib CS564 – Brain Theory and Artificial Intelligence, USC, Fall 2001, Lecture 7: Object Recognition

Thanks for listening

Appendix

Human Image Understanding Hypothesis: small number of geometric components that constitute the primitive elements of the object recognition system (like letters to form words) Geons are directly recognized from edges, based on their nonaccidental properties (i.e., 3D features that are usually preserved by the projective imaging process). –edges are straight or curved –pairs of edges are parallel or non-parallel –vertices will always appear to be vertices Non-accidental properties allows geons to be recognized from any perspective. The information in the geons are redundant so that they can be recognized even when partially occluded.

Appendix The Importance of spatial arrangement

Appendix The Principal of non-accidentalness Examples: Colinearity Smoothness Symmetry Parallelism Cotermination

Appendix Some non-accidental differences