Zhu, Song-Chun and Mumford, David. A Stochastic Grammar of Images. Foundations and Trends in Computer Graphics and Vision 2(4), 259-362 (2006) Hemerson.

Slides:



Advertisements
Similar presentations
Bayesian Belief Propagation
Advertisements

Fitting: The Hough transform. Voting schemes Let each feature vote for all the models that are compatible with it Hopefully the noise features will not.
1 Approximated tracking of multiple non-rigid objects using adaptive quantization and resampling techniques. J. M. Sotoca 1, F.J. Ferri 1, J. Gutierrez.
Chapter 4 Enterprise Modeling.
GRAMMAR & PARSING (Syntactic Analysis) NLP- WEEK 4.
Image Parsing: Unifying Segmentation and Detection Z. Tu, X. Chen, A.L. Yuille and S-C. Hz ICCV 2003 (Marr Prize) & IJCV 2005 Sanketh Shetty.
Fitting: The Hough transform
Learning to Detect A Salient Object Reporter: 鄭綱 (3/2)
Shet, Vinay et alii Predicate Logic based Image Grammars for Complex Pattern International Journal of Computer Vision (2011) Hemerson Pistori Biotechnology.
Primal Sketch Integrating Structure and Texture Ying Nian Wu UCLA Department of Statistics Keck Meeting April 28, 2006 Guo, Zhu, Wu (ICCV, 2003; GMBV,
Feature vs. Model Based Vocal Tract Length Normalization for a Speech Recognition-based Interactive Toy Jacky CHAU Department of Computer Science and Engineering.
Big Ideas in Cmput366. Search Blind Search Iterative deepening Heuristic Search A* Local and Stochastic Search Randomized algorithm Constraint satisfaction.
Optical flow and Tracking CISC 649/849 Spring 2009 University of Delaware.
Belief Propagation Kai Ju Liu March 9, Statistical Problems Medicine Finance Internet Computer vision.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Workspace-based Connectivity Oracle An Adaptive Sampling Strategy for PRM Planning Hanna Kurniawati and David Hsu Presented by Nicolas Lee and Stephen.
Computer vision: models, learning and inference Chapter 10 Graphical Models.
Syntactic Pattern Recognition Statistical PR:Find a feature vector x Train a system using a set of labeled patterns Classify unknown patterns Ignores relational.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
Cue Integration in Figure/Ground Labeling Xiaofeng Ren, Charless Fowlkes and Jitendra Malik, U.C. Berkeley We present a model of edge and region grouping.
(2.1) Grammars  Definitions  Grammars  Backus-Naur Form  Derivation – terminology – trees  Grammars and ambiguity  Simple example  Grammar hierarchies.
Grammar of Image Zhaoyin Jia, Problems  Enormous amount of vision knowledge:  Computational complexity  Semantic gap …… Classification,
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Information Retrieval in Practice
1 Adaptive Automata and Grammars Prof. Dr. Hemerson Pistori INOVISAO – R&D&I Group - Biotechnology Department Dom Bosco Catholic University (UCDB) Campo.
The Three R’s of Vision Jitendra Malik.
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Probabilistic Context Free Grammars for Representing Action Song Mao November 14, 2000.
Recognition using Regions (Demo) Sudheendra V. Outline Generating multiple segmentations –Normalized cuts [Ren & Malik (2003)] Uniform regions –Watershed.
Lecture 3: Region Based Vision
Scalable Inference and Training of Context- Rich Syntactic Translation Models Michel Galley, Jonathan Graehl, Keven Knight, Daniel Marcu, Steve DeNeefe.
Validated Model Transformation Tihamér Levendovszky Budapest University of Technology and Economics Department of Automation and Applied Informatics Applied.
Boltzmann Machine (BM) (§6.4) Hopfield model + hidden nodes + simulated annealing BM Architecture –a set of visible nodes: nodes can be accessed from outside.
A Graph-based Friend Recommendation System Using Genetic Algorithm
Parsing Prepared by Manuel E. Bermúdez, Ph.D. Associate Professor University of Florida Programming Language Principles Lecture 3.
Handover and Tracking in a Camera Network Presented by Dima Gershovich.
Computer Vision Michael Isard and Dimitris Metaxas.
Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)
Fitting: The Hough transform
Graph Visualization and Beyond … Anne Denton, April 4, 2003 Including material from a paper by Ivan Herman, Guy Melançon, and M. Scott Marshall.
Expectation-Maximization (EM) Case Studies
PARSING 2 David Kauchak CS159 – Spring 2011 some slides adapted from Ray Mooney.
Segmentation of Vehicles in Traffic Video Tun-Yu Chiang Wilson Lau.
Grammars in computer vision
Multiple track hypotheses maintained for each possible blob-object mapping Parse driven by blob interaction events Domain-general events are detected (e.g.,
Probabilistic Context Free Grammars Grant Schindler 8803-MDM April 27, 2006.
Motion Estimation using Markov Random Fields Hrvoje Bogunović Image Processing Group Faculty of Electrical Engineering and Computing University of Zagreb.
Stereo Vision Local Map Alignment for Robot Environment Mapping Computer Vision Center Dept. Ciències de la Computació UAB Ricardo Toledo Morales (CVC)
Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.
Department of Computer Science The University of Texas at Austin USA Joint Entity and Relation Extraction using Card-Pyramid Parsing Rohit J. Kate Raymond.
Enhanced hypertext categorization using hyperlinks Soumen Chakrabarti (IBM Almaden) Byron Dom (IBM Almaden) Piotr Indyk (Stanford)
Compiler Syntactic Analysis r Two general classes of parsing techniques m Bottom-up (Operator-Precedence parsing) Begin with the terminal nodes.
Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,
Bayesian Inference and Visual Processing: Image Parsing & DDMCMC. Alan Yuille (Dept. Statistics. UCLA) Tu, Chen, Yuille & Zhu (ICCV 2003).
Daphne Koller Introduction Motivation and Overview Probabilistic Graphical Models.
03/12/13 Trees and CFGs Discrete Structures (CS 173) Derek Hoiem, University of Illinois 1.
1 Algorithms for adjusting parameters of combination of acyclic adjacency graphs in the problems of texture image processing Postgraduate student of TSU.
Inference as a Feedforward Network
Advanced Higher Computing Based on Heriot-Watt University Scholar Materials Applications of AI – Vision and Languages 1.
Games with Chance Other Search Algorithms
Iterative Optimization
Chapter 6 : Game Search 게임 탐색 (Adversarial Search)
Visualizing Prim’s MST Algorithm Used to Trace the Algorithm in Class
Image Parsing & DDMCMC. Alan Yuille (Dept. Statistics. UCLA)
المشرف د.يــــاســـــــــر فـــــــؤاد By: ahmed badrealldeen
SYNTAX DIRECTED DEFINITION
Boltzmann Machine (BM) (§6.4)
Two – One Problem Legal Moves: Slide Rules: 1s’ move right Hop
Two – One Problem Legal Moves: Slide Rules: 1s’ move right Hop
Random Neural Network Texture Model
Presentation transcript:

Zhu, Song-Chun and Mumford, David. A Stochastic Grammar of Images. Foundations and Trends in Computer Graphics and Vision 2(4), (2006) Hemerson Pistori Biotechnology Dept., Dom Bosco Catholic University January, 2012 Bristol, UK Most of the pictures used in this presentation were extracted from Zhu's paper

Grammars as Universal And-Or Trees Universal And-Or Tree can have an infinite size (as in the example) Rules are explicitly named (r1, r2, …) Each or-node A have one child for each rule having A at its left side A parsing tree is a sub-graph of a universal and-or tree “ab” not “abb”

Ambiguity in Visual And-Or Trees Cycles or Diamonds Overlaping Reusable Parts Not dealt with yet (2007) Most common causes in images: – Ambiguous scenes – Multiple partial patterns – Joints – Occlusion (most common) … extra leaves, interpolation

Main issues in Visual Grammars unseen in Textual Grammars No left-to-right ordering in language Solution: explicitly add horizontal edges to represent adjancy Objects appear in arbitrary scales Solution: termination rules at different levels (higher leaves) Much wider spectrum of quite irregular local patterns Solution: combine Markov random fields with stochastic grammars

Contextual information Horizontal lines to represent relations and constraints: – Bonds and connections (more dense) – Joints and junctions – Interactions and semantics (less dense). E.g.: person eating an apple

Stochastic information Probabilities for rules (stochastic grammars). One local probability at each Or-node to account for the relative frequency of each alternative Probabilities of relations (Markov random fields). Local energies associated with each horizontal link. A Configurations is a “word” of the “visual language”.

Visual Vocabulary Bonds – topological information Three hierarchical and connected levels Image primitives: Textons (blobs, bars, terminators and crosses) Geometric groupings: Graphlets Object parts

Clock example And-Or Graph (Grammar)And-Or Parse Graphs

Learning and Estimation with And-Or Graphs Main elements to be learned: (1) Vocabulary and And-Or tree, (2) Relations – Horizontal Line and (3) Parameters What is available (training data): Images and parse trees (manually constructed ground-truths) Three phases: – Learning parameters from training data given relations and vocabulary (gradient method) – Learning news relations given vocabulary and learned parameters (inspired in texture synthesis) – Learning vocabulary and And-Or tree

Image Parsing Iterative Heuristic top-down + bottom-up search Bottom-up: – Hough transforms, Adaboosting … identify possible terminals from images – Bind a number of parts Top-down: – Expand top level nodes – Update the weights of the current hypothesis Key issue: schudeling of bottom-up and top-down search steps (depends on the problem)

What we can explore Learning visual structure from pure textual information Top level grammars or constraints based on Santini's work on Semiological Level of Significance Define and Adaptive Stochastic And-Or Graph to deal with non-static scenes Propose a way to deal with temporal information in the context of Visual Grammars