Modeling Vision as Bayesian Inference: Is it Worth the Effort? Alan L. Yuille. UCLA. Co-Director: Centre for Image and Vision Sciences. Dept. Statistics.

Slides:



Advertisements
Similar presentations
The influence of domain priors on intervention strategy Neil Bramley.
Advertisements

Chapter 2: Marr’s theory of vision. Cognitive Science  José Luis Bermúdez / Cambridge University Press 2010 Overview Introduce Marr’s distinction between.
Deep Learning Bing-Chen Tsai 1/21.
Advanced topics.
Computer vision: models, learning and inference Chapter 18 Models for style and identity.
What is Vision? Why is it Hard ? Alan L. Yuille. UCLA.
Bart Selman CS CS 475: Uncertainty and Multi-Agent Systems Prof. Bart Selman Introduction.
Introduction of Probabilistic Reasoning and Bayesian Networks
Probabilistic Models of Cognition Conceptual Foundations Chater, Tenenbaum, & Yuille TICS, 10(7), (2006)
Causes and coincidences Tom Griffiths Cognitive and Linguistic Sciences Brown University.
Decision Making: An Introduction 1. 2 Decision Making Decision Making is a process of choosing among two or more alternative courses of action for the.
Image Parsing: Unifying Segmentation and Detection Z. Tu, X. Chen, A.L. Yuille and S-C. Hz ICCV 2003 (Marr Prize) & IJCV 2005 Sanketh Shetty.
1 Hierarchical Image-Motion Segmentation using Swendsen-Wang Cuts Adrian Barbu Siemens Corporate Research Princeton, NJ Acknowledgements: S.C. Zhu, Y.N.
PSY 5018H: Math Models Hum Behavior, Prof. Paul Schrater, Spring 2004 Vision as Optimal Inference The problem of visual processing can be thought of as.
Organizational Notes no study guide no review session not sufficient to just read book and glance at lecture material midterm/final is considered hard.
LEARNING FROM OBSERVATIONS Yılmaz KILIÇASLAN. Definition Learning takes place as the agent observes its interactions with the world and its own decision-making.
Decision making as a model 7. Visual perception as Bayesian decision making.
Exploring subjective probability distributions using Bayesian statistics Tom Griffiths Department of Psychology Cognitive Science Program University of.
PSY 5018H: Math Models Hum Behavior, Prof. Paul Schrater, Spring 2005 Making Decisions: Modeling perceptual decisions.
LEARNING FROM OBSERVATIONS Yılmaz KILIÇASLAN. Definition Learning takes place as the agent observes its interactions with the world and its own decision-making.
Overview and History of Cognitive Science. How do minds work? What would an answer to this question look like? What is a mind? What is intelligence? How.
Computational Vision Jitendra Malik University of California at Berkeley Jitendra Malik University of California at Berkeley.
© 2004 by Davi GeigerComputer Vision January 2004 L1.1 Introduction.
Summer 2011 Wednesday, 8/3. Biological Approaches to Understanding the Mind Connectionism is not the only approach to understanding the mind that draws.
Baysian Approaches Kun Guo, PhD Reader in Cognitive Neuroscience School of Psychology University of Lincoln Quantitative Methods 2011.
What is Cognitive Science? … is the interdisciplinary study of mind and intelligence, embracing philosophy, psychology, artificial intelligence, neuroscience,
Jochen Triesch, UC San Diego, 1 COGS Visual Modeling Jochen Triesch & Martin Sereno Dept. of Cognitive Science UC.
Laurent Itti: CS599 – Computational Architectures in Biological Vision, USC Lecture 7: Coding and Representation 1 Computational Architectures in.
Vision in Man and Machine. STATS 19 SEM Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology.
Grammar of Image Zhaoyin Jia, Problems  Enormous amount of vision knowledge:  Computational complexity  Semantic gap …… Classification,
1. Introduction to Pattern Recognition and Machine Learning. Prof. A.L. Yuille. Dept. Statistics. UCLA. Stat 231. Fall 2004.
CMSC 426: Image Processing (Computer Vision) David Jacobs.
Computer vision: models, learning and inference Chapter 6 Learning and Inference in Vision.
1 Using R for consumer psychological research Research Analytics | Strategy & Insight September 2014.
The Three R’s of Vision Jitendra Malik.
A Brief Introduction to Graphical Models
1 AI and Agents CS 171/271 (Chapters 1 and 2) Some text and images in these slides were drawn from Russel & Norvig’s published material.
Soft Computing Lecture 17 Introduction to probabilistic reasoning. Bayesian nets. Markov models.
A General Framework for Tracking Multiple People from a Moving Camera
1 / 41 Inference and Computation with Population Codes 13 November 2012 Inference and Computation with Population Codes Alexandre Pouget, Peter Dayan,
STUDY, MODEL & INTERFACE WITH MOTOR CORTEX Presented by - Waseem Khatri.
Lecture note for Stat 231: Pattern Recognition and Machine Learning 4. Maximum Likelihood Prof. A.L. Yuille Stat 231. Fall 2004.
Leo Zhu CSAIL MIT Joint work with Chen, Yuille, Freeman and Torralba 1.
Graphical Models in Vision. Alan L. Yuille. UCLA. Dept. Statistics.
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
Generic Tasks by Ihab M. Amer Graduate Student Computer Science Dept. AUC, Cairo, Egypt.
AI ● Dr. Ahmad aljaafreh. What is AI? “AI” can be defined as the simulation of human intelligence on a machine, so as to make the machine efficient to.
Ch.9 Bayesian Models of Sensory Cue Integration (Mon) Summarized and Presented by J.W. Ha 1.
6. Population Codes Presented by Rhee, Je-Keun © 2008, SNU Biointelligence Lab,
RULES Patty Nordstrom Hien Nguyen. "Cognitive Skills are Realized by Production Rules"
Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.
A Brief History of AI Fall 2013 COMP3710 Artificial Intelligence Computing Science Thompson Rivers University.
Bayesian Inference and Visual Processing: Image Parsing & DDMCMC. Alan Yuille (Dept. Statistics. UCLA) Tu, Chen, Yuille & Zhu (ICCV 2003).
Introduction to Cogsci April 07, Central Theme Cognitive Science was occuppied with the algorithmic level for much of its history: successive manipulation.
Pattern Recognition. What is Pattern Recognition? Pattern recognition is a sub-topic of machine learning. PR is the science that concerns the description.
Learning Image Statistics for Bayesian Tracking Hedvig Sidenbladh KTH, Sweden Michael Black Brown University, RI, USA
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
Cognitive Modeling Cogs 4961, Cogs 6967 Psyc 4510 CSCI 4960 Mike Schoelles
Bayesian Decision Theory Introduction to Machine Learning (Chap 3), E. Alpaydin.
Brief Intro to Machine Learning CS539
A probabilistic approach to cognition
Chapter 11: Artificial Intelligence
Nicolas Alzetta CoNGA: Cognition and Neuroscience Group of Antwerp
Information-Theoretic Listening
Image Parsing & DDMCMC. Alan Yuille (Dept. Statistics. UCLA)
Network Inference Chris Holmes Oxford Centre for Gene Function, &,
Expectation-Maximization & Belief Propagation
Introduction to Object Tracking
Will Penny Wellcome Trust Centre for Neuroimaging,
Presentation transcript:

Modeling Vision as Bayesian Inference: Is it Worth the Effort? Alan L. Yuille. UCLA. Co-Director: Centre for Image and Vision Sciences. Dept. Statistics Joint Appointments: Computer Science, Psychology

What is the purpose of Vision? To extract information about the world from the input images. This is a decoding problem: inverse inference.

What is Bayes? Inverse Inference. Inverse Inference/Image decoding: Posterior P(S|I) = P(I|S) P(S)/P(I) Likelihood P(I|S), Prior P(S). Cf. Pavan Sinha

How does this relate to the Brain? Marr’s computation level. Information Processing. Bayes derives from Decision Theory – same roots as signal detection theory and ideal observers. Plausible neuronal implementations.

The Challenge of Bayes What are P(I|S) and P(S) for realistic images? Probability Distributions on Structured Representations (Graphs/Grammars). Considerable progress – in many communities; NIPS,Machine Vision, Machine Learning,Artificial Intelligence. Natural Language processing…

Why is Bayes Complicated? It is complicated because of the difficulty of the vision problem. In information processing terms – images are extremely complex and ambiguous. Vision is much more complicated – by many orders of magnitude – than any “solved” inverse inference problem.

Bayes Research Program This program is to model the world with increasingly realistic distributions P(I|S) and P(S). Relate performance on these models to Psychophysics experiments. Speculate (test?) neuronally plausible implementations of these models.

Brief History: 1980’s. Convergent strands: Signal Detection Theory, Ideal Observers. Pattern Theory and Bayesian Inference. Energy Function formulations. Computer Graphics Psychophysics. Axis of Bayes: Brown, Harvard, MIT?

Probabilities defined on Graphs Graphs represent the causal generation of images. (Pearl 1988).

Taxonomy of Vision Tasks. Symbolic taxonomy. Kersten, Mamassian, Yuille (2004). 150 references…

Basic Bayes Bulthoff and Mallot (1988). The perception of shape and depth from visual cues. Trade-off between data-driven and prior- driven (e.g. Weiss et al 1997).

Discounting and Task Dependence What do you care about? The identity of the keys? Their location? Their materials?

Cue Integration How to combine cues? Clark and Yuille 1990 (Amazon says “1 copy still available!”). Bulthoff and Yuille Weak coupling or strong coupling. Knill’s talk.

Perceptual “Explaining away” Knill and Kersten Small changes of the boundary shape can explain away the intensity changes as being due to geometry and not to materials.

Ideal versus Bayes Ideal Ideal for the experiment? Or Ideal for real images? Random Dot Kinematograms (RDK’s).

Ideal for the experiment? Barlow&Tripathy (1997), Lu&Yuille (2006) derived an Ideal Observer model for these stimuli. This model predicts trends, but humans are much worse than the ideal.

Ideal for the Real World? But human performance is well matched to a Bayesian model that uses a slow&smooth prior (Lu&Yuille 2006). Slow&Smooth (Yuille&Grzywaxz 1987, Weiss et al Slow&smooth – consistent with knowledge of motion sequence statistics.

Neural Hypotheses The graphical nature of the models makes it straightforward to propose neuronally plausible implementations – nodes of graph as neurons. Koch, Marroquin, Yuille (1987), Lee (1995). Population models. Heroic efforts to test these models by Lee and his collaborators. Analysis by Synthesis: Feedforward and feedback connections (Mumford 1991). Some experimental support from fMRI (Murray et al.).

Bayesian Research Program. Increasing sophisticated models, inference algorithms, learning. Model LearningInference DATA

And/Or Probabilistic Grammar: P(I|S) & P(S) for human Poses.

Bayes Research Program The Lotus Hill “Genome Project”. Hand parse millions of images. Benchmarks for evaluating algorithms. Facility for learning generative models of images. Directed by Prof. S-C. Zhu.

Lotus Hill Hand parsed images (dataset released).

Inference Algorithms: Bottom-Up/Top-Down Integrating generative and discriminative methods

Unsupervised Learning Learn a model of a horse by hierarchical composition. (Leo Zhu & Yuille). Strategy: (i) suspicious coincidences, (ii) competitive exclusion, (iii) build hierarchy by composition.

Unsupervised Learning. Learn hierarchical schema by composition. Input 10 images, unknown pose/osition. Output; hierarchical model of a horse. Can detect and parse horses >300 real images.

Vision and Cognition The same probabilistic modeling techniques are being successfully applied to model other aspects of cognition. Vision, language, reasoning, motor control, and so on. This offers a theoretical framework for all of cognition.

Is Bayes complicated enough? “To myself I am only a child playing with pebbles on the beach, while vast oceans of truth lie undiscovered before me.” – Isaac Newton. You can’t explore the ocean without the right techniques and tools.

Conclusion “Bayes” – probabilistic models on structured representations – is extremely promising as a model of vision/cognition. There is good progress at scaling Bayes up to deal with the complexities of realistic images and visual tasks. There is encouraging progress at using Bayes to model psychophysics.

Facilities for Learning How to learn Bayes? Tutorial Programs – e.g. the UCLA IPAM website. Videos and Pdf’s of lectures by world experts.