CS 558 Computer Vision John Oliensis. Today’s class What is vision What is computer vision How we can solve vision problems –Important tools –Overall.

Slides:



Advertisements
Similar presentations
CSE 473/573 Computer Vision and Image Processing (CVIP)
Advertisements

16-721: Advanced Machine Perception Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: David Bradley 2216 NSH Web Page:
Computer Vision - A Modern Approach Set: Introduction to Vision Slides by D.A. Forsyth Why study Computer Vision? Images and movies are everywhere Fast-growing.
CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.
Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.
Visual Perception: It Is All In Your Mind Mona Moshtaghi Neuroscience Texture Shape Size Disparity Motion Processing Constraint.
CS 561, Sessions 27 1 Towards intelligent machines Thanks to CSCI561, we now know how to… - Search (and play games) - Build a knowledge base using FOL.
Overview of Computer Vision CS491E/791E. What is Computer Vision? Deals with the development of the theoretical and algorithmic basis by which useful.
Object Recognition. So what does object recognition involve?
16-721: Learning-based Methods in Vision Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: Tomasz Malisiewicz Smith Hall.
2007Theo Schouten1 Introduction. 2007Theo Schouten2 Human Eye Cones, Rods Reaction time: 0.1 sec (enough for transferring 100 nerve.
Color, lightness & brightness Lavanya Sharan February 7, 2011.
Computational Vision Jitendra Malik University of California at Berkeley Jitendra Malik University of California at Berkeley.
What is “Image Processing and Computer Vision”?
Materials II Lavanya Sharan March 2nd, Computational thinking about materials Physics-basedPseudo physics-based.
© 2004 by Davi GeigerComputer Vision January 2004 L1.1 Introduction.
Visual Object Recognition Rob Fergus Courant Institute, New York University
16-721: Learning-based Methods in Vision Staff: Instructor: Alexei (Alyosha) Efros 4207 TA: Jean-Francois Lalonde A521.
Computer Vision Marc Pollefeys COMP 256 Administrivia Classes: Mon & Wed, 11-12:15, SN115 Instructor: Marc Pollefeys (919) Room.
What should be done at the Low Level?
Computer Vision (CSE P576) Staff Prof: Steve Seitz TA: Jiun-Hung Chen Web Page
Jochen Triesch, UC San Diego, 1 COGS Visual Modeling Jochen Triesch & Martin Sereno Dept. of Cognitive Science UC.
December 2, 2014Computer Vision Lecture 21: Image Understanding 1 Today’s topic is.. Image Understanding.
Vision in Man and Machine. STATS 19 SEM Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology.
Approaches to Representing and Recognizing Objects Visual Classification CMSC 828J – David Jacobs.
55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
CMSC 426: Image Processing (Computer Vision) David Jacobs.
CIS 601 Fall 2004 Introduction to Computer Vision and Intelligent Systems Longin Jan Latecki Parts are based on lectures of Rolf Lakaemper and David Young.
G52IIP, School of Computer Science, University of Nottingham What we will learn … Topics relate to the use of computer to Acquire/generate Process/manipulate/store.
1 Perception and VR MONT 104S, Spring 2008 Lecture 22 Other Graphics Considerations Review.
Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.
Introduction to Computer Vision Olac Fuentes Computer Science Department University of Texas at El Paso El Paso, TX, U.S.A.
IIIT-B Computer Vision, Fall 2006 Lecture 1 Introduction to Computer Vision Arvind Lakshmikumar Technology Manager, Sarnoff Corporation Adjunct Faculty,
Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.
CSCE 5013 Computer Vision Fall 2011 Prof. John Gauch
Computing & Information Sciences Kansas State University Wednesday, 03 Dec 2008CIS 530 / 730: Artificial Intelligence Lecture 38 of 42 Wednesday, 03 December.
Introduction EE 520: Image Analysis & Computer Vision.
Computer Vision Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications –building representations.
CS 8690: Computer Vision Ye Duan. CS8690 Computer Vision University of Missouri at Columbia Instructor Ye Duan (209 Engr West)
Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.
DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.
Graphical Models in Vision. Alan L. Yuille. UCLA. Dept. Statistics.
EIE426-AICV 1 Computer Vision Filename: eie426-computer-vision-0809.ppt.
Why is computer vision difficult?
Computer Vision Michael Isard and Dimitris Metaxas.
1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.
12/7/10 Looking Back, Moving Forward Computational Photography Derek Hoiem, University of Illinois Photo Credit Lee Cullivan.
CSE 185 Introduction to Computer Vision Stereo. Taken at the same time or sequential in time stereo vision structure from motion optical flow Multiple.
11/26/2015 Copyright G.D. Hager Class 2 - Schedule 1.Optical Illusions 2.Lecture on Object Recognition 3.Group Work 4.Sports Videos 5.Short Lecture on.
Efficient Visual Object Tracking with Online Nearest Neighbor Classifier Many slides adapt from Steve Gu.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
CSE 140: Computer Vision Camillo J. Taylor Assistant Professor CIS Dept, UPenn.
Vision Overview  Like all AI: in its infancy  Many methods which work well in specific applications  No universal solution  Classic problem: Recognition.
Computer Vision, CS766 Staff Instructor: Li Zhang TA: Yu-Chi Lai
Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT Recognizing and Learning Object Categories ICCV 2005 Beijing, Short Course, Oct 15.
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
1Ellen L. Walker 3D Vision Why? The world is 3D Not all useful information is readily available in 2D Why so hard? “Inverse problem”: one image = many.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Computational Vision Jitendra Malik University of California, Berkeley.
COS 429: Computer Vision Thanks to Chris Bregler.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
From local motion estimates to global ones - physiology:
SHAHAB iCV Research Group.
Common Classification Tasks
Brief Review of Recognition + Context
CMSC 426: Image Processing (Computer Vision)
CSE (c) S. Tanimoto, 2007 Image Understanding
Learning complex visual concepts
Presentation transcript:

CS 558 Computer Vision John Oliensis

Today’s class What is vision What is computer vision How we can solve vision problems –Important tools –Overall approaches

Why is Vision Interesting? Psychology –~ 50% of cerebral cortex is for vision. –Vision is how we experience the world. Engineering –Want machines to interact with world. –Digital images are everywhere.

Vision is inferential

Inferring Surface “Lightness” How do we determine the “true” surface color at A and B? ?Discount slow changes from lighting, keep quick paint changes?

Inferring Surface Color We perceive true surface color despite unknown or changing light!

Vision is Inferential (surface brightness) plaid-movie, haze movie

Vision is inferential: Shape from light

Shape from Motion

Vision is Inferential: Prior Knowledge

Computer Vision Inference  Computation Building machines that see Modeling biological perception

So what do humans care about? slide by Fei Fei, Fergus & Torralba

Verification: is that a bus? slide by Fei Fei, Fergus & Torralba

Detection: are there cars? slide by Fei Fei, Fergus & Torralba

Identification: is that a picture of Mao? slide by Fei Fei, Fergus & Torralba

Object categorization sky building flag wall banner bus cars bus face street lamp slide by Fei Fei, Fergus & Torralba

Scene and context categorization outdoor city traffic … slide by Fei Fei, Fergus & Torralba

Rough 3D layout, depth ordering slide by Fei Fei, Fergus & Torralba

Challenges 1: view point variation Michelangelo slide by Fei Fei, Fergus & Torralba

Challenges 2: illumination slide credit: S. Ullman

Challenges 3: occlusion Magritte, 1957 slide by Fei Fei, Fergus & Torralba

Challenges 4: scale slide by Fei Fei, Fergus & Torralba

Challenges 5: deformation Xu, Beihong 1943 slide by Fei Fei, Fergus & Torralba

Challenges 6: background clutter Klimt, 1913 slide by Fei Fei, Fergus & Torralba

Challenges 7: object intra-class variation slide by Fei-Fei, Fergus & Torralba

Challenges 8: local ambiguity slide by Fei-Fei, Fergus & Torralba

Summary: Same object can appear very different! How can you isolate what’s the same in these two pictures (the horse) given the huge differences?

Quick Tour of Computer Vision

Approach: local cues The entire image is too complex. Try to find distinctive small patches which may help to interpret it Example: brightness boundaries Maybe part of object’s outline? May help in inferring object shapes. Build larger interpretations from these small “clues”

Local cue: Brightness Boundary

Could this be part of the outline of something?

Local cue: Brightness Boundary Part of the leaf outline

Local cue: Brightness Boundary

Could this be part of the outline of something?

Local cue: Brightness Boundary

Not an outline, Just a highlight

Where’s the squirrel outline?

Integrating information over larger regions Finding outlines Finding regions that might correspond to objects

Boundary Detection

Boundary Detection Finding the Corpus Callosum (G. Hamarneh, T. McInerney, D. Terzopoulos)

Segmentation (foreground versus background) (Sharon, Balun, Brandt, Basri)

Segmentation (foreground versus background) Different approach JO

Different approach

A Classical View of Vision Grouping / Segmentation Figure/Ground Organization Object and Scene Recognition pixels, boundaries, small windows… Low-level Mid-level High-level

A Contemporary View of Vision Figure/Ground Organization Grouping / Segmentation Object and Scene Recognition pixels, boundaries, small windows… Low-level Mid-level High-level But where do we draw this line?

Boundaries and regions  Shape Texture  appearance

Texture RepetitionSynthesis Learn the statistics of a texture to recognize it Synthesize texture based on learned model Original

RepetitionSynthesis Texture Textures over time (Smoke, flame,waterfall...) Original

Tracking (JO+HZ)

Understanding Action Tracking face features  emotions Tracking pedestrians  surveillance

Tracking office workers

Stereo

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923 (Slide courtesy Steve Seitz)

Stereo Image 1Image 2 Camera 1Camera 2

Stereo

Stereo

Estimated Camera Motion Structure from Motion Motion and shape from movies

movie to shape Estimated 3D shape

Movie  shape Important for humans

Motion – Application Inserting virtual objects into video (

Motion Application Aligning virtual & real objects despite camera motion Visually guided surgery

Recognition (despite appearance change) Lighting affects appearance

Classification (Funkhauser, Min, Kazhdan, Chen, Halderman, Dobkin, Jacobs)

Viola and Jones: Real time Face Detection

Approaches to Vision

Approach 1: Toy Models + Algorithms 1) Start with simple idealized model of world, images Find good algorithms 2) Experiment on real world. 3) Update model, algorithms Real Problem is going beyond idealizations!

Example: 3D shape from shading (JO)

How does Shading determine Shape? Bright Dark Shading (image brightness) indicates how much light on each surface patch  gives surface patch orientation  overall shape

Very Idealized! Uniformly bright surface (no paint!) (else brightness doesn’t indicate orientation) Other idealizations as well –no shadows –smooth surface –no objects in front of others –no glossiness or mirror reflections –known light source –light from one direction only

Approach 2: Psychology/Neuroscience Derive insights from human/animal vision Example: processing at multiple scales True for people; useful for computers

(Try squinching your eyes from far)

Approach 3: Engineering Limited goals, application-oriented. Exploit domain constraints! Problem: May not generalize to other tasks

Example: Image Mosaics + + … += Goal: Stitch together images into composite image Composite has to look real, taken from one place: may have to warp original images

Approach 4 Bayesian inference + Learning Given the image, what 3D scene produced it? Impossible! Image is 2D, has too little information about scene since it’s 3D. Bayesian solution: Learn: accumulate experience about what types of 3D scenes and images are likely to occur. Use this experience to help in interpreting new images. (i.e., tune algorithm based on experience).

Approach 4 Bayesian inference + Learning Usually based on probabilities –How likely is this object to appear? –How likely is it that this image patch shows the object? Finding the probability for all possibilities often very hard, can lead to huge computations.

Recognize objects (Bayesian learning) Recognize parts (eyes, nose,…) and their spatial arrangement. Learning: Automatically tune algorithm from its success on trial runs

Approach 4A: Learning from millions of pictures

Theory of Vision David Marr (1980s) –Visual understanding is a computation –It proceeds in well defined stages Primal Sketch 2½D Sketch 3D Representations –Wrong in details Gestalt, Gibson ecological theory, geons… Now: no general theory of vision

The State of Computer Vision Technology –Applications Surveillance Road monitoring Computer driven cars Football Movies Medicine Face Recognition/BiometricsSpace HCI (Human Computer Interface); sign language recognition Remote Sensing –Successful companies Largest ~ million in revenues. In-house applications.

The State of Computer Vision Science –More progress in engineering –Interesting theory for specific problems (e.g., estimating 3D shape of objects from images) –Beginnings of progress on “intelligent “vision (i.e., recognizing objects)

The State of Computer Vision Sociology –Engineers (dominant group) –Applied math –Computer science –Visual Psychology, neuroscience

Related Fields Learning (can computers teach themselves to see?) + Artificial Intelligence (AI) Graphics. “Vision is inverse graphics” Visual perception + Neuroscience Math (eg., geometry, statistics/probability) + Physics Operation research, optimization

History (very rough) “Those who cannot remember the past are condemned to repeat it” –Toy models/algorithms (line drawings of blocks) –AI Recognition Systems. –Segmentation. Break images up into regions that could be objects –Low level vision. Detecting brightness boundaries, estimating 3D shapes of objects –Neural nets. David Marr. 1990s –Estimating camera motion from movies. Projective geometry, –Model-based recognition. Use specific object models to find them in images –Represent 2D shapes by their “skeletons” –Tracking –Classifying pixels from appearance ( blue  sky or water, green  leaf, …) 2000s –Learning: internet scale data –More reliable appearance descriptors  better recognition of objects –More math Graph theory, Monte Carlo, level sets. –Robust Statistics: recovering from mistakes of low level modules

Tools Needed for Course Math –Linear Algebra (to be taught) –Signal Processing (to be taught). –Calculus –Some geometry –Probability Computer Science –Algorithms –Programming (matlab)