What is “Image Processing and Computer Vision”?

Slides:



Advertisements
Similar presentations
CS Spring 2009 CS 414 – Multimedia Systems Design Lecture 4 – Digital Image Representation Klara Nahrstedt Spring 2009.
Advertisements

November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
5/13/2015CAM Talk G.Kamberova Computer Vision Introduction Gerda Kamberova Department of Computer Science Hofstra University.
Computer Vision - A Modern Approach Set: Model-based Vision Slides by D.A. Forsyth Recognition by Hypothesize and Test General idea –Hypothesize object.
Vision Based Control Motion Matt Baker Kevin VanDyke.
 INTRODUCTION  STEPS OF GESTURE RECOGNITION  TRACKING TECHNOLOGIES  SPEECH WITH GESTURE  APPLICATIONS.
Computer Vision - A Modern Approach Set: Introduction to Vision Slides by D.A. Forsyth Why study Computer Vision? Images and movies are everywhere Fast-growing.
Oleh Tretiak © Computer Vision Lecture 1: Introduction.
Quadtrees, Octrees and their Applications in Digital Image Processing
December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.
CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.
Lecture 4 Edge Detection
Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.
A Study of Approaches for Object Recognition
Overview of Computer Vision CS491E/791E. What is Computer Vision? Deals with the development of the theoretical and algorithmic basis by which useful.
Processing Digital Images. Filtering Analysis –Recognition Transmission.
Motion based Correspondence for Distributed 3D tracking of multiple dim objects Ashok Veeraraghavan.
Quadtrees, Octrees and their Applications in Digital Image Processing
Vision Computing An Introduction. Visual Perception Sight is our most impressive sense. It gives us, without conscious effort, detailed information about.
Introduction to Computer Vision 3D Vision Topic 9 Stereo Vision (I) CMPSCI 591A/691A CMPSCI 570/670.
Interactive Systems Technical Design
Computer Vision Marc Pollefeys COMP 256 Administrivia Classes: Mon & Wed, 11-12:15, SN115 Instructor: Marc Pollefeys (919) Room.
CS292 Computational Vision and Language Visual Features - Colour and Texture.
Introduction What is “image processing and computer vision”? Image Representation.
November 29, 2004AI: Chapter 24: Perception1 Artificial Intelligence Chapter 24: Perception Michael Scherger Department of Computer Science Kent State.
Tal Mor  Create an automatic system that given an image of a room and a color, will color the room walls  Maintaining the original texture.
CIS 601 Fall 2004 Introduction to Computer Vision and Intelligent Systems Longin Jan Latecki Parts are based on lectures of Rolf Lakaemper and David Young.
A Brief Overview of Computer Vision Jinxiang Chai.
Copyright © 2012 Elsevier Inc. All rights reserved.
Components of a computer vision system
Active Vision Key points: Acting to obtain information Eye movements Depth from motion parallax Extracting motion information from a spatio-temporal pattern.
IIIT-B Computer Vision, Fall 2006 Lecture 1 Introduction to Computer Vision Arvind Lakshmikumar Technology Manager, Sarnoff Corporation Adjunct Faculty,
Perception Introduction Pattern Recognition Image Formation
CS654: Digital Image Analysis Lecture 3: Data Structure for Image Analysis.
CSCE 5013 Computer Vision Fall 2011 Prof. John Gauch
Introduction EE 520: Image Analysis & Computer Vision.
December 4, 2014Computer Vision Lecture 22: Depth 1 Stereo Vision Comparing the similar triangles PMC l and p l LC l, we get: Similarly, for PNC r and.
Computer Vision Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications –building representations.
CS 8690: Computer Vision Ye Duan. CS8690 Computer Vision University of Missouri at Columbia Instructor Ye Duan (209 Engr West)
Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.
Quadtrees, Octrees and their Applications in Digital Image Processing.
Template matching and object recognition. CS8690 Computer Vision University of Missouri at Columbia Matching by relations Idea: –find bits, then say object.
Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)
1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.
Autonomous Robots Vision © Manfred Huber 2014.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Vision Overview  Like all AI: in its infancy  Many methods which work well in specific applications  No universal solution  Classic problem: Recognition.
1 Machine Vision. 2 VISION the most powerful sense.
Computer Vision Overview Marc Schlosberg CS 175 – Spring 2015.
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
1Ellen L. Walker 3D Vision Why? The world is 3D Not all useful information is readily available in 2D Why so hard? “Inverse problem”: one image = many.
Robotics Chapter 6 – Machine Vision Dr. Amit Goradia.
Introduction to Scale Space and Deep Structure. Importance of Scale Painting by Dali Objects exist at certain ranges of scale. It is not known a priory.
CONTENTS:  Introduction.  Face recognition task.  Image preprocessing.  Template Extraction and Normalization.  Template Correlation with image database.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
Processing visual information for Computer Vision
- photometric aspects of image formation gray level images
A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology
Introduction to Computer and Human Vision
Why study Computer Vision?
Common Classification Tasks
Eric Grimson, Chris Stauffer,
Brief Review of Recognition + Context
Cognitive Processes PSY 334
CMSC 426: Image Processing (Computer Vision)
Fast Forward, Part II Multi-view Geometry Stereo Ego-Motion
CSE (c) S. Tanimoto, 2007 Image Understanding
CSE (c) S. Tanimoto, 2001 Image Understanding
Presentation transcript:

What is “Image Processing and Computer Vision”? manipulate image data generate another image Computer Vision process image data generate symbolic data Computer Vision : CISC4/689

Computer Vision : CISC4/689 Reconstruction Recover 3D information from data Recognition Detect and identify objects Understanding What is happening in the scene? Computer Vision : CISC4/689

Computer Vision : CISC4/689 Historical overview 1920s Coding images for transmission by telegraph (3 hours) 1960s Computers powerful enough to store images and process in realistic times Space program Computer Vision : CISC4/689

Computer Vision : CISC4/689 1960s - 1970s Applications Medical imaging Remote sensing Astronomy Computer Vision : CISC4/689

Computer Vision : CISC4/689 Today DTV Image interpretation Biometry GIS Tele-surgery Computer Vision : CISC4/689

Computer Vision : CISC4/689

Computer Vision : CISC4/689 System Overview Enhancement Feature Extraction Feature Recognition Image Recognition Captured data Labels Computer Vision : CISC4/689

Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications building representations of the 3D world from pictures automated surveillance (who’s doing what) movie post-processing face finding Various deep and attractive scientific mysteries how does object recognition work? Greater understanding of human vision Computer Vision : CISC4/689

Part I: The Physics of Imaging How images are formed Cameras What a camera does How to tell where the camera was Light How to measure light What light does at surfaces How the brightness values we see in cameras are determined Color The underlying mechanisms of color How to describe it and measure it Computer Vision : CISC4/689

Part II: Early Vision in One Image Simple inferences from individual pixel values Representing small patches of image For three reasons We wish to establish correspondence between (say) points in different images, so we need to describe the neighborhood of the points Sharp changes are important in practice --- known as “edges” Representing texture by giving some statistics of the different kinds of small patch present in the texture. Tigers have lots of bars, few spots Leopards are the other way Computer Vision : CISC4/689

Representing an image patch Filter outputs essentially form a dot-product between a pattern and an image, while shifting the pattern across the image strong response -> image locally looks like the pattern e.g. derivatives measured by filtering with a kernel that looks like a big derivative (bright bar next to dark bar) Computer Vision : CISC4/689

Computer Vision : CISC4/689 Convolve this image To get this With this kernel At this point, one should point out that the image is represented with largest value bright and smallest value dark. The kernel is large and positive at light points, large and negative at dark points, and mid-grey at zeros. The output has the same properties. Notice that a strong response comes when the image looks like the kernel. The output is a representation that tells us when there are large x-derivatives in the image. Worth getting the class to guess what derivative is being estimated. Computer Vision : CISC4/689

Computer Vision : CISC4/689 Texture Many objects are distinguished by their texture Tigers, cheetahs, grass, trees We represent texture with statistics of filter outputs For tigers, bar filters at a coarse scale respond strongly For cheetahs, spots at the same scale For grass, long narrow bars For the leaves of trees, extended spots Objects with different textures can be segmented The variation in textures is a cue to shape Here one points out that a filter is basically a template. If we have lots of different templates, and count the relative proportions of the different types, we have a texture representation. Computer Vision : CISC4/689

Computer Vision : CISC4/689 Texture determines surface “properties” (wetness, roughness, sliminess, etc.) and gives us shape cues as well Computer Vision : CISC4/689

Computer Vision : CISC4/689 The basics of texture representation: vertical and horizontal bars, count the mean response, and classify on that statistic. Of course, for more detail one would have more filters and more statistics. Computer Vision : CISC4/689

Computer Vision : CISC4/689 Shape from texture Markings on a surface distort when seen in a camera; the pattern of distortions is a cue to the normal of the surface. The details remain vague, but the effect is very widespread and useful (e.g. the surface graph). Computer Vision : CISC4/689

Part III: Early Vision in Multiple Images The geometry of multiple views Where could it appear in camera 2 (3, etc.) given it was here in 1 (1 and 2, etc.)? Stereopsis What we know about the world from having 2 eyes Structure from motion What we know about the world from having many eyes or, more commonly, our eyes moving. Computer Vision : CISC4/689

3D Reconstruction from multiple views Multiple views arise from stereo motion Strategy “triangulate” from distinct measurements of the same thing Issues Correspondence: which points in the images are projections of the same 3D point? The representation: what do we report? Noise: how do we get stable, accurate reports Computer Vision : CISC4/689

Part IV: Mid-Level Vision Impose some order on groups of pixels to separate them from each other and infer shape information Finding coherent structure so as to break the image or movie into big units Segmentation: Breaking images and videos into useful pieces E.g. finding video sequences that correspond to one shot E.g. finding image components that are coherent in internal appearance Tracking: Keeping track of a moving object through a long sequence of views Computer Vision : CISC4/689

Part V: High Level Vision (Geometry) The relations between object geometry and image geometry Model based vision find the position and orientation of known objects Smooth surfaces and outlines how the outline of a curved object is formed, and what it looks like Aspect graphs how the outline of a curved object moves around as you view it from different directions Range data Computer Vision : CISC4/689

Part VI: High Level Vision (Probabilistic) Using classifiers and probability to recognize objects Templates and classifiers how to find objects that look the same from view to view with a classifier Relations break up objects into big, simple parts, find the parts with a classifier, and then reason about the relationships between the parts to find the object. Geometric templates from spatial relations extend this trick so that templates are formed from relations between much smaller parts Computer Vision : CISC4/689

Part VII: Some Applications in Detail Finding images in large collections searching for pictures browsing collections of pictures Image based rendering often very difficult to produce models that look like real objects surface weathering, etc., create details that are hard to model Solution: make new pictures from old Computer Vision : CISC4/689

Some applications of recognition Digital libraries Find me the pic of a certain posture from skating video Surveillance Warn me if there is a mugging in the grove HCI Do what I show you Military Shoot this, not that Computer Vision : CISC4/689

What are the problems in recognition? Which bits of image should be recognised together? Segmentation. How can objects be recognised without focusing on detail? Abstraction. How can objects with many free parameters be recognised? No popular name, but it’s a crucial problem anyhow. How do we structure very large modelbases? again, no popular name; abstraction and learning come into this Computer Vision : CISC4/689

Computer Vision : CISC4/689 Segmentation Which image components “belong together”? Belong together=lie on the same object Cues similar colour similar texture not separated by contour form a suggestive shape when assembled Computer Vision : CISC4/689

Computer Vision : CISC4/689 Image Segmentation Note that the camouflaged Squirrel is detected. The background is still broken due the lack in oriented-texture measurements which we are currently adding into our algorithm. Computer Vision : CISC4/689

Computer Vision : CISC4/689 Image Segmentation Computer Vision : CISC4/689

Computer Vision : CISC4/689

Computer Vision : CISC4/689

Computer Vision : CISC4/689 Matching templates Some objects are 2D patterns e.g. faces Build an explicit pattern matcher discount changes in illumination by using a parametric model changes in background are hard changes in pose are hard Computer Vision : CISC4/689

Computer Vision : CISC4/689 http://www.ri.cmu.edu/projects/project_271.html

Relations between templates e.g. find faces by finding eyes, nose, mouth finding assembly of the three that has the “right” relations Computer Vision : CISC4/689

Computer Vision : CISC4/689

Computer Vision : CISC4/689 http://www.ri.cmu.edu/projects/project_320.html

Computer Vision : CISC4/689 Tracking Use a model to predict next position and refine using next image Model: simple dynamic models (second order dynamics) kinematic models etc. Face tracking and eye tracking now work rather well Computer Vision : CISC4/689

Computer Vision : CISC4/689 Application results Rigid Motion Reconstruction: 2D 3D 2D+3D Clouds: Interpolation Clouds: Reconstruction Tongue: Reconstruction Computer Vision : CISC4/689

Computer Vision : CISC4/689 More.. Tongue Tracking Face Tracking Stereo: Human Body Stereo: Ice (Hard) Bio-medical: Tongue-head Tongue-skull Computer Vision : CISC4/689

Computer Vision : CISC4/689 Few More.. ACCESS Stereo-Face Tracker Computer Vision : CISC4/689

Computer Vision : CISC4/689 Project on Image Guided Surgery: A collaboration between the MIT AI Lab and Brigham and Women's Surgical Planning Laboratory The Computer Vision Group of the MIT Artificial Intelligence Lab has been collaborating closely for several years with the Surgical Planning Laboratory of Brigham and Women's Hospital. As part of the collaboration, tools are being developed to support image guided surgery. Such tools will enable surgeons to visualize internal structures through an automated overlay of 3D reconstructions of internal anatomy on top of live video views of a patient. We are developing image analysis tools for leveraging the detailed three-dimensional structure and relationships in medical images. Sample applications are in preoperative surgical planning, intraoperative surgical guidance, navigation, and instrument tracking. Computer Vision : CISC4/689

Computer Vision : CISC4/689 A view showing the patient and model details superimposed Figures by kind permission of Eric Grimson; further information can be obtained from his web site http://www.ai.mit.edu/people/welg/welg.html. Computer Vision : CISC4/689

Computer Vision : CISC4/689 A patient with a view of the tissue the surgeon needs to remove superimposed; the view is registered to the patient, so its “in the right place” Figures by kind permission of Eric Grimson; further information can be obtained from his web site http://www.ai.mit.edu/people/welg/welg.html. Computer Vision : CISC4/689

Computer Vision : CISC4/689 Some Results MRI data Rotate Model Peel Computer Vision : CISC4/689