Vision Overview  Like all AI: in its infancy  Many methods which work well in specific applications  No universal solution  Classic problem: Recognition.

Slides:

Advertisements

Similar presentations

Course Overview  What is AI?  What are the Major Challenges?  What are the Main Techniques?  Where are we failing, and why?  Step back and look at.

Advertisements

Computer Vision - A Modern Approach Set: Recognition by relations Slides by D.A. Forsyth Matching by relations Idea: –find bits, then say object is present.

By: Mani Baghaei Fard.  During recent years number of moving vehicles in roads and highways has been considerably increased.

Computer Vision Radiometry. Bahadir K. Gunturk2 Radiometry Radiometry is the part of image formation concerned with the relation among the amounts of.

Face Alignment with Part-Based Modeling

Recognition by finding patterns

Computer Vision - A Modern Approach Set: Model-based Vision Slides by D.A. Forsyth Recognition by Hypothesize and Test General idea –Hypothesize object.

Object Inter-Camera Tracking with non- overlapping views: A new dynamic approach Trevor Montcalm Bubaker Boufama.

Computer Vision - A Modern Approach Set: Introduction to Vision Slides by D.A. Forsyth Why study Computer Vision? Images and movies are everywhere Fast-growing.

Robust Moving Object Detection & Categorization using self- improving classifiers Omar Javed, Saad Ali & Mubarak Shah.

CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.

Lecture 5 Template matching

Tracking Objects with Dynamics Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/21/15 some slides from Amin Sadeghi, Lana Lazebnik,

Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.

A Study of Approaches for Object Recognition

Processing Digital Images. Filtering Analysis –Recognition Transmission.

Classifiers for Recognition Reading: Chapter 22 (skip 22.3) Slide credits for this chapter: Frank Dellaert, Forsyth & Ponce, Paul Viola, Christopher Rasmussen.

2007Theo Schouten1 Introduction. 2007Theo Schouten2 Human Eye Cones, Rods Reaction time: 0.1 sec (enough for transferring 100 nerve.

Texture Reading: Chapter 9 (skip 9.4) Key issue: How do we represent texture? Topics: –Texture segmentation –Texture-based matching –Texture synthesis.

CS 223B Assignment 1 Help Session Dan Maynes-Aminzade.

Highlights Lecture on the image part (10) Automatic Perception 16

What is “Image Processing and Computer Vision”?

Computer Vision Marc Pollefeys COMP 256 Administrivia Classes: Mon & Wed, 11-12:15, SN115 Instructor: Marc Pollefeys (919) Room.

CS292 Computational Vision and Language Visual Features - Colour and Texture.

Object recognition under varying illumination. Lighting changes objects appearance.

Computer Vision - A Modern Approach Set: Segmentation Slides by D.A. Forsyth Segmentation and Grouping Motivation: not information is evidence Obtain a.

CIS 601 Fall 2004 Introduction to Computer Vision and Intelligent Systems Longin Jan Latecki Parts are based on lectures of Rolf Lakaemper and David Young.

A Brief Overview of Computer Vision Jinxiang Chai.

G52IIP, School of Computer Science, University of Nottingham What we will learn … Topics relate to the use of computer to Acquire/generate Process/manipulate/store.

Introduction to Computer Vision Olac Fuentes Computer Science Department University of Texas at El Paso El Paso, TX, U.S.A.

Perception Introduction Pattern Recognition Image Formation

Chapter 14: SEGMENTATION BY CLUSTERING 1. 2 Outline Introduction Human Vision & Gestalt Properties Applications – Background Subtraction – Shot Boundary.

CSCE 5013 Computer Vision Fall 2011 Prof. John Gauch

CS-424 Gregory Dudek Today’s Lecture Computational Vision –Images –Image formation in brief (+reading) –Image processing: filtering Linear filters Non-linear.

Computer Vision Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications –building representations.

CS 8690: Computer Vision Ye Duan. CS8690 Computer Vision University of Missouri at Columbia Instructor Ye Duan (209 Engr West)

Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.

DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.

Quadtrees, Octrees and their Applications in Digital Image Processing.

Template matching and object recognition. CS8690 Computer Vision University of Missouri at Columbia Matching by relations Idea: –find bits, then say object.

Computer Vision Michael Isard and Dimitris Metaxas.

MACHINE VISION Machine Vision System Components ENT 273 Ms. HEMA C.R. Lecture 1.

Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)

Learning the Appearance and Motion of People in Video Hedvig Sidenbladh, KTH Michael Black, Brown University.

MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.

School of Engineering and Computer Science Victoria University of Wellington Copyright: Peter Andreae, VUW Image Recognition COMP # 18.

1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.

Rick Parent - CIS681 Motion Analysis – Human Figure Processing video to extract information of objects Motion tracking Pose reconstruction Motion and subject.

Autonomous Robots Vision © Manfred Huber 2014.

Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.

Human Activity Recognition at Mid and Near Range Ram Nevatia University of Southern California Based on work of several collaborators: F. Lv, P. Natarajan,

Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.

Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.

PENGENALAN POLA DAN VISI KOMPUTER PENDAHULUAN. Vision Vision is the process of discovering what is present in the world and where it is by looking.

Local Illumination and Shading

1Ellen L. Walker 3D Vision Why? The world is 3D Not all useful information is readily available in 2D Why so hard? “Inverse problem”: one image = many.

Chapter 24: Perception April 20, Introduction Emphasis on vision Feature extraction approach Model-based approach –S stimulus –W world –f,

Psychology 3680 Illusion Lecture. What is Vision?

  Computer vision is a field that includes methods for acquiring,prcessing, analyzing, and understanding images and, in general, high-dimensional data.

Over the recent years, computer vision has started to play a significant role in the Human Computer Interaction (HCI). With efficient object tracking.

Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.

Tracking Objects with Dynamics

Why study Computer Vision?

Machine Vision Acquisition of image data, followed by the processing and interpretation of these data by computer for some useful application like inspection,

Segmentation and Grouping

Introduction Computer vision is the analysis of digital images

Parallel Integration of Video Modules

CMSC 426: Image Processing (Computer Vision)

Introduction Computer vision is the analysis of digital images

Presentation transcript:

Vision Overview  Like all AI: in its infancy  Many methods which work well in specific applications  No universal solution  Classic problem: Recognition problem  Recognise a type of object  Identify an instance (e.g. a person)  Easy for human  Computers limited:  Specific objects: faces, characters, vehicles  Specific situations: lighting, background, orientation

Vision Hierarchy 4. High level Models 3. Mid level Segmentation 2. Putting together Multiple images 1. Low level processing on a single image 0. The physics of image formation

Camera  Lens focuses light  Charge-coupled device (CCD) detects  Bayer filter for colour  Individual spots in the digital image are “pixels”

Physics of Light  Important to know how light behaves  To guess the objects that generated what you see  Light travels straight  Can assume it is constant along a straight line  When it shines on a surface  Absorbed  Transmitted  Scattered  Combination  Simplifying assumptions  Light leaving a surface only due to light arriving  Light leaving of a specific colour only due to that colour arriving

Physics of Light  In general the amount reflected in some direction depends on  Direction of incoming and reflecting light  But simpler in some special cases:  Lambertian surfaces, e.g. cotton, matt surfaces  Specular surfaces, like a mirror  Modelled by combination

Shadows, Shading…Shading models  Shading model explains brightness of surfaces  allows you to reconstruct the objects in the scene  Local shading model  Surface light due only to sources visible at each point  Shadows appear when a patch can’t see sources  Advantages: easy to extract shape information  Global shading model  Also consider light reflected from other surfaces  Accurate, but too hard to extract shape information

Colour Perception  Color appearance is affected by  other nearby colors  adaptation to previous views  “state of mind”

Colour Perception  Humans have remarkable ability…  Know the colour a surface would have in white light  Know the colour of light arriving at eye  Know the colour of light falling on surface (colour constancy)  Colour should help computers recognise objects, but difficult

Vision Hierarchy 4. High level Models 3. Mid level Segmentation 2. Putting together Multiple images 1. Low level processing on a single image 0. The physics of image formation

Edge Detection  Edges useful, could indicate  Visible sharp edge on object  Object boundary  Shadow  Pattern on object  First smooth to remove noise  Then edge detect

Computer Vision - A Modern Approach Set: Color Slides by D.A. Forsyth

fine scale high threshold Computer Vision - A Modern Approach Set: Color Slides by D.A. Forsyth

coarse scale, high threshold Computer Vision - A Modern Approach Set: Color Slides by D.A. Forsyth

coarse scale low threshold Computer Vision - A Modern Approach Set: Color Slides by D.A. Forsyth

Texture  Depends on scale, can include: grass pebbles, hair  Segment image into areas of different texture  Advanced vision  Reconstruct shape from texture  Assume real texture is same on surface  Hence change is due to shape change  Texture elements get squashed or separated, or a different side visible  Humans very good at using this

Vision Hierarchy 4. High level Models 3. Mid level Segmentation 2. Putting together Multiple images 1. Low level processing on a single image 0. The physics of image formation

Multiple Views  Gives information about 3D distance  Methods  Two cameras (like human)  More cameras – 3 even better  Moving camera – same effect as multiple cameras  Maybe moving and zooming  “Structure from motion” problem  Can extract  shape of scene  Position of cameras (remember robot localisation)  Kinect has been a major development – widely used

Vision Hierarchy 4. High level Models 3. Mid level Segmentation 2. Putting together Multiple images 1. Low level processing on a single image 0. The physics of image formation

Segmentation  Group parts that are similar  Difficult problem  No comprehensive theory as yet  Combine high and low level  Top down – combine because same object  Bottom up – combine because locally similar  Example problems  Summarise video (similar sequences)  Find machined parts (lines, circles)  Find people (bodies, faces)  Find buildings by satellite (edge points, lines, polygons)  Example approaches  Find regions that have same texture/colour  Find blobs of same texture/colour/motion that look like limbs  Fit lines to edge points (grouping things that belong together)

Computer Vision - A Modern Approach Set: Color Slides by D.A. Forsyth

Segmentation  Group parts that are similar  Difficult problem  No comprehensive theory as yet  Combine high and low level  Top down – combine because same object  Bottom up – combine because locally similar  Example problems  Summarise video (similar sequences)  Find machined parts (lines, circles)  Find people (bodies, faces)  Find buildings by satellite (edge points, lines, polygons)  Example approaches  Find regions that have same texture/colour – works well  Find blobs of same texture/colour/motion that look like limbs  Fit lines to edge points (grouping things that belong together)

Human Approach  Gestalt (Psychology)  View as a whole group

Segmentation – Fit a Model  Group parts that are similar  Fit points to a line  Fit points to a curve  Fit to a movement in video (tracking)  Motion capture  Recognition  Surveillance  Targetting  Use high level knowledge for models also…

Vision Hierarchy 4. High level Models 3. Mid level Segmentation 2. Putting together Multiple images 1. Low level processing on a single image 0. The physics of image formation

Object Models  Modelbase  Collection of models of objects to be recognised  e.g. aeroplane, building, nuts and bolts  Method:  Look at features and guess what object they come from  Use the position of features to guess the pose (position & orientation) of the object  Generate a rendering of the object in that pose  Compare with the object seen and see how good your guess was  What are features?  Should be the same from different points of view  Lines  Circles/ellipses  curves

Figure from “Efficient model library access by projectively invariant indexing functions,” by C.A. Rothwell et al., Proc. Computer Vision and Pattern Recognition, 1992, copyright 1992, IEEE

Template matching  Look for parts of an image that match some template  Faces: oval, dark bar for eyes, bright bar for nose  Problem: test if some oval is a face  Solution: Classifiers  Computer can be automatically trained from a set of examples  Neural Networks is a good method

Figure from A Statistical Method for 3D Object Detection Applied to Faces and Cars, H. Schneiderman and T. Kanade, Proc. Computer Vision and Pattern Recognition, 2000, copyright 2000, IEEE

Figure from, “A general framework for object detection,” by C. Papageorgiou, M. Oren and T. Poggio, Proc. Int. Conf. Computer Vision, 1998, copyright 1998, IEEE

Template matching  Look for parts of an image that match some template  Faces: oval, dark bar for eyes, bright bar for nose  Problem: test if some oval is a face  Solution: Classifiers  Computer can be automatically trained from a set of examples  Neural Networks is a good method  Improvement: relations among templates  For face: recognise eyes, nose, mouth  Good for animal faces  For body: recognise arms legs head body  e.g. a horse is made of cylinders

Horses

Figure from “Efficient Matching of Pictorial Structures,” P. Felzenszwalb and D.P. Huttenlocher, Proc. Computer Vision and Pattern Recognition2000, copyright 2000, IEEE

Summing up Object Recognition  Much progress recently  Cheaper computation  Better understanding of component problems  Many techniques – which best? Probably combine  Templates work well,  but more work needed on how to group what’s seen, and template relations  Human comparison  Can recognise a huge number of objects  Robust to changing pattern/design  Robust to different backgrounds  Recognise at an abstract level  Can learn to recognise new object from very few examples

Practical Computer Vision  Controlling processes  e.g. an industrial robot or an autonomous vehicle  Detecting events  e.g. for visual surveillance  Finding images in large collections  Web (indexing, organising), military, copyright, stock photos  Difficult to deal with meaning  Interaction  e.g. as the input to a device for computer-human interaction  Modelling objects or environments  e.g. industrial inspection, medical image analysis or topographical modelling  Image based rendering  Difficult to produce models that look real  e.g. texture, dirt, weathering  Rebuild new scene from existing