Computer Vision CS 223b Jana Kosecka CA: Sanaz Mohatari

Slides:



Advertisements
Similar presentations
Introduction to Computer Vision Dr. Chang Shu COMP 4900C Winter 2008.
Advertisements

Sebastian Thrun CS223B Computer Vision, Winter Stanford CS223B Computer Vision, Winter 2005 Lecture 1 Intro and Image Formation Sebastian Thrun,
5/13/2015CAM Talk G.Kamberova Computer Vision Introduction Gerda Kamberova Department of Computer Science Hofstra University.
Visual Event Detection & Recognition Filiz Bunyak Ersoy, Ph.D. student Smart Engineering Systems Lab.
Oleh Tretiak © Computer Vision Lecture 1: Introduction.
Computer and Robot Vision I
CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.
Exchanging Faces in Images SIGGRAPH ’04 Blanz V., Scherbaum K., Vetter T., Seidel HP. Speaker: Alvin Date: 21 July 2004.
Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.
Math. Meth. in Vision and Imaging cmput 613/498 Lecture 1: Introduction and course overview Martin Jagersand.
CS 561, Sessions 27 1 Towards intelligent machines Thanks to CSCI561, we now know how to… - Search (and play games) - Build a knowledge base using FOL.
Overview of Computer Vision CS491E/791E. What is Computer Vision? Deals with the development of the theoretical and algorithmic basis by which useful.
Introduction to Computer and Human Vision Shimon Ullman, Ronen Basri, Michal Irani Assistants: Tal Hassner Eli Shechtman.
Vision Computing An Introduction. Visual Perception Sight is our most impressive sense. It gives us, without conscious effort, detailed information about.
Computing With Images: Outlook and applications
3D Computer Vision and Video Computing Introduction Instructor: Zhigang Zhu CSc Spring 2006 Reading in 3D Computer Vision and.
2007Theo Schouten1 Introduction. 2007Theo Schouten2 Human Eye Cones, Rods Reaction time: 0.1 sec (enough for transferring 100 nerve.
Computer Vision Dan Witzner Hansen Course web page:
Computational Vision Jitendra Malik University of California at Berkeley Jitendra Malik University of California at Berkeley.
Sebastian Thrun & Jana Kosecka CS223B Computer Vision, Winter 2007 Stanford CS223B Computer Vision, Winter 2007 Lecture 1 Course Overview Image Formation.
Introduction to Computer Vision CS223B, Winter 2005.
Computer Vision Marc Pollefeys COMP 256 Administrivia Classes: Mon & Wed, 11-12:15, SN115 Instructor: Marc Pollefeys (919) Room.
Multiple View Geometry Marc Pollefeys University of North Carolina at Chapel Hill Modified by Philippos Mordohai.
Computer Vision (CSE P576) Staff Prof: Steve Seitz TA: Jiun-Hung Chen Web Page
Jochen Triesch, UC San Diego, 1 COGS Visual Modeling Jochen Triesch & Martin Sereno Dept. of Cognitive Science UC.
1 Computer Vision Instructor: Prof. Sei-Wang Chen, PhD Office: Applied Science Building, Room 101 Communication: Tel:
Convergence of vision and graphics Jitendra Malik University of California at Berkeley Jitendra Malik University of California at Berkeley.
The University of Ontario CS 4487/9587 Algorithms for Image Analysis n Web page: Announcements, assignments, code samples/libraries,
Jason Li Jeremy Fowers Ground Target Following for Unmanned Aerial Vehicles.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
Computer Vision Lecture #1 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
CIS 601 Fall 2004 Introduction to Computer Vision and Intelligent Systems Longin Jan Latecki Parts are based on lectures of Rolf Lakaemper and David Young.
A Brief Overview of Computer Vision Jinxiang Chai.
G52IIP, School of Computer Science, University of Nottingham What we will learn … Topics relate to the use of computer to Acquire/generate Process/manipulate/store.
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 35 – Review for midterm.
A Bayesian Approach For 3D Reconstruction From a Single Image
CIS 601 Fall 2003 Introduction to Computer Vision Longin Jan Latecki Based on the lectures of Rolf Lakaemper and David Young.
Perceptual and Sensory Augmented Computing Visual Object Recognition Tutorial Visual Object Recognition Bastian Leibe & Computer Vision Laboratory ETH.
CSCE 5013 Computer Vision Fall 2011 Prof. John Gauch
CIS 489/689: Computer Vision Instructor: Christopher Rasmussen Course web page: vision.cis.udel.edu/cv February 12, 2003  Lecture 1.
Visual Scene Understanding (CS 598) Derek Hoiem Course Number: Instructor: Derek Hoiem Room: Siebel Center 1109 Class Time: Tuesday and Thursday.
Computer Vision Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications –building representations.
Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.
Conceptual and Experimental Vision Introduction R.Bajcsy, S.Sastry and A.Yang Fall 2006.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
12/7/10 Looking Back, Moving Forward Computational Photography Derek Hoiem, University of Illinois Photo Credit Lee Cullivan.
MASKS © 2004 Invitation to 3D vision. MASKS © 2004 Invitation to 3D vision Lecture 1 Overview and Introduction.
Vision Overview  Like all AI: in its infancy  Many methods which work well in specific applications  No universal solution  Classic problem: Recognition.
Computer Vision, CS766 Staff Instructor: Li Zhang TA: Yu-Chi Lai
Colour and Texture. Extract 3-D information Using Vision Extract 3-D information for performing certain tasks such as manipulation, navigation, and recognition.
Announcements Final is Thursday, March 18, 10:30-12:20 –MGH 287 Sample final out today.
MASKS © 2004 Invitation to 3D vision. MASKS © 2004 Invitation to 3D vision Lecture 1 Overview and Introduction.
What is Multimedia Anyway? David Millard and Paul Lewis.
Robot Vision SS 2009 Matthias Rüther ROBOT VISION 2VO 1KU Matthias Rüther.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
- photometric aspects of image formation gray level images
CS201 Lecture 02 Computer Vision: Image Formation and Basic Techniques
Instructor: Mircea Nicolescu
Computer Vision, Robotics, Machine Learning and Control Lab
Can computers match human perception?
Multiple-View Geometry for Image-Based Modeling (Course 42)
Introduction to Computer and Human Vision
Multiple View Geometry in Computer Vision
Professor Sebastian Thrun CAs: Dan Maynes-Aminzade and Mitul Saha
Computer Vision (CSE 490CV, EE400B)
CMSC 426: Image Processing (Computer Vision)
CSE (c) S. Tanimoto, 2007 Image Understanding
Filtering An image as a function Digital vs. continuous images
Wrap-up Computer Vision Spring 2019, Lecture 26
Presentation transcript:

Computer Vision CS 223b Jana Kosecka CA: Sanaz Mohatari Han-I Su

Logistics Grading: Homeworks (about every 2 weeks) 30% Midterm: 30% Final project: 40% Prerequisites: basic statistical concepts, geometry, linear algebra, calculus Recommended text: Introductory Techniques for 3D Computer Vision (E. Trucco, A. Verri, Prentice Hall, 1998) From Images to Geometric Models: Y. Ma, S. Soatto, J.Kosecka and S. Sastry, Springer Verlag 2003 Computer Vision: Algorithms and Applications R. Szeliski (preprint) Computer Vision a Modern Approach (D. Forsyth, J. Ponce, Prentice Hall 2002) Required Software MATLAB (with Image Processing toolbox) Open CV library

The Texts

Project Deadlines Check Web site for proposals, or develop your own Team up! Dates –Feb 13: Project proposals due –Mar 12: Final report due –Mar 15: Mini-workshop on projects

To define your own project… Find a mentor Generate project description for the Class Web site Gather data, process data Write suitable project proposal Examples: Learn to find sports videos on youtube.com Match images of same location at flickr.com Fly autonomous helicopter with camera

Today’s Goals Learn about CS223b Get Excited about Computer Vision Learn about image formation (Part 1)

What is vision? From the 3-D world to 2-D images: image formation (physics). –Domain of artistic reproduction (synthesis): painting, graphics. From 2-D images to the 3-D world: image analysis (mathematical modeling, inference). –Domain of vision: biological (eye+brain), computational

IMAGE SYNTHESIS: simulation of the image-formation process Pinhole (perspective) imaging in most ancient civilizations. Euclid, perspective projection, 4th century B.C., Alexandria (Egypt) Pompeii frescos, 1st century A.D. (ubiquitous). Geometry understood very early on, then forgotten. Image courtesy of C. Taylor

PERSPECTIVE IMAGING (geometry) Image courtesy of C. Taylor Re-discovered and formalized in the Renaissance: Fillippo Brunelleschi, first Renaissance artist to paint with correct perspective,1413 “Della Pictura”, Leon Battista Alberti, 1435, first treatise Leonardo Da Vinci, stereopsis, shading, color, 1500s Raphael, 1518

Biological motivations Understanding visual sensing modality and its role - We have no difficulties to navigate, manipulate objects recognize familiar places and faces - How can we successfully carry out all these everyday tasks ? - How does the visual perception mediates these activities ? - Overall system Sensation and perception – integrate and interpret sensory readings Behavior – control muscles and glands to mediate some behavior Memory – long term memory (declarative – faces, places, events) short term memory (procedural – associations, skills) Higher level functions – internal models and representations of the environments

Goals of Computer Vision Build machines and develop algorithms which can automatically replicate some functionalities of biological visual system -Systems which navigate in cluttered environments -Systems which can recognize objects, activities -Systems which can interact with humans/world Synergies with other disciplines and various applications Artificial Intelligence - robotics, natural language understanding Vision as a sensor – medical imaging, Geospatial Imaging, robotics, visual surveillance, inspection

Computer Vision Visual Sensing Images I(x,y) – brightness patterns - image appearance depends on structure of the scene - material and reflectance properties of the objects - position and strength of light sources

[Felleman & Van Essen, 1991] This is the part of your brain that processes visual information VISUAL INFORMATION PROCESSING

This is how a computer represents it

And so is this …

And so are these! We need to extract some “invariant”, i.e. what is common to all these images (they are all images of my office)

BUMMER! THIS IS IMPOSSIBLE! –THM: [Weiss, 1991]: There exists NO generic viewpoint invariant! –THM: [Chen et al., 2003]: There exists NO photometric invariant!! So, how do we (primates) solve the problem?

EXAMPLE OF A (VERY COMMON) SENSOR NETWORK Retina performs distributed computation: –Contrast adaptation (lateral inhibition) –Enhancement/edgels (e.g. Mach bands) –Motion detection (leap frog) Does the human visual system really solve the problem? Not really!

Optical Illusion

Checker A and B are of the same gray-level value

SIDE EFFECTS OF LATERAL INHIBITION THESE ARE NOT SPIRALSTHESE ARE NOT SPIRALS AND THEY ARE NOT MOVING!AND THEY ARE NOT MOVING! Cause Peripheral Drift

Challenges/Issues About 40% of our brain is devoted to vision We see immediately and can form and understand images instantly Several strategies for forming a representation of the scenes Task dependent attentional mechanisms Detailed representations are often not necessary Different approaches in the past Animate Vision (biologically inspired), Purposive Vision (depending on the task/purpose) - have many shortcomings

Recovery of the properties of the environment Recovery of the properties of the environment from single or multiple views using various visual cues from single or multiple views using various visual cues Vision problems Segmentation Segmentation Recognition Recognition Reconstruction Reconstruction Vision Based Control - Action Vision Based Control - Action stereo stereo motion motion shading shading texture texture brightness, color brightness, color Visual Cues

Example 1:Stereo See

Example 2: Structure From Motion

Example 2: Structure From Motion

Example 2: Structure From Motion

Example 3: 3D Modeling

Example 5: Segmentation

Example 6: Classification

Example 9: Human Vision

Segmentation – partition image into separate objects Clustering and search algorithms in the space of visual cues Clustering and search algorithms in the space of visual cues

Figure from, “A general framework for object detection,” by C. Papageorgiou, M. Oren and T. Poggio, Proc. ICCV, copyright 1998, IEEE Recognition by finding patterns Figure from A Statistical Method for 3D Object Detection Applied to Faces and Cars, H. Schneiderman and T. Kanade, CVPR, Face detection Human detection

Samples from motorbikes appearance model (Fergus’s et. al results) Recognition by finding patterns

Focus of this course The focus of the course : 1. Geometry of Single and Multiple Views Shape and Motion Recovery, Matching, Alignment Problems Reconstruction (from 2D to 3D) Computation of Pictorial cues – shading, texture, blur, contour, stereo, motion cues 2. Object Detection and Recognition Object representation, detection in cluttered scenes Recognition of object categories

How to reliably recover and represent the geometric model from single image or video and camera motion/pose Representation issues depends on the task/applications Image-based rendering, Computer Graphics Virtual and Augmented Reality Vision based control, surveillance, target tracking Human computer interaction Medical imaging (alignment, monitoring of change) Video Analysis 1. Geometry of Single and Multiple Views, Video

Vision and Computer Graphics - image based rendering techniques - 3D reconstruction from multiple views or video - single view modeling - view morphing (static and dynamic case)

Modeling with Multiple Images University High School, Urbana, Illinois Three of Twelve Images, courtesy Paul Debevec

Final Model

Visual surveillance wide area surveillance, traffic monitoring Interpretation of different activities

Virtual and Augmented Reality, Human computer Interaction Virtual object insertion various gesture based interfaces Interpretation of human activities Enabling technologies of intelligent homes, smart spaces

Topics Image Formation, Representation of Camera Motion, Camera Calibration Image Features – filtering, edge detection, point feature detection Image alignment, 3D structure and motion recovery, stereo Analysis of dynamic scenes, detection, tracking Object detection and object recognition

Surveillance

Image Morphing, Mosaicing, Alignment Images of CSL, UIUC

3D RECONSTRUCTION FROM MULTIPLE VIEWS: GEOMETRY AND PHOTOMETRY jin-soatto-yezzi; image courtesy: j-y bouguet - intel

estimated shape laser-scanned, manually polished jin-soatto-yezzi

with h. jin

APPLICATIONS – Unmanned Aerial Vehicles (UAVs) Berkeley Aerial Robot (BEAR) Project Rate: 10Hz Accuracy: 5cm, 4 o