Computer Vision Prof. Trevor Darrell (Berleley)

Slides:



Advertisements
Similar presentations
C280, Computer Vision Prof. Trevor Darrell Lecture 2: Image Formation.
Advertisements

CP411 Computer Graphics, Wilfrid Laurier University Introduction # 1 Welcome to CP411 Computer Graphics 2012 Instructor: Dr. Hongbing Fan Introduction.
5/13/2015CAM Talk G.Kamberova Computer Vision Introduction Gerda Kamberova Department of Computer Science Hofstra University.
Geometry 1: Projection and Epipolar Lines Introduction to Computer Vision Ronen Basri Weizmann Institute of Science.
Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.
CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.
C280, Computer Vision Prof. Trevor Darrell
Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.
Geometry of Images Pinhole camera, projection A taste of projective geometry Two view geometry:  Homography  Epipolar geometry, the essential matrix.
Overview of Computer Vision CS491E/791E. What is Computer Vision? Deals with the development of the theoretical and algorithmic basis by which useful.
A Brief Overview of Computer Vision Jinxiang Chai Many slides are borrowed from James Hays and Steve Seitz.
Algorithms and Applications in Computer Vision
Computing With Images: Outlook and applications
Computer Vision (CSE 576) Staff Steve Seitz Rick Szeliski ( ) Ian Simon? (
Introduction to Computer Vision CS223B, Winter 2005.
Computer Vision CS302 Data Structures Dr. George Bebis
Computer Vision (CSE P576) Staff Prof: Steve Seitz TA: Jiun-Hung Chen Web Page
Computer Vision (CSE/EE 576) Staff Prof: Steve Seitz TA: Aseem Agarwala Web Page
Instructors TAs Web Page CSE 455: Computer Vision Neel Joshi Ian Simon Ira Kemelmacher
EEM 561 Machine Vision Week 10 :Image Formation and Cameras
What is Computer Vision?
Image formation How are objects in the world captured in an image?
UW Computer Vision (CSE 576)
Staff Web Page Handouts overload list intro slides image filtering slides Computer Vision (CSE.
Prepared by: - Mr. T.R.Shah, Lect., ME/MC Dept., U. V. Patel College of Engineering. Ganpat Vidyanagar. Digital Image Processing & Machine Vision – An.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
CMSC 426: Image Processing (Computer Vision) David Jacobs.
Computer Vision ECE CS 543 / ECE 549 University of Illinois Instructors: Derek Hoiem, David Forsyth TA: Varsha Hedau Presenter: Derek Hoiem.
Staff Web Page Handouts signup sheet intro slides image filtering slides Computer Vision (CSE.
EECS 274 Computer Vision Introduction. What is computer vision? Terminator 2.
CIS 601 Fall 2004 Introduction to Computer Vision and Intelligent Systems Longin Jan Latecki Parts are based on lectures of Rolf Lakaemper and David Young.
CSCE 643: Introduction to Computer Vision
A Brief Overview of Computer Vision Jinxiang Chai.
CS 1114: Introduction to Computing Using MATLAB and Robotics Prof. Noah Snavely CS1114
Introduction to Computer Vision
CSE 803: Computer Vision Naveed Sarfraz Khattak. What is Computer Vision?
Program Studi S-1 Teknik Informatika FMIPA Universitas Padjadjaran
CS-498 Computer Vision Week 1, Day 1 Computer Vision Examples Overview of the Course Introduction to Images 1.
Seminar on Media Technology Computer Vision Albert Alemany Font.
CIS 601 Fall 2003 Introduction to Computer Vision Longin Jan Latecki Based on the lectures of Rolf Lakaemper and David Young.
CSCE 5013 Computer Vision Fall 2011 Prof. John Gauch
Computer Vision cmput 499 Lecture 1: Introduction and course overview Martin Jagersand.
Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.
National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Using Image Data in Your Research Kenton McHenry, Ph.D. Research.
Why is computer vision difficult?
12/7/10 Looking Back, Moving Forward Computational Photography Derek Hoiem, University of Illinois Photo Credit Lee Cullivan.
Computer Vision cmput 428/615 Lecture 1: Introduction and course overview Martin Jagersand.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Subject Name: Computer Graphics Subject Code: Textbook: “Computer Graphics”, C Version By Hearn and Baker Credits: 6 1.
Computer Vision, CS766 Staff Instructor: Li Zhang TA: Yu-Chi Lai
Instructor: Guodong Guo
DSP Laboratory – TELKOM UNIVERSITY
Staff Web Page Handouts signup sheet intro slides image filtering slides Computer Vision (CSE.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
Lecture 1: Introduction
Computer Graphics.
Photorealistic Rendering vs. Interactive 3D Graphics
The Camera : Computational Photography
Instructor: Mircea Nicolescu
CS4670 / 5670: Computer Vision Lecture 12: Cameras Noah Snavely
Study the mathematical relations between corresponding image points.
Lecture 13: Cameras and geometry
Lecturer: Dr. A.H. Abdul Hafez
吴心筱 计算感知 吴心筱
Computer Vision (CSE 490CV, EE400B)
CMSC 426: Image Processing (Computer Vision)
Computer Vision (CSE 455) Staff Web Page Handouts
Computer Vision (CSE 455) Staff Web Page Handouts
Wrap-up Computer Vision Spring 2019, Lecture 26
Presentation transcript:

Computer Vision Prof. Trevor Darrell (Berleley) Modified by: Dr. Mohammad Javad Abdollahifard mj_abdollahi@aut.ac.ir

Today Administrivia “What is vision?” Schedule Introductions Image formation

Prerequisites This course is appropriate as a first course for graduate students with a EECS background, which should have prepared the students with these essential prerequisites: Data structures A good working knowledge of MATLAB programming (or willingness and time to pick it up quickly!) Linear algebra Vector calculus The course does not assume prior imaging experience, computer vision, image processing, or graphics

Text The primary course text will be Rick Szeliski’s draft Computer Vision: Algorithms and Applications; we will use an online copy of the June 19th draft. The secondary text is Forsyth and Ponce, Computer Vision: A Modern Approach.

Primary Text

Secondary Text

Matlab Problem sets and projects will involve Matlab programming (you are free to use alternative packages). Matlab runs on all the Instructional Windows and UNIX systems. Instructions and toolkits are described in http://inst.eecs.berkeley.edu/cgi-bin/pub.cgi?file=matlab.help.

Grading There will be three components to the course grade Computer exercises (20%) Final project (including evaluation of proposal document, implementation, presentation, and final report) (30%) Final Exam (50%) Additional (up to 10%) In addition, strong class participation can offset negative performance in any one of the above components.

Problem sets Pset0 – Basic Image Manipulation in Matlab Pset1 – Filtering and Features Pset2 – Geometry and Calibration Pset3 – Recognition Pset4 – Stereo and Motion Can discuss, but must submit individual work

Final project Significant novel implementation of technique related to course content Teams of 2 encouraged (document role!) Or journal length review article (no teams) Three components: proposal document (no more than 5 pages) in class results presentation (10 minutes) final write-up (no more than 15 pages)

Final Project (my suggestions) Real time face detection and recognition Cross correlation based conditional texture synthesis Sparse representation and manifold models for conditional texture synthesis Super-resolution land cover mapping via sparse representation Using Bnadlet and Grouplet for Conditional texture synthesis

Class Participation Class participation includes showing up being able to articulate key points from last lecture having read assigned sections and being able to “fill in the blank” during the lecture I won’t cold call, but will solicit volunteers Strong in-class participation can offset poor performance in one of the other grade components.

*Course goals….(broadly speaking) principles of image formation convolution and image pyramids local feature analysis multi-view geometry image warping and stitching structure from motion visual recognition image-based rendering

What is computer vision? Done? Is it enough to connect a cam to a PC?

What is computer vision? Automatic understanding of images and video Computing properties of the 3D world from visual data (measurement) Algorithms and representations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation)

Vision for measurement Multi-view stereo for community photo collections Real-time stereo Structure from motion NASA Mars Rover Pollefeys et al. Goesele et al. Slide credit: L. Lazebnik

Vision for perception, interpretation Objects Activities Scenes Locations Text / writing Faces Gestures Motions Emotions… amusement park sky The Wicked Twister Cedar Point Ferris wheel ride ride 12 E Lake Erie water ride tree tree people waiting in line people sitting on ride umbrellas tree maxair carousel deck bench tree pedestrians 17

Related disciplines Computer vision Artificial intelligence Graphics Machine learning Computer vision Cognitive science study of the mind and its processes Image processing Algorithms

Vision and graphics Images Model Inverse problems: analysis and synthesis. Machine vision is complex partially because it is an underdetermined inverse problem which can be solved using a model (either probabilistic or structural)

Why vision? As image sources multiply, so do applications Relieve humans of boring, easy tasks Enhance human abilities: human-computer interaction, visualization Perception for robotics / autonomous agents Organize and give access to visual content

Why vision? Images and video are everywhere! Personal photo albums Movies, news, sports Surveillance and security Medical and scientific images Slide credit; L. Lazebnik

*Again, what is computer vision? Mathematics of geometry of image formation? Statistics of the natural world? Models for neuroscience? Engineering methods for matching images? Science Fiction?

Vision Demo? we’re not quite there yet…. Terminator 2 Clips: terminator 2, enemy of the state (from UCSD “Fact or Fiction” DVD) we’re not quite there yet…. Terminator 2

Every picture tells a story Goal of computer vision is to write computer programs that can interpret images

Can computers match (or beat) human vision? If you can write a formula for it, computers can excel Computer vision can’t solve the whole problem (yet), so breaks it down into pieces. Many of the pieces have important applications. Yes and no (but mostly no!) humans are much better at “hard” things computers can be better at “easy” things

Human perception has its shortcomings… Looks like Ex-President Bill Clinton and his Vice President Al Gore, right? Wrong... It's Clinton's face twice, with two different haircuts Example where humans make mistakes that computers can avoid Sinha and Poggio, Nature, 1996 More illusions on : http://resources.mhs.vic.edu.au/business/Documents/Illusions.htm

Copyright A.Kitaoka 2003

Current state of the art The next slides show some examples of what current vision systems can do

Earth viewers (3D modeling) Image from Microsoft’s Virtual Earth (see also: Google Earth)

3D reconstruction based on a collection of images Photosynth 3D reconstruction based on a collection of images http://labs.live.com/photosynth/ Based on Photo Tourism technology developed by Noah Snavely, Steve Seitz, and Rick Szeliski

Photo Tourism overview Photo Explorer Scene reconstruction Input photographs Relative camera positions and orientations Point cloud Sparse correspondence System for interactive browsing and exploring large collections of photos of a scene. Computes viewpoint of each photo as well as a sparse 3d model of the scene.

Photo Tourism overview

Optical character recognition (OCR) Technology to convert scanned docs to text If you have a scanner, it probably came with OCR software Digit recognition, AT&T labs http://www.research.att.com/~yann/ License plate readers http://en.wikipedia.org/wiki/Automatic_number_plate_recognition

Face detection Many new digital cameras now detect faces Why would this be useful? Main reason is focus. Also enables “smart” cropping. Many new digital cameras now detect faces Canon, Sony, Fuji, …

Smile detection? Sony Cyber-shot® T70 Digital Still Camera

Object recognition (in supermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching for items. When an item is detected and recognized, the cashier verifies the quantity of items that were found under the basket, and continues to close the transaction. The item can remain under the basket, and with LaneHawk,you are assured to get paid for it… “

Face recognition Who is she?

Vision-based biometrics “How the Afghan Girl was Identified by Her Iris Patterns” Read the story

Login without a password… Face recognition systems now beginning to appear more widely http://www.sensiblevision.com/ Fingerprint scanners on many new laptops, other devices

Object recognition (in mobile phones) This is becoming real: Microsoft Research Point & Find, Nokia SnapTell.com (now amazon)

Snaptell http://snaptell.com/demos/DemoLarge.htm

Nokia Point and Tell… http://conversations.nokia.com/home/2008/09/point-and-fin-1.html

Special effects: shape capture The Matrix movies, ESC Entertainment, XYZRGB, NRC

Special effects: motion capture Pirates of the Carribean, Industrial Light and Magic Click here for interactive demo

Sports Sportvision first down line Nice explanation on www.howstuffworks.com

Slide content courtesy of Amnon Shashua Smart cars Mobileye Vision systems currently in high-end BMW, GM, Volvo models By 2010: 70% of car manufacturers. Video demo Slide content courtesy of Amnon Shashua

Slide content courtesy of Amnon Shashua Smart cars Mobileye Vision systems currently in high-end BMW, GM, Volvo models By 2010: 70% of car manufacturers. Video demo Slide content courtesy of Amnon Shashua

Vision-based interaction (and games) Digimask: put your face on a 3D avatar. Nintendo Wii has camera-based IR tracking built in. See Lee’s work at CMU on clever tricks on using it to create a multi-touch display! “Game turns moviegoers into Human Joysticks”, CNET Camera tracking a crowd, based on this work.

Vision in space Vision systems (JPL) used for several tasks NASA'S Mars Exploration Rover Spirit captured this westward view from atop a low plateau where Spirit spent the closing months of 2007. Vision systems (JPL) used for several tasks Panorama stitching 3D terrain modeling Obstacle detection, position tracking For more, read “Computer Vision on Mars” by Matthies et al.

NASA’s Mars Spirit Rover Robotics NASA’s Mars Spirit Rover http://en.wikipedia.org/wiki/Spirit_rover http://www.robocup.org/

Medical imaging Image guided surgery 3D imaging Grimson et al., MIT MRI, CT

Current state of the art You just saw examples of current systems. Many of these are less than 5 years old This is a very active research area, and rapidly changing Many new apps in the next 5 years To learn more about vision applications and companies David Lowe maintains an excellent overview of vision companies http://www.cs.ubc.ca/spider/lowe/vision.html

Syllabus / Schedule (see handout) http://tinyurl.com/UCBC280CAL Image Formation Color Image Filtering Pyramids & Regularization Feature Detection and Matching Geometric Alignment Geometric Image Stitching Photometric Image Stitching Recognition Stereo Optic Flow Dense Motion Models Shape from Silhouettes Shape from Shading and Texture Surface Models Segmentation SFM IBR & HDR…

And now, who are you? And what do you expect to get out of this class? Previous experience in vision, learning, graphics? Research agenda? (Project topics?)

Let’s get started: Image formation How are objects in the world captured in an image?

Physical parameters of image formation Geometric Type of projection Camera pose Optical Sensor’s lens type focal length, field of view, aperture Photometric Type, direction, intensity of light reaching sensor Surfaces’ reflectance properties

Image formation Let’s design a camera Idea 1: put a piece of film in front of an object Do we get a reasonable image? Slide by Steve Seitz

Pinhole camera Add a barrier to block off most of the rays This reduces blurring The opening is known as the aperture How does this transform the image? Slide by Steve Seitz

Pinhole camera Pinhole camera is a simple model to approximate imaging process, perspective projection. Image plane Virtual image pinhole If we treat pinhole as a point, only one ray from any given point can enter the camera. Fig from Forsyth and Ponce

Camera obscura In Latin, means ‘dark room’ "Reinerus Gemma-Frisius, observed an eclipse of the sun at Louvain on January 24, 1544, and later he used this illustration of the event in his book De Radio Astronomica et Geometrica, 1545. It is thought to be the first published illustration of a camera obscura..." Hammond, John H., The Camera Obscura, A Chronicle http://www.acmi.net.au/AIC/CAMERA_OBSCURA.html

Camera obscura An attraction in the late 19th century Jetty at Margate England, 1898. An attraction in the late 19th century Around 1870s http://brightbytes.com/cosite/collection2.html Adapted from R. Duraiswami

Camera obscura at home http://blog.makezine.com/archive/2006/02/how_to_room_sized_camera_obscu.html Sketch from http://www.funsci.com/fun3_en/sky/sky.htm

Perspective effects

Perspective effects Far away objects appear smaller Forsyth and Ponce

Perspective effects

Perspective effects Parallel lines in the scene intersect in the image Converge in image on horizon line Image plane (virtual) pinhole Scene

Slide Credits Slides 14-21, 55-66: Kristen Grauman Slides 23-40,43-52: Steve Seitz and others, as marked…

Next time Continue with Image Formation Readings for today: Szeliski, Ch. 1 Readings for next lecture: Szeliski 2.1-2.3.1, Forsyth and Ponce 1.1, 1.4 (optional). Pset 0 released tomorrow, due following Friday