Computer Vision cmput 428/615 Lecture 1: Introduction and course overview Martin Jagersand.

Slides:



Advertisements
Similar presentations
CP411 Computer Graphics, Wilfrid Laurier University Introduction # 1 Welcome to CP411 Computer Graphics 2012 Instructor: Dr. Hongbing Fan Introduction.
Advertisements

5/13/2015CAM Talk G.Kamberova Computer Vision Introduction Gerda Kamberova Department of Computer Science Hofstra University.
Check web page often T,R 12:30-1:50pm PSCB (Phy Sci Class Blg) 140 Course intro handout.
Stereo.
University of British Columbia CPSC 314 Computer Graphics Jan-Apr 2005 Tamara Munzner Introduction Week 1, Wed Jan 5.
CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.
C280, Computer Vision Prof. Trevor Darrell
Graphics-1 Gentle Introduction to Computer Graphics Based on: –David Brogan’s “Introduction to Computer Graphics” Course Slides, University of Virginia.
Math. Meth. in Vision and Imaging cmput 613/498 Lecture 1: Introduction and course overview Martin Jagersand.
A Brief Overview of Computer Vision Jinxiang Chai Many slides are borrowed from James Hays and Steve Seitz.
Algorithms and Applications in Computer Vision
Virtual Reality. What is virtual reality? a way to visualise, manipulate, and interact with a virtual environment visualise the computer generates visual,
Hand-Eye Coordination and Vision-based Interaction Martin Jagersand Collaborators: Zach Dodds, Greg Hager, Andreas Pichler.
BPC: Art and Computation – Spring 2007 Overview of Spring Semester Tools and Technologies Glenn Bresnahan
Computing With Images: Outlook and applications
Sebastian Thrun & Jana Kosecka CS223B Computer Vision, Winter 2007 Stanford CS223B Computer Vision, Winter 2007 Lecture 1 Course Overview Image Formation.
Computer Vision CS302 Data Structures Dr. George Bebis
Computer Vision (CSE P576) Staff Prof: Steve Seitz TA: Jiun-Hung Chen Web Page
Instructors TAs Web Page CSE 455: Computer Vision Neel Joshi Ian Simon Ira Kemelmacher
The University of Ontario CS 4487/9587 Algorithms for Image Analysis n Web page: Announcements, assignments, code samples/libraries,
What is Computer Vision?
UW Computer Vision (CSE 576)
Staff Web Page Handouts overload list intro slides image filtering slides Computer Vision (CSE.
Mosaics CSE 455, Winter 2010 February 8, 2010 Neel Joshi, CSE 455, Winter Announcements  The Midterm went out Friday  See to the class.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
CMSC 426: Image Processing (Computer Vision) David Jacobs.
Computer Vision Lecture #1 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
Computer Vision ECE CS 543 / ECE 549 University of Illinois Instructors: Derek Hoiem, David Forsyth TA: Varsha Hedau Presenter: Derek Hoiem.
Staff Web Page Handouts signup sheet intro slides image filtering slides Computer Vision (CSE.
EECS 274 Computer Vision Introduction. What is computer vision? Terminator 2.
CSCE 643: Introduction to Computer Vision
A Brief Overview of Computer Vision Jinxiang Chai.
Introduction to ECE432 Instructor: Ying Wu Dept. Electrical & Computer Engr. Northwestern University Evanston, IL 60208
CS 1114: Introduction to Computing Using MATLAB and Robotics Prof. Noah Snavely CS1114
Introduction to Computer Vision
CSE 803: Computer Vision Naveed Sarfraz Khattak. What is Computer Vision?
Program Studi S-1 Teknik Informatika FMIPA Universitas Padjadjaran
CS-498 Computer Vision Week 1, Day 1 Computer Vision Examples Overview of the Course Introduction to Images 1.
Seminar on Media Technology Computer Vision Albert Alemany Font.
CIS 601 Fall 2003 Introduction to Computer Vision Longin Jan Latecki Based on the lectures of Rolf Lakaemper and David Young.
CSCE 5013 Computer Vision Fall 2011 Prof. John Gauch
Robotics cmput 412 Intro to Real World Robotics Real World Robotics Upcoming course project Upcoming course project Martin Jagersand This presentation.
Computer Graphics Lecture 01 Fasih ur Rehman. About Instructor MSc in Electronics and MSc in Systems Engineering in 1994 and 1996 respectively from Quaid-e-Azam.
Computer Vision cmput 499 Lecture 1: Introduction and course overview Martin Jagersand.
Visual Scene Understanding (CS 598) Derek Hoiem Course Number: Instructor: Derek Hoiem Room: Siebel Center 1109 Class Time: Tuesday and Thursday.
Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.
Conceptual and Experimental Vision Introduction R.Bajcsy, S.Sastry and A.Yang Fall 2006.
National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Using Image Data in Your Research Kenton McHenry, Ph.D. Research.
12/7/10 Looking Back, Moving Forward Computational Photography Derek Hoiem, University of Illinois Photo Credit Lee Cullivan.
CS576 Computer Vision Instructor: Dr. Yu-Wing Tai No official TA Tuesday and Thursday 4:00pm – 5:30pm Rm 3444 E3-1 Building 1.
Hand-Eye Coordination and Vision-based Interaction / cmput610 Martin Jagersand.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Subject Name: Computer Graphics Subject Code: Textbook: “Computer Graphics”, C Version By Hearn and Baker Credits: 6 1.
Computer Vision, CS766 Staff Instructor: Li Zhang TA: Yu-Chi Lai
Instructor: Guodong Guo
DSP Laboratory – TELKOM UNIVERSITY
Staff Web Page Handouts signup sheet intro slides image filtering slides Computer Vision (CSE.
MASKS © 2004 Invitation to 3D vision. MASKS © 2004 Invitation to 3D vision Lecture 1 Overview and Introduction.
What is Multimedia Anyway? David Millard and Paul Lewis.
Instructor: Dr. Shereen Aly Taie 1.  Broad introduction to Computer Graphics ◦ Software ◦ Hardware ◦ Applications  CAD برامج التصميم 2.
COS 429: Computer Vision Thanks to Chris Bregler.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
Computer Vision Prof. Trevor Darrell (Berleley)
Intro to Real World Robotics Upcoming course project Martin Jagersand
Lecture 1: Introduction
CS201 Lecture 02 Computer Vision: Image Formation and Basic Techniques
Image Based Modeling and Rendering (PI: Malik)
Computer Vision (CSE 490CV, EE400B)
CMSC 426: Image Processing (Computer Vision)
Introduction to spagetti and meatballs
Presentation transcript:

Computer Vision cmput 428/615 Lecture 1: Introduction and course overview Martin Jagersand

Administrivia Classes: Tue, Thu 2-3:20pm,Classes: Tue, Thu 2-3:20pm, Labs: csc-235 Tue, Wed 5-7:50pm, tutorials, Q/A, student demos)Labs: csc-235 Tue, Wed 5-7:50pm, tutorials, Q/A, student demos) Instructor: Martin Jagersand Martin Jagersand Labs: Karteek Popuri Karteek Popuri Prereq: Compulsory math, numerical methods (e.g. 340, 418)Prereq: Compulsory math, numerical methods (e.g. 340, 418) Textbooks:Textbooks: –Multiple View Geometry in Computer Vision Richard Hartley and Andrew Zisserman Cambridge University Press, Multiple View Geometry in Computer VisionMultiple View Geometry in Computer Vision –Computer Vision: Algorithms and Applications –Computer Vision: Algorithms and Applications Richard Szeliski, Springer 2011 (on-line or printed) Webpage:Webpage: –

On-line Text

Administrivia Grading (undergrad) 40% Assignments (6-7%) and in-lab demos/quiz (3-4%) throughout term 25% Midterm Tue Feb 24 35% Final Graduate 24% Assign/demo, 16% Midterm, 30% Final 5% research article reading and presentation 5%+20% Project proposal P1 and final report/demo P2 Project has a survey and learning part where you read the literature, try some softwares, propose a particular topic and write report P1. Second part, P2, is a report and demo on an implementation you did. Groups are encouraged. Each individual identifiable part in a bigger project. Lab: Room csc2-35, SW: Matlab + Mexvision, Linux. PtGrey Grasshopper Cameras

What is Computer Vision? How to infer salient properties of 3-D world from 2-D images or video ¤ What is salient? ¤ How to deal with loss of information going from 3-D to 2-D projection?

Biological vision For humans and animals vision is powerful yet effortless.For humans and animals vision is powerful yet effortless. A large part of the brain is devoted to visual processing (40%)A large part of the brain is devoted to visual processing (40%)

Computer Vision There are methods for several subdomains e.g.There are methods for several subdomains e.g. 1.2D, 3D, hom Pose tracking in video 2.3D modeling from 2D im/vid 3.Vision-based motion control (arm, hand, walk, (robots 412) But general, human-like vision unsolved/hardBut general, human-like vision unsolved/hard Easier subdomainsEasier subdomains 1.Industrial machine vision Object recognition cmput 399

How much “3D” do we really see? Will the scissors cut the paper in the middle?Will the scissors cut the paper in the middle?

ambiguity Will the scissors cut the paper in the middle? NO!Will the scissors cut the paper in the middle? NO! ambiguity

Size illusions

Tasks in Vision Perception: What: Label what is in a scene: What people, animals, objects…What: Label what is in a scene: What people, animals, objects…Action: Where: Determine the coordinates of where something is. (Mishkin, Ungleider)Where: Determine the coordinates of where something is. (Mishkin, Ungleider) How: Use vision to perform some physical manipulation task. (Goodale etc.)How: Use vision to perform some physical manipulation task. (Goodale etc.)

Compare in Human vision: Dorsal and Ventral Pathways

The structure of ambient light

Only in simple settings does physics based reflectance modeling work

Measuring light vs. measuring scene properties

The “Vision Problem” InputOutput Vision Algorithm

The “Vision Problem” InputOutput Vision Algorithm

What is an image? What is an object? Recognizing objects

Allies and Inspiration Math/geometry Image Processing Cognitive Science Physics, Optics InputOutput Vision Algorithm Engineering Biology, Psychology Computer Science Graphics, AI, NNets,...

What: Recognizing people and objects How do we determine that these are the same objects? Two approaches: 1.Shape 2.Texture

Model based recognition: Find and match the shape Example: Find the outlines of objects. Try to generate 3D or some pseudo 3D geometric description Check database for a similar geometric object

Model based recognition: Techniques. Stereo: From two (or more) images, determine the geometry of the scene by matching corresponding areas of the imagesFrom two (or more) images, determine the geometry of the scene by matching corresponding areas of the images RIGHT IMAGE PLANE LEFT IMAGE PLANE RIGHT FOCAL POINT LEFT FOCAL POINT BASELINE d FOCAL LENGTH f

Apperance based recognition Match the 2D visual “texture” Methods: 1.Spatial: Determine “likeness” by convolution. (Like solving a puzzle) 2.“Frequency”: Transform and measure in fourier or KL Basis space.

RIGHT IMAGE PLANE LEFT IMAGE PLANE RIGHT FOCAL POINT LEFT FOCAL POINT BASELINE d FOCAL LENGTH f Where Use stereo to determine 3D location with respect to camera.Use stereo to determine 3D location with respect to camera. If we know where the camera is we can determine 3D world positionIf we know where the camera is we can determine 3D world position

How (to act in the world)?

Commercial products integrates tidbits of computer vision Many new digital cameras now detect facesMany new digital cameras now detect faces –Canon, Sony, Fuji, …

Smile detection? Sony Cyber-shot® T70 Digital Still Camera

Object recognition (in supermarkets) LaneHawk by EvolutionRobotics “A smart camera is flush-mounted in the checkout lane, continuously watching for items. When an item is detected and recognized, the cashier verifies the quantity of items that were found under the basket, and continues to close the transaction. The item can remain under the basket, and with LaneHawk,you are assured to get paid for it… “

Login without a password… Fingerprint scanners on many new laptops, other devices Face recognition systems now beginning to appear more widely

Object recognition (in mobile phones) This is becoming real:This is becoming real: – Microsoft Research –Point & Find, Nokia Point & FindNokiaPoint & FindNokia –SnapTell.com (now amazon)

Snaptell

The Matrix movies, ESC Entertainment, XYZRGB, NRC Special effects: shape capture

Pirates of the Carribean, Industrial Light and Magic Click here for interactive demo Special effects: motion capture

Sports Sportvision first down line Nice explanation on

Smart cars MobileyeMobileyeMobileye –Vision systems currently in high-end BMW, GM, Volvo models –By 2010: 70% of car manufacturers. –Video demo Video demoVideo demo Slide content courtesy of Amnon Shashua

Smart cars MobileyeMobileyeMobileye –Vision systems currently in high-end BMW, GM, Volvo models –By 2010: 70% of car manufacturers. –Video demo Video demoVideo demo Slide content courtesy of Amnon Shashua

Vision-based interaction (and games) Nintendo Wii has camera-based IR tracking built in. See Lee’s work at CMU on clever tricks on using it to create a multi-touch display!Lee’s work at CMU multi-touch display DigimaskDigimask: put your face on a 3D avatar. “Game turns moviegoers into Human Joysticks”, “Game turns moviegoers into Human Joysticks”, CNET Camera tracking a crowd, based on this work.this work

Vision in space Vision systems (JPL) used for several tasks Panorama stitching 3D terrain modeling Obstacle detection, position tracking For more, read “Computer Vision on Mars” by Matthies et al.Computer Vision on Mars NASA'S Mars Exploration Rover Spirit NASA'S Mars Exploration Rover Spirit captured this westward view from atop a low plateau where Spirit spent the closing months of 2007.

Medical imaging Image guided surgery Grimson et al., MIT 3D imaging MRI, CT

Applications of Computer Vision: Medical Imaging

Applications of Computer Vision: Image Databases From a search for horse pix in 100 horse images and 1086 non-horse images (Courtesy D. Forsyth & J. Ponce)

Applications of Computer Vision: Data Acquisition

3D Computer vision uses Use increases every year. Current and emerging applications are e.g.: 1.Scientific, engineering, industrial 2.Movie special effects 3.Computer games 4.Tele-presence 5.Capture and visualization of cultural and natural heritage. etc… =+

Vision-based modeling and rendering 1.Collect input views 2.Compute 3D geometry and appearance 3.Integrate models 4.Render new views See: poses structure Structure from motion algorithm

Case Study: Virtual heritage Modeling Inuit Artifacts Results: 1.A collection of 3D models of each component 2.Assembly of the individual models into animations and Internet web study material. animationsInternet web study material animationsInternet web study material

Capturing non-rigid animatable models current PhD project, Neil Birkbeck

Questions? More information: System demo videoSystem demo videowww.cs.ualberta.ca/~vis/ibmr/movies/capsys_1min.avi Downloadable renderer+modelsDownloadable renderer+modelswww.cs.ualberta.ca/~vis/ibmr IEEE VR tutorial text + Capturing softwareIEEE VR tutorial text + Capturing softwarewww.cs.ualberta.ca/~vis/VR2003tut Related papers references:Related papers references: Jagersand et al “Three Tier Model” 3DPVT 2008 …. Jagersand et al “Three Tier Model” 3DPVT 2008 …. Jagersand “Image-based Animation…” CVPR 1997 Jagersand “Image-based Animation…” CVPR 1997 More papers: papers:

Questions? More information: Downloadable renderer+modelsDownloadable renderer+modelswww.cs.ualberta.ca/~vis/ibmr Capturing software + IEEE VR tutorial textCapturing software + IEEE VR tutorial textwww.cs.ualberta.ca/~vis/VR2003tut Main references for this talk:Main references for this talk: Jagersand et al “Three Tier Model” 3DPVT 2008 …. Jagersand et al “Three Tier Model” 3DPVT 2008 …. Jagersand “Image-based Animation…” CVPR 1997 Jagersand “Image-based Animation…” CVPR 1997 More papers: papers: