CIS 489/689: Computer Vision Instructor: Christopher Rasmussen Course web page: vision.cis.udel.edu/cv February 12, 2003  Lecture 1.

Slides:



Advertisements
Similar presentations
5/13/2015CAM Talk G.Kamberova Computer Vision Introduction Gerda Kamberova Department of Computer Science Hofstra University.
Advertisements

Oleh Tretiak © Computer Vision Lecture 1: Introduction.
Computer and Robot Vision I
CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.
A new approach for modeling and rendering existing architectural scenes from a sparse set of still photographs Combines both geometry-based and image.
Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.
Math. Meth. in Vision and Imaging cmput 613/498 Lecture 1: Introduction and course overview Martin Jagersand.
CS/CMPE 535 – Machine Learning Outline. CS Machine Learning (Wi ) - Asim LUMS2 Description A course on the fundamentals of machine.
Overview of Computer Vision CS491E/791E. What is Computer Vision? Deals with the development of the theoretical and algorithmic basis by which useful.
3D Computer Vision and Video Computing Introduction Instructor: Zhigang Zhu CSc Spring 2006 Reading in 3D Computer Vision and.
Computer Vision Dan Witzner Hansen Course web page:
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
Computational Vision Jitendra Malik University of California at Berkeley Jitendra Malik University of California at Berkeley.
Stockman MSU Fall Computing Motion from Images Chapter 9 of S&S plus otherwork.
Sebastian Thrun & Jana Kosecka CS223B Computer Vision, Winter 2007 Stanford CS223B Computer Vision, Winter 2007 Lecture 1 Course Overview Image Formation.
Computer Vision Marc Pollefeys COMP 256 Administrivia Classes: Mon & Wed, 11-12:15, SN115 Instructor: Marc Pollefeys (919) Room.
Computer Vision (CSE P576) Staff Prof: Steve Seitz TA: Jiun-Hung Chen Web Page
1 MA375 Introduction To Numerical Computing Fall 2003 Prof. Tim Warburton
1 Computer Vision Instructor: Prof. Sei-Wang Chen, PhD Office: Applied Science Building, Room 101 Communication: Tel:
Digtial Image Processing, Spring ECES 682 Digital Image Processing Oleh Tretiak ECE Department Drexel University.
Convergence of vision and graphics Jitendra Malik University of California at Berkeley Jitendra Malik University of California at Berkeley.
David Luebke Modeling and Rendering Architecture from Photographs A hybrid geometry- and image-based approach Debevec, Taylor, and Malik SIGGRAPH.
CSCE 3110 Data Structures and Algorithm Analysis.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
CIS-305: Data Structures Fall Organizational Details Class Meeting: 4 :00-6:45pm, Tuesday, Room SCIT215 Instructor: Dr. Igor Aizenberg Office:
A Brief Overview of Computer Vision Jinxiang Chai.
An Introduction to Computer Vision George J. Grevera, Ph.D.
CS 376b Introduction to Computer Vision 04 / 29 / 2008 Instructor: Michael Eckmann.
COMP Introduction to Programming Yi Hong May 13, 2015.
Human-Computer Interaction Human-Computer Interaction Tracking Hanyang University Jong-Il Park.
IIIT-B Computer Vision, Fall 2006 Lecture 1 Introduction to Computer Vision Arvind Lakshmikumar Technology Manager, Sarnoff Corporation Adjunct Faculty,
Instructor: Tina Tian. About me Office: RLC 203A Office Hours: Wednesday 1:30 - 4:30 PM or .
CSCE 5013 Computer Vision Fall 2011 Prof. John Gauch
November 10, 2004 Prof. Christopher Rasmussen Lab web page: vision.cis.udel.edu.
Tactic Analysis in Football Instructors: Nima Najafzadeh Mahdi Oraei Spring
Introduction EE 520: Image Analysis & Computer Vision.
CSE 581: Interactive Computer Graphics Spring 2012, UG 4 Tuesday, Thursday – 9:00AM – 10:18AM DL 0317 Raghu Machiraju Slides: Courtesy - Prof. Huamin Wang,
Computer Vision Why study Computer Vision? Images and movies are everywhere Fast-growing collection of useful applications –building representations.
CS 8690: Computer Vision Ye Duan. CS8690 Computer Vision University of Missouri at Columbia Instructor Ye Duan (209 Engr West)
Research Activities at Florida State Vision Group Xiuwen Liu Florida State Vision Group Department of Computer Science Florida State University
Stereo Course web page: vision.cis.udel.edu/~cv April 11, 2003  Lecture 21.
Conceptual and Experimental Vision Introduction R.Bajcsy, S.Sastry and A.Yang Fall 2006.
Single View Geometry Course web page: vision.cis.udel.edu/cv April 9, 2003  Lecture 20.
Object Oriented Programming (OOP) Design Lecture 1 : Course Overview Bong-Soo Sohn Associate Professor School of Computer Science and Engineering Chung-Ang.
June 19, Liang-Jun Zhang MTWRF 9:45-11:15 am Sitterson Hall 011 Comp 110 Introduction to Programming.
12/7/10 Looking Back, Moving Forward Computational Photography Derek Hoiem, University of Illinois Photo Credit Lee Cullivan.
IST 210: Organization of Data
MASKS © 2004 Invitation to 3D vision. MASKS © 2004 Invitation to 3D vision Lecture 1 Overview and Introduction.
ITIS 4510/5510 Web Mining Spring Overview Class hour 5:00 – 6:15pm, Tuesday & Thursday, Woodward Hall 135 Office hour 3:00 – 5:00pm, Tuesday, Woodward.
Final Review Course web page: vision.cis.udel.edu/~cv May 21, 2003  Lecture 37.
Computer Vision, CS766 Staff Instructor: Li Zhang TA: Yu-Chi Lai
An Introduction to Computer Vision George J. Grevera, Ph.D.
Course Information CSE 2031 Fall Instructor U. T. Nguyen /new-yen/ Office: CSEB Office hours:  Tuesday,
IST 210: ORGANIZATION OF DATA Introduction IST210 1.
MASKS © 2004 Invitation to 3D vision. MASKS © 2004 Invitation to 3D vision Lecture 1 Overview and Introduction.
Camera Calibration Course web page: vision.cis.udel.edu/cv March 24, 2003  Lecture 17.
Introduction to Machine Learning Original Version by Prof. Nati Srebro-Bartom Modifications by Nir Ailon Lecture 0: What is Machine Learning? What.
COS 429: Computer Vision Thanks to Chris Bregler.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
Processing visual information for Computer Vision
Visual Information Retrieval
ECE 533 Digital Image Processing
A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology
CIS Introduction to Computer Vision
Course Overview Juan Carlos Niebles and Ranjay Krishna
Multiple View Geometry in Computer Vision
Multiple View Geometry for Robotics
Computer Vision (CSE 490CV, EE400B)
CMSC 426: Image Processing (Computer Vision)
Presentation transcript:

CIS 489/689: Computer Vision Instructor: Christopher Rasmussen Course web page: vision.cis.udel.edu/cv February 12, 2003  Lecture 1

Course description An introduction to the analysis of images and video in order to recognize, reconstruct, model, and otherwise infer static and dynamic properties of objects in the three-dimensional world. We will study the geometry of image formation; basic concepts in image processing such as smoothing, edge and feature detection, color, and texture; segmentation; shape representation including deformable templates; stereo vision; motion estimation and tracking; techniques for 3-D reconstruction; and probabilistic approaches to recognition and classification.

Outline What is Vision? Course outline Applications About the course

The Vision Problem How to infer salient properties of 3-D world from time-varying 2-D image projection

Computer Vision Outline Image formation Low-level –Single image processing –Multiple views Mid-level –Estimation, segmentation High-level –Recognition –Classification

Image Formation 3-D geometry Physics of light Camera properties –Focal length –Distortion Sampling issues –Spatial –Temporal

Low-level: Single Image Processing Filtering –Edge –Color –Local pattern similarity Texture –Appearance characterization from the statistics of applying multiple filters 3-D structure estimation from… –Shading –Texture

Low-level: Multiple Views Stereo –Structure from two views Structure from motion –What can we learn in general from many views, whether they were taken simultaneously or sequentially?

Mid-Level: Estimation, Segmentation Estimation: Fitting parameters to data –Static (e.g., shape) –Dynamic (e.g., tracking) Segmentation/clustering –Breaking an image or image sequence into a few meaningful pieces with internal similarity

High-level: Recognition, Classification Recognition: Finding and parametrizing a known object Classification –Assignment to known categories using statistics/probability to make best choice

Applications Inspection –Factory monitoring: Analyze components for deviations –Character recognition for mail delivery, scanning Biometrics (face recognition, etc.), surveillance Image databases: Image search on Google, etc. Medicine –Segmentation for radiology –Motion capture for gait analysis Entertainment –1 st down line in football, virtual advertising –Matchmove, rotoscoping in movies –Motion capture for movies, video games Architecture, archaeology: Image-based modeling, etc. Robot vision –Obstacle avoidance, object recognition –Motion compensation/image stabilization

Applications: Factory Inspection Cognex’s “CapInspect” system

Applications: Face Detection courtesy of H. Rowley

Applications: Text Detection & Recognition from J. Zhang et al.

Applications: MRI Interpretation Coronal slice of brainSegmented white matter from W. Wells et al.

Detection and Recognition: How? Build models of the appearance characteristics (color, texture, etc.) of all objects of interest Detection: Look for areas of image with sufficiently similar appearance to a particular object Recognition: Decide which of several objects is most similar to what we see Segmentation: “Recognize” every pixel

Applications: Football First-Down Line courtesy of Sportvision

Applications: Virtual Advertising courtesy of Princeton Video Image

First-Down Line, Virtual Advertising: How? Sensors that measure pan, tilt, zoom and focus are attached to calibrated cameras at surveyed positions Knowledge of the 3-D position of the line, advertising rectangle, etc. can be directly translated into where in the image it should appear for a given camera The part of the image where the graphic is to be inserted is examined for occluding objects like the ball, players, and so on. These are recognized by being a sufficiently different color from the background at that point. This allows pixel-by-pixel compositing.

Applications: Inserting Computer Graphics with a Moving Camera Opening titles from the movie “Panic Room”

Applications: Inserting Computer Graphics with a Moving Camera courtesy of 2d3

CG Insertion with a Moving Camera: How? This technique is often called matchmove Once again, we need camera calibration, but also information on how the camera is moving—its egomotion. This allows the CG object to correctly move with the real scene, even if we don’t know the 3-D parameters of that scene. Estimating camera motion: –Much simpler if we know camera is moving sideways (e.g., some of the “Panic Room” shots), because then the problem is only 2-D –For general motions: By identifying and following scene features over the entire length of the shot, we can solve retrospectively for what 3-D camera motion would be consistent with their 2-D image tracks. Must also make sure to ignore independently moving objects like cars and people.

Applications: Rotoscoping 2d3’s Pixeldust

Applications: Motion Capture Vicon software: 12 cameras, 41 markers for body capture; 6 zoom cameras, 30 markers for face

Applications: Motion Capture without Markers courtesy of C. Bregler

Motion Capture: How? Similar to matchmove in that we follow features and estimate underlying motion that explains their tracks Difference is that the motion is not of the camera but rather of the subject (though camera could be moving, too) –Face/arm/person has more degrees of freedom than camera flying through space, but still constrained Special markers make feature identification and tracking considerably easier Multiple cameras gather more information

Applications: Image-Based Modeling courtesy of P. Debevec Façade project: UC Berkeley Campanile

Image-Based Modeling: How? 3-D model constructed from manually- selected line correspondences in images from multiple calibrated cameras Novel views generated by texture- mapping selected images onto model

Applications: Robotics Autonomous driving: Lane & vehicle tracking (with radar)

Applications: Mosaicing for Image Stabilization from a UAV courtesy of S. Srinivasan

Course Prerequisites Background in/comfort with: –Linear algebra –Multi-variable calculus –Statistics, probability Homeworks will use Matlab, so an ability to program in: –Matlab, C/C++, Java, or equivalent

Grading 60%: 6 programming assignments/ problem sets with 9-14 days to finish each one 15%: Midterm exam (on March 28, the Friday before spring break) 25%: Final exam

Readings Textbook: Computer Vision: A Modern Approach, by D. Forsyth and J. Ponce Supplemental readings will be available online as PDF files Try to complete assigned reading before corresponding lecture

Details Homework –Mostly programming in Matlab –Some math-type problems to solve Must turn in typeset PDF files—not hand-written. I’ll explain this on Wednesday –Upload through course web page –Lateness policy Accepted up to 5 days after deadline; 10% penalty per day subtracted from on-time mark you would have gotten Exams –Closed book –Material will be from lectures plus assigned reading

More Details Instructor – –Office hours (by appointment in Smith 409): Mondays, 3:30-4:30 pm Tuesdays, 9-10 am TA: Qi Li – –Office hours (Pearson 115A) Fridays 3:30-5:30 pm

Announcements me ASAP to get ID code; then register for homework submission on the class web page Try to get Matlab & LaTeX running in some form –Register for account on Evans 133 machines if you need one Read Matlab & LaTeX primers for Friday

More questions? First try the web page: vision.cis.udel.edu/cv The TA should be able to help with many procedural and content issues, but feel free to me if necessary