3D Computer Vision: CSc 83020.

Slides:



Advertisements
Similar presentations
Sebastian Thrun CS223B Computer Vision, Winter Stanford CS223B Computer Vision, Winter 2005 Lecture 1 Intro and Image Formation Sebastian Thrun,
Advertisements

September 2, 2014Computer Vision Lecture 1: Human Vision 1 Welcome to CS 675 – Computer Vision Fall 2014 Instructor: Marc Pomplun Instructor: Marc Pomplun.
Structured Lighting Guido Gerig, Univ. of Utah
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean 5403 T-R 3:00pm – 4:20pm Lecture #17.
Vision Sensing. Multi-View Stereo for Community Photo Collections Michael Goesele, et al, ICCV 2007 Venus de Milo.
5/13/2015CAM Talk G.Kamberova Computer Vision Introduction Gerda Kamberova Department of Computer Science Hofstra University.
Oleh Tretiak © Computer Vision Lecture 1: Introduction.
Computer and Robot Vision I
CSc D Computer Vision – Ioannis Stamos 3-D Computer Vision CSc Camera Calibration.
CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.
Exchanging Faces in Images SIGGRAPH ’04 Blanz V., Scherbaum K., Vetter T., Seidel HP. Speaker: Alvin Date: 21 July 2004.
Advanced Computer Vision Introduction Goal and objectives To introduce the fundamental problems of computer vision. To introduce the main concepts and.
Overview of Computer Vision CS491E/791E. What is Computer Vision? Deals with the development of the theoretical and algorithmic basis by which useful.
Vision Computing An Introduction. Visual Perception Sight is our most impressive sense. It gives us, without conscious effort, detailed information about.
Introduction to Computer Vision 3D Vision Topic 9 Stereo Vision (I) CMPSCI 591A/691A CMPSCI 570/670.
Computing With Images: Outlook and applications
3D Computer Vision and Video Computing Introduction Instructor: Zhigang Zhu CSc Spring 2006 Reading in 3D Computer Vision and.
Sebastian Thrun & Jana Kosecka CS223B Computer Vision, Winter 2007 Stanford CS223B Computer Vision, Winter 2007 Lecture 1 Course Overview Image Formation.
Introduction to Computer Vision CS223B, Winter 2005.
Computer Vision Marc Pollefeys COMP 256 Administrivia Classes: Mon & Wed, 11-12:15, SN115 Instructor: Marc Pollefeys (919) Room.
Multiple View Geometry Marc Pollefeys University of North Carolina at Chapel Hill Modified by Philippos Mordohai.
Computer Vision (CSE P576) Staff Prof: Steve Seitz TA: Jiun-Hung Chen Web Page
CSCE 641: Computer Graphics Image-based Rendering Jinxiang Chai.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
The University of Ontario CS 4487/9587 Algorithms for Image Analysis n Web page: Announcements, assignments, code samples/libraries,
Introduction to Machine Vision Systems
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.
Computer Vision Spring ,-685 Instructor: S. Narasimhan Wean Hall 5409 T-R 10:30am – 11:50am.
CIS 601 Fall 2004 Introduction to Computer Vision and Intelligent Systems Longin Jan Latecki Parts are based on lectures of Rolf Lakaemper and David Young.
Structured light and active ranging techniques Class 8
A Brief Overview of Computer Vision Jinxiang Chai.
Sensory Information Processing Shinsaku HIURA Division of Systems Science and Applied Informatics.
A Bayesian Approach For 3D Reconstruction From a Single Image
3D Surface Reconstruction from 2D Images (Survey)
Perception Introduction Pattern Recognition Image Formation
Visual Perception PhD Program in Information Technologies Description: Obtention of 3D Information. Study of the problem of triangulation, camera calibration.
CIS 601 Fall 2003 Introduction to Computer Vision Longin Jan Latecki Based on the lectures of Rolf Lakaemper and David Young.
CSCE 5013 Computer Vision Fall 2011 Prof. John Gauch
Recap from Monday Image Warping – Coordinate transforms – Linear transforms expressed in matrix form – Inverse transforms useful when synthesizing images.
Image-based rendering Michael F. Cohen Microsoft Research.
CSC 461: Lecture 3 1 CSC461 Lecture 3: Models and Architectures  Objectives –Learn the basic design of a graphics system –Introduce pipeline architecture.
1 Finding depth. 2 Overview Depth from stereo Depth from structured light Depth from focus / defocus Laser rangefinders.
Conceptual and Experimental Vision Introduction R.Bajcsy, S.Sastry and A.Yang Fall 2006.
Why is computer vision difficult?
Spring 2015 CSc 83020: 3D Photography Prof. Ioannis Stamos Mondays 4:15 – 6:15
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
CS576 Computer Vision Instructor: Dr. Yu-Wing Tai No official TA Tuesday and Thursday 4:00pm – 5:30pm Rm 3444 E3-1 Building 1.
Tutorial Visual Perception Towards Computer Vision
Computer Vision, CS766 Staff Instructor: Li Zhang TA: Yu-Chi Lai
Intelligent Vision Systems Image Geometry and Acquisition ENT 496 Ms. HEMA C.R. Lecture 2.
Computer Vision No. 1 What is the Computer Vision?
MASKS © 2004 Invitation to 3D vision. MASKS © 2004 Invitation to 3D vision Lecture 1 Overview and Introduction.
  Computer vision is a field that includes methods for acquiring,prcessing, analyzing, and understanding images and, in general, high-dimensional data.
COS 429: Computer Vision Thanks to Chris Bregler.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
Introduction Computational Photography Seminar: EECS 395/495
제 5 장 스테레오.
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
Instructor: Mircea Nicolescu
CIS Introduction to Computer Vision
Sebastian Thrun, Stanford Rick Szeliski, Microsoft
Image Based Modeling and Rendering (PI: Malik)
Chapter 1: Image processing and computer vision Introduction
Multiple View Geometry in Computer Vision
Professor Sebastian Thrun CAs: Dan Maynes-Aminzade and Mitul Saha
Computer Vision (CSE 490CV, EE400B)
CMSC 426: Image Processing (Computer Vision)
Wrap-up Computer Vision Spring 2019, Lecture 26
No. 1 What is the Computer Vision?
Presentation transcript:

3D Computer Vision: CSc 83020

3D Computer Vision: CSc 83020 Instructor: Ioannis Stamos istamos (at) hunter.cuny.edu http://www.cs.hunter.cuny.edu/~ioannis Office Hours: Tuesdays 4-6 (at Hunter) or by appoitnment Office: 1090G Hunter North (69th street bw. Park and Lex.) Computer Vision Lab: 1090E Hunter North Course web page: http://www.cs.hunter.cuny.edu/~ioannis/3D_f12.html

Goals To familiarize you with basic the techniques and jargon in the field To enable you to solve computer vision problems To let you experience (and appreciate!) the difficulties of real-world computer vision To get you excited!

Class Policy You have to Late policy Teaming Turn in all assignments (60% of grade) Complete a final project (30% of grade) Actively participate in class (10% of grade) Late policy Six late days (but not for final project) Teaming For final project you can work in groups of 2

About me 11th year at Hunter and the Graduate Center Graduated from Columbia in ’01 CS Ph.D. Research Areas: Computer Vision 3D Modeling Computer Graphics

Books Computer Vision: Algorithms and Applications, Richard Szeliski, 2010 (available online for free) Robot Vision B. K. P. Horn, The MIT Press (great classic book) Introductory Techniques for 3-D Computer Vision Emanuele Trucco and Alessandro Verri, Prentice Hall, 1998 (algorithmic perspective) Computer Vision A Modern Approach David A. Forsyth, Jean Ponce, Prentice Hall 2003 An Invitation to 3-D Vision Yi Ma, Stefano Soatto, Jana Kosecka, S. Shankar Sastry Springer 2004. Three-Dimensional Computer Vision: A Geometric Viewpoint Olivier Faugeras The MIT Press, 1996.

Journals/Web International Journal of Computer Vision. Computer Vision and Image Understanding. IEEE Trans. on Pattern Analysis and Machine Intelligence. SIGGRAPH (mostly Graphics) http://www.ri.cmu.edu/ (CMU’s Robotic Institute) http://www.cs.cmu.edu/~cil/vision.html (The Vision Home Page) http://www.dai.ed.ac.uk/CVonline/ (CV Online) http://iris.usc.edu/Vision-Notes/bibliography/contents.html (Annotated CV Bibliography)

Class History Based on class taught at Columbia University by Prof. Shree Nayar. New material reflects modern approach. Taught similar class at Hunter Taught “3D Photography” class at the Graduate Center of CUNY. My active research area Funded by the National Science Foundation

Class Schedule Check class website Final project proposals Due Nov. 7 Design your own or check list of possible projects on class website Final project presentations and report May 16 (last class)

What is Computer Vision? Sensors Images or Video Illumination Vision System Physical 3D World Scene Description Measuring Visual Information

Computer Graphics Output Image Model Synthetic Camera (slides courtesy of Michael Cohen)

Computer Vision Output Model Real Scene Real Cameras (slides courtesy of Michael Cohen)

Combined Output Image Model Real Scene Synthetic Camera Real Cameras (slides courtesy of Michael Cohen)

Cont. Vision is automating visual processes (Ball & Brown). Vision is an information processing task (Marr). Vision is inverting image formation (Horn). Vision is inverse graphics. Vision looks easy, but is difficult. Vision is difficult, but it is fun (Kanade). Vision is useful.

Some Applications Industrial Material Handling Inspection Assembly

Some Applications Autonomous Navigation

Some Applications Vision for Graphics Film Industry Urban Planning E-commerce Virtual Reality

Some Applications Realistic 3D experience Google Earth http://earth.google.com/ Microsoft Photosynth http://labs.live.com/photosynth/

More Applications! Optical Character Recognition (OCR) Visual Databases (images or movies) Searching for image content Face Recognition (security) Iris Recognition (security) Traffic Monitoring Systems Many more…

Vision deals with images

Images Look Nice…

Images Look Nice… Ioannis Stamos – CSc 83020 Spring 2007

...Essentially a 2D array of numbers 107 132 107 107 132 99 132 107 132 99 107 132 99 107 132 91 107 132 99 132 99 107 107 132 99 132 107 132 107 132 91 107 132 107 132 99 107 132 107 132 107 99 132 99 132 99 132 99 132 124 132 99 132 107 132 132 107 132 124 132 132 124 132 150 107 150 150 132 150 132 150 132 150 107 150 132 124 132 132 150 107 99 132 132 107 132 107 132 150 132 150 99 132 107 150 132 107 150 132 124 132 132 107 150 99 150 107 150 132 107 150 132 124 132 150 115 124 132 150 107 132 150 132 150 150 107 132 116 132 124 132 107 99 150 132 107 132 150 132 124 132 150 107 150 107 132 99 132 107 150 132 150 107 150 132 150 150 107 107 150 150 150 150 115 167 107 150 107 132 150 107 150 132 124 132 124 132 124 132 124 132 150 107 150 107 107 132 116 132 150 132 150 107 150 150 132 150 132 116 132 124 132 150 132 150 150 150 132 116 132 116 107 132 99 150 150 132 107 132 150 107 150 132 124 132 116 132 107 150 132 107 150 132 150 107 150 107 132

Low-Level or “Early” Vision Considers local properties of an image “There’s an edge!” From: Szymon Rusinkiewicz, Princeton.

“There’s an object and a background!” Mid-Level Vision Grouping and segmentation “There’s an object and a background!”

High-Level Vision Recognition “It’s a chair!”

Humans Vision is easy for us. But how do we do it?

Human Vision: Illusions Fraser’s spiral (Fraser 1908)

Illusions Zölner Illusion (1860) Hering Illusion (1861) Wundt Illusion (1896)

Visual Ambiguities Young-Girl/Old-Woman

Visual Ambiguities From NALWA.

Visual Ambiguities

Visual Ambiguities Two lava cones with craters, perceived to be the image of two craters with mounds. Implicit assumption that lighting is from above.

Visual Ambiguities

Seeing and Thinking Kanizsa (1979)

Syllabus Overview

Image Formation and Optics Light Source p Surface normal CCD Array Lens P Object Surface Projection of 3-D World on a 2-D plane

Lenses Ray of light Optical Axis

Image Sensors/Camera Models Typical 512x512 CCD array Imaging Area 262,144 pixels One Pixel 20μm 20μm 512 (10.25mm) Convert Optical Images To Electrical Signals. 512 (10.25mm)

Filtering = h g f

Image Features Detecting intensity changes in the image Ioannis Stamos – CSc 83020 Spring 2007

Grouping image features Finding continuous lines from edge segments Ioannis Stamos – CSc 83020 Spring 2007

Camera Calibration Camera Coordinate Frame Zc Pixel Coordinates Yc Intrinsic Parameters Xc Extrinsic Parameters Image Coordinate Frame Zw Yw World Coordinate Frame Xw

Shape from X Shape from X Stereo Motion Shading Texture foreshortening

Binocular Stereo depth map

Active Sensing Sheet of light CCD array Lens Sources of error: 1) grazing angle, 2) object boundaries. Sheet of light CCD array Lens

Shape from Shading Three-dimensional shape from a single image. Ioannis Stamos – CSc 83020 Spring 2007

Motion (optical flow) Determining the movement of scene objects Ioannis Stamos – CSc 83020 Spring 2007

Reflectance and Color Why do these spheres look different?

Object Recognition Learning visual appearance. Real-time object recognition.

Template-Based Methods Cootes et al.

Some Vision Systems…

Example 2: Structure From Motion Slide courtesy of Sebastian Thrun http://cs223b.stanford.edu Stanford http://www.cs.unc.edu/Research/urbanscape

Example 2: Structure From Motion Slide courtesy of Sebastian Thrun http://cs223b.stanford.edu Stanford http://www.cs.unc.edu/Research/urbanscape

Example 2: Structure From Motion Slide courtesy of Sebastian Thrun http://cs223b.stanford.edu Stanford http://www.cs.unc.edu/Research/urbanscape

Example 2: Structure From Motion Slide courtesy of Sebastian Thrun http://cs223b.stanford.edu Stanford http://www.cs.unc.edu/Research/urbanscape

Example 2: Structure From Motion Slide courtesy of Sebastian Thrun http://cs223b.stanford.edu Stanford http://www.cs.unc.edu/Research/urbanscape

Example 4: 3D Modeling Slide courtesy of Sebastian Thrun http://cs223b.stanford.edu Stanford Drago Anguelov

Example 5: Segmentation Slide courtesy of Sebastian Thrun http://cs223b.stanford.edu Stanford http://elib.cs.berkeley.edu/photos/classify/

Example 6: Classification Slide courtesy of Sebastian Thrun http://cs223b.stanford.edu Stanford

Example 6: Classification Slide courtesy of Sebastian Thrun http://cs223b.stanford.edu Stanford

Real-world Applications Osuna et al:

Range Scanning Outdoor Structures Ioannis Stamos – CSc 83020 Spring 2007

Data Acquisition Spot laser scanner. Time of flight. Max Range: 100m. Scanning time: 20 minutes for 1000 x1000 points. Accuracy: 6mm.

Video

Latest Video

Inserting models in Google Earth

Image sequence (CMU, Virtualized Reality Project) Dynamic Scenes Image sequence (CMU, Virtualized Reality Project)

Dynamic Scenes Dynamic 3D model.

Dynamic texture-mapped model. Dynamic Scenes Dynamic texture-mapped model.

Scanning the David Marc Levoy, Stanford 2:30 through closeup of eye 0:40 So here we are scanning our “piece de resistance” the gantry is extended to nearly its full height If you’ve looked at our web site you know the little story about the gantry being too short initially because we relied on art history books for the height of the David and they're all wrong – by 3 feet !! so at the last minute we had to add a piece to the gantry…here and load another few hundred pounds of ballast to the base…here – to keep the gantry from tipping over on the statue But as you can see from the photo on the right, we did finally reach the top of the statue height of gantry: 7.5 meters weight of gantry: 800 kilograms 4:00 through architectural reps 0:45

Statistics about the scan 0:20 So here’s a rendering of our computer model of the David we rolled the gantry, which now weighed about a ton, 480 times 2 billion polygons you can read the rest It took 30 days to scan the statue but we were only allowed to scan at night, when the museum was closed so it was basically 30 all-nighters in a row 480 individually aimed scans 2 billion polygons 7,000 color images 32 gigabytes 30 nights of scanning 22 people Ioannis Stamos – CSc 83020 Spring 2007 0:30

Head of Michelangelo’s David 0:20 We haven’t assembled the David at full-resolution yet Here’s a 1 mm model of the head watertight with color The photograph at the left was taken with uncalibrated lighting so the two images don’t match exactly but hopefully you agree that we’ve basically captured the statue’s appearance photograph 1.0 mm computer model

David’s left eye 0.25 mm model photograph 0:30 Here’s a closeup of David’s left eye at the maximum resolution of our model – ¼ mm we can see some interesting things… These bumps are holes from Michelangelo’s drill so that’s real geometry are artifacts from space carving, the hole-filling part of our range image merging algorithm smoothing out those artifacts, while preserving the observed surfaces, is future work Finally, these bumps are noise from laser subsurface scattering Let’s zoom in some more and look at the triangle mesh noise from laser scatter holes from Michelangelo’s drill artifacts from space carving 0.25 mm model photograph Ioannis Stamos – CSc 83020 Spring 2007 0:45

What do you think?