No. 1 What is the Computer Vision?
Instructor Katsushi Ikeuchi Pointers: 03-5452-6242 cvl-sec@iis.u-tokyo.ac.jp 4-6-1 Komaba Meguro-ku http://www.cvl.iis.u-tokyo.ac.jp
Evaluation attendance 50% report 50%
Schedule Shape-from-X Interpretation Special topics Analysis of line-drawing Shape-from-shading Binocular stereo Interpretation Interpolation Representation Special topics Modeling from reality
Katsu Ikeuchi U. Tokyo ETL U. Tokyo MIT AI CMU Human visual system Object recognition U. Tokyo Virtual heritage MIT AI Shape-from-shading CMU Assembly plan from observation Modeling from reality 1978 1980 1986 1996
Demonstration Videos
Photometric Stereo (1980) Brightness difference -> 3D shape 3D shape -> 3D Pose determination 3DPose -> Grasping
Bin Picking
Assembly Plan from Observation (1990)
Recent Result Assembly plan from observation
Learning Human Dance
Motion Capture Data
Robot Dancing
Modeling Cultural Heritage
Virtual City Probe Info
Virtual City Speed:10km/h Vehicle Pedestrian Vehicle Near Yoyogi park ampm-1.png Vehicle Pedestrian Speed:10km/h Near Yoyogi park Vehicle
Computer Vision (CV) To make a computer to recognize the 3D world as we do To generate 3D representations from 2D images
CV and related areas Image Understanding (AI) Pattern Recognition (Mathematical theories) Image Processing (Signal processing)
CV and related areas Image Understanding (AI) Pattern Recognition (Mathematical theories) Image Processing (Signal processing)
To get better images: 2D-to-2D Image Processing To get better images: 2D-to-2D
CV and related areas Image Understanding (AI) Pattern Recognition (Mathematical theories) Image Processing (Signal processing)
Decision making: mathematical theories Pattern Recognition Decision making: mathematical theories
CV and related areas Image Understanding (AI) Pattern Recognition (Mathematical theories) Image Processing (Signal processing)
Image Understanding Scene description
Why difficult ? A lot of data Ambiguity Projection of a 3D world to a 2D image Many factors to influence the image Illumination condition Object shape Camera characteristics
Image Foggy golden triangle in Pittsburgh
But …
A lot of data Landsat image Color TV image 1scene: 3300 x 2300 x 4 = 30000000 bytes 200 scenes/ day Color TV image 512 x 512 x 3 x 30 = 25000000 bytes/sec
Why difficult ? A lot of data Ambiguity Projection of a 3D world to a 2D image Many factors to influence the image Illumination condition Object shape Camera characteristics
Illusion due to the projection
Why difficult ? A lot of data Ambiguity Projection of a 3D world to a 2D image Many factors to influence to the image Illumination condition Object shape Camera characteristics
Image A image is a matrix of pixels Each pixel brightness Color Distance
Inside and Outside (Gestalt)
Common sense To formulate the common sense → research topics
Current issues A lot of data Ambiguity Many factors Computational sensor Vision board Ambiguity Projective geometry constraints Many factors Physics-based vision
Application areas
Application areas
What is Computer Vision? Vision is … an information processing task that constructs efficient symbolic descriptions of the world from images. (Marr) Vision is … inverse graphics. Vision is … looks easy, but is difficult. Vision is … difficult, but is fun. (Kanade) Vision is an engineering science to create an alternative of human visual systems on computers (Ikeuchi)
References Journals Inter. J. Computer Vision IEEE Trans. Pattern Analysis and Machine Intelligence IEICE D-2 IPSJ Trans CVIM International conferences Inter. Conf. Computer Vision (ICCV) Computer Vision and Pattern Recognition (CVPR) Asian Conf. Computer Vision (ACCV) Special interest groups IPSJ CVIM IEICE PRMU
Schedule (April-May) 4/12 Introduction 4/19 Line drawing 4/26 Perspective projection 5/3 Holiday 5/10 Shape from Shading 5/17 Color Dr. Miyazaki 5/24 Stereo#1 5/31 Stereo#2 Dr. Vanno and Dr. Ogawara
Schedule (June-July) 6/7 Motion analysis 6/14 No class 6/21 EPI, IBR & MBR Dr. Ono 6/28 Interpolation 7/5 Object representation#1 Dr.Takamatsu 7/12 Object representation#2