Where computer vision needs help from computer science (and machine learning) Bill Freeman Electrical Engineering and Computer Science Dept. Massachusetts.

Slides:



Advertisements
Similar presentations
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Advertisements

Microsoft Office Illustrated Fundamentals Unit N: Polishing and Running a Presentation.
3D Graphics Rendering and Terrain Modeling
1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.
IMAGE RESTORATION AND REALISM MILLIONS OF IMAGES SEMINAR YUVAL RADO.
AAM based Face Tracking with Temporal Matching and Face Segmentation Dalong Du.
Large dataset for object and scene recognition A. Torralba, R. Fergus, W. T. Freeman 80 million tiny images Ron Yanovich Guy Peled.
Proceedings of the IEEE 2010 Antonio Torralba, MIT Jenny Yuen, MIT Bryan C. Russell, MIT.
Event prediction CS 590v. Applications Video search Surveillance – Detecting suspicious activities – Illegally parked cars – Abandoned bags Intelligent.
Small Codes and Large Image Databases for Recognition CVPR 2008 Antonio Torralba, MIT Rob Fergus, NYU Yair Weiss, Hebrew University.
Fast and Compact Retrieval Methods in Computer Vision Part II A. Torralba, R. Fergus and Y. Weiss. Small Codes and Large Image Databases for Recognition.
Direct Methods for Visual Scene Reconstruction Paper by Richard Szeliski & Sing Bing Kang Presented by Kristin Branson November 7, 2002.
Let Computer Draw Qingyuan Kong. Goal Give me a picture “Obama stands in front of a pyramid”
Object Recognition: History and Overview Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Jean Ponce.
Detecting Image Region Duplication Using SIFT Features March 16, ICASSP 2010 Dallas, TX Xunyu Pan and Siwei Lyu Computer Science Department University.
Image Stitching and Panoramas
Highlights Lecture on the image part (10) Automatic Perception 16
Opportunities of Scale Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.
Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.
Creating and Exploring a Large Photorealistic Virtual Space INRIA / CSAIL / Adobe First IEEE Workshop on Internet Vision, associated with CVPR 2008.
Digital Image Processing & Pattern Analysis (CSCE 563) Course Outline & Introduction Prof. Amr Goneid Department of Computer Science & Engineering The.
Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.
Quick Overview of Robotics and Computer Vision. Computer Vision Agent Environment camera Light ?
A Brief Overview of Computer Vision Jinxiang Chai.
G52IIP, School of Computer Science, University of Nottingham What we will learn … Topics relate to the use of computer to Acquire/generate Process/manipulate/store.
Technology and Historical Overview. Introduction to 3d Computer Graphics  3D computer graphics is the science, study, and method of projecting a mathematical.
Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.
Dense correspondences across scenes and scales Tal Hassner The Open University of Israel CVPR’14 Tutorial on Dense Image Correspondences for Computer Vision.
High-Resolution Interactive Panoramas with MPEG-4 발표자 : 김영백 임베디드시스템연구실.
STREET VIEW Wenyang Liu GMAT9205: Fundamentals of Geo-Positioning Session 1, 2010.
CS654: Digital Image Analysis Lecture 8: Stereo Imaging.
Scale-less Dense Correspondences Tal Hassner The Open University of Israel ICCV’13 Tutorial on Dense Image Correspondences for Computer Vision.
Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
IBM QBIC: Query by Image and Video Content Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC 28223
ECE 172A SIMPLE OBJECT DETECTOR WITH INDICATOR WHEN A NEW OBJECT HAS BEEN ADDED TO OR MISSING IN A ROOM Presented by by Hugo Groening.
UNBIASED LOOK AT DATASET BIAS Antonio Torralba Massachusetts Institute of Technology Alexei A. Efros Carnegie Mellon University CVPR 2011.
Handwritten Hindi Numerals Recognition Kritika Singh Akarshan Sarkar Mentor- Prof. Amitabha Mukerjee.
Implementing GIST on the GPU. Refrence Original Work  Aude Oliva, Antonio Torralba  Modeling the shape of the scene: a holistic representation of the.
Scene Completion Using Millions of Photographs James Hays, Alexei A. Efros Carnegie Mellon University ACM SIGGRAPH 2007.
Sparse Bayesian Learning for Efficient Visual Tracking O. Williams, A. Blake & R. Cipolloa PAMI, Aug Presented by Yuting Qi Machine Learning Reading.
Volume Visualization Presented by Zhao, hai. What’ volume visualization Volume visualization is the creation of graphical representations of data sets.
Visual Data on the Internet With slides from Alexei Efros, James Hays, Antonio Torralba, and Frederic Heger : Computational Photography Jean-Francois.
Histograms of Oriented Gradients for Human Detection(HOG)
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
112/5/ :54 Graphics II Image Based Rendering Session 11.
Workshop on The Transformation of Science Max Planck Society, Elmau, Germany June 1, 1999 TOWARDS INFORMATIONAL SCIENCE Indexing and Analyzing the Knowledge.
Tracking Turbulent 3D Features Lu Zhang Nov. 10, 2005.
Course14 Dynamic Vision. Biological vision can cope with changing world Moving and changing objects Change illumination Change View-point.
Photo VR Editor: A Panoramic and Spherical Environment Map Authoring Tool for Image-Based VR Browsers Jyh-Kuen Horng, Ming Ouhyoung Communications and.
1 Perception and VR MONT 104S, Fall 2008 Lecture 20 Computer Graphics and VR.
Representing Moving Images with Layers J. Y. Wang and E. H. Adelson MIT Media Lab.
High-Level Vision Object Recognition.
Markov Networks: Theory and Applications Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208
Tentative Future Courses Fall `11 : Computer Vision – emphasis on recognition Spring `11 : Graduate seminar Fall `12 : Computational Photography.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
WLD: A Robust Local Image Descriptor Jie Chen, Shiguang Shan, Chu He, Guoying Zhao, Matti Pietikäinen, Xilin Chen, Wen Gao 报告人:蒲薇榄.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
Processing visual information for Computer Vision
Graphcut Textures:Image and Video Synthesis Using Graph Cuts
Visual Information Retrieval
Representing Moving Images with Layers
CS 1674: Intro to Computer Vision Scene Recognition
Representing Moving Images with Layers
Brief Review of Recognition + Context
Image processing and computer vision
Paper Reading Dalong Du April.08, 2011.
Local features and image matching May 7th, 2019
I think that live teachers will soon be replaced by robots that have a training program. They will only teach what is laid in them.
Presentation transcript:

Where computer vision needs help from computer science (and machine learning) Bill Freeman Electrical Engineering and Computer Science Dept. Massachusetts Institute of Technology September 9, 2009

Outline My background Computer vision applications Computer vision techniques and problems: – Low-level vision: underdetermined problems – High-level vision: combinatorial problems – Miscellaneous problems

At Photokina, in Cologne, Germany

Me (Foreign Expert) and my wife (English teacher), riding from the Foreigners’ Cafeteria at the Taiyuan University of Technology, Shanxi, China

While in China, I read this book (to be re-issued by MIT Press this year), and got very excited about computer vision. Studied for PhD at MIT.

Worked for 9 years at Mitsubishi Electric Research Labs, an industrial research lab doing fundamental research across the street from MIT.

2001 – present, MIT

Infinite images Joint work with: Biliana Kaneva Josef Sivic Shai Avidan Antonio Torralba

A computer graphics application of belief propagation for optimal seam finding

The image database We have collected ~6 million images from Flickr based on keyword and group searches – typical image size is 500x375 pixels – 720GB of disk space (jpeg compressed)

Image representation Color layout GIST [Oliva and Torralba’01] Original image

Obtaining semantically coherent themes We further break-up the collection into themes of semantically coherent scenes: Train SVM-based classifiers from 1-2k training images [Oliva and Torralba, 2001]

Basic camera motions Forward motionCamera rotation Camera pan Starting from a single image, find a sequence of images to simulate a camera motion:

3. Find a match to fill the missing pixels Scene matching with camera view transformations: Translation 1. Move camera 2. View from the virtual camera 4. Locally align images 5. Find a seam 6. Blend in the gradient domain

4. Stitched rotation Scene matching with camera view transformations: Camera rotation 1. Rotate camera 2. View from the virtual camera 3. Find a match to fill-in the missing pixels 5. Display on a cylinder

More “infinite” images – camera translation

Virtual space as an image graph Forward Rotate (left/right) Pan (left/right) Nodes represent Images Edges represent particular motions: Edge cost is given by the cost of the image match under the particular transformation Image graph Kaneva, Sivic, Torralba, Avidan, and Freeman, Infinite Images, to appear in Proceedings of IEEE.

Virtual image space laid out in 3D Kaneva, Sivic, Torralba, Avidan, and Freeman, Infinite Images, to appear in Proceedings of IEEE.