Viola 1999 MIT AI Laboratory Progress on: Variable Viewpoint Reality Image Database Paul Viola & Eric Grimson Jeremy DeBonet, Aparna Lakshmiratan, William.

Slides:



Advertisements
Similar presentations
Information and Software Technology Option: Artificial intelligence, simulation and modelling.
Advertisements

Free-viewpoint Immersive Networked Experience February 2010.
Kien A. Hua Division of Computer Science University of Central Florida.
MIT AI Lab Paul Viola NTT Visit: Image Database Retrieval Variable Viewpoint Reality Professor Paul Viola Collaborators: Professor Eric Grimson, Jeremy.
Page 1 SIXTH SENSE TECHNOLOGY Presented by: KIRTI AGGARWAL 2K7-MRCE-CS-035.
Vision Based Control Motion Matt Baker Kevin VanDyke.
Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee.
Robust Object Tracking via Sparsity-based Collaborative Model
Broadcast Court-Net Sports Video Analysis Using Fast 3-D Camera Modeling Jungong Han Dirk Farin Peter H. N. IEEE CSVT 2008.
HCI Final Project Robust Real Time Face Detection Paul Viola, Michael Jones, Robust Real-Time Face Detetion, International Journal of Computer Vision,
Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.
Computational Photography: Image-based Modeling Jinxiang Chai.
ADVISE: Advanced Digital Video Information Segmentation Engine
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
CS 561, Sessions 27 1 Towards intelligent machines Thanks to CSCI561, we now know how to… - Search (and play games) - Build a knowledge base using FOL.
Advanced Computer Graphics (Fall 2010) CS 283, Lecture 4: 3D Objects and Meshes Ravi Ramamoorthi
Real-time Computer Vision with Scanning N-Tuple Grids Simon Lucas Computer Science Dept.
1 Image-Based Visual Hulls Paper by Wojciech Matusik, Chris Buehler, Ramesh Raskar, Steven J. Gortler and Leonard McMillan [
Multi-view stereo Many slides adapted from S. Seitz.
2007Theo Schouten1 Introduction. 2007Theo Schouten2 Human Eye Cones, Rods Reaction time: 0.1 sec (enough for transferring 100 nerve.
Computer Vision for Interactive Computer Graphics Mrudang Rawal.
Project 1 grades out Announcements. Multiview stereo Readings S. M. Seitz and C. R. Dyer, Photorealistic Scene Reconstruction by Voxel Coloring, International.
Shading Languages By Markus Kummerer. Markus Kummerer 2 / 19 State of the Art Shading.
Oral Defense by Sunny Tang 15 Aug 2003
Research & Innovation 1 An Industry Perspective on VVG Research Oliver Grau BBC Research & Innovation VVG SUMMER SCHOOL '07.
November 29, 2004AI: Chapter 24: Perception1 Artificial Intelligence Chapter 24: Perception Michael Scherger Department of Computer Science Kent State.
Sana Naghipour, Saba Naghipour Mentor: Phani Chavali Advisers: Ed Richter, Prof. Arye Nehorai.
Electronic Visualization Laboratory University of Illinois at Chicago Interaction between Real and Virtual Humans: Playing Checkers R. Torre, S. Balcisoy.
Knowledge Systems Lab JN 8/24/2015 A Method for Temporal Hand Gesture Recognition Joshua R. New Knowledge Systems Laboratory Jacksonville State University.
Deep Green System for real-time tracking and playing the board game Reversi Nadav Erell Intro to Computational and Biological Vision, CS department, Ben-Gurion.
Volume Graphics (graduate course) Bong-Soo Sohn School of Computer Science and Engineering Chung-Ang University.
An Introduction to Computer Vision George J. Grevera, Ph.D.
Themes in Computer Vision Carlo Tomasi. Applications autonomous cars, planes, missiles, robots,... space exploration aid to the blind, ASL recognition.
GIP: Computer Graphics & Image Processing 1 1 Medical Image Processing & 3D Modeling.
The MPEG-7 Color Descriptors
Multimedia Databases (MMDB)
EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.
SPIE'01CIRL-JHU1 Dynamic Composition of Tracking Primitives for Interactive Vision-Guided Navigation D. Burschka and G. Hager Computational Interaction.
KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.
Presentation by: K.G.P.Srikanth. CONTENTS  Introduction  Components  Working  Applications.
Exploitation of 3D Video Technologies Takashi Matsuyama Graduate School of Informatics, Kyoto University 12 th International Conference on Informatics.
Submitted by:- Vinay kr. Gupta Computer Sci. & Engg. 4 th year.
MIT AI Lab Viola & Grimson Variable Viewpoint Reality Professor Paul Viola & Professor Eric Grimson Collaborators: J eremy De Bonet, John Winn, Owen Ozier,
DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.
Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)
Action and Gait Recognition From Recovered 3-D Human Joints IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 40, NO. 4, AUGUST.
MIT AI Lab / LIDS Laboatory for Information and Decision Systems & Artificial Intelligence Laboratory Massachusetts Institute of Technology A Unified Multiresolution.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
An Introduction to Computer Vision George J. Grevera, Ph.D.
Knowledge Systems Lab JN 1/15/2016 Facilitating User Interaction with Complex Systems via Hand Gesture Recognition MCIS Department Knowledge Systems Laboratory.
High Resolution Surface Reconstruction from Overlapping Multiple-Views
1/7/99 Virtual Viewpoint Reality Virtual Viewpoint Reality NTT: Visit 1/7/99.
Image-Based Rendering Geometry and light interaction may be difficult and expensive to model –Think of how hard radiosity is –Imagine the complexity of.
Camera surface reference images desired ray ‘closest’ ray focal surface ‘closest’ camera Light Field Parameterization We take a non-traditional approach.
Face Detection 蔡宇軒.
11/25/03 3D Model Acquisition by Tracking 2D Wireframes Presenter: Jing Han Shiau M. Brown, T. Drummond and R. Cipolla Department of Engineering University.
MIT Artificial Intelligence Laboratory — Research Directions Intelligent Perceptual Interfaces Trevor Darrell Eric Grimson.
MIT AI Lab Paul Viola NTT Visit: Image Database Retrieval Variable Viewpoint Reality Professor Paul Viola Collaborators: Professor Eric Grimson, Jeremy.
Visual Information Processing. Human Perception V.S. Machine Perception  Human perception: pictorial information improvement for human interpretation.
REAL-TIME DETECTOR FOR UNUSUAL BEHAVIOR
Learning Patterns of Activity
Digital Video Library - Jacky Ma.
A Unified Multiresolution Framework for Automatic Target Recognition
Image Database Retrieval NTT: Visit 1/7/99
Coding Approaches for End-to-End 3D TV Systems
Announcements Midterm due now Project 2 artifacts: vote today!
BOOSTING IMAGE RETRIEVAL
COMPUTER GRAPHICS with OpenGL (3rd Edition) Donald Hearn M
Announcements Project 3 out today (help session at end of class)
Presentation transcript:

Viola 1999 MIT AI Laboratory Progress on: Variable Viewpoint Reality Image Database Paul Viola & Eric Grimson Jeremy DeBonet, Aparna Lakshmiratan, William Wells, Kinh Tieu,Dan Snow, Owen Ozier, John Winn, Mike Ross MIT Artificial Intelligence Lab

Viola 1999 MIT AI Laboratory Overview of Presentation Variable Viewpoint Reality –Overview –Progress at MIT Image Database Retrieval –Overview –Progress

Viola 1999 MIT AI Laboratory VVR: Motivating Scenario Construct a system that will allow each/every user to observe any viewpoint of a sporting event. Provide high level commentary/statistics –Analyze plays

Viola 1999 MIT AI Laboratory For example … Computed using a single view… some steps by hand

Viola 1999 MIT AI Laboratory VVR Spectator Environment Build an exciting, fun, high-profile system –Sports: Soccer, Hockey, Tennis, Basketball –Drama, Dance, Ballet Leverage MIT technology in: –Vision/Video Analysis Tracking, Calibration, Action Recognition Image/Video Databases –Graphics Build a system that provides data available nowhere else… –Record/Study Human movements and actions –Motion Capture / Motion Generation

Viola 1999 MIT AI Laboratory Window of Opportunity cameras in a stadium –Soon there will be many more US HDTV is digital –Flexible, very high bandwidth digital transmissions Future Televisions will be Computers –Plenty of extra computation available –3D Graphics hardware will be integrated Economics of sports –Dollar investments by broadcasters is huge (Billions) Computation is getting cheaper

Viola 1999 MIT AI Laboratory Progress at MIT Simple intersection of silhouettes (Visual Hull) –Efficient but limited Tomographic reconstruction –Based on medical reconstruction Probabilistic Voxel Analysis (Poxels) –Handles occlusion & transparency Parametric Human Forms

Viola 1999 MIT AI Laboratory Visual Hull in 2D

Viola 1999 MIT AI Laboratory Visual Hull: Segment

Viola 1999 MIT AI Laboratory Visual Hull: Segment

Viola 1999 MIT AI Laboratory Visual Hull: Segment

Viola 1999 MIT AI Laboratory Visual Hull: Intersection

Viola 1999 MIT AI Laboratory Idea in 2D: Visual Hull

Viola 1999 MIT AI Laboratory Real Data: Tweety Data acquired on a turntable –180 views are available… not all are used.

Viola 1999 MIT AI Laboratory Intersection of Frusta Intersection of 18 frusta –Computations are very fast perhaps real-time

Viola 1999 MIT AI Laboratory New Apparatus Twelve cameras, computers, digitizers Parallel software for acquisition/processing

Viola 1999 MIT AI Laboratory Current System Real-time image acquisition Silhouettes computed in parallel Silhouettes sent to a central machine –15 per second Real-time Intersection and Visual Hull –In progress

Viola 1999 MIT AI Laboratory Visual Hull is very coarse … Agreement provides additional information Agreement provides additional information

Viola 1999 MIT AI Laboratory Tomographic Reconstruction Motivated by medical imaging –CT - Computed Tomography –Measurements are line integrals in a volume –Reconstruction is by back-projection & deconvolution

Viola 1999 MIT AI Laboratory Back-projection of image intensities

Viola 1999 MIT AI Laboratory Volume Render... Captures shape very well Intensities are not perfect

Viola 1999 MIT AI Laboratory Poxels: An improvement to tomography Tomography confuses color with transparency –Does not model occlusion... The Probabilistic Voxel Approach: Poxel –Estimates both color and transparency –Models occlusion –Much better results Though slower –Work submitted to ICCV 99

Viola 1999 MIT AI Laboratory Occlusion causes disagreement

Viola 1999 MIT AI Laboratory Initial agreement is not enough… Agreement is high “Opaque” Agreement is poor

Viola 1999 MIT AI Laboratory Second pass uses information about occlusion Unoccluded Cameras Occluded camera is ignored Agreement becomes good “Opaque”

Viola 1999 MIT AI Laboratory Poxels Algorithm: Definitions Images I k (u,v) Ray Casting Grid of color & transparency v(x,y,z)

Viola 1999 MIT AI Laboratory Poxels: Model of Transparency Eye Voxels

Viola 1999 MIT AI Laboratory Poxels Algorithm: Agreement (Step 1) Point of highest agreement Point of highest agreement

Viola 1999 MIT AI Laboratory Results… Rendering of reconstructed shape.

Viola 1999 MIT AI Laboratory From ICCV paper... Input Image Reconstructed Volume

Viola 1999 MIT AI Laboratory … additional results

Viola 1999 MIT AI Laboratory Image Databases: Motivating Scenario Image Databases are proliferating –The Web –Commercial Image Databases –Video Databases –Catalog Databases “Find me a bag that looks like a Gucci.” –Virtual Museums “Find me impressionist portraits.” –Travel Information “Find me towns with Gothic architecture.” –Real-estate “Find me a home that is sunny and open.”

Viola 1999 MIT AI Laboratory But, the problem is very hard… There are a very wide variety of images...

Viola 1999 MIT AI Laboratory We have made good progress... Query: “Waterfall Images”

Viola 1999 MIT AI Laboratory Search for cars? Trained by example

Viola 1999 MIT AI Laboratory Complex Feature Representation Motivated by the Human brain… –Infero-temporal cortex computes many thousand selective features –Features are selective yet insensitive to unimportant variations –Every object/image has some but not all of these features Retrieval involves matching the most salient features

Viola 1999 MIT AI Laboratory Image Database Retrieval NTT: Visit 1/7/99

Viola 1999 MIT AI Laboratory Overview of IDB Meeting Motivation from MIT... Discuss current and related work –Flexible Templates –Complex Features –Demonstrations Related NTT Efforts Discussion of collaboration Future work Dinner

Viola 1999 MIT AI Laboratory Motivating Scenario Image Databases are proliferating –The Web –Commercial Image Databases –Video Databases –Catalog Databases “Find me a bag that looks like a Gucci.” –Virtual Museums “Find me impressionist portraits.” –Travel Information “Find me towns with Gothic architecture.” –Real-estate “Find me a home that is sunny and open.”

Viola 1999 MIT AI Laboratory There is a very wide variety of images...

Viola 1999 MIT AI Laboratory Search for images containing waterfalls?

Viola 1999 MIT AI Laboratory Search for cars?

Viola 1999 MIT AI Laboratory What makes IDB hard? Finding the right features –Insensitive to movement of components –Sensitive to critical properties Focussing attention –Not everything matters Generalization based on class –Given two images Small black dog & Large white dog (Don’t have much in common…) –Return other dogs

Viola 1999 MIT AI Laboratory Overview of IDB Meeting Motivation from MIT... Discuss current and related work –Flexible Templates –Complex Features –Demonstrations Related NTT Efforts Discussion of collaboration Future work Dinner

Viola 1999 MIT AI Laboratory Complex Feature Representation Motivated by the Human brain… –Infero-temporal cortex computes many thousand selective features –Features are selective yet insensitive to unimportant variations –Every object/image has some but not all of these features Retrieval involves matching the most salient features

Viola 1999 MIT AI Laboratory Features are extracted with many Convolution Filters source image x2x2 x2x2 x2x2 x2x2 Filters Vertical Horizontal

Viola 1999 MIT AI Laboratory * x2x2 x2x2 x2x2 * * x2x2 * x2x2 * x2x2 * x2x2 * * x2x2 x2x2 x2x2 * * x2x2 * x2x2 * x2x2 * x2x2 * Characteristic signature

Viola 1999 MIT AI Laboratory Feature Value Resolution is reduced at each step…

Viola 1999 MIT AI Laboratory projection 1projection 2 Query 1 (variation of object location) Query 2 (variation of lighting) * Not every feature is useful for a query Poor Features for query 1 Good Features for query 1 Poor Features for query 2 Features: A,B Features: C,D

Viola 1999 MIT AI Laboratory * * normalization average signature Query images Normalization brings some image closer to the mean Normalization of Signature Space

Viola 1999 MIT AI Laboratory q t 2 Distance/Similarity Measure Diagonal Mahalanobis Distance

Viola 1999 MIT AI Laboratory Image Database Progress at MIT Better learning algorithms to select features Developed a very compact feature representation –Fewer features required –2-3 bits per feature Pre-segmentation of images –Better learning –More selective queries Construction of object models: –Faces, people, cars, etc. (ICCV 99) factor of 72

Viola 1999 MIT AI Laboratory

Viola 1999 MIT AI Laboratory

Viola 1999 MIT AI Laboratory

Viola 1999 MIT AI Laboratory

Viola 1999 MIT AI Laboratory Conclusions Variable Viewpoint Reality –Prototypes constructed –New approaches Image Database Retrieval –New more efficient representations –Improved performance