Motivation Where is my W-2 Form?. Video-based Tracking Camera view of the desk Camera Overhead video camera.

Slides:



Advertisements
Similar presentations
Distinctive Image Features from Scale-Invariant Keypoints
Advertisements

Feature extraction: Corners
Feature Detection. Description Localization More Points Robust to occlusion Works with less texture More Repeatable Robust detection Precise localization.
Presented by Xinyu Chang
CSE 473/573 Computer Vision and Image Processing (CVIP)
By: Ryan Wendel.  It is an ongoing analysis in which videos are analyzed frame by frame  Most of the video recognition is pulled from 3-D graphic engines.
The SIFT (Scale Invariant Feature Transform) Detector and Descriptor
Outline Feature Extraction and Matching (for Larger Motion)
Kiyoshi Irie, Tomoaki Yoshida, and Masahiro Tomono 2011 IEEE International Conference on Robotics and Automation Shanghai International Conference Center.
ECE 562 Computer Architecture and Design Project: Improving Feature Extraction Using SIFT on GPU Rodrigo Savage, Wo-Tak Wu.
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.
CMPT-884 Jan 18, 2010 Video Copy Detection using Hadoop Presented by: Cameron Harvey Naghmeh Khodabakhshi CMPT 820 December 2, 2010.
Feature extraction: Corners 9300 Harris Corners Pkwy, Charlotte, NC.
Affine Linear Transformations Prof. Graeme Bailey (notes modified from Noah Snavely, Spring 2009)
A Bayesian algorithm for tracking multiple moving objects in outdoor surveillance video Department of Electrical Engineering and Computer Science The University.
SIFT Guest Lecture by Jiwon Kim
Scale Invariant Feature Transform
Distinctive Image Feature from Scale-Invariant KeyPoints
Feature extraction: Corners and blobs
Distinctive image features from scale-invariant keypoints. David G. Lowe, Int. Journal of Computer Vision, 60, 2 (2004), pp Presented by: Shalomi.
Feature-based object recognition Prof. Noah Snavely CS1114
Object Recognition Using Distinctive Image Feature From Scale-Invariant Key point D. Lowe, IJCV 2004 Presenting – Anat Kaspi.
Scale Invariant Feature Transform (SIFT)
UIST 2004Kim, Seitz, Agrawala Video-Based Document Tracking: Unifying Your Physical and Electronic Desktops Jiwon KimSteven M. SeitzManeesh Agrawala University.
Blob detection.
Collaborative Spaces Presented by Jonathan Effrat David Tu.
Hand Signals Recognition from Video Using 3D Motion Capture Archive Tai-Peng Tian Stan Sclaroff Computer Science Department B OSTON U NIVERSITY I. Introduction.
Lecture 6: Feature matching and alignment CS4670: Computer Vision Noah Snavely.
Scale-Invariant Feature Transform (SIFT) Jinxiang Chai.
Electronic Visualization Laboratory University of Illinois at Chicago Interaction between Real and Virtual Humans: Playing Checkers R. Torre, S. Balcisoy.
Research Area B Leif Kobbelt. Communication System Interface Research Area B 2.
Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.
A Brief Overview of Computer Vision Jinxiang Chai.
Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.
AUTOMATIC ANNOTATION OF GEO-INFORMATION IN PANORAMIC STREET VIEW BY IMAGE RETRIEVAL Ming Chen, Yueting Zhuang, Fei Wu College of Computer Science, Zhejiang.
Lecture 4: Feature matching CS4670 / 5670: Computer Vision Noah Snavely.
Institute for Visualization and Perception Research 1 © Copyright 2000 Haim Levkowitz Introduction (Foley & Van Dam Ch 1) Uses of computer graphics … Some.
Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.
CSCE 643 Computer Vision: Extractions of Image Features Jinxiang Chai.
Feature extraction: Corners 9300 Harris Corners Pkwy, Charlotte, NC.
Lecture 7: Features Part 2 CS4670/5670: Computer Vision Noah Snavely.
Puzzle Solver Sravan Bhagavatula EE 638 Project Stanford ECE.
Department of Psychology & The Human Computer Interaction Program Vision Sciences Society’s Annual Meeting, Sarasota, FL May 13, 2007 Jeremiah D. Still,
Efficient Visual Object Tracking with Online Nearest Neighbor Classifier Many slides adapt from Steve Gu.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Feature extraction: Corners and blobs. Why extract features? Motivation: panorama stitching We have two images – how do we combine them?
Virtual Image Peephole By Kyle Patience Supervisor: Reg Dodds Co Supervisor: Mehrdad Ghaziasgar.
Lecture 5: Image Interpolation and Features
Features Jan-Michael Frahm.
IEEE International Conference on Multimedia and Expo.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
What is Multimedia Anyway? David Millard and Paul Lewis.
Distinctive Image Features from Scale-Invariant Keypoints Presenter :JIA-HONG,DONG Advisor : Yen- Ting, Chen 1 David G. Lowe International Journal of Computer.
Keypoint extraction: Corners 9300 Harris Corners Pkwy, Charlotte, NC.
WLD: A Robust Local Image Descriptor Jie Chen, Shiguang Shan, Chu He, Guoying Zhao, Matti Pietikäinen, Xilin Chen, Wen Gao 报告人:蒲薇榄.
Lecture 5: Feature detection and matching CS4670 / 5670: Computer Vision Noah Snavely.
3D Puppetry: A Kinect-based Interface for 3D Animation
SIFT Scale-Invariant Feature Transform David Lowe
Lecture 07 13/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
CS201 Lecture 02 Computer Vision: Image Formation and Basic Techniques
Modeling the world with photos
Features Readings All is Vanity, by C. Allan Gilbert,
Aim of the project Take your image Submit it to the search engine
KFC: Keypoints, Features and Correspondences
Adaboost for faces. Material
ECE734 Project-Scale Invariant Feature Transform Algorithm
Lecture VI: Corner and Blob Detection
Jiwon Kim Steve Seitz Maneesh Agrawala
Presentation transcript:

Motivation Where is my W-2 Form?

Video-based Tracking Camera view of the desk Camera Overhead video camera

Example 40 minutes, 15 fps

System Overview User Input Video

System Overview User Internal Representation Video Analysis Vision Engine Desk TT+1 Input Video

Query Interface System Overview Query (where is my W-2 form?) Internal Representation Input Video User Video Analysis Desk TT+1

System Overview Query (where is my W-2 form?) Internal Representation Input Video User Video Analysis Answer Query Interface Desk TT+1

System Overview User Internal Representation Video Analysis Input Video Vision Engine Desk TT+1

Vision Problem … …

Event … …

Vision Problem … Event … … Desk

Vision Problem Desk … … Event … … Desk

Vision Problem … … tut-article.pdf sanders01.pdf objectspaces.pdfkidd94.pdf lowe04sift.pdf Event … … Desk

Vision Problem … … Event Scene Graph (DAG) … … tut-article.pdf sanders01.pdf objectspaces.pdfkidd94.pdf lowe04sift.pdf Desk

Assumptions Simplifying –Corresponding electronic copy exists

Assumptions Simplifying –Corresponding electronic copy exists –3 event types: move/entry/exit –One document at a time –Only topmost document can move –No duplicate copies of same document

Assumptions Simplifying –Corresponding electronic copy exists –3 event types: move/entry/exit –One document at a time –Only topmost document can move –No duplicate copies of same document Other –Desk need not be initially empty

Algorithm Overview Input Frames … … Event Detection Event Interpretation Document Recognition Scene Graph Update

Algorithm Overview Input Frames … … Event Detection Event Interpretation Document Recognition beforeafter Scene Graph Update

Algorithm Overview Input Frames … … Event Detection Event Interpretation “A document moved from (x 1,y 1 ) to (x 2,y 2 )” Document Recognition beforeafter Scene Graph Update

Algorithm Overview Input Frames … … Event Detection Event Interpretation “A document moved from (x 1,y 1 ) to (x 2,y 2 )” Document Recognition beforeafter File1.pdf File2.pdf File3.pdf Scene Graph Update

Algorithm Overview Input Frames … … Event Detection Event Interpretation “A document moved from (x 1,y 1 ) to (x 2,y 2 )” Document Recognition beforeafter File1.pdf File2.pdf File3.pdf Scene Graph Update

Algorithm Overview Input Frames … … Event Detection Event Interpretation “A document moved from (x 1,y 1 ) to (x 2,y 2 )” Document Recognition beforeafter File1.pdf File2.pdf File3.pdf Scene Graph Update

Event Detection … …

time Frame Difference … …

Event Detection time Threshold Event Frames time … … Frame Difference

Event Detection beforeafter Event Frames … …

Algorithm Overview Input Frames … … Event Detection Event Interpretation “A document moved from (x 1,y 1 ) to (x 2,y 2 )” Document Recognition beforeafter File1.pdf File2.pdf File3.pdf Scene Graph Update

Event Interpretation Move beforeafter

Event Interpretation Move Entry beforeafter

Event Interpretation Move Entry Exit beforeafter

Event Interpretation Move Entry Exit Motion: (x,y,θ) beforeafter

Event Interpretation Move Entry Exit 1. Move vs. Entry/Exit beforeafter

Event Interpretation Move Entry Exit 2. Entry vs. Exit beforeafter

Event Interpretation Use SIFT [Lowe 99] –Scale Invariant Feature Transform –Distinctive feature descriptor –Reliable object recognition

Move vs. Entry/Exit before after

Move vs. Entry/Exit before after

Move vs. Entry/Exit before after

Move vs. Entry/Exit before after

Move vs. Entry/Exit before after

Move vs. Entry/Exit before after

Move vs. Entry/Exit before after

Move vs. Entry/Exit before after

Move vs. Entry/Exit before after

Move vs. Entry/Exit before after

Move vs. Entry/Exit before after

Move vs. Entry/Exit before after Motion: (x,y,θ)

Entry vs. Exit beforeafter Example 1 (entry)

Entry vs. Exit beforeafter Example 1 (entry)

Entry vs. Exit beforeafter Example 1 (entry)

Entry vs. Exit beforeafter … File1.pdf File2.pdfFile3.pdfFile4.pdfFile5.pdfFile6.pdf …

Entry vs. Exit beforeafter Example 2 (entry)

Entry vs. Exit beforeafter … File1.pdf File2.pdfFile3.pdfFile4.pdfFile5.pdfFile6.pdf …

Entry vs. Exit beforeafter

Entry vs. Exit beforeafter ?

Entry vs. Exit beforeafter ?

Entry vs. Exit beforeafter ……

Entry vs. Exit beforeafter …… Amount of change time

Algorithm Overview Input Frames … … Event Detection Event Interpretation “A document moved from (x 1,y 1 ) to (x 2,y 2 )” Document Recognition beforeafter File1.pdf File2.pdf File3.pdf Scene Graph Update

Document Recognition … File1.pdf File2.pdfFile3.pdfFile4.pdfFile5.pdfFile6.pdf Match against PDF image database …

Document Recognition … File1.pdf File2.pdfFile3.pdfFile4.pdfFile5.pdfFile6.pdf Match against PDF image database Performance –Can differentiate between ~100 (or more) documents –~200x300 pixels per document for reliable match …

Algorithm Overview Input Frames … … Event Detection Event Interpretation “A document moved from (x 1,y 1 ) to (x 2,y 2 )” Document Recognition beforeafter File1.pdf File2.pdf File3.pdf Scene Graph Update

before after Motion: (x,y,θ) Desk

Scene Graph Update before after Motion: (x,y,θ) Desk

Scene Graph Update before after Motion: (x,y,θ) Desk

Scene Graph Update before after Motion: (x,y,θ) Desk ?

Scene Graph Update before after Motion: (x,y,θ) Desk

Scene Graph Update Desk …

Scene Graph Update Desk …

Photo Sorting Example

Current Directions Handle more realistic desktops Speed up processing time Other useful functionalities –Written annotation –Version management –Bookmark –Multi-user queries

For More Information Our publications –Jiwon Kim, Steven M. Seitz, Maneesh Agrawala. The Office of the Past. IEEE Workshop on Real-time Vision for HCI, –Jiwon Kim, Steven M. Seitz, Maneesh Agrawala. Video-based Document Tracking: Unifying Your Physical and Electronic Desktops. To appear in Proceedings of UIST, Other related work –David G. Lowe. Distinctive image features from scale invariant keypoints. International Journal of Computer Vision, –Pierre Wellner. Interacting with paper on the DigitalDesk. Communications of the ACM, 36(7):86.97,1993. –D. Rus and P. deSantis. The self-organizing desk. In Proceedings of International Joint Conference on Artificial Intelligence, 1997.

For More Information Our publications –Jiwon Kim, Steven M. Seitz, Maneesh Agrawala. The Office of the Past. IEEE Workshop on Real-time Vision for HCI, –Jiwon Kim, Steven M. Seitz, Maneesh Agrawala. Video-based Document Tracking: Unifying Your Physical and Electronic Desktops. To appear in Proceedings of UIST, Other related work –David G. Lowe. Object recognition from local scale-invariant features. International Conference on Computer Vision, –Pierre Wellner. Interacting with paper on the DigitalDesk. Communications of the ACM, 36(7):86.97,1993. –D. Rus and P. deSantis. The self-organizing desk. In Proceedings of International Joint Conference on Artificial Intelligence, 1997.