Motivation Where is my W-2 Form?. Video-based Tracking Camera view of the desk Camera Overhead video camera.

Motivation Where is my W-2 Form?

Video-based Tracking Camera view of the desk Camera Overhead video camera

Example 40 minutes, 1024x768 @ 15 fps

System Overview User Input Video

System Overview User Internal Representation Video Analysis Vision Engine Desk TT+1 Input Video

Query Interface System Overview Query (where is my W-2 form?) Internal Representation Input Video User Video Analysis Desk TT+1

System Overview Query (where is my W-2 form?) Internal Representation Input Video User Video Analysis Answer Query Interface Desk TT+1

System Overview User Internal Representation Video Analysis Input Video Vision Engine Desk TT+1

Vision Problem … …

Event … …

Vision Problem … Event … … Desk

Vision Problem Desk … … Event … … Desk

Vision Problem … … tut-article.pdf sanders01.pdf objectspaces.pdfkidd94.pdf lowe04sift.pdf Event … … Desk

Vision Problem … … Event Scene Graph (DAG) … … tut-article.pdf sanders01.pdf objectspaces.pdfkidd94.pdf lowe04sift.pdf Desk

Assumptions Simplifying –Corresponding electronic copy exists

Assumptions Simplifying –Corresponding electronic copy exists –3 event types: move/entry/exit –One document at a time –Only topmost document can move –No duplicate copies of same document

Assumptions Simplifying –Corresponding electronic copy exists –3 event types: move/entry/exit –One document at a time –Only topmost document can move –No duplicate copies of same document Other –Desk need not be initially empty

Algorithm Overview Input Frames … … Event Detection Event Interpretation Document Recognition Scene Graph Update

Algorithm Overview Input Frames … … Event Detection Event Interpretation Document Recognition beforeafter Scene Graph Update

Algorithm Overview Input Frames … … Event Detection Event Interpretation “A document moved from (x 1,y 1 ) to (x 2,y 2 )” Document Recognition beforeafter Scene Graph Update

Algorithm Overview Input Frames … … Event Detection Event Interpretation “A document moved from (x 1,y 1 ) to (x 2,y 2 )” Document Recognition beforeafter File1.pdf File2.pdf File3.pdf Scene Graph Update

Event Detection … …

time Frame Difference … …

Event Detection time Threshold Event Frames time … … Frame Difference

Event Detection beforeafter Event Frames … …

Event Interpretation Move beforeafter

Event Interpretation Move Entry beforeafter

Event Interpretation Move Entry Exit beforeafter

Event Interpretation Move Entry Exit Motion: (x,y,θ) beforeafter

Event Interpretation Move Entry Exit 1. Move vs. Entry/Exit beforeafter

Event Interpretation Move Entry Exit 2. Entry vs. Exit beforeafter

Event Interpretation Use SIFT [Lowe 99] –Scale Invariant Feature Transform –Distinctive feature descriptor –Reliable object recognition

Move vs. Entry/Exit before after

Move vs. Entry/Exit before after Motion: (x,y,θ)

Entry vs. Exit beforeafter Example 1 (entry)

Entry vs. Exit beforeafter … File1.pdf File2.pdfFile3.pdfFile4.pdfFile5.pdfFile6.pdf …

Entry vs. Exit beforeafter Example 2 (entry)

Entry vs. Exit beforeafter … File1.pdf File2.pdfFile3.pdfFile4.pdfFile5.pdfFile6.pdf …

Entry vs. Exit beforeafter

Entry vs. Exit beforeafter ?

Entry vs. Exit beforeafter ……

Entry vs. Exit beforeafter …… Amount of change time

Document Recognition … File1.pdf File2.pdfFile3.pdfFile4.pdfFile5.pdfFile6.pdf Match against PDF image database …

Document Recognition … File1.pdf File2.pdfFile3.pdfFile4.pdfFile5.pdfFile6.pdf Match against PDF image database Performance –Can differentiate between ~100 (or more) documents –~200x300 pixels per document for reliable match …

before after Motion: (x,y,θ) Desk

Scene Graph Update before after Motion: (x,y,θ) Desk

Scene Graph Update before after Motion: (x,y,θ) Desk ?

Scene Graph Update before after Motion: (x,y,θ) Desk

Scene Graph Update Desk …

Photo Sorting Example

Current Directions Handle more realistic desktops Speed up processing time Other useful functionalities –Written annotation –Version management –Bookmark –Multi-user queries

For More Information Our publications –Jiwon Kim, Steven M. Seitz, Maneesh Agrawala. The Office of the Past. IEEE Workshop on Real-time Vision for HCI, 2004. –Jiwon Kim, Steven M. Seitz, Maneesh Agrawala. Video-based Document Tracking: Unifying Your Physical and Electronic Desktops. To appear in Proceedings of UIST, 2004. Other related work –David G. Lowe. Distinctive image features from scale invariant keypoints. International Journal of Computer Vision, 2004. –Pierre Wellner. Interacting with paper on the DigitalDesk. Communications of the ACM, 36(7):86.97,1993. –D. Rus and P. deSantis. The self-organizing desk. In Proceedings of International Joint Conference on Artificial Intelligence, 1997.

For More Information Our publications –Jiwon Kim, Steven M. Seitz, Maneesh Agrawala. The Office of the Past. IEEE Workshop on Real-time Vision for HCI, 2004. –Jiwon Kim, Steven M. Seitz, Maneesh Agrawala. Video-based Document Tracking: Unifying Your Physical and Electronic Desktops. To appear in Proceedings of UIST, 2004. Other related work –David G. Lowe. Object recognition from local scale-invariant features. International Conference on Computer Vision, 1999. –Pierre Wellner. Interacting with paper on the DigitalDesk. Communications of the ACM, 36(7):86.97,1993. –D. Rus and P. deSantis. The self-organizing desk. In Proceedings of International Joint Conference on Artificial Intelligence, 1997.

Motivation Where is my W-2 Form?. Video-based Tracking Camera view of the desk Camera Overhead video camera.

Similar presentations

Presentation on theme: "Motivation Where is my W-2 Form?. Video-based Tracking Camera view of the desk Camera Overhead video camera."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Motivation Where is my W-2 Form?. Video-based Tracking Camera view of the desk Camera Overhead video camera.

Similar presentations

Presentation on theme: "Motivation Where is my W-2 Form?. Video-based Tracking Camera view of the desk Camera Overhead video camera."— Presentation transcript:

Similar presentations

About project

Feedback