IIIT HYDERABAD Techniques for Organization and Visualization of Community Photo Collections Kumar Srijan Faculty Advisor : Dr. C.V. Jawahar.

Slides:



Advertisements
Similar presentations
Image Retrieval with Geometry-Preserving Visual Phrases
Advertisements

Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.
Aggregating local image descriptors into compact codes
Registration for Robotics Kurt Konolige Willow Garage Stanford University Patrick Mihelich JD Chen James Bowman Helen Oleynikova Freiburg TORO group: Giorgio.
Three things everyone should know to improve object retrieval
Presented by Xinyu Chang
Building Rome in a Day Sameer Agarwal1 Noah Snavely2 Ian Simon1 Steven M. Seitz1 Richard Szeliski3 1University of Washington 2Cornell University 3Microsoft.
TP14 - Local features: detection and description Computer Vision, FCUP, 2014 Miguel Coimbra Slides by Prof. Kristen Grauman.
Recent work in image-based rendering from unstructured image collections and remaining challenges Sudipta N. Sinha Microsoft Research, Redmond, USA.
Low Complexity Keypoint Recognition and Pose Estimation Vincent Lepetit.
Discrete-Continuous Optimization for Large-scale Structure from Motion David Crandall, Andrew Owens, Noah Snavely, Dan Huttenlocher Presented by: Rahul.
Localization in indoor environments by querying omnidirectional visual maps using perspective images Miguel Lourenco, V. Pedro and João P. Barreto ICRA.
Image alignment Image from
CVPR 2008 James Philbin Ondˇrej Chum Michael Isard Josef Sivic
Bag of Features Approach: recent work, using geometric information.
Robust and large-scale alignment Image from
Structure from motion.
WISE: Large Scale Content-Based Web Image Search Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley 1.
Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.
Lecture 23: Structure from motion and multi-view stereo
Lecture 11: Structure from motion, part 2 CS6670: Computer Vision Noah Snavely.
Lecture 11: Structure from motion CS6670: Computer Vision Noah Snavely.
Lecture 22: Structure from motion CS6670: Computer Vision Noah Snavely.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Lecture 3a: Feature detection and matching CS6670: Computer Vision Noah Snavely.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.
Global Alignment and Structure from Motion Computer Vision CSE455, Winter 2008 Noah Snavely.
Lecture 6: Feature matching and alignment CS4670: Computer Vision Noah Snavely.
Lecture 12: Structure from motion CS6670: Computer Vision Noah Snavely.
Matthew Brown University of British Columbia (prev.) Microsoft Research [ Collaborators: † Simon Winder, *Gang Hua, † Rick Szeliski † =MS Research, *=MS.
Nonlinear Dimensionality Reduction Approaches. Dimensionality Reduction The goal: The meaningful low-dimensional structures hidden in their high-dimensional.
Overview Introduction to local features
Yuping Lin and Gérard Medioni.  Introduction  Method  Register UAV streams to a global reference image ▪ Consecutive UAV image registration ▪ UAV to.
Local invariant features Cordelia Schmid INRIA, Grenoble.
Lecture 4: Feature matching CS4670 / 5670: Computer Vision Noah Snavely.
1 Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval Ondrej Chum, James Philbin, Josef Sivic, Michael Isard and.
Fast Similarity Search for Learned Metrics Prateek Jain, Brian Kulis, and Kristen Grauman Department of Computer Sciences University of Texas at Austin.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
Large Scale Discovery of Spatially Related Images Ondřej Chum and Jiří Matas Center for Machine Perception Czech Technical University Prague.
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.
Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.
IIIT Hyderabad Document Image Retrieval using Bag of Visual Words Model Ravi Shekhar CVIT, IIIT Hyderabad Advisor : Prof. C.V. Jawahar.
Lecture 7: Features Part 2 CS4670/5670: Computer Vision Noah Snavely.
Local invariant features Cordelia Schmid INRIA, Grenoble.
IIIT HYDERABAD Image-based walkthroughs from partial and incremental scene reconstructions Kumar Srijan Syed Ahsan Ishtiaque C. V. Jawahar Center for Visual.
Scene Reconstruction Seminar presented by Anton Jigalin Advanced Topics in Computer Vision ( )
Overview Introduction to local features Harris interest points + SSD, ZNCC, SIFT Scale & affine invariant interest point detectors Evaluation and comparison.
Local features: detection and description
Bundling Features for Large Scale Partial-Duplicate Web Image Search Zhong Wu ∗, Qifa Ke, Michael Isard, and Jian Sun Microsoft Research.
Geometry-aware Feature Matching for Structure from Motion Applications Rajvi Shah, Vanshika Srivastava, P J Narayanan Center for Visual Information Technology.
776 Computer Vision Jan-Michael Frahm Spring 2012.
Lecture 22: Structure from motion CS6670: Computer Vision Noah Snavely.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching.
Invariant Local Features Image content is transformed into local feature coordinates that are invariant to translation, rotation, scale, and other imaging.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching Link: singhashwini.mesinghashwini.me.
Lecture 07 13/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Capturing, Processing and Experiencing Indian Monuments
TP12 - Local features: detection and description
Approximate Models for Fast and Accurate Epipolar Geometry Estimation
Modeling the world with photos
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Structure from motion Input: Output: (Tomasi and Kanade)
Features Readings All is Vanity, by C. Allan Gilbert,
Geometric Hashing: An Overview
Noah Snavely.
KFC: Keypoints, Features and Correspondences
CSE 185 Introduction to Computer Vision
Computational Photography
Structure from motion Input: Output: (Tomasi and Kanade)
Lecture 15: Structure from motion
Presentation transcript:

IIIT HYDERABAD Techniques for Organization and Visualization of Community Photo Collections Kumar Srijan Faculty Advisor : Dr. C.V. Jawahar

IIIT HYDERABAD Community Photo Collections Anyone can take photographs! Sharing photographs is easy! Searching for photographs is easy!

IIIT HYDERABAD Community Photo Collections Golkonda Fort (Google Images + Flickr) – > 50 K images

IIIT HYDERABAD Photo Tourism Noah Snavely, Steven M. Seitz, Richard Szeliski – Photo tourism: Exploring photo collections in 3D Photo tourism: Exploring photo collections in 3D – Photosynth

IIIT HYDERABAD Photo Tourism Computing Correspondences Feature Extraction Pairwise Feature Matching Match Refinement Track Creation Incremental SfM Seeding Add new images and triangulate new points Bundle adjust Snavely et. al, Photo Tourism: Exploring image collections in 3D Input Images Full Scene Reconstruction

IIIT HYDERABAD Bottlenecks and Issues Quadratic Image Matching cost Global scene reconstruction – O(N 4 ) in the worst case – Sensitivity to the choice of the initial pair – Cascading of errors Image credits: Snavely et. al, Photo Tourism: Exploring image collections in 3D

IIIT HYDERABAD Bottlenecks and Issues Timing Breakdown Snavely et. al, Photo Tourism: Exploring image collections in 3D Full Scene Reconstruction for Trafalgar Square with 8000 images took > 50 days

IIIT HYDERABAD Motivation CPCs are unstructured collections – Different resolutions, viewpoint, lighting conditions… – Very limited number of images match Contribution 1 : Matching – Exhaustive pairwise matching w/o quadratic cost Contribution 2 : Visualization – Framework for bypassing the issues faced with Incremental Sfm.

IIIT HYDERABAD Image Matching Problem Compute Image Match Graph – Images  Nodes – Image Match  Edges Queries: – Connected components – Shortest path

IIIT HYDERABAD Discovering Matching Images Object Retrieval with Large Vocabularies and Fast Spatial Matching – Philbin et al. Image Retrieval 1. Indexing Image Database – Quantization : Image Features  Visual Words(VW) – Inverted Index : over VWs 2. Querying Image Database – Filtering  Shortlist of Top Scoring matches – Verification  of shortlist O(N) time for a single querying

IIIT HYDERABAD Discovering Matching Images Large Scale Discovery of Spatially Related Images - Chum, O. and Matas. J

IIIT HYDERABAD Our Solution : Overview Exhaustive Pairwise Matching – Query each image in turn Goal : O(1) per query Addressing Exhaustiveness – Verify all potential matches : No shortlists – Verification doable from Index retrievals Our Main Result : Indexing geometry allows both!

IIIT HYDERABAD Indexing Geometry High Order Features – Combine nearby features Primary with Secondary Features Encode Affine Invariants – Relative Orientation and Scale – Normalized distance – Baseline orientation – HOF is a Tuple Huge Feature Space

IIIT HYDERABAD Constant Time Queries using HOFs Regular Inverted Index – Posting lists grow with Database size O(N) HOF => Huge Feature Space ( > ) – Reproject with Hash Functions! – Range α Database size Constant sized posting lists Result : Constant time queries

IIIT HYDERABAD Spatial Verification Computable from index retrievals – For a query primary feature Search all secondary features in database images Pass if R features are found.

IIIT HYDERABAD Solution : Summary Extract HoF in the N database images Select Reprojection size as CN Initialize an Index of size CN Indexing – Key : Hash value of HoF – Value : Image Id Query : Each image in turn – Record matches in adjacency list Result : Image Match Graph

IIIT HYDERABAD Results UK benchmark – 2550 categories x 4 = images – 73.2 % recall – Large Scale Discovery of Spatially Related Images (Min Hash based solution) 49.6 % recall

IIIT HYDERABAD Results Oxford 5KOxford 105K #HOF78 Mn1480 Mn Index Size2^252^292^30 Feature Extraction Time 27 min8 hours Query Time per Image sec0.085 sec0.061 sec Query Time2 min2.5 hours1.8 hours Clusters Found Images Registered

IIIT HYDERABAD Results Small Clusters Errors

IIIT HYDERABAD Visualizing CPCs

IIIT HYDERABAD Problem Statement Efficiently browse and keep Incorporating incoming stream of images

IIIT HYDERABAD Our Solution : Overview Observation : In a walkthrough, users primarily see nearby overlapping images. Advantages: – Robustness to errors in incremental SfM module – Worst case linear running time – Scalable – Incremental Independent Partial Scene Reconstructions instead of Global Scene Reconstruction

IIIT HYDERABAD Partial Reconstructions Image Match Compute MatchesRefine Matches Compute partial Reconstructions Standard SfM Correct Match Incorrect Match

IIIT HYDERABAD Visualization Interface User interface and navigation Input images Verified neighbors Sample image Partial reconstruction

IIIT HYDERABAD Incremental insertion New Image Match Geometric Verification Compute Partial Scene Reconstruction Improve Connectivity

IIIT HYDERABAD Dataset Fort Dataset 5989 images Golconda Fort, Hyderabad

IIIT HYDERABAD Results

IIIT HYDERABAD Results

IIIT HYDERABAD Results Courtyard Dataset with 687 images Initialized with 200 images Added 487 image one by one Largest CC of 674 images.

IIIT HYDERABAD Conclusions Image Matching : HOFs gives a larger feature space which can be reprojected to obtain sparse posting lists making Exhaustive Pairwise Matching feasible. CPCs Visualization : Partial scene reconstructions can effectively be used to navigate through large collections of images. – Bypasses issues faced by standard Sfm.

IIIT HYDERABAD Thank you! QUESTIONS ?! Take Home Message : 2 ideas – For information retrieval using a inverted index, combining features gives a larger feature space which can be reprojected to control the average lengths of posting lists, and thus the query time. – For a very complex algorithm O(N > 2), it may sometimes be meaningful to fragment the dataset into O(N) groups, each of finite size, there by reducing the overall complexity to O(N).

IIIT HYDERABAD Thank You! Questions

IIIT HYDERABAD Backup Slides

IIIT HYDERABAD Photo Tourism Annotation Transfer

IIIT HYDERABAD Matching images Correspondence computation Match Verification – RANSAC based epipolar geometry estimation – Expensive

IIIT HYDERABAD Establishing Correspondences SIFT features : D. Lowe – Scale Invariant Feature Transform – Key points Detection Description : 128D Correspondence – Key points with Similar descriptors Alternatives : SURF, Brisk..

IIIT HYDERABAD Image Retrieval Feature Quantization – Visual Words A B C D E F G A B C D E F G

IIIT HYDERABAD Image Retrieval Feature Quantization – Visual Words Inverted Indexing Visual WordImage Ids A0, 1, 3, 4, 7 B0, 1, 2, 5, 8, 9 C1, 3, 6, 8 D1, 2, 4, 6, 8 E2, 4, 6, 9 F3, 4, 6, 8, 9... Query visual Word (E)

IIIT HYDERABAD Image Retrieval Feature Quantization – Visual Words Inverted Indexing Geometric verification – Epipolar Geometry

IIIT HYDERABAD Bloom Filters Bloom Filter – Set Membership – Bit array(m) – Hash Functions(k) – Elements(n) Insert(A) H1 H2 H3 A

IIIT HYDERABAD Bloom Filters Bloom Filter – Set Membership – Bit array(m) – Hash Functions(k) – Elements(n) Insert(A) H1 H2 H3 A

IIIT HYDERABAD Bloom Filters Bloom Filter – Set Membership – Bit array(m) – Hash Functions(k) – Elements(n) Insert(A) Insert(B) H1 H2 H3 B

IIIT HYDERABAD Bloom Filters Bloom Filter – Set Membership – Bit array(m) – Hash Functions(k) – Elements(n) Insert(A) Insert(B) Query(C) – Not present H1 H2 H3 C Set = {A,B}

IIIT HYDERABAD Bloom Filters Bloom Filter – Set Membership – Bit array(m) – Hash Functions(k) – Elements(n) Insert(A) Insert(B) Query(C) – Not present Query(D) – False positive H1 H2 H3 D Set = {A,B}

IIIT HYDERABAD Global vs. Partial Global : Allows transition to any image Partial : Allows transition to a limited number of overlapping images A -> B implies B -> A A B B A