Collective Vision: Using Extremely Large Photograph Collections Mark Lenz CameraNet Seminar University of Wisconsin – Madison February 2, 2010 Acknowledgments:

Slides:



Advertisements
Similar presentations
Trees for spatial indexing
Advertisements

Wiki-Reality: Augmenting Reality with Community Driven Websites Speaker: Yi Wu Intel Labs/vision and image processing research Collaborators: Douglas Gray,
Yan-Tao Zheng1, Ming Zhao2, Yang Song2, Hartwig Adam2 Ulrich Buddemeier2, Alessandro Bissacco2, Fernando Brucher2 Tat-Seng Chua1, and Hartmut Neven2 1.
Employing structural representation for symbol detection, symbol spotting and indexation in line drawing document images Muhammad Muzzamil Luqman
Ignas Budvytis*, Tae-Kyun Kim*, Roberto Cipolla * - indicates equal contribution Making a Shallow Network Deep: Growing a Tree from Decision Regions of.
DONG XU, MEMBER, IEEE, AND SHIH-FU CHANG, FELLOW, IEEE Video Event Recognition Using Kernel Methods with Multilevel Temporal Alignment.
Three things everyone should know to improve object retrieval
Object recognition and scene “understanding”
Presented by Xinyu Chang
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
VisualRank: Applying PageRank to Large-Scale Image Search Yushi Jing, Member, IEEE, and Shumeet Baluja, Member, IEEE.
UCB Computer Vision Animals on the Web Tamara L. Berg CSE 595 Words & Pictures.
Automatically Annotating and Integrating Spatial Datasets Chieng-Chien Chen, Snehal Thakkar, Crail Knoblock, Cyrus Shahabi Department of Computer Science.
Data Mining Techniques: Clustering
Presented by: Mingyuan Zhou Duke University, ECE September 18, 2009
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
BEYOND SIMPLE FEATURES: A LARGE-SCALE FEATURE SEARCH APPROACH TO UNCONSTRAINED FACE RECOGNITION Nicolas Pinto Massachusetts Institute of Technology David.
Tour the World: building a web-scale landmark recognition engine ICCV 2009 Yan-Tao Zheng1, Ming Zhao2, Yang Song2, Hartwig Adam2 Ulrich Buddemeier2, Alessandro.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Object-based Image Representation Dr. B.S. Manjunath Sitaram Bhagavathy Shawn Newsam Baris Sumengen Vision Research Lab University of California, Santa.
Semantic text features from small world graphs Jure Leskovec, IJS + CMU John Shawe-Taylor, Southampton.
Kyle Heath, Natasha Gelfand, Maks Ovsjanikov, Mridul Aanjaneya, Leo Guibas Image Webs Computing and Exploiting Connectivity in Image Collections.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Video/Image Fingerprinting & Search Naren Chittar CS 223-B project, Winter 2008.
Computer Vision Group University of California Berkeley Recognizing Objects in Adversarial Clutter: Breaking a Visual CAPTCHA Greg Mori and Jitendra Malik.
Spatial Pyramid Pooling in Deep Convolutional
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
Oral Defense by Sunny Tang 15 Aug 2003
© 2013 IBM Corporation Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments Presented by Shashank Mujumdar IBM Research,
IIIT Hyderabad Atif Iqbal and Anoop Namboodiri Cascaded.
Improving web image search results using query-relative classifiers Josip Krapacy Moray Allanyy Jakob Verbeeky Fr´ed´eric Jurieyy.
Indexing Techniques Mei-Chen Yeh.
Bag of Video-Words Video Representation
Collective Vision: Using Extremely Large Photograph Collections Mark Lenz CameraNet Seminar University of Wisconsin – Madison January 26, 2010 Acknowledgments:
Computer Vision James Hays, Brown
Keypoint-based Recognition Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/04/10.
Automatically Identifying Localizable Queries Center for E-Business Technology Seoul National University Seoul, Korea Nam, Kwang-hyun Intelligent Database.
PageRank for Product Image Search Kevin Jing (Googlc IncGVU, College of Computing, Georgia Institute of Technology) Shumeet Baluja (Google Inc.) WWW 2008.
Recognition using Regions (Demo) Sudheendra V. Outline Generating multiple segmentations –Normalized cuts [Ren & Malik (2003)] Uniform regions –Watershed.
Data Structures for Orthogonal Range Queries A New Data Structure and Comparison to Previous Work. Application to Contact Detection in Solid Mechanics.
Why Categorize in Computer Vision ?. Why Use Categories? People love categories!
VAST 2011 Sebastian Bremm, Tatiana von Landesberger, Martin Heß, Tobias Schreck, Philipp Weil, and Kay Hamacher Interactive-Graphics Systems TU Darmstadt,
Presenter: Lung-Hao Lee ( 李龍豪 ) January 7, 309.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li and Li Fei-Fei Dept. of Computer Science, Princeton University, USA CVPR ImageNet1.
Mingyang Zhu, Huaijiang Sun, Zhigang Deng Quaternion Space Sparse Decomposition for Motion Compression and Retrieval SCA 2012.
School of Engineering and Computer Science Victoria University of Wellington Copyright: Peter Andreae, VUW Image Recognition COMP # 18.
Computational Geometry Piyush Kumar (Lecture 5: Range Searching) Welcome to CIS5930.
Prostate Cancer CAD Michael Feldman, MD, PhD Assistant Professor Pathology University Pennsylvania.
Scene Reconstruction Seminar presented by Anton Jigalin Advanced Topics in Computer Vision ( )
An Approximate Nearest Neighbor Retrieval Scheme for Computationally Intensive Distance Measures Pratyush Bhatt MS by Research(CVIT)
Search Worms, ACM Workshop on Recurring Malcode (WORM) 2006 N Provos, J McClain, K Wang Dhruv Sharma
Lecture 8: Feature matching CS6670: Computer Vision Noah Snavely.
Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework N 工科所 錢雅馨 2011/01/16 Li-Jia Li, Richard.
KAIST TS & IS Lab. CS710 Know your Neighbors: Web Spam Detection using the Web Topology SIGIR 2007, Carlos Castillo et al., Yahoo! 이 승 민.
CS332 Visual Processing Department of Computer Science Wellesley College High-Level Vision Face Recognition I.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Meeting 8: Features for Object Classification Ullman et al.
Plan for today Introduction Graph Matching Method Theme Recognition Comparison Conclusion.
Fast Human Detection in Crowded Scenes by Contour Integration and Local Shape Estimation Csaba Beleznai, Horst Bischof Computer Vision and Pattern Recognition,
ITree: Exploring Time-Varying Data using Indexable Tree Yi Gu and Chaoli Wang Michigan Technological University Presented at IEEE Pacific Visualization.
Flow cytometry data analysis: SPADE for cell population identification and sample clustering Narahara.
Content-Based Image Retrieval Readings: Chapter 8:
Mean Shift Segmentation
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
“Bayesian Identity Clustering”
Grouping/Segmentation
Presentation transcript:

Collective Vision: Using Extremely Large Photograph Collections Mark Lenz CameraNet Seminar University of Wisconsin – Madison February 2, 2010 Acknowledgments: These slides combine and modify slides provided by Yantao Zheng et al. (National University of Singapore/Google)

Last Time Distributed Collaboration Google Goggles –Personal object recognition World-Wide Landmark Recognition – Building the engine

Today World-Wide Landmark Recognition – Querying the engine Building Rome in a Day –Distributed matching and reconstruction Discussion

Unsupervised learning of landmark images Geo- clusters Landmarks from tour articles Noisy image pool Visual clustering Premise: photos from landmark should be similar Clustering based on local features Validate and clean models Visual model validates landmarks! Photo v.s. non-photo classifer to filter out noisy images ……

Object matching based on local features Sim( ) = image match score, Image representation Interest points: Laplacian-of-Gaussian (LoG) filter Local feature: Gabor wavelets match score = Probability that match of and is false positive Probability of at least m out of n features match, if Probability of a feature match by chance

Match Region Graph Image matching Node is match region 2 types of edges: match edge: measures match confidence overlap region edge: measures spatial overlapping Agglomerative hierarchical clustering Visual clusters

False detected images Match is technically correct, but match region is not landmark Match is technically false, due to visual similarity A problem of model generation A problem of image feature and matching mechanism For positive images: 337/417 (80.8%) are correct Identification rate: 337/728 (46.3%) For negative images: False acceptance rate: 1.1%

Landmark Recognition All local features indexed in one k-d tree Match region - interest points that contribute to a match between two images

k-d trees k-dimensional binary tree Sub-trees split at median w.r.t one dim Cycle through dimensions Creates “bins” of NNs Indexing local feature for matching Query time: ~0.2 sec in a P4 computer

Landmark Recognition Detect features on query image For each feature in query image – Find NN features using k-d tree NN features link to their model image Score match regions between query and model images

Scoring Match Regions Query image interest points matching points in model image determined through NN search Match score = 1-P FPij (probability match b/w regions is false positive) – P FPij is based on the number of matched points Match threshold = total score > 5

Intuition Query image should have many interest points with matches in match region = high match score Points should have matches in multiple regions (images) - threshold

Building Rome in a Day Use photos from photo-sharing websites to build 3D models of cities Web photos less structured than automated image capture (e.g. aerial) Increased efficiency through distributed computations

Multi-Stage Parallel Matching Matching is distributed across nodes Vocabulary tree-inspired match proposals – For distributed matching Query Expansion to increase cluster density – Match proposals create only sparse clusters

Conclusion Distributed Collaboration Google Goggles –Personal object recognition World-Wide Landmark Recognition Building Rome in a Day –Distributed matching and reconstruction

Thoughts for Discussion Geo-clustering to filter out seldom traveled/photographed sites Match region graph for view comparison Pre-tag landmarks such as exits Augmented reality Distributed matching of features Ad-hoc wireless network range Other thoughts...