Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.

Slides:

Advertisements

Similar presentations

James Hays and Alexei A. Efros Carnegie Mellon University CVPR IM2GPS: estimating geographic information from a single image Wen-Tsai Huang.

Advertisements

Christie Tyler.  Online maps are searchable databases that can display various map data on a web page. ◦ Google Maps Live Search Maps (now Bing Maps)

Building Rome in a Day Sameer Agarwal1 Noah Snavely2 Ian Simon1 Steven M. Seitz1 Richard Szeliski3 1University of Washington 2Cornell University 3Microsoft.

Print meets Web 2.0. Information sharing Interoperability User-centered design Collaboration Web 2.0.

Data-driven Visual Similarity for Cross-domain Image Matching

Discrete-Continuous Optimization for Large-scale Structure from Motion David Crandall, Andrew Owens, Noah Snavely, Dan Huttenlocher Presented by: Rahul.

Large dataset for object and scene recognition A. Torralba, R. Fergus, W. T. Freeman 80 million tiny images Ron Yanovich Guy Peled.

1 Search Engines What is the Internet? The Web is only part of the Internet The Internet is a computer network connecting millions of computers.

Vision For Graphics ICCV 2005 Vision for Graphics Larry Zitnick, Sing Bing Kang, Rick Szeliski Interactive Visual Media Group Microsoft Research Steve.

Proceedings of the IEEE 2010 Antonio Torralba, MIT Jenny Yuen, MIT Bryan C. Russell, MIT.

Landmark Classification in Large- scale Image Collections Yunpeng Li David J. Crandall Daniel P. Huttenlocher ICCV 2009.

Tour the World: building a web-scale landmark recognition engine ICCV 2009 Yan-Tao Zheng1, Ming Zhao2, Yang Song2, Hartwig Adam2 Ulrich Buddemeier2, Alessandro.

Small Codes and Large Image Databases for Recognition CVPR 2008 Antonio Torralba, MIT Rob Fergus, NYU Yair Weiss, Hebrew University.

Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Fast and Compact Retrieval Methods in Computer Vision Part II A. Torralba, R. Fergus and Y. Weiss. Small Codes and Large Image Databases for Recognition.

Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman.

Lecture 26: Vision for the Internet CS6670: Computer Vision Noah Snavely.

By: Mohamad Alhamada.. Table of Content. What is PhotoSynth? Using PhotoSynth. Features. In the media. Bibliography.

Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Let Computer Draw Qingyuan Kong. Goal Give me a picture “Obama stands in front of a pyramid”

Multi-view stereo Many slides adapted from S. Seitz.

A. Frank Multimedia Multimedia/Video Search. 2 A. Frank Contents Multimedia (MM) and search/retrieval Text-based MM search in General SEs Text-based MM.

Where computer vision needs help from computer science (and machine learning) Bill Freeman Electrical Engineering and Computer Science Dept. Massachusetts.

Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.

Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.

Extreme, Non-parametric Object Recognition 80 million tiny images (Torralba et al)

Opportunities of Scale Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.

IS Today (Valacich & Schneider) 5/e Copyright © 2012 Pearson Education, Inc. Published as Prentice Hall 7/2/ Facebook is the most popular social.

Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.

Photosynth Brian Reagan MIS 304 Professor Fang Fang “Imagine being able to share the places and things you love using the cinematic quality of a movie,

Creating and Exploring a Large Photorealistic Virtual Space INRIA / CSAIL / Adobe First IEEE Workshop on Internet Vision, associated with CVPR 2008.

Matthew Brown University of British Columbia (prev.) Microsoft Research [ Collaborators: † Simon Winder, *Gang Hua, † Rick Szeliski † =MS Research, *=MS.

SBU Digital Media CSE 690 Internet Vision Organizational Meeting Tamara Berg Assistant Professor SUNY Stony Brook.

How Search Engines Work. Any ideas? Building an index Dan taylor Flickr Creative Commons.

Research Area B Leif Kobbelt. Communication System Interface Research Area B 2.

CS 1950-G Computational Photography Instructor: James Hays HTA: Patrick Doran UTA: Alex Collins.

Shin’ichi Satoh National Institute of Informatics.

Information Technology Industry Report Brown University ADSP Lab 余渊善

Lecture #32 WWW Search. Review: Data Organization Kinds of things to organize –Menu items –Text –Images –Sound –Videos –Records (I.e. a person ’ s name,

Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.

Human abilities Presented By Mahmoud Awadallah 1.

Food – Photo & Community Michael Wong 27 Nov 2012 Copyright © All rights reserved.

Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.

PICASA Erceg Aleksandra II4. About -Picasa is used for organizing, viewing and editing digital photos. It also has an integrated photo-sharing website.

T HE U SES O F C OMPUTERS The Slide Show will Begin Shortly.

Data-driven methods: Video & Texture Cs195g Computational Photography James Hays, Brown, Spring 2010 Many slides from Alexei Efros.

Scale-less Dense Correspondences Tal Hassner The Open University of Israel ICCV’13 Tutorial on Dense Image Correspondences for Computer Vision.

80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.

UNBIASED LOOK AT DATASET BIAS Antonio Torralba Massachusetts Institute of Technology Alexei A. Efros Carnegie Mellon University CVPR 2011.

Visual Data on the Internet With slides from Alexei Efros, James Hays, Antonio Torralba, and Frederic Heger : Computational Photography Jean-Francois.

Lecture 8: Feature matching CS6670: Computer Vision Noah Snavely.

Computer Vision Overview Marc Schlosberg CS 175 – Spring 2015.

Pascal Kelm Technische Universität Berlin Communication Systems Group Thursday, 2 April 2009 Video Key Frame Extraction for image-based Applications.

Optimizing through Image Sharing Sites Chris Silver Smith Head, Technology & Development Superpages.com by Idearc Media.

Search Engine Optimization SEO… In Design. Introduction: What is SEO? - Is a process of improving the visibility of a website/ webpage in search engine.

Picasa Area 2 CAT presentation by Derek Southern October 21, 2010.

Internet-scale Imagery for Graphics and Vision James Hays cs129 Computational Photography Brown University, Spring 2011.

S TORY BOARD Done by: Zahra Hamed Bani Oraba ID:

Tentative Future Courses Fall `11 : Computer Vision – emphasis on recognition Spring `11 : Graduate seminar Fall `12 : Computational Photography.

CS 4501: Introduction to Computer Vision Sparse Feature Detectors: Harris Corner, Difference of Gaussian Connelly Barnes Slides from Jason Lawrence, Fei.

Community and Social Impact

Community and Social Impact

Can computers match human perception?

Opportunities of Scale, Part 2

Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT

Modeling the world with photos

Rob Fergus Computer Vision

Overview of Multimedia Mapping Three technical sessions…

Lecture 23: Structure from motion 2

All About the Internet.

Presentation transcript:

Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010

Big issues What is out there on the Internet? How do we get it? What can we do with it? How do we compute distances between images?

The Internet as a Data Source Social Networking Sites (e.g. Facebook, MySpace) Image Search Engines (e.g. Google, Bing) Photo Sharing Sites (e.g. Flickr, Picasa, Panoramio, photo.net, dpchallenge.com) Computer Vision Databases (e.g. CalTech 256, PASCAL VOC, LabelMe, Tiny Images, image- net.org, ESP game, Squigl, Matchin)

How Big is Flickr? As of June 19 th, 2009 Total content: – 3.6 billion photographs – 100+ million geotagged images Public content: – 1.3 billion photographs – 74 million geotagged images

How Annotated is Flickr? (tag search) Party – 7,355,998 Paris – 4,139,927 Chair – 232,885 Violin – 55,015 Trashcan – 9,818

Trashcan Results gs&z=t&page=5 gs&z=t&page=5

Different ways to leverage Internet Data Aggregate Statistics (e.g. Photo collection priors, Image sequence geolocation) Text keywords, other metadata (e.g. Phototourism, Photo Clip Art, sketch2photo) Visual similarity (e.g. Tiny Images, Scene Completion, im2gps, cg2real, DB photo enhancement, Virtual Photoreal Space, Total Recall) – Scene level similarity – Instance level similarity

Statistics from Large Photo Collections

Priors for Large Photo Collections and What They Reveal about Cameras. Sujit Kuthirummal, Aseem Agarwala, Dan B Goldman, and Shree K. Nayar ECCV 2008

im2gps Geographic Photo Density

Image Sequence Geolocation with Human Travel Priors Kalogerakis, Vesselova, Hays, Efros, Hertzmann. Image Sequence Geolocation with Human Travel Priors. ICCV 2009

Internet Imagery from metadata search

Building Rome in a Day Sameer Agarwal, University of Washington Yasutaka Furukawa, University of Washington Noah Snavely, Cornell University Ian Simon, University of Washington Steve Seitz, University of Washington Richard Szeliski, Microsoft Research

Sketch2photo

Internet Imagery from visual search

Distance Metrics = Euclidian distance of 5 units = Grayvalue distance of 50 values = ? x y x y

SSD says these are not similar ?

Tiny Images 80 million tiny images: a large dataset for non- parametric object and scene recognition Antonio Torralba, Rob Fergus and William T. Freeman. PAMI 2008.

Human Scene Recognition

Tiny Images Project Page

Powers of 10 Number of images on my hard drive: 10 4 Number of images seen during my first 10 years:10 8 (3 images/second * 60 * 60 * 16 * 365 * 10 = ) Number of images seen by all humanity: ,456,367,669 humans 1 * 60 years * 3 images/second * 60 * 60 * 16 * 365 = 1 from Number of photons in the universe: Number of all 32x32 images: *32*3 ~

Scenes are unique

But not all scenes are so original

How many images are there? Torralba, Fergus, Freeman. PAMI 2008