Internet-scale Imagery for Graphics and Vision James Hays cs129 Computational Photography Brown University, Spring 2011.

Slides:



Advertisements
Similar presentations
James Hays and Alexei A. Efros Carnegie Mellon University CVPR IM2GPS: estimating geographic information from a single image Wen-Tsai Huang.
Advertisements

Yan-Tao Zheng1, Ming Zhao2, Yang Song2, Hartwig Adam2 Ulrich Buddemeier2, Alessandro Bissacco2, Fernando Brucher2 Tat-Seng Chua1, and Hartmut Neven2 1.
Christie Tyler.  Online maps are searchable databases that can display various map data on a web page. ◦ Google Maps Live Search Maps (now Bing Maps)
Publishing and the Web. What do online customers want? The Google generation expect: To find everything quickly & efficiently Websites to be easy to use.
Building Rome in a Day Sameer Agarwal1 Noah Snavely2 Ian Simon1 Steven M. Seitz1 Richard Szeliski3 1University of Washington 2Cornell University 3Microsoft.
Data-driven Visual Similarity for Cross-domain Image Matching
Photosynth is an entirely new visual medium developed by Microsoft Live Labs. Photosynth is an entirely new visual medium developed by Microsoft Live.
Discrete-Continuous Optimization for Large-scale Structure from Motion David Crandall, Andrew Owens, Noah Snavely, Dan Huttenlocher Presented by: Rahul.
Large dataset for object and scene recognition A. Torralba, R. Fergus, W. T. Freeman 80 million tiny images Ron Yanovich Guy Peled.
1 Search Engines What is the Internet? The Web is only part of the Internet The Internet is a computer network connecting millions of computers.
Vision For Graphics ICCV 2005 Vision for Graphics Larry Zitnick, Sing Bing Kang, Rick Szeliski Interactive Visual Media Group Microsoft Research Steve.
Landmark Classification in Large- scale Image Collections Yunpeng Li David J. Crandall Daniel P. Huttenlocher ICCV 2009.
Tour the World: building a web-scale landmark recognition engine ICCV 2009 Yan-Tao Zheng1, Ming Zhao2, Yang Song2, Hartwig Adam2 Ulrich Buddemeier2, Alessandro.
Lecture 26: Vision for the Internet CS6670: Computer Vision Noah Snavely.
Retrieving Graphic Images From The Internet. Images, Subject, Search.
By: Mohamad Alhamada.. Table of Content. What is PhotoSynth? Using PhotoSynth. Features. In the media. Bibliography.
Let Computer Draw Qingyuan Kong. Goal Give me a picture “Obama stands in front of a pyramid”
Multi-view stereo Many slides adapted from S. Seitz.
A. Frank Multimedia Multimedia/Video Search. 2 A. Frank Contents Multimedia (MM) and search/retrieval Text-based MM search in General SEs Text-based MM.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Opportunities of Scale Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.
IS Today (Valacich & Schneider) 5/e Copyright © 2012 Pearson Education, Inc. Published as Prentice Hall 7/2/ Facebook is the most popular social.
Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.
Computer Vision and the Web Images on the Web Movies, news, sports 916,271 titles 10 mil. videos, 65,000 added daily 350 mil. photos, 1 mil. added daily.
Photosynth Brian Reagan MIS 304 Professor Fang Fang “Imagine being able to share the places and things you love using the cinematic quality of a movie,
Review: Binocular stereo If necessary, rectify the two stereo images to transform epipolar lines into scanlines For each pixel x in the first image Find.
Creating and Exploring a Large Photorealistic Virtual Space INRIA / CSAIL / Adobe First IEEE Workshop on Internet Vision, associated with CVPR 2008.
Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.
Towards Mobile Enhanced Digital Collections Tito Sierra and Markus Wust NCSU Libraries The Second International m-Libraries Conference June 24, 2009.
SBU Digital Media CSE 690 Internet Vision Organizational Meeting Tamara Berg Assistant Professor SUNY Stony Brook.
How Search Engines Work. Any ideas? Building an index Dan taylor Flickr Creative Commons.
Research Area B Leif Kobbelt. Communication System Interface Research Area B 2.
CS 1950-G Computational Photography Instructor: James Hays HTA: Patrick Doran UTA: Alex Collins.
Shin’ichi Satoh National Institute of Informatics.
Information Technology Industry Report Brown University ADSP Lab 余 渊 善
Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.
Human abilities Presented By Mahmoud Awadallah 1.
Food – Photo & Community Michael Wong 27 Nov 2012 Copyright © All rights reserved.
Peter Sturm INRIA Grenoble – Rhône-Alpes (Institut National de Recherche en Informatique et Automatique) An overview of multi-view stereo and other topics.
Point Processing (Szeliski 3.1) cs129: Computational Photography James Hays, Brown, Fall 2012 Some figures from Alexei Efros, Steve Seitz, and Gonzalez.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
What Is SEO? Search engine optimization (SEO) is the art and science of publishing and marketing information that ranks well for valuable keywords in.
Bibliography Examples. This is the basic shape of all bibliographies. Kind of looks like Oklahoma, doesn’t it?
UNBIASED LOOK AT DATASET BIAS Antonio Torralba Massachusetts Institute of Technology Alexei A. Efros Carnegie Mellon University CVPR 2011.
Visual Data on the Internet With slides from Alexei Efros, James Hays, Antonio Torralba, and Frederic Heger : Computational Photography Jean-Francois.
Photo Sharing. A popular photo sharing site Founded 2004 by 2 Canadians Stewart Butterfield Caterina Fake Bought by Yahoo in million users Well.
Lecture 8: Feature matching CS6670: Computer Vision Noah Snavely.
Computer Vision Overview Marc Schlosberg CS 175 – Spring 2015.
Pascal Kelm Technische Universität Berlin Communication Systems Group Thursday, 2 April 2009 Video Key Frame Extraction for image-based Applications.
Announcements No midterm Project 3 will be done in pairs same partners as for project 2.
Tentative Future Courses Fall `11 : Computer Vision – emphasis on recognition Spring `11 : Graduate seminar Fall `12 : Computational Photography.
3D Motion Classification Partial Image Retrieval and Download Multimedia Project Multimedia and Network Lab, Department of Computer Science.
Sheila K. Hoffman Ph.D. Candidate in Museology, Heritage and Cultural Mediation, UQÀM Ph.D. Candidate in Art History, Université de Paris I, Panthéon-Sorbonne.
Using Social Media to Promote Events By Alan Merriman DL231 N
Visual Information Retrieval
Home entertainment.
CS 4501: Introduction to Computer Vision Sparse Feature Detectors: Harris Corner, Difference of Gaussian Connelly Barnes Slides from Jason Lawrence, Fei.
Community and Social Impact
Community and Social Impact
Multi-Perspective Panoramas
Opportunities of Scale, Part 2
Towards Mobile Enhanced Digital Collections
Modeling the world with photos
Rob Fergus Computer Vision
Ramesh Jain Visual Intelligence Ramesh Jain
Overview of Multimedia Mapping Three technical sessions…
Using Today’s Technologies to Enhance Documentation of a Project
Lecture 23: Structure from motion 2
All About the Internet.
Point Processing cs129: Computational Photography
Presentation transcript:

Internet-scale Imagery for Graphics and Vision James Hays cs129 Computational Photography Brown University, Spring 2011

Big issues What is out there on the Internet? How do we get it? What can we do with it? How do we compute distances between images?

The Internet as a Data Source Social Networking Sites (e.g. Facebook, MySpace) Image Search Engines (e.g. Google, Bing) Photo Sharing Sites (e.g. Flickr, Picasa, Panoramio, photo.net, dpchallenge.com) Computer Vision Databases (e.g. CalTech 256, PASCAL VOC, LabelMe, Tiny Images, image- net.org, ESP game, Squigl, Matchin)

How Big is Flickr? As of 2010 Total content: – 5 billion photographs – 100+ million geotagged images Public content: – about 1/3 rd of images

How Annotated is Flickr? (tag search) Party – 7,355,998 Paris – 4,139,927 Chair – 232,885 Violin – 55,015 Trashcan – 9,818

Trashcan Results gs&z=t&page=5 gs&z=t&page=5

Different ways to leverage Internet Data Aggregate Statistics (e.g. Photo collection priors, Image sequence geolocation) Text keywords, other metadata (e.g. Phototourism, Photo Clip Art, sketch2photo) Visual similarity (e.g. Tiny Images, Scene Completion, im2gps, cg2real, DB photo enhancement, Virtual Photoreal Space, Total Recall) – Scene level similarity – Instance level similarity

Statistics from Large Photo Collections

Priors for Large Photo Collections and What They Reveal about Cameras. Sujit Kuthirummal, Aseem Agarwala, Dan B Goldman, and Shree K. Nayar ECCV 2008

im2gps Geographic Photo Density

Image Sequence Geolocation with Human Travel Priors Kalogerakis, Vesselova, Hays, Efros, Hertzmann. Image Sequence Geolocation with Human Travel Priors. ICCV 2009

Internet Imagery from metadata search

Building Rome in a Day Sameer Agarwal, University of Washington Yasutaka Furukawa, University of Washington Noah Snavely, Cornell University Ian Simon, University of Washington Steve Seitz, University of Washington Richard Szeliski, Microsoft Research

Sketch2photo