Opportunities of Scale, Part 2

Slides:

Advertisements

Similar presentations

James Hays and Alexei A. Efros Carnegie Mellon University CVPR IM2GPS: estimating geographic information from a single image Wen-Tsai Huang.

Advertisements

Data-driven methods: Texture (Sz 10.5) Cs129 Computational Photography James Hays, Brown, Spring 2011 Many slides from Alexei Efros.

Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.

Patch to the Future: Unsupervised Visual Prediction

Computer Vision for Human-Computer InteractionResearch Group, Universität Karlsruhe (TH) cv:hci Dr. Edgar Seemann 1 Computer Vision: Histograms of Oriented.

Computer Vision – Image Representation (Histograms)

IMAGE RESTORATION AND REALISM MILLIONS OF IMAGES SEMINAR YUVAL RADO.

Large dataset for object and scene recognition A. Torralba, R. Fergus, W. T. Freeman 80 million tiny images Ron Yanovich Guy Peled.

Recognition: A machine learning approach

Event prediction CS 590v. Applications Video search Surveillance – Detecting suspicious activities – Illegally parked cars – Abandoned bags Intelligent.

Small Codes and Large Image Databases for Recognition CVPR 2008 Antonio Torralba, MIT Rob Fergus, NYU Yair Weiss, Hebrew University.

Fast and Compact Retrieval Methods in Computer Vision Part II A. Torralba, R. Fergus and Y. Weiss. Small Codes and Large Image Databases for Recognition.

Object Recognition with Informative Features and Linear Classification Authors: Vidal-Naquet & Ullman Presenter: David Bradley.

Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

Let Computer Draw Qingyuan Kong. Goal Give me a picture “Obama stands in front of a pyramid”

Highlights Lecture on the image part (10) Automatic Perception 16

Extreme, Non-parametric Object Recognition 80 million tiny images (Torralba et al)

Opportunities of Scale Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.

Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.

Understanding Faces Computational Photography

Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.

SBU Digital Media CSE 690 Internet Vision Organizational Meeting Tamara Berg Assistant Professor SUNY Stony Brook.

Computer Vision. DARPA Challenge Seeks Robots To Drive Into Disasters. DARPA's Robotics Challenge offers a $2 million prize if you can build a robot capable.

Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.

Human abilities Presented By Mahmoud Awadallah 1.

Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.

September 23, 2014Computer Vision Lecture 5: Binary Image Processing 1 Binary Images Binary images are grayscale images with only two possible levels of.

80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.

UNBIASED LOOK AT DATASET BIAS Antonio Torralba Massachusetts Institute of Technology Alexei A. Efros Carnegie Mellon University CVPR 2011.

Putting Context into Vision Derek Hoiem September 15, 2004.

School of Engineering and Computer Science Victoria University of Wellington Copyright: Peter Andreae, VUW Image Recognition COMP # 18.

Scene Completion Using Millions of Photographs James Hays, Alexei A. Efros Carnegie Mellon University ACM SIGGRAPH 2007.

Epitomic Location Recognition A generative approach for location recognition K. Ni, A. Kannan, A. Criminisi and J. Winn In proc. CVPR Anchorage,

Visual Data on the Internet With slides from Alexei Efros, James Hays, Antonio Torralba, and Frederic Heger : Computational Photography Jean-Francois.

CS 1699: Intro to Computer Vision Support Vector Machines Prof. Adriana Kovashka University of Pittsburgh October 29, 2015.

2D Texture Synthesis Instructor: Yizhou Yu. Texture synthesis Goal: increase texture resolution yet keep local texture variation.

Visual Computing Computer Vision 2 INFO410 & INFO350 S2 2015

SUN Database: Large-scale Scene Recognition from Abbey to Zoo Jianxiong Xiao *James Haysy Krista A. Ehinger Aude Oliva Antonio Torralba Massachusetts Institute.

Internet-scale Imagery for Graphics and Vision James Hays cs129 Computational Photography Brown University, Spring 2011.

Introduction to Recognition CS4670/5670: Intro to Computer Vision Noah Snavely mountain building tree banner vendor people street lamp.

Tentative Future Courses Fall `11 : Computer Vision – emphasis on recognition Spring `11 : Graduate seminar Fall `12 : Computational Photography.

Course : T Computer Vision

Announcements Project 4 out today help session at the end of class.

A Personal Tour of Machine Learning and Its Applications

CS 4501: Introduction to Computer Vision Sparse Feature Detectors: Harris Corner, Difference of Gaussian Connelly Barnes Slides from Jason Lawrence, Fei.

Data Driven Attributes for Action Detection

Lecture 26 Hand Pose Estimation Using a Database of Hand Images

Lecture 25: Introduction to Recognition

Face Recognition and Feature Subspaces

Computational Photography Derek Hoiem, University of Illinois

Lecture 14: Introduction to Recognition

Digit Recognition using SVMS

Li Fei-Fei, UIUC Rob Fergus, MIT Antonio Torralba, MIT

An opposition to: Context-Based Vision System for Place and Object Recognition Contextual Models for Object Detection Using BRFs.

Object detection as supervised classification

Enhanced-alignment Measure for Binary Foreground Map Evaluation

Computer Vision Lecture 5: Binary Image Processing

Lecture 25: Introduction to Recognition

Data-driven methods: Texture 2 (Sz 10.5)

CS 1674: Intro to Computer Vision Scene Recognition

ICCV 2009 Kyoto, Short Course, September 24

Rob Fergus Computer Vision

Brief Review of Recognition + Context

Image Quilting for Texture Synthesis & Transfer

Announcements Guest lecture next Tuesday

MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING

CS4670: Intro to Computer Vision

Midterm Exam Closed book, notes, computer Similar to test 1 in format:

Midterm Exam Closed book, notes, computer Similar to test 1 in format:

Presentation transcript:

Opportunities of Scale, Part 2 Computer Vision James Hays Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba

Recap Opportunities of Scale: Data-driven methods Previous Lecture The unreasonable effectiveness of data Scene completion Today Im2gps Recognition via Tiny Images Project 5 Intro

The Unreasonable Effectiveness of Math “The miracle of the appropriateness of the language of mathematics…” Eugene Wigner “The most incomprehensible thing about the universe is that it is comprehensible.” Albert Einstein “There is only one thing which is more unreasonable than the unreasonable effectiveness of mathematics in physics, and this is the unreasonable ineffectiveness of mathematics in biology.” Israel Gelfand “We should stop acting as if our goal is to author extremely elegant theories, and instead embrace complexity and make use of the best ally we have: the unreasonable effectiveness of data.” Peter Norvig

Info from Most Similar Images General Principal Input Image Images Associated Info Huge Dataset Info from Most Similar Images image matching Hopefully, If you have enough images, the dataset will contain very similar images that you can find with simple matching methods.

… 200 total 5

Graph cut + Poisson blending

im2gps (Hays & Efros, CVPR 2008) 6 million geo-tagged Flickr images http://graphics.cs.cmu.edu/projects/im2gps/

How much can an image tell about its geographic location?

Nearest Neighbors according to gist + bag of SIFT + color histogram + a few others

Im2gps

Example Scene Matches

Voting Scheme Probability map for that same image Explain the green circles and how they relate to evaluation 13

im2gps

Effect of Dataset Size

Population density ranking High Predicted Density Low Predicted Density

Where is This? [Olga Vesselova, Vangelis Kalogerakis, Aaron Hertzmann, James Hays, Alexei A. Efros. Image Sequence Geolocation. ICCV’09] 19

Where is This?

Where are These? 15:14, June 18th, 2006 16:31, June 18th, 2006 Individually each photograph is 16%, but together they reinforce each other well 15:14, June 18th, 2006 16:31, June 18th, 2006 21

Where are These? 15:14, June 18th, 2006 16:31, June 18th, 2006 If you know that brits love to vacation in greece… 15:14, June 18th, 2006 16:31, June 18th, 2006 17:24, June 19th, 2006 22

Results im2gps – 10% (geo-loc within 400 km) temporal im2gps – 56%

Tiny Images 80 million tiny images: a large dataset for non-parametric object and scene recognition Antonio Torralba, Rob Fergus and William T. Freeman. PAMI 2008. http://groups.csail.mit.edu/vision/TinyImages/

Human Scene Recognition

Humans vs. Computers: Car-Image Classification Various computer vision algorithms for full resolution images Humans for 32 pixel tall images

Powers of 10 Number of images on my hard drive: 104 Number of images seen during my first 10 years: 108 (3 images/second * 60 * 60 * 16 * 365 * 10 = 630720000) Number of images seen by all humanity: 1020 106,456,367,669 humans1 * 60 years * 3 images/second * 60 * 60 * 16 * 365 = 1 from http://www.prb.org/Articles/2002/HowManyPeopleHaveEverLivedonEarth.aspx Number of photons in the universe: 1088 so don’t listen to anyone claiming to brute force image space But most of image space is empty, never observed. If you were to run the rand() command in Matlab, how often would it generate a plausible image? Number of all 32x32 images: 107373 256 32*32*3 ~ 107373

Scenes are unique Scene and object recognition at human performance is very hard because we are able to deal with so many different kinds of scenes… Scenes are so unique.

But not all scenes are so original But most scenes are not really like that. Most of the time, scenes are boring places in which nothing really surprising happens. For instance, this street… there is nothing really special about it. In fact, I can easily find thousands of other street pictures that look just like this. Of course, these scenes are different in the details, just as two faces are different, but at a coarser level they are form by similar objects arranged in a similar spatial configuration.

Lots Of Images A. Torralba, R. Fergus, W.T.Freeman. PAMI 2008

Lots Of Images A. Torralba, R. Fergus, W.T.Freeman. PAMI 2008

Lots Of Images

Application: Automatic Colorization Color Transfer Input Color Transfer Matches (gray) Matches (w/ color) Avg Color of Match

Application: Automatic Colorization Color Transfer Input Color Transfer Matches (gray) Matches (w/ color) Avg Color of Match

Automatic Orientation Examples Based on average correlation to the top 50 nearest neighbors. Note that this doesn’t really beat just assuming that the image is upright. A. Torralba, R. Fergus, W.T.Freeman. 2008 39

Summary With billions of images on the web, it’s often possible to find a close nearest neighbor In such cases, we can shortcut hard problems by “looking up” the answer, stealing the labels from our nearest neighbor For example, simple (or learned) associations can be used to synthesize background regions, colorize, or recognize objects

Project 5 http://www.cc.gatech.edu/~hays/compvision/proj5/