3 Small Comments Alex Berg Stony Brook University I work on recognition: features – action recognition – alignment – detection – attributes – hierarchical.

Slides:



Advertisements
Similar presentations
Object recognition and scene “understanding”
Advertisements

Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
Classification using intersection kernel SVMs is efficient Joint work with Subhransu Maji and Alex Berg Jitendra Malik UC Berkeley.
Face Alignment by Explicit Shape Regression
Human Action Recognition by Learning Bases of Action Attributes and Parts.
Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.
IT’S NOT POLITE TO POINT: DESCRIBING PEOPLE WITH UNCERTAIN ATTRIBUTES CVPR 2013 Poster.
Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.
Large-Scale Object Recognition with Weak Supervision
Contour Based Approaches for Visual Object Recognition Jamie Shotton University of Cambridge Joint work with Roberto Cipolla, Andrew Blake.
Biased Normalized Cuts 1 Subhransu Maji and Jithndra Malik University of California, Berkeley IEEE Conference on Computer Vision and Pattern Recognition.
Fast intersection kernel SVMs for Realtime Object Detection
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Statistical Recognition Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and Kristen Grauman.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Object Recognition: Conceptual Issues Slides adapted from Fei-Fei Li, Rob Fergus, Antonio Torralba, and K. Grauman.
Viola and Jones Object Detector Ruxandra Paun EE/CS/CNS Presentation
PANDA: Pose Aligned Networks for Deep Attribute Modeling Ning Zhang1;2, Manohar Paluri1, Marc’Aurelio Ranzato1, Trevor Darrell2, Lubomir Bourdev1 1: Facebook.
MACHINE LEARNING AND ARTIFICIAL NEURAL NETWORKS FOR FACE VERIFICATION
SBU Digital Media CSE 690 Internet Vision Organizational Meeting Tamara Berg Assistant Professor SUNY Stony Brook.
Wang, Z., et al. Presented by: Kayla Henneman October 27, 2014 WHO IS HERE: LOCATION AWARE FACE RECOGNITION.
Exercise Session 10 – Image Categorization
Multiclass object recognition
Efficient Algorithms for Matching Pedro Felzenszwalb Trevor Darrell Yann LeCun Alex Berg.
Computer vision.
Kuan-Chuan Peng Tsuhan Chen
Studying Relationships Between Human Gaze, Description, and Computer Vision Kiwon Yun 1, Yifan Peng 1 Dimitris Samaras 1, Gregory J. Zelinsky 1,2, Tamara.
Object Bank Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 4 th, 2013.
Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.
Why Categorize in Computer Vision ?. Why Use Categories? People love categories!
Compiled By: Raj G Tiwari.  A pattern is an object, process or event that can be given a name.  A pattern class (or category) is a set of patterns sharing.
Model comparison and challenges II Compositional bias of salient object detection benchmarking Xiaodi Hou K-Lab, Computation and Neural Systems California.
Sign Classification Boosted Cascade of Classifiers using University of Southern California Thang Dinh Eunyoung Kim
Yao, B., and Fei-fei, L. IEEE Transactions on PAMI(2012)
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
2009 ML Project: Goal: Do some real machine learning… A project you are interested in works better Data is often the hard part (get it in plenty of time)
Representations for object class recognition David Lowe Department of Computer Science University of British Columbia Vancouver, Canada Sept. 21, 2006.
Deformable Part Model Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 11 st, 2013.
BAGGING ALGORITHM, ONLINE BOOSTING AND VISION Se – Hoon Park.
1 Pattern Recognition Pattern recognition is: 1. A research area in which patterns in data are found, recognized, discovered, …whatever. 2. A catchall.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
UNBIASED LOOK AT DATASET BIAS Antonio Torralba Massachusetts Institute of Technology Alexei A. Efros Carnegie Mellon University CVPR 2011.
Fitting: The Hough transform
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
Recognition Using Visual Phrases
Learning Features and Parts for Fine-Grained Recognition Authors: Jonathan Krause, Timnit Gebru, Jia Deng, Li-Jia Li, Li Fei-Fei ICPR, 2014 Presented by:
COMP24111: Machine Learning Ensemble Models Gavin Brown
Machine Learning in Practice Lecture 2 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Face detection Many slides adapted from P. Viola.
PANDA: Pose Aligned Networks for Deep Attribute Modeling Ning Zhang 1,2 Manohar Paluri 1 Marć Aurelio Ranzato 1 Trevor Darrell 2 Lumbomir Boudev 1 1 Facebook.
Week 3 Emily Hand UNR. Online Multiple Instance Learning The goal of MIL is to classify unseen bags, instances, by using the labeled bags as training.
Zhuode Liu 2016/2/13 University of Texas at Austin CS 381V: Visual Recognition Discovering the Spatial Extent of Relative Attributes Xiao and Lee, ICCV.
2. Skin - color filtering.
The Relationship between Deep Learning and Brain Function
From Vision to Grasping: Adapting Visual Networks
Supervised Time Series Pattern Discovery through Local Importance
Recognition using Nearest Neighbor (or kNN)
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
What is Pattern Recognition?
Finding Clusters within a Class to Improve Classification Accuracy
Bird-species Recognition Using Convolutional Neural Network
Attributes and Simile Classifiers for Face Verification
Image Classification.
Large Scale Image Deduplication
Objects as Attributes for Scene Classification
A Proposal Defense On Deep Residual Network For Face Recognition Presented By SAGAR MISHRA MECE
Using Natural Language Processing to Aid Computer Vision
Presentation transcript:

3 Small Comments Alex Berg Stony Brook University I work on recognition: features – action recognition – alignment – detection – attributes – hierarchical image classification & retrieval + machine learning for large scale recognition Collaborations: words & pictures – ImageNet – human visual search – neural coding of visual memory

0. Good computer vision tells us something about the structure of the visual world or about our descriptions of the visual world. 1.We should know what we are recognizing, and be able to prove it! 2.Large datasets can be gallant but doomed attempts to avoid hard representation problems. (Small datasets are worse.) 3.We should try to understand uncertainty in computer vision. Alex Berg – Stony Brook University 3 Small Comments

The structure of the visual world is complex. So are the ways we describe it. Recognition experiments probe mappings from samples of the visual world to descriptions. Alex Berg – Stony Brook University 0. Good computer vision tells us something about the structure of the visual world.

This is a PR problem! Advertise our successes and abilities, but avoid over-selling. Example Given training and evaluation of an X detector on my own dataset: IdealIf your dataset has Xs, then my algorithm can detect them. GreatIf people can detect Xs in your dataset, then my algorithm will detect them. GoodI can predict performance on your dataset based on a statistical characterization. Okay I can very roughly predict performance on your dataset based on a characterization. Now If I were to run my algorithm on your dataset, we could determine performance. Bad I don’t know how it works on my dataset! Alex Berg – Stony Brook University 1. We should know what we are recognizing, and be able to prove it!

Describable Visual Attributes for Face Verification and Image Search N. Kumar, A.C. Berg, P.N. Belhumeur, S.K. Nayar (T.PAMI 2011) Verification classifier

6 LFW Results In 2009

Describable Visual Attributes for Face Verification and Image Search N. Kumar, A.C. Berg, P.N. Belhumeur, S.K. Nayar Verification classifier

2. Large datasets can be gallant but doomed attempts to avoid hard representation problems. Alex Berg – Stony Brook University w/ Li Fei-Fei & Jia Stanford Efficient Additive Models for Detection & Classification w/ Subhransu UCB −> TTI-C Test Image Large Dataset There comes a point when it is necessary to go into more detail – this is the regime of mid-level vision. More data provides better joint statistics, but is enough only sometimes. There is some boost from looking at large data, but, to do well we still need to address hard (mid-level) representation problems. Might hope that matching/classifying a whole image / pattern / large scale “just works” for recognition. Large Scale Recognition Challenge going on now (part of PASCAL VOC) Image Classification + Object Detection! (Small datasets are worse!)

3. We should try to understand uncertainty in computer vision. Alex Berg – Stony Brook University 1.Need an explicit idea of what is possible/likely given observations. 2.We need this for low, mid, and high level vision. 3.This is a difficult representational challenge – mainly because of complex structure.

What does classifying more than 10,000 image categories tell us? J. Deng, A.C. Berg, K. Li, L. Fei-Fei (ECCV 2010) Correlation between CV classifier confusions and WordNet!

0. Good computer vision tells us something about the structure of the visual world or about our descriptions of the visual world. 1.We should know what we are recognizing, and be able to prove it! 2.Large datasets can be gallant but doomed attempts to avoid hard representation problems. (Small datasets are worse.) 3.We should try to understand uncertainty in computer vision. Alex Berg – Stony Brook University 3 Small Comments