Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li and Li Fei-Fei Dept. of Computer Science, Princeton University, USA CVPR 2009 2015-11-11ImageNet1.

Slides:



Advertisements
Similar presentations
Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.
Advertisements

Scalable Multi-Label Annotation Jia Deng Olga Russakovsky Jonathan Krause, Michael Bernstein Alexander Berg Li Fei-Fei.
Incentivize Crowd Labeling under Budget Constraint
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
Data Mining Classification: Alternative Techniques
3 Small Comments Alex Berg Stony Brook University I work on recognition: features – action recognition – alignment – detection – attributes – hierarchical.
Large Scale Visual Recognition Challenge 2011 Alex BergStony Brook Jia DengStanford & Princeton Sanjeev SatheeshStanford Hao SuStanford Fei-Fei LiStanford.
›SIFT features [3] were computed for 100 images (from ImageNet [4]) for each of our 48 subordinate-level categories. ›The visual features of an image were.
Parsing Clothing in Fashion Photographs
Stephan Gammeter, Lukas Bossard, Till Quack, Luc Van Gool.
Large dataset for object and scene recognition A. Torralba, R. Fergus, W. T. Freeman 80 million tiny images Ron Yanovich Guy Peled.
Li-Jia Li Yongwhan Lim Li Fei-Fei Chong Wang David M. Blei B UILDING AND U SING A S EMANTIVISUAL I MAGE H IERARCHY CVPR, 2010.
Presented by Zeehasham Rasheed
Predicting Matchability - CVPR 2014 Paper -
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
ImageNet: A Large-Scale Hierarchical Image Database
Joint Image Clustering and Labeling by Matrix Factorization
Yin Yang (Hong Kong University of Science and Technology) Nilesh Bansal (University of Toronto) Wisam Dakka (Google) Panagiotis Ipeirotis (New York University)
School of Electronic Information Engineering, Tianjin University Human Action Recognition by Learning Bases of Action Attributes and Parts Jia pingping.
Mining and Summarizing Customer Reviews
Language Identification of Search Engine Queries Hakan Ceylan Yookyung Kim Department of Computer Science Yahoo! Inc. University of North Texas 2821 Mission.
Human abilities Presented By Mahmoud Awadallah 1.
The identification of interesting web sites Presented by Xiaoshu Cai.
Accessing the Deep Web Bin He IBM Almaden Research Center in San Jose, CA Mitesh Patel Microsoft Corporation Zhen Zhang computer science at the University.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
Tag Ranking Present by Jie Xiao Dept. of Computer Science Univ. of Texas at San Antonio.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
Presenter: Lung-Hao Lee ( 李龍豪 ) January 7, 309.
A Language Independent Method for Question Classification COLING 2004.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Collective Vision: Using Extremely Large Photograph Collections Mark Lenz CameraNet Seminar University of Wisconsin – Madison February 2, 2010 Acknowledgments:
Web Image Retrieval Re-Ranking with Relevance Model Wei-Hao Lin, Rong Jin, Alexander Hauptmann Language Technologies Institute School of Computer Science.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Automatic Set Instance Extraction using the Web Richard C. Wang and William W. Cohen Language Technologies Institute Carnegie Mellon University Pittsburgh,
Understanding User’s Query Intent with Wikipedia G 여 승 후.
School of Engineering and Computer Science Victoria University of Wellington Copyright: Peter Andreae, VUW Image Recognition COMP # 18.
Study of Protein Prediction Related Problems Ph.D. candidate Le-Yi WEI 1.
C. Lawrence Zitnick Microsoft Research, Redmond Devi Parikh Virginia Tech Bringing Semantics Into Focus Using Visual.
Truth Discovery with Multiple Conflicting Information Providers on the Web KDD 07.
Algorithmic Detection of Semantic Similarity WWW 2005.
1 CSC 594 Topics in AI – Text Mining and Analytics Fall 2015/16 3. Word Association.
Topic Models Presented by Iulian Pruteanu Friday, July 28 th, 2006.
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
Image Classification for Automatic Annotation
Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
Recognition Using Visual Phrases
PhD Dissertation Defense Scaling Up Machine Learning Algorithms to Handle Big Data BY KHALIFEH ALJADDA ADVISOR: PROFESSOR JOHN A. MILLER DEC-2014 Computer.
Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework N 工科所 錢雅馨 2011/01/16 Li-Jia Li, Richard.
The Cross Language Image Retrieval Track: ImageCLEF Breakout session discussion.
Annotation Framework & ImageCLEF 2014 JAN BOTOREK, PETRA BUDÍKOVÁ
From Words to Senses: A Case Study of Subjectivity Recognition Author: Fangzhong Su & Katja Markert (University of Leeds, UK) Source: COLING 2008 Reporter:
2/10/2016Semantic Similarity1 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Refined Online Citation Matching and Adaptive Canonical Metadata Construction CSE 598B Course Project Report Huajing Li.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,
CiteData: A New Multi-Faceted Dataset for Evaluating Personalized Search Performance CIKM’10 Advisor : Jia-Ling, Koh Speaker : Po-Hsien, Shih.
Semi-Supervised Recognition of Sarcastic Sentences in Twitter and Amazon -Smit Shilu.
Using ODP Metadata to Personalize Search University of Seoul Computer Science Database Lab. Min Mi-young.
Saliency detection Donghun Yeo CV Lab..
Quanzeng You, Jiebo Luo, Hailin Jin and Jianchao Yang
a chicken and egg problem…
Giannis Varelas Epimenidis Voutsakis Paraskevi Raftopoulou
Presentation transcript:

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li and Li Fei-Fei Dept. of Computer Science, Princeton University, USA CVPR ImageNet1 Jiewen Lei

 This paper is mainly an introduction to ImageNet. The paper is organized as follows: 1. shows properties of ImageNet 2. Compare ImageNet with current related datasets 3. Constructing ImageNet - describes without concrete steps 4. ImageNet Applications  mainly focus on the constructing ImageNet.  It mostly relatives to Crawling and PageRank ImageNet2

 A dataset - Datasets and Computer Vision  Based on WordNet - Each node is depicted by images  A knowledge ontology - Taxonomy - Partonomy ImageNet3

 2-step process ImageNet4 Step 1 : Collect candidate images Via the Internet Step 2 : Clean up the candidate Images by humans

 For each synset, the queries are the set of WordNet synonyms  Accuracy of Internet Image search results: 10 % - For clean images, needs 10K images  Query expansion - Synonyms: German police dog, German shepherd dog - Appending words form ancestors: sheepdog, dog  Multiple Languages - Italian, Dutch, Spanish, Chinese e.g. 德国牧羊犬, pastore tedesco  More engines: Yahoo!, flickr, Google  Parallel downloading ImageNet5

 Rely on humans to verify each candidate image collected for a given synset  Amazon Mechanical Turk (AMT) - used for labeling vision data images: 0.02 dollar - 14,197,122 images: 946 dollars - 10 repetition: 9460 dollars - Jul Apr 2010:11 million images  Present the users with a set of candidate images and the definition of the target synset  let users select the best match ones ImageNet6

ImageNet7 Workers do annotation on AMT -Multiple annotations for each images Annotation Results - An average of > 97% accuracy

 Users Enhancement - Provide wiki and google links for definitions - Make sure workers read the definition - Definition quiz - Allow more feedback. E.g. “unimagable synset” expert opinion ImageNet8

 Human users make mistakes  Not all users follow the instructions  Users do not always agree with each other - Subtle or confusing synsets, e.g. Burmese cat  Quality Control System ImageNet9

 randomly sample an initial subset of image to users - Have multiple users independently label same image  obtain a confidence score table, indicating the probability of an image being a good image given the user votes - Different categories requires different levels of consensus  Proceed until a pre-determined confidence score threshold reached ImageNet10

 Scale: 12 subtrees,3,2 million images,5247 categories  Hierarchy: densely populated semantic hierarchy, based on WordNet ImageNet11

 Accuracy: clean dataset at all level ImageNet12  Diversity: variable appearances, positions, view points, poses, background clutter, occlusions.

 Non-parametric Object Recognition 1. NN-voting + noisy ImageNet 2. NN-voting + clean ImageNet 3. Naive Bayesian Nearest Neighbor (NBNN) 4. NBNN-100  Tree Based Image Classification  Automatic Object Localization ImageNet13

 Pros 1. Crowdsourcing 2. Benchmarking 3. Open: Download Original Images, URLs, Features, Object Attributes, API  Cons 1. Improve algorithm: PageRank 2. AMT: hierarchical users based on their ability 3. Only one tag per image ImageNet14