Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň, 16. 4. 2014.

Slides:



Advertisements
Similar presentations
Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.
Advertisements

Image Retrieval With Relevant Feedback Hayati Cam & Ozge Cavus IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK Hayati CAM Ozge CAVUS.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme Presented by Smitashree Choudhury.
Search in Source Code Based on Identifying Popular Fragments Eduard Kuric and Mária Bieliková Faculty of Informatics and Information.
ImageCLEF breakout session Please help us to prepare ImageCLEF2010.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Li-Jia Li Yongwhan Lim Li Fei-Fei Chong Wang David M. Blei B UILDING AND U SING A S EMANTIVISUAL I MAGE H IERARCHY CVPR, 2010.
Object retrieval with large vocabularies and fast spatial matching
PROBLEM BEING ATTEMPTED Privacy -Enhancing Personalized Web Search Based on:  User's Existing Private Data Browsing History s Recent Documents 
Video retrieval using inference network A.Graves, M. Lalmas In Sig IR 02.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Sentiment Lexicon Creation from Lexical Resources BIS 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam
MANISHA VERMA, VASUDEVA VARMA PATENT SEARCH USING IPC CLASSIFICATION VECTORS.
Designing clustering methods for ontology building: The Mo’K workbench Authors: Gilles Bisson, Claire Nédellec and Dolores Cañamero Presenter: Ovidiu Fortu.
Presented by Zeehasham Rasheed
1 Web Query Classification Query Classification Task: map queries to concepts Application: Paid advertisement 问题:百度 /Google 怎么赚钱?
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Online Learning for Web Query Generation: Finding Documents Matching a Minority Concept on the Web Rayid Ghani Accenture Technology Labs, USA Rosie Jones.
Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.
Information Retrieval
Memoplex Browser: Searching and Browsing in Semantic Networks CPSC 533C - Project Update Yoel Lanir.
Information Retrieval in Practice
DOG I : an Annotation System for Images of Dog Breeds Antonis Dimas Pyrros Koletsis Euripides Petrakis Intelligent Systems Laboratory Technical University.
Information Retrieval – and projects we have done. Group Members: Aditya Tiwari ( ) Harshit Mittal ( ) Rohit Kumar Saraf ( ) Vinay.
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
1 Wikification CSE 6339 (Section 002) Abhijit Tendulkar.
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Detecting Semantic Cloaking on the Web Baoning Wu and Brian D. Davison Lehigh University, USA WWW 2006.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Query Processing In Multimedia Databases Dheeraj Kumar Mekala Devarasetty Bhanu Kiran.
CROSSMARC Web Pages Collection: Crawling and Spidering Components Vangelis Karkaletsis Institute of Informatics & Telecommunications NCSR “Demokritos”
Mining the Web to Create Minority Language Corpora Rayid Ghani Accenture Technology Labs - Research Rosie Jones Carnegie Mellon University Dunja Mladenic.
Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.
SYMPOSIUM ON SEMANTICS IN SYSTEMS FOR TEXT PROCESSING September 22-24, Venice, Italy Combining Knowledge-based Methods and Supervised Learning for.
Describing Images using Inferred Visual Dependency Representations Authors : Desmond Elliot & Arjen P. de Vries Presentation of Paper by : Jantre Sanket.
10/22/2015ACM WIDM'20051 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis Voutsakis.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Date : 2012/10/25 Author : Yosi Mass, Yehoshua Sagiv Source : WSDM’12 Speaker : Er-Gang Liu Advisor : Dr. Jia-ling Koh 1.
Unsupervised Learning of Visual Sense Models for Polysemous Words Kate Saenko Trevor Darrell Deepak.
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
Beyond Nouns Exploiting Preposition and Comparative adjectives for learning visual classifiers.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li and Li Fei-Fei Dept. of Computer Science, Princeton University, USA CVPR ImageNet1.
Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
From Text to Image: Generating Visual Query for Image Retrieval Wen-Cheng Lin, Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information.
Neural networks in modern image processing Petra Budíková DISA seminar,
Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Generating Query Substitutions Alicia Wood. What is the problem to be solved?
Annotation Framework & ImageCLEF 2014 JAN BOTOREK, PETRA BUDÍKOVÁ
Using Wikipedia for Hierarchical Finer Categorization of Named Entities Aasish Pappu Language Technologies Institute Carnegie Mellon University PACLIC.
Relevance Models and Answer Granularity for Question Answering W. Bruce Croft and James Allan CIIR University of Massachusetts, Amherst.
Combining Text and Image Queries at ImageCLEF2005: A Corpus-Based Relevance-Feedback Approach Yih-Cheng Chang Department of Computer Science and Information.
2/10/2016Semantic Similarity1 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis.
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
1 Random Walks on the Click Graph Nick Craswell and Martin Szummer Microsoft Research Cambridge SIGIR 2007.
Chapter 6. Inference beyond the index 2007 년 1 월 30 일 부산대학교 인공지능연구실 김민호 Text : FINDING OUT ABOUT Page. 182 ~ 251.
Constructing A Yami Language Lexicon Database from Yami Archiving Projects Meng-Chien Yang(Providence University, Taiwan) D. Victoria Rau(National Chung.
Scalable Image Annotation with ConceptRank Petra Budíková, Michal Batko, Pavel Zezula.
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Information Retrieval
Presentation transcript:

Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,

CEMI, Plzeň, Slide 2 of 11  Formalization  The annotation problem is defined by a query image I and a vocabulary V of candidate concepts  The annotation function f A assigns to each concept c ∈ V its probability of being relevant for I The annotation problem ? V = {flower, animal, person, building}  Basic possible approaches  Model-based annotation  Train classifiers  Suitable for tasks with smaller dictionaries and available training images (e.g. medical image classification)  Search-based annotation  Exploit results of similarity search in annotated images  Suitable for tasks with wide dictionaries (e.g. image annotation for web search)

CEMI, Plzeň, Slide 3 of 11 Search-based annotation in a nutshell

CEMI, Plzeň, Slide 4 of 11 Our vision  Next generation of similarity-based annotation  Similarity searching  Text cleaning  Semantic information extraction  Classifiers  Relevance feedback

CEMI, Plzeň, Slide 5 of 11 MUFIN Image Annotation  Already done (paper IDEAS 2013):  Modular framework for annotation processing  Implementation of basic modules  Similarity search, text cleaning, basic WordNet-based semantic processing  Working system for keyword annotation with % precision  Vocabulary V = all English words  Problems  Not precise enough  Results too unstructured for practical use  Difficult to evaluate

CEMI, Plzeň, Slide 6 of 11 Current focus  Hierarchical approach  Vocabulary hierarchically organized  WordNet hypernymy/hyponymy tree, ontology  Semantics-aware processing of similar images’ descriptions  Study and exploit suitable resources of semantic information  Determine the relevance of candidate concepts with respect to semantic relationships  ImageCLEF evaluation  ImageCLEF2014: scalability-oriented, no manually labelled training data  100 test concepts, provided with links to WordNet synsets

CEMI, Plzeň, Slide 7 of 11 ConceptRank  Inspiration: PageRank  Importance of a page is derived from the importance of pages that link to it  Linear iterated process, modelled as a Markov system  Random restarts to avoid “rank sinks”  ConceptRank idea: Semantic ranking of WordNet synsets  A Markov system, nodes are formed by WordNet synsets  Links between nodes connected by some WordNet relationship  Weighted according to the type of the relationship  Random restarts are not weighted uniformly, but reflect the initial weights of synsets as determined by similarity searching

CEMI, Plzeň, Slide 8 of 11 ConceptRank illustration

CEMI, Plzeň, Slide 9 of 11 ConceptRank Resources  Content-based image retrieval  powered by MUFIN  20M Profiset collection, 250K ImageCLEF training data  WordNet  Standard relationships (hypernymy, antonymy, part-whole, gloss overlap, …)  Word similarity metrics defined on top of hyponymy/hypernymy tree  the “language” point of view  Visual Concept Ontology (VCO)  Semantic hierarchy of most common visual concepts, linked to WordNet  VCO sub-trees are used to limit the search for WordNet relationships  Co-occurrence lists for keywords from Profimedia dataset  Constructed from very large text corpus (linguists from MFF UK)  Corpus size approximately 1 billion words  “human/database” point of view

CEMI, Plzeň, Slide 10 of 11 Cooperation with other CEMI teams  UFAL  Information about keyword co-occurrence in text corpora  Already part of MUFIN Image Annotation processing  Other semantic resources: WikiNet  Being studied at UFAL  ČVUT  High-precision classifier for 1000 ImageNet concepts  Todo: compare performance of this classifier and MUFIN search-based solution; if complementary, try to combine  Image similarity measure derived from the classifier  Todo: compare it to MPEG7 similarity utilized by MUFIN Image Annotation

CEMI, Plzeň, Slide 11 of 11 Questions, comments?