Combining Text and Image Queries at ImageCLEF2005: A Corpus-Based Relevance-Feedback Approach Yih-Cheng Chang Department of Computer Science and Information.

Slides:



Advertisements
Similar presentations
Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.
Advertisements

Yansong Feng and Mirella Lapata
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Chapter 5: Introduction to Information Retrieval
Multimedia Database Systems
Image Retrieval Basics Uichin Lee KAIST KSE Slides based on “Relevance Models for Automatic Image and Video Annotation & Retrieval” by R. Manmatha (UMASS)
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.
1 Statistical correlation analysis in image retrieval Reporter : Erica Li 2004/9/30.
1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.
A novel log-based relevance feedback technique in content- based image retrieval Reporter: Francis 2005/6/2.
MANISHA VERMA, VASUDEVA VARMA PATENT SEARCH USING IPC CLASSIFICATION VECTORS.
Comments on Guillaume Pitel: “Using bilingual LSA for FrameNet annotation of French text from generic resources” Gerd Fliedner Computational Linguistics.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Chapter 5: Information Retrieval and Web Search
SIEVE—Search Images Effectively through Visual Elimination Ying Liu, Dengsheng Zhang and Guojun Lu Gippsland School of Info Tech,
Improving web image search results using query-relative classifiers Josip Krapacy Moray Allanyy Jakob Verbeeky Fr´ed´eric Jurieyy.
Information Retrieval in Practice
An Automatic Segmentation Method Combined with Length Descending and String Frequency Statistics for Chinese Shaohua Jiang, Yanzhong Dang Institute of.
Image Annotation and Feature Extraction
A New Approach for Cross- Language Plagiarism Analysis Rafael Corezola Pereira, Viviane P. Moreira, and Renata Galante Universidade Federal do Rio Grande.
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
Text- and Content-based Approaches to Image Retrieval for the ImageCLEF 2009 Medical Retrieval Track Matthew Simpson, Md Mahmudur Rahman, Dina Demner-Fushman,
Multimedia Databases (MMDB)
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
Combining Lexical Semantic Resources with Question & Answer Archives for Translation-Based Answer Finding Delphine Bernhard and Iryna Gurevvch Ubiquitous.
Università degli Studi di Modena and Reggio Emilia Dipartimento di Ingegneria dell’Informazione Prototypes selection with.
1 Cross-Lingual Query Suggestion Using Query Logs of Different Languages SIGIR 07.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
The CLEF 2003 cross language image retrieval task Paul Clough and Mark Sanderson University of Sheffield
Information Retrieval and Web Search Cross Language Information Retrieval Instructor: Rada Mihalcea Class web page:
MIRACLE Multilingual Information RetrievAl for the CLEF campaign DAEDALUS – Data, Decisions and Language, S.A. Universidad Carlos III de.
Multilingual Relevant Sentence Detection Using Reference Corpus Ming-Hung Hsu, Ming-Feng Tsai, Hsin-Hsi Chen Department of CSIE National Taiwan University.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Presenter : Chien Shing Chen Author: Wei-Hao.
Math Information Retrieval Zhao Jin. Zhao Jin. Math Information Retrieval Examples: –Looking for formulas –Collect teaching resources –Keeping updated.
Chapter 6: Information Retrieval and Web Search
1 Automatic Classification of Bookmarked Web Pages Chris Staff Second Talk February 2007.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
UA in ImageCLEF 2005 Maximiliano Saiz Noeda. Index System  Indexing  Retrieval Image category classification  Building  Use Experiments and results.
1 01/10/09 1 INFILE CEA LIST ELDA Univ. Lille 3 - Geriico Overview of the INFILE track at CLEF 2009 multilingual INformation FILtering Evaluation.
Mining Binary Constraints in Feature Models: A Classification-based Approach Yi Li.
A Model for Learning the Semantics of Pictures V. Lavrenko, R. Manmatha, J. Jeon Center for Intelligent Information Retrieval Computer Science Department,
Iterative Translation Disambiguation for Cross Language Information Retrieval Christof Monz and Bonnie J. Dorr Institute for Advanced Computer Studies.
Word Translation Disambiguation Using Bilingial Bootsrapping Paper written by Hang Li and Cong Li, Microsoft Research Asia Presented by Sarah Hunter.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
Improving Named Entity Translation Combining Phonetic and Semantic Similarities Fei Huang, Stephan Vogel, Alex Waibel Language Technologies Institute School.
From Text to Image: Generating Visual Query for Image Retrieval Wen-Cheng Lin, Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information.
Learning Phonetic Similarity for Matching Named Entity Translations and Mining New Translations Wai Lam Ruizhang Huang Pik-Shan Cheung Department of Systems.
Information Retrieval
V. Clustering 인공지능 연구실 이승희 Text: Text mining Page:82-93.
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
Yixin Chen and James Z. Wang The Pennsylvania State University
The Cross Language Image Retrieval Track: ImageCLEF Breakout session discussion.
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
A Multilingual Hierarchy Mapping Method Based on GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of.
Analysis of Experiments on Hybridization of different approaches in mono and cross-language information retrieval DAEDALUS – Data, Decisions and Language,
1 Knowledge-Based Medical Image Indexing and Retrieval Caroline LACOSTE Joo Hwee LIM Jean-Pierre CHEVALLET Daniel RACOCEANU Nicolas Maillot Image Perception,
Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Presentation transcript:

Combining Text and Image Queries at ImageCLEF2005: A Corpus-Based Relevance-Feedback Approach Yih-Cheng Chang Department of Computer Science and Information Engineering National Taiwan University Taipei, Taiwan ImageCLEF 2005 Hsin-Hsi Chen Department of Computer Science and Information Engineering National Taiwan University Taipei, Taiwan Wen-Cheng Lin Department of Medical Informatics Tzu Chi University Hualien, Taiwan

NTU NLPL2 Why Combining Text and Image Queries in Cross Language Image Retrieval ? Text-based image retrieval Translation errors in cross language image retrieval Annotation errors in automatic annotation Easy to catch semantic meanings Easy to construct textual query Content-based image retrieval (CBIR) Semantic meanings are hard to be represented Have to find/draw example images Avoid translation in cross-language image retrieval Annotation is not necessary

NTU NLPL3 How to Combine Text and Image Features in Cross Language Image Retrieval ? Parallel approach Conducting text- and content-based retrieval separately and merging the retrieval results Pipeline approach Using textual or visual information to perform initial retrieval, and then employing the other feature to filter out the irrelevant images Transformation-based approach Mining the relations between images and text, and employing the mined relations to transform textual information into visual one, and vice versa

NTU NLPL4 Approach at ImageCLEF 2004 Automatically transform textual queries into visual representations Mine the relationships between text and images Divide an image into several smaller parts Link the words in caption to the corresponding parts Analogous to word alignment in a sentence aligned parallel corpus Build a transmedia dictionary Transform a textual query into visual one using the transmedia dictionary

NTU NLPL5 System at ImageCLEF2004 Query translation ImagesImage captions Text-Image correlation learning Text-based image retrieval Source language textual query Visual index Textual index ImagesImage captions Query transformation Transmedia dictionary Target language textual query Visual query Content-based image retrieval Result merging Retrieved images Language resources Target collectionTraining collection

NTU NLPL6 Learning Correlation Mare and foal in field, slopes of Clatto Hill, Fife hill mare foal field slope segmentation B01 B02 B03 B04

NTU NLPL7 Text-Based Image Retrieval at ImageCLEF2004 RunQuery Translation Backward Transliteration Mean Average Precision WCO No WCO+NTWCOYes F2hfFirst-two-highest-frequencyNo F2hf+NTFirst-two-highest-frequencyYes Mono Using similarity-based backward transliteration improves performance 69.71%

NTU NLPL8 Cross-Language Experiments at ImageCLEF2004 Query Type Mean Average Precision Textual Query (F2hf+NT) Generated Visual Query (18 topics) Textual Query + Generated Visual Query (N+V+A, n=30, t=0.02) poor +0.46%: Insignificant Performance Increase +

NTU NLPL9 Analyses of These Approaches Parallel approach and Pipeline approach Simple and useful Not employ the relations between visual and textual features Transformation-based approach Textual and visual queries can be translated to each other using relations between visual and textual features Hard to learn all relations between all visual and textual features Degree of ambiguity of the relations is usually high

NTU NLPL10 Our Approach at ImageCLEF2005: A Corpus-Based Relevance Feedback Method A Corpus-Based Relevance Feedback approach Initiate a content-based retrieval Treat the retrieved images and their text descriptions as aligned documents Adopt a corpus-based method to select key terms from text descriptions, and generate a new query.

NTU NLPL11 Fundamental Concepts of a Corpus-Based Relevant Feedback Approach

(Aircraft on the ground) VIPER system

NTU NLPL14 Bilingual Ad hoc Retrieval Task 28,133 photographs from St. Andrews University Library ’ s photographic collection Collection is in English and queries are in different languages In our experiments, queries are in Chinese All images are accompanied by a textual description written in English by librarians working at St. Andrews Library The test set contains 28 topics, and each topic has text description and an example image.

NTU NLPL15 An Example – An image and Its Description

NTU NLPL16 An Example – A topic in Chinese A Chinese Title An English Title

NTU NLPL17 Some Models in Formal Runs

NTU NLPL18 Experiment Results at ImageCLEF % % % Performance of EE+EX > CE+EX  EE > EX > CE > Visual run

NTU NLPL19 Lessons Learned Comparing to initial visual retrieval, average precision is increased from 8.29% to 34.25% after feedback cycle. Combining Textual and Visual information can improve performance

20 Example: Aircraft on the Ground ( ) Text only (monolingual) Text only (cross-lingual ) Top 2 images in cross-lingual run are non-relevant because of query translation problem : clear ( ), above ( ), floor ( )

NTU NLPL21 Example: Aircraft on the Ground (after integration) Text (monolingual) + Visual Text+Visual Run is better than monolingual run because it expands some useful words, e.g., aeroplane, military air base, airfield

NTU NLPL22 ImageCLEF2004 vs. ImageCLEF2005 Text-based IR (monolingual case) (2004) vs (2005) Topics of this year is a little harder Text+Image IR (monolingual case) (2004) vs (2005) Text+Image IR (crosslingual case) (2004) vs (2005) 70.45% vs %

NTU NLPL23 Automatic Annotation Task The automatic annotate task in ImageCLEF 2005 can be seen as a classification task, since each image can only be annotated with one word (i.e., a category) We propose several methods to measure the similarity between a test image and a category, and a test image is classified to the most similar category. The methods we proposed use the same image features, but different classification approaches.

NTU NLPL24 Image Feature Extraction Resize images to 256 x 256 pixels Segment each image into 32 x 32 blocks (each block is 8 x 8 pixels). Compute the average gray value of each block to construct a vector with 1,024 elements. The similarity between two images is measured by cosine formula.

NTU NLPL25 Some Models and Experimental Results NTU-annotate05-1NN Baseline model. It uses 1-NN method to classify each image. NTU-annotate05-Top2 Computing the similarity between a test image and a category using the top 2 nearest images in each category, and classify the test image to the most similar category. NTU-annotate05-SC Training data is clustered using k-means algorithm (k=1000). We compute the centroid of each category in each cluster, and classify a test image to the category of the nearest centroid.

NTU NLPL26 Conclusion: Bilingual Ad hoc Retrieval Task An approach of combining textual and image features is proposed for Chinese-English image retrieval.  a corpus-based feedback cycle from CBIR Compared with the performance of monolingual IR (0.3952), integrating visual and textual queries achieves better performance in CL image retrieval (0.3977).  resolve part of translation errors The integration of visual and textual queries also improves the performance of the monolingual IR from to  provide more information The improvement is the best among all the groups.  78.2% of the best monolingual text retrieval

NTU NLPL27 Conclusion: Automatic Annotation Task A feature extraction algorithm is proposed and several classification approaches are explored under the same image features. The approaches of 1-NN and top-2, which have error rates 21.7%, outperform the centroid-based approach (with error rate 22.5%). Our method is 9% worse than the group of the best performance (error rate 12.6%), but is better than most of the groups in this task.

Thank You and Comments