1 Knowledge-Based Medical Image Indexing and Retrieval Caroline LACOSTE Joo Hwee LIM Jean-Pierre CHEVALLET Daniel RACOCEANU Nicolas Maillot Image Perception,

Slides:



Advertisements
Similar presentations
Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Improvements and extras Paul Thomas CSIRO. Overview of the lectures 1.Introduction to information retrieval (IR) 2.Ranked retrieval 3.Probabilistic retrieval.
Chapter 5: Introduction to Information Retrieval
Introduction to Information Retrieval
Content-Based Image Retrieval
Ke Liu1, Junqiu Wu2, Shengwen Peng1,Chengxiang Zhai3, Shanfeng Zhu1
Chapter 8 Content-Based Image Retrieval. Query By Keyword: Some textual attributes (keywords) should be maintained for each image. The image can be indexed.
Addressing the Medical Image Annotation Task using visual words representation Uri Avni, Tel Aviv University, Israel Hayit GreenspanTel Aviv University,
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
Li-Jia Li Yongwhan Lim Li Fei-Fei Chong Wang David M. Blei B UILDING AND U SING A S EMANTIVISUAL I MAGE H IERARCHY CVPR, 2010.
Morris LeBlanc.  Why Image Retrieval is Hard?  Problems with Image Retrieval  Support Vector Machines  Active Learning  Image Processing ◦ Texture.
Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.
2 IndexingRankingClustering …… Recommendation Annotation Multimedia Information Retrieval.
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
February, Content-Based Image Retrieval Saint-Petersburg State University Natalia Vassilieva Il’ya Markov
WORD-PREDICTION AS A TOOL TO EVALUATE LOW-LEVEL VISION PROCESSES Prasad Gabbur, Kobus Barnard University of Arizona.
Advance Information Retrieval Topics Hassan Bashiri.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Video Search Engines and Content-Based Retrieval Steven C.H. Hoi CUHK, CSE 18-Sept, 2006.
A structured learning framework for content- based image indexing and visual Query (Joo-Hwee, Jesse S. Jin) Presentation By: Salman Ahmad (270279)
Chapter 5: Information Retrieval and Web Search
Information Retrieval in Practice
DOG I : an Annotation System for Images of Dog Breeds Antonis Dimas Pyrros Koletsis Euripides Petrakis Intelligent Systems Laboratory Technical University.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
A Thousand Words in a Scene P. Quelhas, F. Monay, J. Odobez, D. Gatica-Perez and T. Tuytelaars PAMI, Sept
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics – Bag of concepts – Semantic distance between two words.
Thien Anh Dinh1, Tomi Silander1, Bolan Su1, Tianxia Gong
1 Using Ontology Dimension and Negative Expansion In CLEF05 Jean-Pierre Chevallet (IPAL CNRS) Joo Hwee Lim (I2R A-STAR) Saïd Radhouani (CLIPS CUI)
Text- and Content-based Approaches to Image Retrieval for the ImageCLEF 2009 Medical Retrieval Track Matthew Simpson, Md Mahmudur Rahman, Dina Demner-Fushman,
Image and Video Retrieval INST 734 Doug Oard Module 13.
“How much context do you need?” An experiment about context size in Interactive Cross-language Question Answering B. Navarro, L. Moreno-Monteagudo, E.
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
Watch, Listen and Learn Sonal Gupta, Joohyun Kim, Kristen Grauman and Raymond Mooney -Pratiksha Shah.
Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos Yiming Liu, Dong Xu, Ivor W. Tsang, Jiebo Luo Nanyang Technological.
Content-Based Image Retrieval
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
A Study on Query Expansion Methods for Patent Retrieval Walid MagdyGareth Jones Centre for Next Generation Localisation School of Computing Dublin City.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Image Retrieval in Radiology: The ImageCLEF Challenge Charles E. Kahn, Jr. Medical College of Wisconsin Jayashree Kalpathy-Cramer Oregon Health & Science.
Classifying Images with Visual/Textual Cues By Steven Kappes and Yan Cao.
Chapter 6: Information Retrieval and Web Search
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
IPAL at ImageClef 2007 Mixing Features, Models and Knowledge Sheng Gao IPAL French-Singaporean Joint Lab Institute for Infocomm Research Singapore.
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
From Text to Image: Generating Visual Query for Image Retrieval Wen-Cheng Lin, Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information.
Information Retrieval using Word Senses: Root Sense Tagging Approach Sang-Bum Kim, Hee-Cheol Seo and Hae-Chang Rim Natural Language Processing Lab., Department.
Selecting Good Expansion Terms for Pseudo-Relevance Feedback Guihong Cao, Jian-Yun Nie, Jianfeng Gao, Stephen Robertson 2008 SIGIR reporter: Chen, Yi-wen.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
1/12/ Multimedia Data Mining. Multimedia data types any type of information medium that can be represented, processed, stored and transmitted over.
+ Speed Up Texture Classification in Clothing Retrieval System 電機三 吳瑋凌.
Generating Query Substitutions Alicia Wood. What is the problem to be solved?
Combining Text and Image Queries at ImageCLEF2005: A Corpus-Based Relevance-Feedback Approach Yih-Cheng Chang Department of Computer Science and Information.
Relevance Feedback in Image Retrieval System: A Survey Tao Huang Lin Luo Chengcui Zhang.
Analysis of Experiments on Hybridization of different approaches in mono and cross-language information retrieval DAEDALUS – Data, Decisions and Language,
Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.
Image Retrieval and Ranking using L.S.I and Cross View Learning Sumit Kumar Vivek Gupta
Personalized Social Image Recommendation
and Knowledge Graphs for Query Expansion Saeid Balaneshinkordan
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Aim of the project Take your image Submit it to the search engine
Multimedia Information Retrieval
CSE 635 Multimedia Information Retrieval
Color Image Retrieval based on Primitives of Color Moments
Information Retrieval and Web Design
Presentation transcript:

1 Knowledge-Based Medical Image Indexing and Retrieval Caroline LACOSTE Joo Hwee LIM Jean-Pierre CHEVALLET Daniel RACOCEANU Nicolas Maillot Image Perception, Access & Language (IPAL) French-Singaporean Joint Lab (CNRS,I2R,NUS,UJF)

2 Approach  Indexing both image and text using medical concepts from the NLM’s Unified Medical Language System (UMLS) To incorporate expert knowledge To work at a higher semantic level To unify text and image  Structured learning approach to extract medical semantics from images (e.g. modality, anatomy, pathology)  Fusion between textual and visual information

3 UMLS-based text indexing  UMLS (Unified Medical Language System) Multilingual: meta-thesaurus 17 languages Large : More than concepts, 5,5 millions of terms Consistent: categorization of concepts in semantic types  Why using concepts and not terms ? Remove the problem of term variation / synonymy Ex: “Fracture” and “Broken Bones” “Natural” multi-lingual indexing  Using Meta-Thesaurus Structure Vector space model does not take into account this structure Using Semantic Dimension (Given by the UMLS)  Tested this year Dimension filtering: At least one matching according one of the three dimension Dimension weighting: Re-weight similarity according to the number of matched dimension

4 Semantic Image Indexing Low-level features - Color - Texture - Shape High level features - Modality - Anatomy - Pathology Computed Tomography Chest Nodules ? Supervised learning framework Global indexing to access image modality Local indexing to access semantic local features related to anatomy, modality, and pathology concepts

5 Angiography X-ray Micro Global Indexing Learning set: 4000 images Medical Image I Learning Semantic Classification Low-level feature extraction Low-level feature extraction 32 VMTs (modality, anatomy, spatial UMLS concepts, and color percepts) MRI Head Sagittal Color: histogram Texture: Gabor features Spatial: Thumbnails VMT indexes P(VMT|I) Modality-anatomy probabilities SVM VMT:Visual Medical Terms

6 X-ray Bone Fracture Local indexing Learning set: 3631 patches Learning Low-level feature extraction 64 local VMTs (modality, anatomy, pathology concepts, and color percepts) Color: first moments Texture: Gabor features VMT indexes per block Low-level feature extraction per patch Semantic Classification per patch Histograms of local VMTs Spatial Aggregation in grid layout (3x3, 3x1, 3x2, 1x3, 2x3) SVM

7 Visual retrieval  Retrieval using the global visual indexing Retrieval based on the Manhattan distance between 2 indexes (~histogram of modality-anatomy concepts) Modality filtering according to the textual query modality concepts I admissible if P( mod Q | I) > T  Retrieval using the local indexing Retrieval based on the mean of Manhattan distances between local VMT histograms  Late Fusion : mean of the two similarity measures

8 Medical Image Retrieval Typical query Show me x-ray images with fractures of the femur. UMLS concepts Low-level features UMLS-based features FUSION Index Matching Textual retrieval Matching Visual retrieval Mixed retrieval FUSION MetaMap SVM classifiers Index Matching Mixed retrieval

9 Comparative Results on CLEF2005 MethodTextualVisualMAPRank Concept, Dimension re-weighting (1)x26.46%1/31 Concept, Pseudo-Rel. FB, Dim filtx22.94%2/31 Concepts, Dimension filteringx22.70%3/31 Terms, Dimension filteringx20.88%5/31 Concept, semantic doc expansionx18.56%10/31 Fusion Local+Global Indexing (2)x06.41%3/11 Global Indexing according to modalityx05.66%4/11 Local Indexing with VMTx04.84%6/11 Late fusion (1) + (2)xx30.95%1/37 Concept fusion (1) + (2) adaptive threshold xx28.78%2/37 Concept fusion (1) + (2) Fixed Threshold 0.15 xx28.45%3/37

10 Conclusion  CLEF multilingual medical image retrieval task: Very difficult task: large collection, “precise” and “semantic” queries  Concept Indexing: It works !! Inter-media common indexing Multilingual  Image and text: complementary Text: closer to the meaning Efficiency of the visual filtering to remove aberrant images  Importance of semantic dimensions for text visual terms and learning  The UMLS indexing induces a lot of perspectives: Early fusion Semantic-based retrieval

11 Inter-Media Pseudo-Relevance Feedback Application to ImageCLEF Photo 2006 Nicolas MAILLOT Jean-Pierre CHEVALLET Joo Hwee LIM Image Perception, Access & Language (IPAL) French-Singaporean Joint Lab (CNRS,I2R,NUS,UJF)

12 Inter-Media Pseudo-Relevance Feedback  Problem: Multi-modal (text+image) Information Indexing and Retrieval Multi-modal documents Multi-modal queries Application to the ImageCLEF Photo Task  Objectives: Inter-media enrichment dealing with synonymy Re-using existing mono-media image and text retrieval systems

13 Related Works  Mono-modal pseudo-relevance feedback [Xu and Croft 96]  Translation models [Lin et al. 05] Automatic translation of the textual query into a visual query Requires prior mining of textual/visual relationships  Late Fusion [Chevallet et al. 05] Text and image retrieval are done in parallel and the results are merged (weighted sum) No mutual enrichment  Latent Semantic Indexing (text+visual keywords) Computationally expensive Inter-Media Pseudo-Relevance Feedback

14 Text Processing  Morpho-Syntax Part of Speech Tagging Unknown Proper Nouns detection Word Normalization & Spelling correction  Index terms Noun Phrase detection using WordNet Geographic named entities detected using Wordnet or the tag  Concept Indexing Selection based on the most frequent sense provided by Wordnet Inter-Media Pseudo-Relevance Feedback

15 Image Processing  Image Segmentation Meanshift Segmentation Patch Based Tessellation  Feature Extraction Color histograms Bags of SIFT Scale Invariant Feature Transform Gabor Features (Texture) Inter-Media Pseudo-Relevance Feedback Extraction of one local orientation histogram per location

16 Image Retrieval Engine Text Retrieval Engine ImageText ImageText Image Text Text Query Expansion Ranked Documents Index Documents Query Top k Retrieved Documents Inter-Media Pseudo-Relevance Feedback Overview Expanded Query

17 ModalityA/MMAPP10P20P30Rank Inter Media Pseudo Relevance FeedbackText+VisualAutomatic /43 Best RunText+VisualManual /43 Inter-Media Pseudo-Relevance Feedback Results

18 Conclusion  Using the image modality to expand the text query Use of the text associated with the top k images retrieved Assumption: these k images are relevant  Future work: Advanced conceptual filtering and reasoning some concepts are not characterized by the visual appearance Comparison with the translation model Inter-Media Pseudo-Relevance Feedback

19 Thanks !

20 Supervised learning framework Learning set: VMT instances SVM classifiers Image/Region Learning Semantic Classification VMT indexes P(VMT|I) Low-level feature extraction Low-level feature extraction