LEARNING SEMANTICS OF WORDS AND PICTURES TEJASWI DEVARAPALLI.

Slides:



Advertisements
Similar presentations
Clustering Art & Learning the Semantics of Words and Pictures Manigantan Sethuraman.
Advertisements

Image Retrieval With Relevant Feedback Hayati Cam & Ozge Cavus IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK Hayati CAM Ozge CAVUS.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Mustafa Cayci INFS 795 An Evaluation on Feature Selection for Text Clustering.
PARTITIONAL CLUSTERING
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
Image Retrieval Basics Uichin Lee KAIST KSE Slides based on “Relevance Models for Automatic Image and Video Annotation & Retrieval” by R. Manmatha (UMASS)
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Date:2011/06/08 吳昕澧 BOA: The Bayesian Optimization Algorithm.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.
Expectation Maximization Method Effective Image Retrieval Based on Hidden Concept Discovery in Image Database By Sanket Korgaonkar Masters Computer Science.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman ICCV 2003 Presented by: Indriyati Atmosukarto.
Simulation.
MANISHA VERMA, VASUDEVA VARMA PATENT SEARCH USING IPC CLASSIFICATION VECTORS.
WORD-PREDICTION AS A TOOL TO EVALUATE LOW-LEVEL VISION PROCESSES Prasad Gabbur, Kobus Barnard University of Arizona.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
1 An Empirical Study on Large-Scale Content-Based Image Retrieval Group Meeting Presented by Wyman
Presented by Zeehasham Rasheed
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
ICME 2004 Tzvetanka I. Ianeva Arjen P. de Vries Thijs Westerveld A Dynamic Probabilistic Multimedia Retrieval Model.
Latent Semantic Analysis (LSA). Introduction to LSA Learning Model Uses Singular Value Decomposition (SVD) to simulate human learning of word and passage.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.
Computer Vision - A Modern Approach Set: Segmentation Slides by D.A. Forsyth Segmentation and Grouping Motivation: not information is evidence Obtain a.
Information Retrieval in Practice
Image Segmentation Image segmentation is the operation of partitioning an image into a collection of connected sets of pixels. 1. into regions, which usually.
Image Annotation and Feature Extraction
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
Memory Bounded Inference on Topic Models Paper by R. Gomes, M. Welling, and P. Perona Included in Proceedings of ICML 2008 Presentation by Eric Wang 1/9/2009.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Introduction to Digital Libraries hussein suleman uct cs honours 2003.
Contextual Ranking of Keywords Using Click Data Utku Irmak, Vadim von Brzeski, Reiner Kraft Yahoo! Inc ICDE 09’ Datamining session Summarized.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
A Model for Learning the Semantics of Pictures V. Lavrenko, R. Manmatha, J. Jeon Center for Intelligent Information Retrieval Computer Science Department,
CS654: Digital Image Analysis
Non-Photorealistic Rendering and Content- Based Image Retrieval Yuan-Hao Lai Pacific Graphics (2003)
Evolutionary Algorithms for Finding Optimal Gene Sets in Micro array Prediction. J. M. Deutsch Presented by: Shruti Sharma.
C. Lawrence Zitnick Microsoft Research, Redmond Devi Parikh Virginia Tech Bringing Semantics Into Focus Using Visual.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Externally growing self-organizing maps and its application to database visualization and exploration.
Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.
Object Recognition Part 2 Authors: Kobus Barnard, Pinar Duygulu, Nado de Freitas, and David Forsyth Slides by Rong Zhang CSE 595 – Words and Pictures Presentation.
A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER
Image Emotional Semantic Query Based On Color Semantic Description Wei-Ning Wang, Ying-Lin Yu Department of Electronic and Information Engineering, South.
Effective Automatic Image Annotation Via A Coherent Language Model and Active Learning Rong Jin, Joyce Y. Chai Michigan State University Luo Si Carnegie.
Exploiting Ontologies for Automatic Image Annotation Munirathnam Srikanth, Joshua Varner, Mitchell Bowden, Dan Moldovan Language Computer Corporation SIGIR.
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
Towards Total Scene Understanding: Classification, Annotation and Segmentation in an Automatic Framework N 工科所 錢雅馨 2011/01/16 Li-Jia Li, Richard.
Relation Strength-Aware Clustering of Heterogeneous Information Networks with Incomplete Attributes ∗ Source: VLDB.
A Framework to Predict the Quality of Answers with Non-Textual Features Jiwoon Jeon, W. Bruce Croft(University of Massachusetts-Amherst) Joon Ho Lee (Soongsil.
1 Random Walks on the Click Graph Nick Craswell and Martin Szummer Microsoft Research Cambridge SIGIR 2007.
Enhanced hypertext categorization using hyperlinks Soumen Chakrabarti (IBM Almaden) Byron Dom (IBM Almaden) Piotr Indyk (Stanford)
Content-Based Image Retrieval Using Color Space Transformation and Wavelet Transform Presented by Tienwei Tsai Department of Information Management Chihlee.
Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,
1 Dongheng Sun 04/26/2011 Learning with Matrix Factorizations By Nathan Srebro.
Singular Value Decomposition and its applications
Self-Organizing Maps for Content-Based Image Database Retrieval
Matching Words with Pictures
Matching Words and Pictures
Text Categorization Berlin Chen 2003 Reference:
Relevance and Reinforcement in Interactive Browsing
Presentation transcript:

LEARNING SEMANTICS OF WORDS AND PICTURES TEJASWI DEVARAPALLI

CONTENT INTRODUCTION MODELING IMAGE DATASET STATISTICS HIERARCHICAL MODEL TESTING AND USING BASIC MODEL AUTO ILLUSTRATION AUTO ANNOTATION RESULTS DISCUSSIONS

SEMANTICS LANGUAGE USES A SYSTEM OF LINGUISTIC SIGNS, EACH OF WHICH IS A COMBINATION OF MEANING AND PHONOLOGICAL AND/OR ORTHOGRAPHIC FORMS. SEMANTICS IS TRADITIONALLY DEFINED AS THE STUDY OF MEANING IN LANGUAGE.

ABSTRACT A STATISTICAL MODEL FOR ORGANIZING IMAGE COLLECTIONS. INTEGRATES SEMANTIC INFORMATION PROVIDED BY ASSOCIATED TEXT AND VISUAL INFORMATION PROVIDED BY IMAGE FEATURES. PROMISING MODEL FOR INFORMATION RETRIEVAL TASKS LIKE DATABASE BROWSING, SEARCHING FOR IMAGES. USED FOR NOVEL APPLICATIONS.

INTRODUCTION METHOD FOR ORGANIZING IMAGE DATABASES. INTEGRATES TWO KINDS OF INFORMATION DURING MODEL CONSTRUCTION. LEARNS LINKS BETWEEN IMAGE FEATURES AND SEMANTICS. LEARNINGS USEFUL IN BETTER BROWSING BETTER SEARCH NOVEL APPLICATIONS

INTRODUCTION(CONTINUED) MODELS STATISTICS ABOUT OCCURRENCE AND CO-OCCURRENCE OF WORD AND FEATURES. HIERARCHICAL STRUCTURE. GENERATIVE MODEL, IMPLICITLY CONTAINS PROCESSES FOR PREDICTING IMAGE COMPONENTS WORDS AND FEATURES

COMPARISON THIS MODEL SUPPORTS BROWSING FOR THE IMAGE RETRIEVAL PURPOSES SYSTEMS FOR SEARCHING IMAGE DATABASES INCLUDES SEARCH BY QUERY. TEXT IMAGE FEATURE SIMILARITY SEGMENT FEATURES IMAGE SKETCH

MODELING IMAGE DATASET STATISTICS GENERATIVE HIERARCHICAL MODEL COMBINATION OF ASYMMETRIC CLUSTERING MODEL (MAPS DOCUMENTS INTO CLUSTERS) SYMMETRIC CLUSTERING MODEL(MODELS JOINT DISTRIBUTION OF DOCUMENTS AND FEATURES). DATA MODELED AS FIXED HIERARCHY OF NODES. NODES GENERATE WORD IMAGE SEGMENT

ILLUSTRATION DOCUMENTS MODELED AS SEQUENCE OF WORDS AND SEQUENCE OF SEGMENTS USING BLOBWORLD REPRESENTATION. "BLOBWORLD" REPRESENTATION IS CREATED BY CLUSTERING PIXELS IN A JOINT COLOR-TEXTURE-POSITION FEATURE SPACE. THE DOCUMENT IS MODELED BY SUM OVER THE CLUSTERS, TAKING ALL CLUSTERS INTO CONSIDERATION.

Higher level nodes emit more general words and blobs. (e. g. sky) Moderately general words and blobs. (e. g. Sun,sea) Lower level nodes emit more specific words and blobs. (e. g. Waves) Sun Sky Sea Waves HIERARCHICAL MODEL EACH NODE HAS A PROBABILITY OF GENERATING A WORD/ IMAGE W.R.T THE DOCUMENT UNDER CONSIDERATION. CLUSTER DEFINES THE PATH. CLUSTER, LEVEL IDENTIFIES THE NODE.

Mathematical Process for generating set of observations D associated with a document d is described by C – clusters, i – items, l – levels.

GAUSSIAN DISTRIBUTIONS NUMBER OF FEATURES LIKE ASPECTS OF SIZE, POSITION, COLOR, TEXTURE AND SHAPE ALL TOGETHER FORM FEATURE VECTOR X. PROBABILITY DISTRIBUTION OVER IMAGE SEGMENTS BY USUAL FORMULA:-

MODELING IMAGE DATASET STATISTICS THIS MODEL USES HIERARCHICAL MODEL AS IT BEST SUPPORTS BROWSING OF LARGE COLLECTIONS OF IMAGES COMPACT REPRESENTATION PROVIDES IMPLEMENTATION DETAILS FOR AVOIDING OVER TRAINING. THE TRAINING PROCEDURE CLUSTERS A FEW THOUSAND IMAGES IN A FEW HOURS ON A STATE OF THE ART PC.

MODELING IMAGE DATASET STATISTICS RESOURCE REQUIREMENTS LIKE MEMORY INCREASE RAPIDLY WITH NO.OF IMAGES. SO WE NEED EXTRA CARE. THERE ARE DIFFERENT APPROACHES FOR AVOIDING OVER-TRAINING AND RESOURCE USAGE.

FIRST APPROACH WE TRAIN ON RANDOMLY SELECTED SUBSET OF IMAGES UNTIL LOG LIKELYHOOD FOR HELD OUT DATA, RANDOMLY SELECTED FROM REMAINING DATA BEGINS TO DROP. THE MODEL SO FOUND IS USED AS A STARTING POINT FOR NEXT TRAINING ROUND USING SECOND RANDOM SET OF IMAGES.

SECOND APPROACH SECOND METHOD FOR REDUCING RESOURCE USAGE IS TO LIMIT CLUSTER MEMBERSHIP. FIRST COMPUTE APPROXIMATE CLUSTERING BY TRAINING ON A SUBSET. THEN CLUSTER ON ENTIRE DATASET, MAINTAIN PROBABILITY THAT A POINT IS IN A CLUSTER FOR TOP TWENTY CLUSTERS. REST OF THE MEMBERSHIP PROBABILITIES ASSUMED TO BE ZERO FOR NEXT FEW ITERATIONS.

TESTING AND USING BASIC MODEL METHOD STABILITY IS TESTED BY RUNNING FITTING PROCESS. FITTING PROCESS IS RUN ON SAME DATA SEVERAL TIMES WITH DIFFERENT INITIAL CONDITIONS AS EXPECTATION MAXIMIZATION(EM) PROCESS IS SENSITIVE TO THE STARTING POINT. THE CLUSTERING POINT DEPENDS MORE ON STARTING POINT THAN ON EXACT IMAGES CHOSEN FOR TRAINING. THE SECOND TEST IS TO VERIFY WHETHER CLUSTERING ON BOTH IMAGE AND TEXT HAS ADVANTAGE OR NOT.

TESTING AND USING THE BASIC MODEL THIS FIGURE SHOWS 16 IMAGES FROM A CLUSTER FOUND USING TEXT ONLY

TESTING AND USING THE BASIC MODEL THIS FIGURE SHOWS 16 IMAGES FROM A CLUSTER FOUND USING ONLY IMAGE FEATURES

TESTING AND USING THE BASIC MODEL

BROWSING MOST IMAGE RETRIEVAL SYSTEMS DO NOT SUPPORT BROWSING. THEY FORCE USER TO SPECIFY A QUERY. THE ISSUE IS WHETHER THE CLUSTERS FOUND THROUGH BROWSING MAKE SENSE TO THE USER. IF THE USER FINDS THE CLUSTERS COHERENT THEN THEY CAN BEGIN TO INTERNALIZE THE KIND OF STRUCTURE THEY REPRESENT.

BROWSING USER STUDY GENERATE 64 CLUSTERS FOR 3000 CLUSTERS. GENERATE 64 RANDOM CLUSTERS FROM THE SAME IMAGES. PRESENT RANDOM CLUSTER TO USER, ASK TO RATE COHERENCE(YES/NO). 94% ACCURACY

IMAGE SEARCH SUPPLY A COMBINATION OF TEXT AND IMAGE FEATURES. APPROACH : COMPUTE FOR EACH CANDIDATE IMAGE, THE PROBABILITY OF EMITTING THE QUERY ITEMS. Q = SET OF QUERY ITEMS D= CANDIDATE DOCUMENT.

IMAGE SEARCH THE FIGURE SHOWS THE RESULTS OF THE RIVER AND TIGER QUERY.

IMAGE SEARCH SECOND APPROACH FINDING THE PROBABILITY THAT EACH CLUSTER GENERATES A QUERY AND THEN SAMPLE ACCORDING TO WEIGHTED CLUSTERS. CLUSTER MEMBERSHIP PLAYS IMPORTANT ROLE IN GENERATING DOCUMENTS, WE CAN SAY CLUSTERS ARE COHERENT.

IMAGE SEARCH PROVIDING MORE FLEXIBLE METHOD OF SPECIFYING IMAGE FEATURES IS AN IMPORTANT NEXT STEP. THIS IS AS EXPLORED IN MANY QUERY BY EXAMPLE IMAGE RETRIEVAL SYSTEMS. EXAMPLE :- WE CAN QUERY FOR A DOG WITH WORD DOG AND IF WE WANT BLUE SKY THEN WE CAN GET IT BY ADDING IMAGE SEGMENT FEATURE TO THE QUERY.

PICTURES FROM WORDS AND WORDS FROM PICTURES THERE ARE TWO TYPES OF APPROACHES FOR LINKING WORDS TO PICTURES AND PICTURES TO WORDS. AUTO ILLUSTRATION AUTO ANNOTATION

AUTO ILLUSTRATION AUTO ILLUSTRATION – THE PROCESS OF LINKING PICTURES TO WORDS. GIVEN A SET OF QUERY ITEMS, Q AND A CANDIDATE DOCUMENT D, WE CAN EXPRESS THE PROBABILITY THAT A DOCUMENT PRODUCES THE QUERY BY:

AUTO ANNOTATION GENERATE WORDS FOR A GIVEN IMAGE CONSIDER THE PROBABILITY OF THE IMAGE BELONGING TO THE CURRENT CLUSTER. CONSIDER THE PROBABILITY OF THE ITEMS IN THE IMAGE BEING GENERATED BY THE NODES AT VARIOUS LEVELS IN THE PATH ASSOCIATED TO THE CLUSTER. WORK THE ABOVE OUT FOR ALL CLUSTERS.

AUTO ANNOTATION WE ARE COMPUTING THE PROBABILITY THAT AN IMAGE EMITS A PROPOSED WORD, GIVEN THE OBSERVED SEGMENTS, B:

AUTO ANNOTATION THE FIGURE SHOWS SOME ANNOTATION RESULTS SHOWING THE ORIGINAL IMAGE, THE BLOBWORLD SEGMENTATION, THE COREL KEYWORDS, AND THE PREDICTED WORDS IN RANK ORDER.

AUTO ANNOTATION THE TEST IMAGES WERE NOT IN THE TRAINING SET, BUT THEY COME FROM SAME SET OF CDS USED FOR TRAINING. THE KEYWORDS IN UPPER-CASE ARE IN THE VOCABULARY.

AUTO ANNOTATION TESTING THE ANNOTATION PROCEDURE: WE USE THE MODEL TO PREDICT THE IMAGE WORDS BASED ONLY ON THE SEGMENTS, THEN COMPARE THE WORDS WITH SEGMENTS. PERFORM TEST ON TRAINING DATA AND TWO DIFFERENT TEST SETS. THEY ARE 1 ST SET - RANDOMLY SELECTED HELD OUT SET FROM PROPOSED TRAINING DATA COMING FROM COREL CDS. 2 ND SET - IMAGES FROM OTHER CDS

AUTO ANNOTATION QUANTITATIVE PERFORMANCE USE 160 COREL CDS, EACH WITH 100 IMAGES(GROUPED BY THEME) SELECT 80 OF THE CDS, SPLIT INTO TRAINING (75%) AND TEST (25%). REMAINING 80 CDS ARE A HARDER TEST SET. MODEL SCORING: N = NUMBER OF WORDS FOR THE IMAGE, R= NUMBER OF WORDS RECTLY.

RESULTS ANNOTATION RESULTS ON THREE KINDS OF TEST DATA, WITH THREE DIFFERENT SCORING METHODS.

RESULTS THE ABOVE TABLE SUMMARIZES THE ANNOTATION RESULT USING THE THREE SCORING METHODS AND THE THREE HELD OUT SETS. WE AVERAGE THE RESULTS OF 5 SEPARATE RUNS WITH DIFFERENT HELD OUT SETS. USING THE COMPARISON OF SAMPLING FROM THE WORD PRIOR, WE SCORE 3.14 ON THE TRAINING DATA, 2.70 ON NON-TRAINING DATA FROM THE SAME CD SET AS THE TRAINING DATA AND 1.65 FOR TEST DATA TAKEN FROM COMPLETELY DIFFERENT SET OF CDS.

DISCUSSION PERFORMANCE OF THE SYSTEM CAN BE MEASURED BY TAKING ADVANTAGE OF ITS PREDICTIVE CAPABILITIES. WORDS WITH NO RELEVANCE TO VISUAL CONTENT CAUSE RANDOM NOISE, BY TAKING AWAY PROBABILITY FROM MORE RELEVANT WORDS. SUCH WORDS CAN BE REMOVED BY OBSERVING THEIR EMISSION PROBABILITIES ARE SPREAD OUT OVER THE NODES. THIS IS AUTOMATIC IMAGE REDUCTION METHOD WORKS DEPENDING ON THE NATURE OF THE DATA SET.

REFERENCES LEARNING SEMANTICS OF WORDS AND PICTURES BY KOBUS BARNARD AND DAVID FORSYTH, COMPUTER DIVISION, UNIVERSITY OF CALIFORNIA, BERKELEY C.CARSON, S.BELONGE, H. GREENSPAN AND J.MALIK, BLOBWORLD: IMAGE SEGMENTATION USING EXPECTATION MAXIMIZATION AND ITS APPLICATION TO IMAGE QUERYING, IN REVIEW.

QUERIES

THANK YOU