2 IndexingRankingClustering …… Recommendation Annotation Multimedia Information Retrieval.

Slides:



Advertisements
Similar presentations
Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.
Advertisements

DONG XU, MEMBER, IEEE, AND SHIH-FU CHANG, FELLOW, IEEE Video Event Recognition Using Kernel Methods with Multilevel Temporal Alignment.
Content-Based Image Retrieval
Multimedia Answer Generation for Community Question Answering.
Bring Order to Your Photos: Event-Driven Classification of Flickr Images Based on Social Knowledge Date: 2011/11/21 Source: Claudiu S. Firan (CIKM’10)
EventCube Aviation Safety Data Analysis System Fangbo Tao, Xiao Yu, Jiawei Han 08/10/13.
Visualization and Cluster
1 Texmex – November 15 th, 2005 Strategy for the future Global goal “Understand” (= structure…) TV and other MM documents Prepare these documents for applications.
1 Content-Based Retrieval (CBR) -in multimedia systems Presented by: Chao Cai Date: March 28, 2006 C SC 561.
Recommender Systems Aalap Kohojkar Yang Liu Zhan Shi March 31, 2008.
Li-Jia Li Yongwhan Lim Li Fei-Fei Chong Wang David M. Blei B UILDING AND U SING A S EMANTIVISUAL I MAGE H IERARCHY CVPR, 2010.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Vector Space Information Retrieval Using Concept Projection Presented by Zhiguo Li
IR Models: Latent Semantic Analysis. IR Model Taxonomy Non-Overlapping Lists Proximal Nodes Structured Models U s e r T a s k Set Theoretic Fuzzy Extended.
QueryAnnotationsImages Search Result Text Based : Annotation by surrounding text Content Based : Annotation by the content of images Social Based.
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
Information Retrieval in Practice
Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics – Bag of concepts – Semantic distance between two words.
Classifying Tags Using Open Content Resources Simon Overell, Borkur Sigurbjornsson & Roelof van Zwol WSDM ‘09.
Multimedia Databases (MMDB)
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab
Content-Based Image Retrieval
Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.
No Title, yet Hyunwoo Kim SNU IDB Lab. September 11, 2008.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
Research Projects 6v81 Multimedia Database Yohan Jin, T.A.
IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.
Unsupervised Learning of Visual Sense Models for Polysemous Words Kate Saenko Trevor Darrell Deepak.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
1 A Web Search Engine-Based Approach to Measure Semantic Similarity between Words Presenter: Guan-Yu Chen IEEE Trans. on Knowledge & Data Engineering,
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
Text Analytics in Action: Using Text Analytics as a Toolset TBC 4:15 p.m. - 5:00 p.m. Marjorie Hlava Semantic enrichment / Semantic Fingerprinting.
What Is Text Mining? Also known as Text Data Mining Process of examining large collections of unstructured textual resources in order to generate new.
Image Classification for Automatic Annotation
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
Unsupervised Auxiliary Visual Words Discovery for Large-Scale Image Object Retrieval Yin-Hsi Kuo1,2, Hsuan-Tien Lin 1, Wen-Huang Cheng 2, Yi-Hsuan Yang.
Annotation Framework & ImageCLEF 2014 JAN BOTOREK, PETRA BUDÍKOVÁ
Automatic Labeling of Multinomial Topic Models
Duc-Tien Dang-Nguyen, Giulia Boato, Alessandro Moschitti, Francesco G.B. De Natale Department to Information and Computer Science –University of Trento.
Data Mining for Surveillance Applications Suspicious Event Detection Dr. Bhavani Thuraisingham.
MULTIMEDIA DATA MODELS AND AUTHORING
Automatic Labeling of Multinomial Topic Models Qiaozhu Mei, Xuehua Shen, and ChengXiang Zhai DAIS The Database and Information Systems Laboratory.
Relevance Feedback in Image Retrieval System: A Survey Tao Huang Lin Luo Chengcui Zhang.
1 Knowledge-Based Medical Image Indexing and Retrieval Caroline LACOSTE Joo Hwee LIM Jean-Pierre CHEVALLET Daniel RACOCEANU Nicolas Maillot Image Perception,
Designed by Jennifer Yong
3D Motion Classification Partial Image Retrieval and Download Multimedia Project Multimedia and Network Lab, Department of Computer Science.
Sports Flash Cards track golf gymnastics
Cross-modal Hashing Through Ranking Subspace Learning
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics Semantic distance between two words.
From Frequency to Meaning: Vector Space Models of Semantics
Visual Information Retrieval
3D Motion Classification Partial Image Retrieval and Download
SAMT 2006.
Personalized Social Image Recommendation
Accounting for the relative importance of objects in image retrieval
Unit 5 : Do you like baseball?
Multimedia Information Retrieval
CSE 635 Multimedia Information Retrieval
Do you have a soccer ball?
Presentation transcript:

2 IndexingRankingClustering …… Recommendation Annotation Multimedia Information Retrieval

3 Image Similarity/ Distance Concept Similarity/ Distance Annotation Indexing Ranking Clustering …… Recommendation

4 Image Similarity/ Distance Concept Similarity/ Distance Image Similarity/Distance

5 Numerous efforts have been made. Concept Similarity/ Distance Concept Similarity/Distance

Image Similarity/Distance 6 Concept Similarity/Distance Olympic Numerous efforts have been made. SportsCat TigerPaw More and more used, but not well studied.

7 WordNet Distance Google Distance Tag Concurrence Distance

8 Built by human experts, so close to human perception Coverage is limited and difficult to extend

9 Easy to get and huge coverage Only reflects concurrency in textual documents. Not really concept distance (semantic relationship)

10

11 Images are taken into account a)Tags are sparse so visual concurrency is not well reflected b)Training data is difficult to get similarity matrix: 500 tags similarity matrix: 50 tags

12

13 Synonymy different words but the same meaning table tennis ping- pong — Visually Similar similar things or things of same type horsedonkey — Meronymy part and the whole carwheel — Concurrency exist at the same scene/place airplaneairport —

14 Image tag concurrence distance Image tag concurrence distance implicitly uses image information, but tags are too sparse Google distance Google distance’s coverage is very high, but it is for text domain Concept Distance WordNet distance WordNet distance is good, but coverage is too low Mine from ontology Mine from text documents Mine from image tags

15 Can we mine concept distance from image content?

16 To mine concept distance from a large tagged image collection based on image content

17 Concept A: Airplane Concept B: Airport Concept Model A Concept Model B Flickr Distance (A, B)

18 Flickr Distance is able to cover the four different semantic relationships Synonymy, Visually Similar, Meronymy, and Concurrency

19

20 SVM, Boosting, … Discriminative Generative Global Feature Local Feature w/o Spatial Relation w/ Spatial Relation Bag-of-Words (pLSA, LDA), … 2D HMM, MRF, … Concept Models

21 SVM, Boosting, … Discriminative Generative Global Feature Local Feature w/o Spatial Relation w/ Spatial Relation Bag-of-Words, … 2D HMM, MRF, … Concept Models VLM – Visual Language Model Spatial-relation sensitive Efficient Efficient Can handle object variations Can handle object variations

22 Iamtalkingaboutstatisticallanguagemodel. Unigram ModelBigram ModelTrigram Model

23 Unigram ModelBigram ModelTrigram Model Image  Patch Patch  Gradient Texture Histogram Hashing  Visual Word Visual Word Generation

24

25

26

27

28

29 Concept A: Airplane Concept B: Airport Concept Model A Concept Model B Flickr Distance (A, B) Tag search in Flickr Jensen-ShannonDivergence LT-VLM

30

31

32

33

34 Normalized Google Distance Tag Concurrence Distance Flickr Distance Group1Group2Group3 Group 1 Group2Group3Group1Group2Group3 bears horses moon space bowling dolphin donkey Saturn sharks snake softball spiders turtle Venus whale wolf baseball basketball football golf soccer tennis volleyball moon space Venus whale baseball donkey softball wolf basketball bears bowling dolphin football golf horses Saturn sharks soccer spiders tennis turtle volleyball moon Saturn space Venus bears dolphin donkey golf horses sharks spiders tennis whale wolf baseball basketball football snake soccer bowling softball volleyball

35 The number of correctly annotated keywords at the first N words

36

37

38 If we find similar patterns in the images associated with different concepts, the corresponding concept relationships can be discovered.

39

40

41

42

43

44 International Network for Social Network Analysis

45

46

47 Flickr Distance is able to cover the four different semantic relationships Synonymy, Visually Similar, Meronymy, and Concurrency

48

Image  Patch Patch  GradientTexture Histogram Hashing  Visual Word

50

51 compared with ground-truth distance pair NGD Ground- Truth

52

53