Flickr Tag Recommendation based on Collective Knowledge Hyunwoo Kim SNU IDB Lab. August 27, 2008 Borkur Sigurbjornsson, Roelof van Zwol Yahoo! Research.

Slides:

Advertisements

Similar presentations

Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London.

Advertisements

Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.

Towards Methods for the Collective Gathering and Quality Control of Relevance Assessments SIGIR´09, July 2009.

Search in Source Code Based on Identifying Popular Fragments Eduard Kuric and Mária Bieliková Faculty of Informatics and Information.

Bring Order to Your Photos: Event-Driven Classification of Flickr Images Based on Social Knowledge Date: 2011/11/21 Source: Claudiu S. Firan (CIKM’10)

Stephan Gammeter, Lukas Bossard, Till Quack, Luc Van Gool.

Large dataset for object and scene recognition A. Torralba, R. Fergus, W. T. Freeman 80 million tiny images Ron Yanovich Guy Peled.

1 Entity Ranking Using Wikipedia as a Pivot (CIKM 10’) Rianne Kaptein, Pavel Serdyukov, Arjen de Vries, Jaap Kamps 2010/12/14 Yu-wen,Hsu.

Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)

Query Operations: Automatic Local Analysis. Introduction Difficulty of formulating user queries –Insufficient knowledge of the collection –Insufficient.

Video retrieval using inference network A.Graves, M. Lalmas In Sig IR 02.

Mobile Web Search Personalization Kapil Goenka. Outline Introduction & Background Methodology Evaluation Future Work Conclusion.

Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.

J. Chen, O. R. Zaiane and R. Goebel An Unsupervised Approach to Cluster Web Search Results based on Word Sense Communities.

Mobile Photos April 17, Auto Extraction of Flickr Tags Unstructured text labels Extract structured knowledge Place and event semantics Scale-structure.

Quality-Aware Collaborative Question Answering: Methods and Evaluation Maggy Anastasia Suryanto, Ee-Peng Lim, Aixin Sun, and Roger H. L. Chiang. In Proceedings.

Finding Wormholes with Flickr Geotags Maarten Clements Marcel Reinders Arjen de Vries Pavel Serdyukov December 3 rd, 2009 GIS.

Tag-based Social Interest Discovery

Web 2.0: Concepts and Applications 4 Organizing Information.

2008/06/06 Y.H.Chang Towards Effective Browsing of Large Scale Social Annotations1 Towards Effective Browsing of Large Scale Social Annotations WWW 2007.

Tag Clouds Revisited Date : 2011/12/12 Source : CIKM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh. Jia-ling 1.

Classifying Tags Using Open Content Resources Simon Overell, Borkur Sigurbjornsson & Roelof van Zwol WSDM ‘09.

Citation Recommendation 1 Web Technology Laboratory Ferdowsi University of Mashhad.

An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.

1 Wikification CSE 6339 (Section 002) Abhijit Tendulkar.

Reyyan Yeniterzi Weakly-Supervised Discovery of Named Entities Using Web Search Queries Marius Pasca Google CIKM 2007.

Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab

Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.

No Title, yet Hyunwoo Kim SNU IDB Lab. September 11, 2008.

PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.

When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.

ON INCENTIVE-BASED TAGGING Xuan S. Yang, Reynold Cheng, Luyi Mo, Ben Kao, David W. Cheung {xyang2, ckcheng, lymo, kao, The University.

A Probabilistic Graphical Model for Joint Answer Ranking in Question Answering Jeongwoo Ko, Luo Si, Eric Nyberg (SIGIR ’ 07) Speaker: Cho, Chin Wei Advisor:

Date: 2013/8/27 Author: Shinya Tanaka, Adam Jatowt, Makoto P. Kato, Katsumi Tanaka Source: WSDM’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Estimating.

TOPIC CENTRIC QUERY ROUTING Research Methods (CS689) 11/21/00 By Anupam Khanal.

Developing Trust Networks based on User Tagging Information for Recommendation Making Touhid Bhuiyan et al. WISE May 2012 SNU IDB Lab. Hyunwoo Kim.

80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Extracting meaningful labels for WEBSOM text archives Advisor.

Detecting Dominant Locations from Search Queries Lee Wang, Chuang Wang, Xing Xie, Josh Forman, Yansheng Lu, Wei-Ying Ma, Ying Li SIGIR 2005.

Flickr the framework of Flickr. Observe them  How many photos does each user offer?  How many tags does each photo have?  The tag hot-list  How many.

You Are What You Tag Yi-Ching Huang and Chia-Chuan Hung and Jane Yung-jen Hsu Department of Computer Science and Information Engineering Graduate Institute.

Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.

Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,

How Useful are Your Comments? Analyzing and Predicting YouTube Comments and Comment Ratings Stefan Siersdorfer, Sergiu Chelaru, Wolfgang Nejdl, Jose San.

Authors: Marius Pasca and Benjamin Van Durme Presented by Bonan Min Weakly-Supervised Acquisition of Open- Domain Classes and Class Attributes from Web.

Flickr Tag Recommendation based on Collective Knowledge BÖrkur SigurbjÖnsson, Roelof van Zwol Yahoo! Research WWW Summarized and presented.

From Text to Image: Generating Visual Query for Image Retrieval Wen-Cheng Lin, Yih-Chen Chang and Hsin-Hsi Chen Department of Computer Science and Information.

Semi-Automatic Image Annotation Liu Wenyin, Susan Dumais, Yanfeng Sun, HongJiang Zhang, Mary Czerwinski and Brent Field Microsoft Research.

1 Masters Thesis Presentation By Debotosh Dey AUTOMATIC CONSTRUCTION OF HASHTAGS HIERARCHIES UNIVERSITAT ROVIRA I VIRGILI Tarragona, June 2015 Supervised.

Information Retrieval using Word Senses: Root Sense Tagging Approach Sang-Bum Kim, Hee-Cheol Seo and Hae-Chang Rim Natural Language Processing Lab., Department.

Tagging Systems and Their Effect on Resource Popularity Austin Wester.

Comparing Document Segmentation for Passage Retrieval in Question Answering Jorg Tiedemann University of Groningen presented by: Moy’awiah Al-Shannaq

Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -

Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.

DivQ: Diversification for Keyword Search over Structured Databases Elena Demidova, Peter Fankhauser, Xuan Zhou and Wolfgang Nejfl L3S Research Center,

Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)

Learning in a Pairwise Term-Term Proximity Framework for Information Retrieval Ronan Cummins, Colm O’Riordan Digital Enterprise Research Institute SIGIR.

Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.

ENHANCING CLUSTER LABELING USING WIKIPEDIA David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab SIGIR’09.

NN k Networks for browsing and clustering image collections Daniel Heesch Communications and Signal Processing Group Electrical and Electronic Engineering.

1 Knowledge-Based Medical Image Indexing and Retrieval Caroline LACOSTE Joo Hwee LIM Jean-Pierre CHEVALLET Daniel RACOCEANU Nicolas Maillot Image Perception,

CiteData: A New Multi-Faceted Dataset for Evaluating Personalized Search Performance CIKM’10 Advisor : Jia-Ling, Koh Speaker : Po-Hsien, Shih.

Question Answering Passage Retrieval Using Dependency Relations (SIGIR 2005) (National University of Singapore) Hang Cui, Renxu Sun, Keya Li, Min-Yen Kan,

A content-based System for Music Recommendation and Visualization of User Preference Working on Semantic Notions Dmitry Bogdanov, Martin Haro, Ferdinand.

Personalized Ontology for Web Search Personalization S. Sendhilkumar, T.V. Geetha Anna University, Chennai India 1st ACM Bangalore annual Compute conference,

User Modeling for Personal Assistant

Neighborhood - based Tag Prediction

Personalized Social Image Recommendation

Project 3 Image Retrieval

WSExpress: A QoS-Aware Search Engine for Web Services

Presentation transcript:

Flickr Tag Recommendation based on Collective Knowledge Hyunwoo Kim SNU IDB Lab. August 27, 2008 Borkur Sigurbjornsson, Roelof van Zwol Yahoo! Research WWW 2008

Contents  Introduction  Related Work  Tag Behavior in Flickr  Tag Recommendation Strategies  Evaluation  Conclusion 2

Introduction [1/4] 3  Tagging  Action of adding keywords to objects  Tags  Meaningful descriptors of the objects  To organize and index contents  Useful with multimedia objects  – little or no textual context

Introduction [2/4] 4  Users are willing to provide semantic context through manual annotations  User annotate their photos to make them better accessible to the general public  Same photo would be annotated by another user it is possible that a different description is produced

Introduction [3/4] 5  La Sagrada Familia  Barcelona  Gaudi  Spain  Catalunya  Arcitecture  Church

Introduction [4/4] 6  How can we assist users in the tagging phase?  Two contributions 1. Analyze how users tag photos and what information is contained in the tagging 2. Evaluate tag recommendation strategies using global co- occurrence

Contents  Introduction  Related Work  Tag Behavior in Flickr  Tag Recommendation Strategies  Evaluation  Conclusion 7

Related Work [1/2] 8  Tags are useful to give improved access to photo collection using temporal information  Visualizing Tags Over Time, WWW2006  Usefulness of tagging information depends on the motivation of users  Why We Tag, SIGCHI2007

Related Work [2/2] 9  Various methods exist semi-automatically annotate photographs  Matching Words and Pictures, JMLR2003  Real-time Computerized Annotation of Pictures, MC2006  Adding semantic labels to Flickr tags  Towards Automatic Extraction of Event and Place Semantics from Flickr Tags, SIGIR2007

Contents  Introduction  Related Work  Tag Behavior in Flickr  Tag Recommendation Strategies  Evaluation  Conclusion 10

Tag Behavior in Flickr [1/7] 11  How do users tag?  What are they tagging?  Why do people tag? - Users are highly driven by social incentives

Tag Behavior in Flickr [2/7] Flickr Photo Collection 12  Flickr contains hundreds of millions of photos  More than 8.5 million users  12,000 photos served per second  2 million photos uploaded per day

Tag Behavior in Flickr [3/7] General Tag Characteristics 13  How users tag their photos  3.7 million unique tags

Tag Behavior in Flickr [4/7] General Tag Characteristics 14  Top 5 most frequent tags  2005, 2006, wedding, party, and 2004  The infrequent tags  Ambrose tompkins, ambient vector  15.7 million tags occur only once  Highly specific tags will only be useful in exceptional cases  3.7 million unique tags

Tag Behavior in Flickr [5/7] General Tag Characteristics 15  Less than 3 tagged photos covers 64% of all  Tag recommendation to be useful

Tag Behavior in Flickr [6/7] Tag Categorization 16  What are users tagging?  Mapping Flickr tags onto WordNet ex) London According to WordNet, London belongs to noun.person and noun.location

Tag Behavior in Flickr [7/7] Tag Categorization 17  Not only visual contents, also broader context ex) location, time, actions

Contents  Introduction  Related Work  Tag Behavior in Flickr  Tag Recommendation Strategies  Evaluation  Conclusion 18

Tag Recommendation Strategies [1/8] Tag Recommendation System 19

Tag Recommendation Strategies [2/8] Tag Co-occurrence 20  Method to calculate co-occurrence coefficients between of two tags  The co-occurrence between two tags : the number of photos where both tags are used

Tag Recommendation Strategies [3/8] Tag Co-occurrence 21  Symmetric measures  Asymmetric measures

Tag Recommendation Strategies [4/8] Tag Co-occurrence 22  The difference between symmetric and asymmetric ex) Eiffel Tower Symmetric method: Tour Eiffel, Eiffel, Seine, La Tour Eiffel, Paris Asymmetric method: Paris, France, Tour Eiffel, Eiffel, Europe  Asymmetric tag co-occurrence provides more suitable diversity of candidate tags

Tag Recommendation Strategies [5/8] Tag Aggregation and Promotion 23  Tag aggregation step is needed to merge the list into a single ranking  Two aggregation methods  Voting - It doesn’t take the co-occurrence values  Summing - It takes the co-occurrence values to produce final ranking

Tag Recommendation Strategies [6/8] Tag Aggregation and Promotion 24  Voting  Summing

Tag Recommendation Strategies [7/8] Tag Aggregation and Promotion 25  Promotion  The head and the tail of the power law is not good tags for recommendation  Stability-promotion  Descriptiveness-promotion  Rank-promotion

Tag Recommendation Strategies [8/8] Tag Aggregation and Promotion 26

Contents  Introduction  Related Work  Tag Behavior in Flickr  Tag Recommendation Strategies  Evaluation  Conclusion 27

Evaluation [1/3]  Evaluation metrics  Mean Reciprocal Rank (MRR) : the ability to return a relevant tag at the top ranking  Success at rank k : the probability of finding a good descriptive tag among the top k recommended tags  Precision at rank k : the proportion of retrieved tags that is relevant 28

Evaluation [2/3] 29

Evaluation [3/3]  The recommended tags contain useful additions to the user-defined tags  Promotion function has a positive effect on the performance in general  Best strategy has a stable performance over different classes of photos  System is particularly good at recommending locations, artifacts, and objects 30

Contents  Introduction  Related Work  Tag Behavior in Flickr  Tag Recommendation Strategies  Evaluation  Conclusion 31

Conclusion [1/2] 32  Tag behavior in Flickr  Mid section of power law contained the most interesting candidates for tag recommendation  The majority of the photos is being annotated with only a few tags  Users annotate where their photos are taken, who or what is on the photo, and when the photo was taken

Conclusion [2/2] 33  Extending Flickr photo annotations  Collective knowledge  Tag aggregation strategies are effective  Promotion function is an effective way to incorporate the ranking of tags  Best strategy shows to be a very stable approach for different types of tag-classes