+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
HT06, Position Paper, Tagging, Taxonomy, Flickr, Academic Article, ToRead, Presentation Cameron Marlow, Mor Naaman, danah boyd, Marc Davis Yahoo! Research.
Search Engines Information Retrieval in Practice All slides ©Addison Wesley, 2008.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
1 Entity Ranking Using Wikipedia as a Pivot (CIKM 10’) Rianne Kaptein, Pavel Serdyukov, Arjen de Vries, Jaap Kamps 2010/12/14 Yu-wen,Hsu.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
Evaluating Search Engine
6/16/20151 Recent Results in Automatic Web Resource Discovery Soumen Chakrabartiv Presentation by Cui Tao.
Instructional Information in Adaptive Spatial Hypertext Luis Francisco-Revilla and Frank Shipman Presented By : Ananda Man Shrestha.
LinkSelector: A Web Mining Approach to Hyperlink Selection for Web Portals Xiao Fang University of Arizona 10/18/2002.
Recommender Systems; Social Information Filtering.
Information Retrieval
Overview of Web Data Mining and Applications Part I
SEARCH ENGINE By Ms. Preeti Patel Lecturer School of Library and Information Science DAVV, Indore E mail:
“ The Initiative's focus is to dramatically advance the means to collect,store,and organize information in digital forms,and make it available for searching,retrieval,and.
For REAL MEN REAL STYLE.  Search Engine Optimization  SEO is strategies, techniques and tactics to improve or promote a website in order to get a.
Business and Management Research
1 Hierarchical Tag visualization and application for tag recommendations CIKM’11 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
Tag-based Social Interest Discovery
«Tag-based Social Interest Discovery» Proceedings of the 17th International World Wide Web Conference (WWW2008) Xin Li, Lei Guo, Yihong Zhao Yahoo! Inc.,
Web 2.0: Concepts and Applications 4 Organizing Information.
Personalization in Local Search Personalization of Content Ranking in the Context of Local Search Philip O’Brien, Xiao Luo, Tony Abou-Assaleh, Weizheng.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Web Usage Mining with Semantic Analysis Date: 2013/12/18 Author: Laura Hollink, Peter Mika, Roi Blanco Source: WWW’13 Advisor: Jia-Ling Koh Speaker: Pei-Hao.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
23 Participants Average years in education = Average age = males; 14 females Most used applications = Events; Social Networks; Music Least.
JENNIE MATHEWS ST. JOHN’S UNIVERSITY LIS 239 Can the Addition of Social Software Tools & Tags Improve the Productivity of an Academic Library OPAC? 1.
Improved search for Socially Annotated Data Authors: Nikos Sarkas, Gautam Das, Nick Koudas Presented by: Amanda Cohen Mostafavi.
Integrating Technology for Instruction and Learning Jennifer Verschoor & Evelyn Izquierdo April 3, 2009.
Tag Data and Personalized Information Retrieval 1.
Web Search. Structure of the Web n The Web is a complex network (graph) of nodes & links that has the appearance of a self-organizing structure  The.
WHAT IS A SEARCH ENGINE A search engine is not a physical engine, instead its an electronic code or a software programme that searches and indexes millions.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
Social Bookmarking and Linking Strategies Page 1 Linda Fisk and Mark Robertson Freedom Interactive Marketing.
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Exploring Online Social Activities for Adaptive Search Personalization CIKM’10 Advisor : Jia Ling, Koh Speaker : SHENG HONG, CHUNG.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
ON INCENTIVE-BASED TAGGING Xuan S. Yang, Reynold Cheng, Luyi Mo, Ben Kao, David W. Cheung {xyang2, ckcheng, lymo, kao, The University.
ON THE SELECTION OF TAGS FOR TAG CLOUDS (WSDM11) Advisor: Dr. Koh. Jia-Ling Speaker: Chiang, Guang-ting Date:2011/06/20 1.
 Social and Technological developments that led to Web 2.0 Internet connectivity available almost everywhere Personal computer evolved into homes, schools,
WISER: Gadgets and Widgets Jane Rawson, Vere Harmsworth Library Emma Cragg, Sainsbury Library.
Date: 2012/4/23 Source: Michael J. Welch. al(WSDM’11) Advisor: Jia-ling, Koh Speaker: Jiun Jia, Chiou Topical semantics of twitter links 1.
Let's play “tag”. what is a tag? A tag is a keyword or descriptive term associated with an item as means of classification by means of a folksonomy...
Information Retrieval Effectiveness of Folksonomies on the World Wide Web P. Jason Morrison.
Discovering Computers Fundamentals, Third Edition CGS 1000 Introduction to Computers and Technology Spring 2007.
CS315-Web Search & Data Mining. A Semester in 50 minutes or less The Web History Key technologies and developments Its future Information Retrieval (IR)
Improving Search Engines Using Human Computation Games (CIKM09) Date: 2011/3/28 Advisor: Dr. Koh. Jia-Ling Speaker: Chiang, Guang-ting 1.
Chapter 8 Evaluating Search Engine. Evaluation n Evaluation is key to building effective and efficient search engines  Measurement usually carried out.
Retroactive Answering of Search Queries Beverly Yang Glen Jeh.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
By Andrew McDaniel. Bloom’s Revised Taxonomy Remembering Understanding Applying Analyzing Evaluating Creating.
Independence Middle School MEDIA CENTER /7.
Company LOGO In the Name of Allah,The Most Gracious, The Most Merciful King Khalid University College of Computer and Information System Websites Programming.
Blogging. Website and blog A website, also written as web site,or simply site, is a set of related web pages typically served from a single web domain.
DivQ: Diversification for Keyword Search over Structured Databases Elena Demidova, Peter Fankhauser, Xuan Zhou and Wolfgang Nejfl L3S Research Center,
Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.
Adaptive Faceted Browsing in Job Offers Danielle H. Lee
Hybrid Content and Tag-based Profiles for recommendation in Collaborative Tagging Systems Latin American Web Conference IEEE Computer Society, 2008 Presenter:
Web Design Vocabulary #3. HTML Hypertext Markup Language - The coding scheme used to format text for use on the World Wide Web.
Weaving a Personal Web: Using online technologies to create customized, connected, and dynamic learning environments.
WEB STRUCTURE MINING SUBMITTED BY: BLESSY JOHN R7A ROLL NO:18.
Data mining in web applications
Search Engine Optimization
User Modeling for Personal Assistant
Neighborhood - based Tag Prediction
Evaluation of IR Systems
Business and Management Research
Presentation transcript:

+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January 2009

+ 2 Preview Introduction Collaborative tagging User-Induced hyperlinks Similarity of Assigned Tags Association Rule Mining Analysis of User-induced links Tag Prediction Discussion Conclusion 2

+ 3 Introduction Hyper links Makes navigation through the web possible The author decides the document to link to Due to the limited links that authors give, has lead to user- contributed content on the web. In social bookmarking sites, e.g. Delicious Users can maintain a collection of documents URLs are identified by their chosen tags 3

+ 4 Collaborative tagging (1/2) Popular Tagging systems e.g. Delicious and LibraryThing Allows users describe their favorite online resources using their own words Eg tags new, tv, sports weather, travelhttp:/// Advantages over traditional methods Flexibility and freedom offered by these systems Systems are quick to adapt to changes in the vocabulary among the users. 4

+ 5 Collaborative tagging (2/2) Collaborative tagging activities of participating user results in scheme called folksonomy Folksonomy is divided into three types of elements Users Assign tags to the Web Tags Keywords chosen by users to describe and categorize a web document Documents Object tagged by the user 5

+ 6 User-Induced hyperlinks Two types of hyperlinks For Navigation For recommendation Directs users to other documents that contain related information Two different approached to discover implicit relations in folksonomy Calculating the similarity between the sets of tags assigned to the document Analyzing the collective behavior of the user who have tagged the document User-induced Links are implicit links in a folksonomy as resulted from collaborative tagging activities by users 6

+ 7 Similarity of Assigned Tags (1/4) First approach of discovering user-induced links Calculate the pair-wise similarity between documents based on their tags Jaccard Coefficient In IR, Cosine Similarity 7

+ 8 Similarity of Assigned Tags (2/4) 8 Cosine Similarity

+ 9 Similarity of Assigned Tags (3/4) 9 Second similarity function The normalized discounted cumulative gain (NDCG) used to evaluate ranking of documents according to their relevance score Firstly list the tags of the two documents Secondly, calculate the DCG at position p

+ 10 Similarity of Assigned Tags (4/4) 10 Thirdly, iDCG Finally, calculate the NDCG Use a function

+ 11 Association Rule Mining 11 Second approach of discovering user-induced links Finding out pairs of Web documents that have both been tagged by the same group of users Aims at identifying implicit patterns within a large database of transactions Two major concepts Support confidence

+ 12 Analysis of User-Induced Links (1/3) 12 Two methods described Identify user-induced links in data collected Delicious Compared them with existing hyperlinks in terms of several different aspects. Several aspects to compare Do they connect 2 documents from the same domain/website Similarity between documents on the two ends of a link Whether users are equally interested in the linked documents

+ 13 Analysis of User-Induced Links (2/3) 13 Data collection Data collected from Delicious Documents cover a wide range of topics Documents collected on per-tag basis First collected at random 130 tags, popular tags For each tag, crawl Delicious to obtain a set of documents and users that have tag the document.

+ 14 Analysis of User-Induced Links (3/3) 14 Results Identify user-induced links between the documents using the two methods For similarity, vary the similarity threshold to 0.5 For association Rule, set minimum support to 100 and vary the minimum confidence level Findings Very few user-induced links that supported confidence of 0.5 and above

+ 15 Results (1/8) 15

+ 16 Results (2/8) on Same Domain 16 One important function of hyperlinks allow users to navigate from one hypertext document to another More beneficial if the links point to some document outside external to the current website Check whether the documents at the ends are from the same domain

+ 17 Results (3/8) on Same Domain 17

+ 18 Results (4/8) on Coincidence between existing hyperlinks and user-induced links 18 See whether such links already exist between the documents If user-induced links coincide with existing hyperlinks means that users are satisfied with the existing hyperlinks If user-induces are mostly new, means that there are user interests and perspectives that existing hyperlinks have note captures

+ 19 Results (5/8) on Coincidence between existing hyperlinks and user-induced links 19

+ 20 Results (6/8) on similarity and user preferences 20 Look at documents that are connected by user-induce links Between blog posts of highly related topics News articles on the same topics Websites offering applications of similar functionalities Q&A pages of some portal site Two different approaches for generating user-induced links Association rule, a link is generated if enough users are interested in two documents regardless of the similarity between them Similarity based, generates links based on the tags assigned regardless of whether there are many users interested in the documents

+ 21 Results (7/8) on similarity and user preferences 21

+ 22 Results (8/8) on similarity and user preferences 22

+ 23 Tags Prediction (1/3) 23 The analysis of user-induced links shows that links generated by association rule mining of user collections usually connect documents that are highly related to each other as judged by the similarity between their tags To predict the tags Identify the other documents that have a link to this document The set of documents that have a link (d x )

+ 24 Tags Prediction (2/3) 24 Firstly, consider a simple averaging method

+ 25 Tags Prediction (3/3) 25 Secondly method of aggregation method

+ 26 Experiments (1/2) 26 Measure the performance of the predictions By using NDCG Precision at the nth Term NDCG was used To investigate whether the predictions are accurate in terms of the ordering of the tags.

+ 27 Experiments (2/2) 27

+ 28 Discussion 28 Implicit relation between web documents can be discovered by examining user preferences and document similarity embedded in a folksonomy User-induced are different from hyperlinks Collaborative tagging environment shows the differences between the perspective of Web authors and Web readers Worthwhile considering an open hypermedia structure backed by a collaborative tagging system.

+ 29 Conclusion 29 User-induced links, a form of implicit relations between documents We used Tag similarity to generate many user-induced links Association rule miming to generate very high user-induced-links

+ 30 Thank you for your attention 30