Tracking User Attention in Collaborative Tagging Communities Elizeu Santos-Neto Matei Ripeanu Univesity of British Columbia Adriana Iamnitchi University.

Slides:



Advertisements
Similar presentations
S4: A Simple Storage Service for Sciences Matei Ripeanu Adriana Iamnitchi University of British Columbia University of South Florida.
Advertisements

Influence and Passivity in Social Media Daniel M. Romero, Wojciech Galuba, Sitaram Asur, and Bernardo A. Huberman Social Computing Lab, HP Labs.
Stelios Lelis UAegean, FME: Special Lecture Social Media & Social Networks (SM&SN)
Cognitive Publish/Subscribe for Heterogeneous Clouds Šarūnas Girdzijauskas, Swedish Institute of Computer Science (SICS) Joint work with:
Elizeu Santos-Neto, Flavio Figueiredo Jussara Almeida, Miranda Mowbray Marcos Gonçalves, Matei Ripeanu The 2 nd IEEE SocialCom/SIN -- August 2010.
COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.
What should you Cache? A Global Analysis on YouTube Related Video Caching Dilip Kumar Krishnappa, Michael Zink and Carsten Griwodz NOSSDAV 2013.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
„IP“ is not always „Internet Protocol“ A long and a very short example for IP problems in Web 2.0 research Ralf Schenkel Joint work with Tom Crecelius,
Small-World File-Sharing Communities Adriana Iamnitchi, Matei Ripeanu and Ian Foster,
Recsplorer: Recommendation Algorithms Based on Precedence Mining Aditya Parameswaran Stanford University (Joint work with G. Koutrika, B. Bercovitz & H.
Tagging Systems Austin Wester. Tags A keywords linked to a resource (image, video, web page, blog, etc) by users without using a controlled vocabulary.
Tagging Systems Mustafa Kilavuz. Tags A tag is a keyword added to an internet resource (web page, image, video) by users without relying on a controlled.
1 Individual and Social Behavior in Tagging Systems Elizeu Santos-Neto David Condon, Nazareno Andrade Adriana Iamnitchi, Matei Ripeanu 20th ACM International.
Expertise Networks in Online Communities: Structure and Algorithms Jun Zhang Mark S. Ackerman Lada Adamic University of Michigan WWW 2007, May 8–12, 2007,
Are P2P Data-Dissemination Techniques Viable in Today's Data- Intensive Scientific Collaborations? Samer Al-Kiswany – University of British Columbia joint.
Learning Bit by Bit Collaborative Filtering/Recommendation Systems.
Introduction Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Introduction Facebook How does Facebook use your data? Where do you think.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
The Public Library Catalogue as a Social Space: A Case Study of Social Discovery Systems in Two Canadian Public Libraries Louise Spiteri. School of Information.
Adoption and Metrics Jam 2/23/2012 © Copyright All rights reserved.
Towards Boosting Video Popularity via Tag Selection Elizeu Santos-Neto, Tatiana Pontes, Jussara Almeida, Matei Ripeanu University of British Columbia -
School of Electronics Engineering and Computer Science Peking University Beijing, P.R. China Ziqi Wang, Yuwei Tan, Ming Zhang.
Mark Levene, An Introduction to Search Engines and Web Navigation © Pearson Education Limited 2005 Slide 9.1 Chapter 9 : Social Networks What is a social.
Liang Xiang, Quan Yuan, Shiwan Zhao, Li Chen, Xiatian Zhang, Qing Yang and Jimeng Sun Institute of Automation Chinese Academy of Sciences, IBM Research.
Network Aware Resource Allocation in Distributed Clouds.
Event-Based Model for Reconciling Digital Entries Thesis Proposal Ahmet Fatih Mustacoglu 10/3/20151Ahmet.
Clustering-based Collaborative filtering for web page recommendation CSCE 561 project Proposal Mohammad Amir Sharif
Improved search for Socially Annotated Data Authors: Nikos Sarkas, Gautam Das, Nick Koudas Presented by: Amanda Cohen Mostafavi.
Personalized Interaction with Web Resources First Sino-German Symposium on KNOWLEDGE HANDLING: REPRESENTATION, MANAGEMENT AND PERSONALIZED APPLICATION.
User Profiling based on Folksonomy Information in Web 2.0 for Personalized Recommender Systems Huizhi (Elly) Liang Supervisors: Yue Xu, Yuefeng Li, Richi.
Collusion-Resistance Misbehaving User Detection Schemes Speaker: Jing-Kai Lou 2015/10/131.
Adding Value to Online Content Jean Fitzgerald Abu Dhabi Women’s College Higher Colleges of Technology United Arab Emirates
To Blog or Not to Blog: Characterizing and Predicting Retention in Community Blogs Imrul Kayes 1, Xiang Zuo 1, Da Wang 2, Jacob Chakareski 3 1 University.
1 presentation of article: Small-World File-Sharing Communities Article: Adriana Iamnitchi, Matei Ripeanu, Ian Foster Presentation: Periklis Akritidis.
IMPROVING E-COMMERCE COLLABORATIVE RECOMMENDATIONS BY SEMANTIC INFERENCE OF NEIGHBORS’ PRACTICAL EXPERTISE 6 th International Workshop on Semantic Media.
Presented By :Ayesha Khan. Content Introduction Everyday Examples of Collaborative Filtering Traditional Collaborative Filtering Socially Collaborative.
A Graph-based Friend Recommendation System Using Genetic Algorithm
ON THE SELECTION OF TAGS FOR TAG CLOUDS (WSDM11) Advisor: Dr. Koh. Jia-Ling Speaker: Chiang, Guang-ting Date:2011/06/20 1.
Event-Based Hybrid Consistency Framework (EBHCF) for Distributed Annotation Records Ahmet Fatih Mustacoglu Advisor: Prof. Geoffrey.
Mining Bulletin Board Systems Using Community Generation Ming Li, Zhongfei (Mark) Zhang, and Zhi-Hua Zhou PAKDD’08 Reporter: Che-Wei, Liang Date:
Improving Efficiencies Through Cost- Benefit Analysis of Metadata Creation Joyce Celeste Chapman NCSU Libraries Fellow Metadata and Digital Object roundtable:
User-Centric Data Dissemination in Disruption Tolerant Networks Wei Gao and Guohong Cao Dept. of Computer Science and Engineering Pennsylvania State University.
A Statistical Comparison of Tag and Query Logs Mark J. Carman, Robert Gwadera, Fabio Crestani, and Mark Baillie SIGIR 2009 June 4, 2010 Hyunwoo Kim.
Most of contents are provided by the website Introduction TJTSD66: Advanced Topics in Social Media Dr.
Algorithmic Detection of Semantic Similarity WWW 2005.
CyberInfrastructure for Network Analysis Importance of, contributions by network analysis Transformation of NA Support needed for NA.
Institute of Computing Technology, Chinese Academy of Sciences 1 A Unified Framework of Recommending Diverse and Relevant Queries Speaker: Xiaofei Zhu.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
QoS Supported Clustered Query Processing in Large Collaboration of Heterogeneous Sensor Networks Debraj De and Lifeng Sang Ohio State University Workshop.
+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.
KNN CF: A Temporal Social Network kNN CF: A Temporal Social Network Neal Lathia, Stephen Hailes, Licia Capra University College London RecSys ’ 08 Advisor:
ASSIST: Adaptive Social Support for Information Space Traversal Jill Freyne and Rosta Farzan.
Mapping the Gnutella Network: Properties of Large-Scale Peer-to-Peer Systems and Implications for System Design Authors: Matei Ripeanu Ian Foster Adriana.
1 NETWORKING 2012 Parallel and Distributed Systems Group, Delft University of Technology, the Netherlands May 22, 2012 Reducing the History in Decentralized.
File Grouping for Scientific Data Management: Lessons from Experimenting with Real Traces Shyamala Doraimani* and Adriana Iamnitchi University of South.
A Connectivity-Based Popularity Prediction Approach for Social Networks Huangmao Quan, Ana Milicic, Slobodan Vucetic, and Jie Wu Department of Computer.
ItemBased Collaborative Filtering Recommendation Algorithms 1.
Using Blog Properties to Improve Retrieval Gilad Mishne (ICWSM 2007)
Content Reuse and Interest Sharing in Tagging Communities
Neighborhood - based Tag Prediction
Summary Presented by : Aishwarya Deep Shukla
Elizeu Santos-Neto, Tatiana Pontes, Jussara Almeida, Matei Ripeanu
Advanced Scientometrics Workshop
Ahmet Fatih Mustacoglu
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
Postdoc, School of Information, University of Arizona
Jason altmetrics an exploratory study of impact metrics based on social media Jason
Movie Recommendation System
Graph-based Security and Privacy Analytics via Collective Classification with Joint Weight Learning and Propagation Binghui Wang, Jinyuan Jia, and Neil.
Presentation transcript:

Tracking User Attention in Collaborative Tagging Communities Elizeu Santos-Neto Matei Ripeanu Univesity of British Columbia Adriana Iamnitchi University of South Florida

ACM/IEEE CAMA2007 Workshop2 Collaborative Tagging - Introduction Users collect items and mark them with tags Items can be URLs, photos, books, paintings, blog posts, etc… All tagging events are visible in this study

ACM/IEEE CAMA2007 Workshop3 The Problem Growth reduces navigability  Lack of collaborative tagging behavior models How to improve scalability in these commuities?  Improve user experience via personalization Ability to find relevant content.

ACM/IEEE CAMA2007 Workshop4 Goals How is the activity distributed among users?  User activity level is highly heterogeneous Does the interest sharing has any structure?  Several disjoint sub-communities  Large amount of singleton users Can the tracked behavior help navigation?  Interest sharing graph helps improve navigability Contributions

ACM/IEEE CAMA2007 Workshop5 Data Sets CiteULikeBibsonomy Users~6K~600 Items~200K~67K Tags~51K~21K Assignements~452K~257K Data Cleaning  Robot users: tagged ~3K items within 5min  Automated Tags: bibtex-import, no-tag

ACM/IEEE CAMA2007 Workshop6 Contributions – Part I How is the activity distributed among users?  User activity level is highly heterogeneous Does the interest sharing has any structure?  Several disjoint sub-communities  Large amount of singleton users Can the tracked behavior help navigation?  Interest sharing graph helps improve navigability

ACM/IEEE CAMA2007 Workshop7 Tagging Activity User activity level is highly heterogeneous Item and Tag set sizes - strong correlation Item Set Size DistributionTag Set Size Distribution

ACM/IEEE CAMA2007 Workshop8 Contributions – Part II How is the activity distributed among users?  User activity level is highly heterogeneous Does the interest sharing has any structure?  Several disjoint sub-communities  Large amount of singleton users Can the tracked behavior help navigation?  Interest sharing graph helps improve navigability

ACM/IEEE CAMA2007 Workshop9 An Interest Sharing Graph GeorgeTony Castro ItemsTags

ACM/IEEE CAMA2007 Workshop10 Interest Sharing – Structure Users are nodes! Zero-degree nodes are removed! At least one shared item!

ACM/IEEE CAMA2007 Workshop11 Scalable Interest Sharing Definition GeorgeTony Castro ItemsTags For example: At least 30% items are shared! 50% 20% Several similarity metrics are possible

ACM/IEEE CAMA2007 Workshop12 Finding sub-communities CiteULike Bibsonomy Several disjoint sub-communities Large amount of singleton users

ACM/IEEE CAMA2007 Workshop13 Contributions – Part III How is the activity distributed among users?  User activity level is highly heterogeneous Does the interest sharing has any structure?  Several disjoint sub-communities  Large amount of singleton users Can the tracked behavior help navigation?  Interest sharing graph helps improve navigability

ACM/IEEE CAMA2007 Workshop14 Growth reduces navigability Intuition: 1. the higher is the entropy 2. more randomness 3. the harder is to find relevant content Global Entropy: ~11.75

ACM/IEEE CAMA2007 Workshop15 Interest Sharing to Reduce Entropy Global Entropy: ~ Average Entropy Random Graph Each user owns a library!

ACM/IEEE CAMA2007 Workshop16 How useful is the reduction of entropy? Hit Rate Evaluation  Let G(t) be a graph at time t  Compare the user libraries in the graphs G(t) and G(t+1)  Time unit can be month, day or hour. Preliminary results  Predicted about 20%(hour) to 5% (month) Is the data inherently hard to predict ?  Current work: comparison against other prediction techniques

ACM/IEEE CAMA2007 Workshop17 Summary User activity level is highly heterogeneous…  … and the Hoerl function is a good model. Users do share interest…  … but they form disjoint sub-communities. The entropy can be reduced…  …thus, more relevant content can be presented Future actions can be predicted...  …a sys admin was impressed by the results.

ACM/IEEE CAMA2007 Workshop18 Thanks! Obrigado! Questions?

ACM/IEEE CAMA2007 Workshop19 Related Studies How this paper relates to the other papers presented here?

ACM/IEEE CAMA2007 Workshop20 Current Work Recommendation techniques  e.g., Top-k most pop, clustering-based similarity, reputation based Are there other structural patterns?  e.g., small-world Application of the interest-sharing graph  BitTorrent communities  Scientific Collaborations

ACM/IEEE CAMA2007 Workshop21 Hoerl Model parameters CiteULike abc Tag Assignments9, Library Size2, Vocabulary Size3, Bibsonomy Tag Assignments28, Library Size6, Vocabulary Size2,

ACM/IEEE CAMA2007 Workshop22 Tagging Activity - assignements

ACM/IEEE CAMA2007 Workshop23 Tagging Activity – vocabulary size

ACM/IEEE CAMA2007 Workshop24 Interest Sharing - Definitions A graph definition: G=(U,E)  U is the set of users and E is the set of edges Interest-Sharing Graph definition  User-Item  User-Tag  Directed-User-Item

ACM/IEEE CAMA2007 Workshop25 Interest Sharing – # nodes

ACM/IEEE CAMA2007 Workshop26 Interest Sharing – Structure

ACM/IEEE CAMA2007 Workshop27 Entropy I is the set of items P(i) is the popularity of item i

ACM/IEEE CAMA2007 Workshop28 Entropy Global Entropy: ~ This is due to the effect of the neighborhood library size.