Lecture 28 Collaborative filtering & tagging networks Slides are modified from Lada Adamic and Mustafa Kilavuz.

Slides:



Advertisements
Similar presentations
Our Digital World Second Edition
Advertisements

ICT iGCSE.  You should be able to describe the use of Internet developments such as:  Web 2.0  blogs  wikis  digital media uploading websites  social.
Creating Collaborative Partnerships
HT06, Position Paper, Tagging, Taxonomy, Flickr, Academic Article, ToRead, Presentation Cameron Marlow, Mor Naaman, danah boyd, Marc Davis Yahoo! Research.
Power Laws: Rich-Get-Richer Phenomena
Social Media Networking Sites Charlotte Jenkins Designing the Social Web
Flickr Tags Network Mustafa Kilavuz. Tags A tag is a keyword Search, spam detection, reputation systems, personal organization and metadata.
Tagging Systems Austin Wester. Tags A keywords linked to a resource (image, video, web page, blog, etc) by users without using a controlled vocabulary.
Tagging Systems Mustafa Kilavuz. Tags A tag is a keyword added to an internet resource (web page, image, video) by users without relying on a controlled.
2 4. But first  A bit more from Tuesday about Privacy Social Media Marketing, 2e© 2-2.
A Brief Overview of Social Media UW PRSSA May 21, 2009 Jessica Randazza.
TC2-Computer Literacy Mr. Sencer February 4, 2010.
| Alper Ortac | Computer Science Department | Ubiquitous Knowledge Processing Lab | © Prof. Dr. Iryna Gurevych | 1 Knowledge Management in Web.
Social Bookmarking & Research What Delicious can do for you.
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Collaborative Filtering Shaun Kaasten CPSC CSCW.
Recommender Systems; Social Information Filtering.
Recommender systems Ram Akella November 26 th 2008.
OPAL Conference, August Social Tagging, Folksonomies & Controlled Vocabularies Inviting New Access Systems to our Academic Table Margaret Maurer.
Online communities 1 Theory revision Complete some of the activities in this powerpoint and use the revision book to answer questions.
How to Analyse Social Network? : Part 2 Power Laws and Rich-Get-Richer Phenomena Thank you for all referred contexts and figures.
 2008 Pearson Education, Inc. All rights reserved What Is Web 2.0?  Web 1.0 focused on a relatively small number of companies and advertisers.
Electronic communication and social networks. Changes in elite college tuition policy Harvard, Yale Stanford, Brown, Dartmouth Example: At Stanford: No.
The Social Web: A laboratory for studying s ocial networks, tagging and beyond Kristina Lerman USC Information Sciences Institute.
ICT iGCSE.  You should be able to describe the use of Internet developments such as:  Web 2.0  blogs  wikis  digital media uploading websites  social.
Yahoo! For Teachers By Teachers, For Teachers. In July 2006 Yahoo! invited a group of educators to partner with them to build technology that addressed.
You can customize your privacy settings. The privacy page gives you control over who can view your content. At most only your friends, their friends and.
Tag-based Social Interest Discovery
© 2011 IBM Corporation Getting social in IBM. © 2011 IBM Corporation Are we having a conversation?  Today, business happens in real-time. Market shifts.
Golder and Huberman, 2006 Journal of Information Science Usage Patterns of Collaborative Tagging System.
Web 2.0: Concepts and Applications 4 Organizing Information.
 Type of site: Social annotations, highlighting and social bookmarking  Launched: July 24 th, 2006  Institutional video:
AVI/Psych 358/IE 340: Human Factors Web 2.0 November
HENRY FORD “If I’d asked my customers what they wanted, they would’ve said a faster horse.”
What is Social Media? And how best to use it.
H OW T O G O S OCIAL In the Timeshare Industry. W HAT IS S OCIAL MEDIA ? “Social media describes the online technologies and practices that people use.
1 Usage patterns of collaborative tagging systems Scott A. Golder and Bernardo A. Huberman Information Dynamics Lab, HP Labs, Palo Alto, CA, USA Journal.
Integrating Technology for Instruction and Learning Jennifer Verschoor & Evelyn Izquierdo April 3, 2009.
Blogging Transforming Writing for the 21 st Century.
James Williams e: eTutor Project SUMMARY OF KEY FINDINGS for 2 Pilot studies of the.
Understanding and Using Social Media. Attention Overload.
SOCIAL BOOKMARKING How to Use del.icio.us to Save, Recall and Share Links Jo-Anne Gibson June, 2007.
29-30 October, 2006, Estonia 1 IST4Balt Information analysis using social bookmarking and other tools IST4Balt Information analysis using social bookmarking.
WISER: Gadgets and Widgets Jane Rawson, Vere Harmsworth Library Emma Cragg, Sainsbury Library.
School of Information University of Michigan Unless otherwise noted, the content of this course material is licensed under a Creative Commons Attribution.
History Day Interpretive Web Site Category Notes on Web Site Category  Fourth year as a full NHD category.  Individual and Group entries are now.
Let's play “tag”. what is a tag? A tag is a keyword or descriptive term associated with an item as means of classification by means of a folksonomy...
Individualized Knowledge Access David Karger Lynn Andrea Stein Mark Ackerman Ralph Swick.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Blogs and Wikis Tim Bornholtz. Purpose Many new technologies are available on the internet that enable people to publish and edit content without expensive.
Social Bookmarking with del.icio.us. What is del.icio.us? Social Software Store your bookmarks online Tag your bookmarks Share your bookmarks with others.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Phone Apps and Their Uses By: Deividas Vavilovas.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
By Andrew McDaniel. Bloom’s Revised Taxonomy Remembering Understanding Applying Analyzing Evaluating Creating.
Web Information Retrieval Prof. Alessandro Agostini 1 Context in Web Search Steve Lawrence Speaker: Antonella Delmestri IEEE Data Engineering Bulletin.
SEO for Google in Hello I'm Dave Taylor from Webmedia.
User Modeling and Recommender Systems: Introduction to recommender systems Adolfo Ruiz Calleja 06/09/2014.
Michelle Halvorsen Texas School for the Deaf, Austin NABT Conference Friday November 5, 2010.
15 Sep 2015 EunJeong Cheon i501: introduction to informatics Semiotic Dynamics and Collaborative Tagging Ciro Cattuto, Vittorio Loreto, and Luciano Pietronero.
Library Web Site 2.0 Easy Tools For Creating Interactive Sites Miranda Doyle Library Media Teacher Martin Luther King Middle School San Francisco Unified.
Social Media & Social Networking 101 Canadian Society of Safety Engineering (CSSE)
Chapter 8: Web Analytics, Web Mining, and Social Analytics
‘Net’-working for Your Own Professional Development Helen Pownall University of Manchester 15 July 2010.
 GEETHA P.  Originally coined by Tim O’Reilly Publishing Media  Second generation of services available on www.  Lets people collaborate and share.
Reviewing websites DAVE MCDONALD UNIT 13 WEBSITE DEVELOPMENT.
CLAN SOFT LEARNING VIRTUAL ENVIRONMENT September 19, 2008 Kaunas 2nd Meeting.
Create a blog Skills: create, modify and post to a blog
Delicious Social Bookmarking
Business in a Connected World
Presentation transcript:

Lecture 28 Collaborative filtering & tagging networks Slides are modified from Lada Adamic and Mustafa Kilavuz

outline motivation for collaborative filtering the Long Tail of content popularity unprecedented amount of user-generated content tagging as a tripartite network/hypergraph evolution of the tagging network pitfalls of collaborative tagging

Chris Anderson, ‘The Long Tail’, Wired, Issue October 2004Issue Need a way to discover items that match our interests & tastes among tens or hundreds of thousands The Long Tail The internet enables the distribution of niche items

That is you (plural) not you (singular)! Collaborative content tagging, and filtering is allowing the little guys (like you and me) to find audience for and discover new content Source:

when people search alone… | query | count| | how to tie a tie 92 | how to 58 | how to write a resume 47 | how to lose weight 23 | how to build a deck 23 | how to get pregnant 21 | how to write a bibliography 20 | how to gain weight 19 | how to kiss 18 | how to get a Passport 17 | how to write a cover letter 17 | How to lose a guy in 10 days 17 | how to draw 14 | how to pass a drug test 14 | how to knit 13 | how to write a book 13 | how to ask for a raise 13 | how to play guitar 13 | how to save money 13 | how to play poker 12 | how to get rid of ants 12 | how to start a business 11 | how to make money 11 | how to draw anime 11 | how to draw manga 11 | how to pray the rosary 10

Example: Yahoo music recommends similar songs/artists Instant message what you’re playing to friends service suggests ‘influencers’ who match your taste you can choose your own influencers… By rating and listening to music you let Y! Music know your tastes Y! Music customizes suggestions/radio station to match your taste Demo this interactive graph at: Source: M. R. David Gleich, Matt Rasmussen, Leonid Zhukov and K. Lang. The World of Music: SDP layout of high dimensional data. In Info Vis, 2005.

find others like you based on your writing/download history Recommendations: user centric view Source: Lada Adamic

Mapping knowledge communities from download patterns users across the organization share interests based on the documents they access each node is a user accessing the system links identify users looking at the same documents color identifies position in the organization

Tags A tag is a keyword added to a resource (web page, image, video) by users without relying on a controlled vocabulary Helps to improve search, spam detection, reputation systems, personal organization and metadata

Social tagging a method of explicit social search Social bookmarking Personal bookmarks Allows users to store and retrieve resources More than just like or dislike, download or not categorize & comment Social tagging systems Shared tags for particular resources Each tag is a link to additional resources tagged the same way by other users Folksonomy: popular tags users collectively label items which can then be retrieved by others

A model

Examples of Tagging Systems Flickr: A photo sharing system allowing users to store and tag their personal photos, as well as maintain a network of contacts and tag others photos. Del.icio.us: A “social bookmarking site,” allowing users to save and tag web pages and resources. digg: “With digg, users submit stories for review, but rather than allow an editor to decide which stories go on the homepage, the users do.” CiteULike: A site allowing users to tag citations and references, e.g. academic papers or books. Youtube: A video sharing system allowing users to upload video content and describe it with tags. ESP Game: An internet game of tagging where users are randomly paired with each other, and try to guess tags the other would use when presented with a random photo. Last.fm: A music information database allowing members to tag artists, albums, and songs

Source: digg,

Social tagging - Flickr Image search much more difficult than textual search solution: tagging One person’s nose is another person’s cat or Katze Source: polandeze, flickr;

tripartite/hypergraph tagging graphs page person tag Can project onto bipartite graphs person – tag tag – page person – page Can project onto one-mode graphs person – person tag – tag person - page

Vocabulary Problem Different users use different terms to describe the same things Different languages Polysemy: A single word has multiple meanings Synonymy: Different words have the same meaning Abstraction: Tagging a resource in different levels of abstraction Animal, cat, Persian cat, Felis silvestris catus, longhair Persian Missing context: Tags that could not be related with the images by others Holiday, me, friends, a person’s name

System Design and Attributes Tagging rights: A tag can be added or removed by the creator of the resource, a restricted group or everyone Tagging support: The mechanism of a tag entry Blind tagging: a tagging user cannot see tags added by others to the same resource Viewable tagging: all tags are visible Suggestive tagging: the system suggests the user possible tags Aggregation: Systems allow duplicate tagging (bag-model) or prevent (set- model) Type of object: web pages, images, videos, songs Source of Material: Resources can be supplied by the system or the users, or anything on the web can be tagged Resource connectivity: links, groups etc. connecting resources other than tags Social connectivity: The connection between the users may result localized folksonomies.

User Incentives Future retrieval: To mark items for personal retrieval of either the individual resource or a collection (playlists) Contribution and sharing: To add to conceptual clusters for the value of either known or unknown audiences Attract attention: to attract other users to look at their resources (common tags, spam tags) Play and competition: to produce tags based on an internal or external set of rules Self presentation: to write a user’s own identity lo leave a mark Opinion expression: to convey value judgments that they wish to share with others

Modeling the growth of tagging networks users become aware of popular items and tag them users copy others’ tags users tend to use their own tags…

All the little side effects of living digitally Find out the coolest/newest things from what people are blogging, tagging, ing, searching what’s this? what is going on in the German blogophere? Source: Most ed – The New York Times, Source: Technorati,

Brrreeeport: how long does it take for news to get around? Source: M Freitas

tag purpose – which ones are useful for social search? 1. Identifying What (or Who) it is About. identify topics. include common nouns, proper nouns (people or organizations). 2. Identifying What it Is. e.g. article, blog and book. 3. Identifying Who Owns It. e.g. a blogger 4. Refining Categories. e.g. numbers, especially round numbers (e.g. 25, 100) 5. Identifying Qualities or Characteristics. Adjectives expressing opinion such as scary, funny, stupid… 6. Self Reference. Tags beginning with “my,” like mystuff and mycomments 7. Task Organizing. grouping information together by task. Examples include toread, jobsearch.

del.icio.us (study by Golder and Huberman) some users use mostly same-old tags for everything, others create new ones at a fast rate Source: Golder, S. and Huberman, B. A. (2006) Usage patterns of collaborative tagging systems. Journal of Information Science, 32(2):

tag proportions – different tags for different people? tags’ relative proportions stable after some number of users have tagged the same URL Source: Golder, S. and Huberman, B. A. (2006) Usage patterns of collaborative tagging systems. Journal of Information Science, 32(2):

simple model of user behavior Polya’s urn (contagion model) draw a ball, note it’s color replace the ball, and place another ball of the same color in the urn del.icio.us suggests tags used by others in order of popularity Source: del.icio.us,

tagging activity Catutto et al. PNAS 2006 Source: Semiotic dynamics and collaborative tagging; Ciro Cattuto, Vittorio Loreto, and Luciano Pietronero

time evolution blog ajax Source: Semiotic dynamics and collaborative tagging; Ciro Cattuto, Vittorio Loreto, and Luciano Pietronero

tag popularity Source: Semiotic dynamics and collaborative tagging; Ciro Cattuto, Vittorio Loreto, and Luciano Pietronero

will the same tag be used? as more time elapses, probability decays Source: Semiotic dynamics and collaborative tagging; Ciro Cattuto, Vittorio Loreto, and Luciano Pietronero

tagging process Yule process: with probability p, choose new tag, with probability 1-p copy an existing tag but weigh by how long ago the tag was used… Source: Semiotic dynamics and collaborative tagging; Ciro Cattuto, Vittorio Loreto, and Luciano Pietronero

If collaborative filtering is so great, why do mediocre things sometimes become big hits, and true gems sometimes fall by the wayside? Information cascades herding behavior individual signal (knowledge, opinion) group signal (what others are saying) group can overpower individual signal things can become big hits, depending on what the word-of-mouth is I don’t think it’s all that cool, but everyone else thinks so, so it must be Those other two guys can’t be wrong I kind of liked it, I’ll give it a thumbs up I fell asleep half way through, I’ll say I liked it so people don’t find out

Source: Music Lab,

Social influence study published in Science Experimental Study of Inequality and Unpredictability in an Artificial Cultural Market Matthew J. Salganik, Peter Sheridan Dodds, Duncan J. Watts Science, Feb. 10 th, 2006 Web experiment set up site with free music downloads 14,000 participants (recruited through a teen-interest site) profile information (age, gender, music influence, knowledge)

How many people have chosen to download this song? Source: Music Lab,

Experimental setup Subjects were randomly assigned to different groups 1 ‘independent’ group: no information about downloads by others 8 ‘social influence groups’ see how many downloads were made by people in your own group participants are unaware of the existence of groups, just of ‘others’ Creates 8 different ‘worlds’ where the success or failure of a song evolves independently

Findings about social influence Best songs rarely did poorly Worst songs rarely did well Anything else was possible! The greater the social influence, the more unequal and unpredictable the collective outcomes become. Experiment 2: songs shown in order of download popularity Experiment 1: songs shown in random order In both experiments variance in song success higher in the social influence case

summary tagging networks are tripartite tagging is a process of invention and imitation imitation can skew popularity results