Feeds That Matter A Study of Bloglines Subscriptions Akshay Java Pranam Kolari, Tim Finin, Anupam Joshi, Tim Oates.

Slides:



Advertisements
Similar presentations
Web 2.0? Library 2.0? How Libraries Are Using New Web Tools Mary Page March 7, 2007.
Advertisements

Choosing a Topic and Developing Research Questions
February 6, Background: Where We Are The Internet is changing the way Americans obtain news and information 55 million blogs Explosion of social.
Introducing Calais A Thomson Reuters initiative designed to make content interoperable on the Web A free API that anyone can use An easy way to automatically.
What is Web 2.0? Communication, Collaboration & Community.
Enhancing Research Projects with Environmental Informatics and Web Technologies.
Blog Data Analysis S. Muthukrishnan, CS Rutgers & DIMACS Graham Cormode, DIMACS.
Existing tools to analyze Blogosphere. IceRocket Ice Spy – Spy on what others are searching. Blog Trends – Identifies the trend of particular terms in.
Blogging & The Blogosphere Harnessing the Power of the Blog.
THE UNIVERSITY OF HONG KONG WEB BY DANIEL CHURCHILL 2.0.
Analysing Public Science Debates through Blogs and Online News Sources Mike Thelwall Statistical Cybermetrics Research Group University of Wolverhampton,
Del.icio.us Bill G. Kelm IDS 150: Research in the Information Age April 3, 2007.
Google Tools and your Library - the Possibilities are Exponential Google CSE Google CSE Google Scholar Google Scholar Google My Library Google.
RSS F EEDS Greg Vogl, Research and Development Services Colorado State University Libraries December 9, 2008.
Blog searching and Web 2.0 Technologies: New Insights into Customers/Citizens/Voters? Mike Thelwall Statistical Cybermetrics Research Group Web Impact.
By: Wordpress.org Present by: Bora Hong Introduction to Blogging.
Web 2.0: Concepts and Applications 3 Syndicating Content.
Defining Blogs & RSS Feeds. What is a blog?  A web log  Definition by Darlene Fichter….a blog is a “web page containing brief entries arranged chronologically.”
Bloglines: LISD Brown Bag Webinar, February 23, 2010.
CONCRETE SOFTWARE SOLUTIONS PVT. LTD. A leading Digital Marketing Firm In India.
Mining Social Media Communities and Content Akshay Java Ph.D. Dissertation Defense October 16 th 2008.
Using Bloglines Presented by Bonnie Shucha © University of WI Law Library
Tag-based Social Interest Discovery
«Tag-based Social Interest Discovery» Proceedings of the 17th International World Wide Web Conference (WWW2008) Xin Li, Lei Guo, Yihong Zhao Yahoo! Inc.,
Web 2.0: Concepts and Applications 4 Organizing Information.
Overview Research Tools Turning Search into Research Darlene Fichter Data Library Services, U of S Library November 15, 2004.
Social Bookmarks and RSS Web 2.0 Tools for Educators October 7, 2008 Presented by Michael Lackner Loyola Blakefield School.
Adaptive News Access Daniel Billsus Presented by Chirayu Wongchokprasitti.
Web 2.0 Social Bookmarking and Start Pages in the Classroom Sally Todd, St John’s School Library, April 2009.
Modeling the Spread of Influence on the Blogosphere Akshay Java, Pranam Kolari, Tim Finin, and Tim Oates UMBC Tech Report 04/12/06.
PODCASTS IN SCHOOL LIBRARIES Dr. Dana Dukic West Island School Hong Kong CITE Research Symposium 2008.
Integrating Technology for Instruction and Learning Jennifer Verschoor & Evelyn Izquierdo April 3, 2009.
Raising Awareness in Library 2.0 way: The UJ Sciences Librarian Virtual Experience SANLiC Workshop, 28 May 2009.
Do's and don'ts to improve your site's ranking … Presentation by:
Using an RSS Feed Aggregator An Introduction Prepared by Liz Rodrigues.
What is RSS? and Why Should You (teacher, librarian, student) Care?” Jo Ann Ponville EBRPSS Instructional Technology Facilitator.
29-30 October, 2006, Estonia 1 IST4Balt Information analysis using social bookmarking and other tools IST4Balt Information analysis using social bookmarking.
WISER: Gadgets and Widgets Jane Rawson, Vere Harmsworth Library Emma Cragg, Sainsbury Library.
In Search of GREAT Science Web Sites Lisa Bachman.
OT Connections is AOTA’s new online community which allows occupational therapists, occupational therapy assistants and students to connect with each.
Searching the “New” Web: Bloglines Demo ORALL Annual Meeting October 13, 2005 Presented by Bonnie Shucha UW Law Library
Session 2: Using Web 2.0 Technologies to Create Teacher Web Pages.
Bringing DLESE to Your Doorstep Using RSS to distribute content and personalize the DLESE experience DLESE Annual Meeting July 10, 2004 Shelley Olds DLESE.
Let's play “tag”. what is a tag? A tag is a keyword or descriptive term associated with an item as means of classification by means of a folksonomy...
NTU Natural Language Processing Lab. 1 An Analysis of Effectiveness of Tagging in Blogs Christopher H. Brooks and Nancy Montanez University of San Francisco.
Topic Relevance through Current Events and Podcasting Presenter: Ken Baldauf Program in Interdisciplinary Computing - FSU.
Detecting Communities Via Simultaneous Clustering of Graphs and Folksonomies Akshay Java Anupam Joshi Tim Finin University of Maryland, Baltimore County.
Combating Information Overload with RSS Feeds Meghan Sitar Instruction and Outreach Librarian Library Instruction Services University of Texas Libraries.
Web SyndicationFebruary, 2006 Web Syndication: Building A Custom News Page Presented to The Columbus Computer Society February, 2006.
Social Bookmarking with del.icio.us. What is del.icio.us? Social Software Store your bookmarks online Tag your bookmarks Share your bookmarks with others.
Social Computing Social networking, Social software.
+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.
IBM Lotus Software © 2006 IBM Corporation IBM Lotus Notes Domino Blog Template Steve Castledine.
Blogging. Website and blog A website, also written as web site,or simply site, is a set of related web pages typically served from a single web domain.
Kendra Hunter & Charde Johnson EDUC Dr. M. Kariuki.
Blog Track Open Task: Spam Blog Detection Tim Finin Pranam Kolari, Akshay Java, Tim Finin, Anupam Joshi, Justin.
Social Bookmarking Steve Evans – British Council Madrid Young Learners.
Enhanced hypertext categorization using hyperlinks Soumen Chakrabarti (IBM Almaden) Byron Dom (IBM Almaden) Piotr Indyk (Stanford)
Using RSS Readers in Education: The Google Reader.
CREATE, IMPLEMENT AND ENJOY! Blogs,Wikis & RSS Readers.
RSS: What it is, How to find it How to use it. RSS in Plain English: A CommonCraft Video find more great videos on technology at
Theresa Gabor, CCCOE Web 2.0 What You Need to Know.
Modeling Influence Opinions and Structure in Social Media
Feeds That Matter A study of Bloglines subscriptions
Generative Model To Construct Blog and Post Networks In Blogosphere
Practical RSS for Teaching and Learning
Web 2.0 Creating Content.
The likelihood of linking to a popular website is higher
Social Bookmarking Tools
4/17/2019 Blogs 4/17/2019.
Presentation transcript:

Feeds That Matter A Study of Bloglines Subscriptions Akshay Java Pranam Kolari, Tim Finin, Anupam Joshi, Tim Oates

Outline Background and Motivation Bloglines General Statistics Grouping Related Topics Applications Conclusion

Bloglines Feed Reader Folders Use folder label as approximation for topic. Group similar folders together Rank Feeds under a “topic”

Motivation Study user generated tags in feed reader subscriptions Find relevant blogs about a topic Needed labeled, training data for building text classifiers for different topics Tag Cloud generated by using folder names and merging related folders

Outline Background and Motivation Bloglines General Statistics Grouping Related Topics Applications Conclusion

Bloglines General Statistics 83K publicly listed subscribers 2.8M feeds, 500K are unique 26K users (35%) use folders to organize subscriptions Data collected in May 2006 Although there may be ~ 50M+ Blogs, only a small fraction get continued user attention in the form of subscriptions Users subscribe to Web 2.0 content such as flickr, delicious, technorati and google searches

Bloglines General Statistics Feed Subscriptions follow a power law distribution

Bloglines General Statistics Most users subscribe to modest number of feeds Most users have only a few folders User attention is limited

Bloglines General Statistics As subscriptions increase, users tend to organize them into folders.

Outline Background and Motivation Bloglines General Statistics Grouping Related Topics Applications Conclusion

Bloglines General Statistics A folksonomy emerges from the folder names. Many users use popular folder names to classify feeds. technologica Musica Weather , Mailing List, Tracking Foreign Language

Tag Cloud Before Merge

Tag Cloud After Merge Folder names are used as topics. Lower ranked folder are merged into a higher ranked folder if there is an overlap and a high cosine similarity.

Merging Tags Interesting Cases: Music vs. Musica : English and Spanish Music sites Podcasting vs. Podcasts: One refers to the tools for podcasting while the other feeds containing podcasts Regional Interests: China, Japan, India, etc. Foreign Language: Spanish, German

Feeds That Matter Top Feeds for “Politics” Merged folders: “political”, “political blogs” Talking Points Memo: by Joshua Micah MarshallTalking Points Memo: by Joshua Micah Marshall Daily Kos: State of the Nation Eschaton The Washington Monthly Wonkette, Politics for People with Dirty Minds Informed Comment Power Line AMERICAblog: Because a great nation deserves the truthAMERICAblog: Because a great nation deserves the truth Crooks and Liars Top Feeds for “Knitting” Merged folders “knitting blogs” Yarn HarlotknittingYarn Harlotknitting Wendy Knits! See Eunny Knit! the blue blog Grumperina goes to local yarn shops and Home DepotGrumperina goes to local yarn shops and Home Depot You Knit What?? Mason-Dixon Knitting knit and tonic Crazy Aunt Purl

Most Subscribed Feeds, Top Folders 1.Bloglines 2.Wired 3.Slashdot 4.BloingBoing 5.Dilbert 6.Gizmodo 7.Engadget 8.Official Google Blog 9.Alist Apart 10.News: CNN, Reuters, Moreover 1.News 2.Blogs 3.Tech 4.Comics 5.Politics 6.Podcasts 7.Design 8.Sports 9.Science 10. Business Top FeedsTop Folders

Tag Merging Folder names are used as topics. Lower ranked folder are merged into a higher ranked folder if there is an overlap and a high cosine similarity.

Outline Background and Motivation Bloglines General Statistics Grouping Related Topics Applications Conclusion

FTM! Site Explore Popular Topics Subscribe To Interesting Feeds If you like X you will like…

Feed Recommender (Method 1) Two feeds are similar if they are categorized under similar folders Politics Business Technology knitting

Feed Recommendation (Method 2) Start with a seed set from FTM! Using, graph from WWE dataset, find nodes influenced by the seed set Find other blogs frequently co-cited by the followers Blogs influenced by seed set

Feed Recommendation Using Co-citation PoliticsKnitting

Outline Background and Motivation Bloglines General Statistics Grouping Related Topics Applications Conclusion

Conclusions Folder labels can be used to produce an intuitive set of topics for feeds or blogs Subscription information combined with simple techniques can be quite effective in ranking blogs for a topic. Many useful applications such as feed recommendation and meme trackers can benefit from this data.

Thanks! “Want to find a few good feeds? Try Feeds That Matter, an interesting grouping of publicly listed feeds at Bloglines’’ delicious user skyamese skyamese Easy way to find good blogs - delicious user kc144 Provides a "swarm" with keywords on subjects which will take you to a list of blogs/sites relating to that keyword. All are rss feeds delicious user damenjoe.damenjoe kind of a meta blog delicious user frontporschefrontporsche Find how to classify your feeds and find new feeds based on tags - delicious user inf Links to loads of good RSS feeds. hmspolio …it's a great example of a technique for extracting usefulmetadata from the world - JD on EP blog …find information and resources that have already been filtered by like minded people – Tryangulation blog It brings you popular feeds from Bloglines in different categories and I found almost all the popular feeds in appropriate categories out there. Worth paying a visit – netgautam blogger Nothing better to read online? Feeds that matters gives you loads of highly rated feeds in all category ….great source for some quality content for a blog or just for browsing. Blendedblog blogger University Study Reveals Rich Data on Bloglines Feeds Feeds That MatterFeeds That Matter is a fascinating new analysis project out of UMBC and a terrific way to find new RSS feeds to subscribe to.. - Steve Rubel, Micropersuasion blog 600+ bookmarks on delicious & more…

Backup

Feed Recommendation (Method 2) Starting with a seed set from FTM! Find other influential feeds from Blogpulse data, using co- citations. Blogs influenced by seed set