Date: 2012/4/23 Source: Michael J. Welch. al(WSDM’11) Advisor: Jia-ling, Koh Speaker: Jiun Jia, Chiou Topical semantics of twitter links 1.

Slides:



Advertisements
Similar presentations
Learning to Suggest: A Machine Learning Framework for Ranking Query Suggestions Date: 2013/02/18 Author: Umut Ozertem, Olivier Chapelle, Pinar Donmez,
Advertisements

Date: 2014/05/06 Author: Michael Schuhmacher, Simon Paolo Ponzetto Source: WSDM’14 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Knowledge-based Graph Document.
Learning more about Facebook and Twitter. Introduction  What we’ve covered in the Social Media webinar series so far  Agenda for this call Facebook.
Influence and Passivity in Social Media Daniel M. Romero, Wojciech Galuba, Sitaram Asur, and Bernardo A. Huberman Social Computing Lab, HP Labs.
SOCIAL MEDIA & PHYSICAL ACTIVITY PROMOTION: MAKING THE CONNECTIONS Presented by: Sandra De Freitas
Finding Topic-sensitive Influential Twitterers Presenter 吴伟涛 TwitterRank:
Twitter Glossary. #: People use the hashtag symbol # before a relevant keyword or phrase (no spaces) in their Tweet to categorize those Tweets and help.
How to make the most of your website: It’s one of your best marketing, branding, awareness tools.
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
Twitter The Basics. What is Twitter? Tweets are: 140 characters or less Quick to follow and view updates Used to share links, photos, videos, music,hot.
TWITTER BASICS GATEHOUSE NEWS & INTERACTIVE DIVISION.
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
Computing Trust in Social Networks
Social Media Motion: How to Get Started & Keep Going With Facebook, Twitter & More Presented by Eli Lilly and Company Hosted by Rob Robinson McNeely Pigott.
TwitterSearch : A Comparison of Microblog Search and Web Search
Presented by Karen Porter UM School of Business Administration & ImpactOnlineMarketing.com Google + and Twitter for Biz ImpactOnlineMarketing.com.
Tweet, Tweet, Tweet… Tweeting Assignments & Discussions Kara Damm, Technology Integration Specialist.
WARNINGBIRD: A Near Real-time Detection System for Suspicious URLs in Twitter Stream.
Using Social Media to Communicate and Support Your School A Closer Look at Twitter.
On Sparsity and Drift for Effective Real- time Filtering in Microblogs Date : 2014/05/13 Source : CIKM’13 Advisor : Prof. Jia-Ling, Koh Speaker : Yi-Hsuan.
Web Usage Mining with Semantic Analysis Date: 2013/12/18 Author: Laura Hollink, Peter Mika, Roi Blanco Source: WWW’13 Advisor: Jia-Ling Koh Speaker: Pei-Hao.
Digital Communication Report January Facebook Increase of 331 fans. In terms of fans, we had basically the same growth of the previous months. We.
Date: 2012/10/18 Author: Makoto P. Kato, Tetsuya Sakai, Katsumi Tanaka Source: World Wide Web conference (WWW "12) Advisor: Jia-ling, Koh Speaker: Jiun.
12/2014 Heidi Larson HeidiL_edc.  Setting up an account  Twitter vocabulary – With Strategy tips  How to Tweet  Why to Tweet  How to get started.
HOW-TO: Driving Traffic with Twitter Cards & Analytics 9 types of Twitter Cards to install on your site and how to measure ROI for subscription sales.
How do I decide whom to follow on Twitter ? IARank: Ranking Users on Twitter in Near Real-time, Based on their Information Amplification Potential.
Alias Detection Using Social Network Analysis Ralf Holzer, Bradley Malin, Latanya Sweeney LinkKDD 2005 Advisor: Dr. Koh Jia-Ling Reporter: Che-Wei,
Introduction to Text and Web Mining. I. Text Mining is part of our lives.
1 University of Qom Information Retrieval Course Web Search (Link Analysis) Based on:
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
1,000,000,000 This is the number of items shared on Facebook in a single day – photos, comments, thoughts and links. Increasingly, these are being shared.
Influence of Search Engines Christina Pong cs349.
Pete Bohman Adam Kunk. Real-Time Search  Definition: A search mechanism capable of finding information in an online fashion as it is produced. Technology.
Social Media Getting Social in a Digital World. (And, why it matters to your business!)
Date: 2012/3/5 Source: Marcus Fontouraet. al(CIKM’11) Advisor: Jia-ling, Koh Speaker: Jiun Jia, Chiou 1 Efficiently encoding term co-occurrences in inverted.
Feedback Effects between Similarity and Social Influence in Online Communities David Crandall, Dan Cosley, Daniel Huttenlocher, Jon Kleinberg, Siddharth.
Jargon Busters Presented by Katie Munton and Natalie Dawson.
Topical Semantics of Twitter Links MARCH 23, 2011 In-seok An SNU Internet Database Lab. Michael J. Welch, Uri Schonfeld Yahoo! Inc., UCLA Computer Science.
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
CS315-Web Search & Data Mining. A Semester in 50 minutes or less The Web History Key technologies and developments Its future Information Retrieval (IR)
LOGO Summarizing Conversations with Clue Words Giuseppe Carenini, Raymond T. Ng, Xiaodong Zhou (WWW ’07) Advisor : Dr. Koh Jia-Ling Speaker : Tu.
Algorithmic Detection of Semantic Similarity WWW 2005.
LOGO 1 Corroborate and Learn Facts from the Web Advisor : Dr. Koh Jia-Ling Speaker : Tu Yi-Lang Date : Shubin Zhao, Jonathan Betz (KDD '07 )
Jiafeng Guo(ICT) Xueqi Cheng(ICT) Hua-Wei Shen(ICT) Gu Xu (MSRA) Speaker: Rui-Rui Li Supervisor: Prof. Ben Kao.
Our simulation is based on Chris Starnes. original work by Reynolds [8] on the simulation of flocks of birds (or ‘Boids‘) in a manner not subject to the.
Finding Experts Using Social Network Analysis 2007 IEEE/WIC/ACM International Conference on Web Intelligence Yupeng Fu, Rongjing Xiang, Yong Wang, Min.
LOGO Identifying Opinion Leaders in the Blogosphere Xiaodan Song, Yun Chi, Koji Hino, Belle L. Tseng CIKM 2007 Advisor : Dr. Koh Jia-Ling Speaker : Tu.
+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.
HAWAII CLEAN ENERGY INITIATIVE ONLINE PRESENCE Cover goes here.
We.b : The web of short URLs Demetris Antoniades, lasonas Polakis, Gerogios Kontaxis, Elias Athansapoulos, Sotiris loannidis, Evangelos P.Markatos, Thomas.
CONTEXTUAL SEARCH AND NAME DISAMBIGUATION IN USING GRAPHS EINAT MINKOV, WILLIAM W. COHEN, ANDREW Y. NG SIGIR’06 Date: 2008/7/17 Advisor: Dr. Koh,
1 The EigenRumor Algorithm for Ranking Blogs Advisor: Hsin-Hsi Chen Speaker: Sheng-Chung Yen ( 嚴聖筌 )
Dominique Renault. > Groups Groups - A group can be set up by any user and can be set to private. These are generally used by smaller groups of people.
Don’t Follow me : Spam Detection in Twitter January 12, 2011 In-seok An SNU Internet Database Lab. Alex Hai Wang The Pensylvania State University International.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
Fabricio Benevenuto, Gabriel Magno, Tiago Rodrigues, and Virgilio Almeida Universidade Federal de Minas Gerais Belo Horizonte, Brazil ACSAC 2010 Fabricio.
LOGO Comments-Oriented Blog Summarization by Sentence Extraction Meishan Hu, Aixin Sun, Ee-Peng Lim (ACM CIKM’07) Advisor : Dr. Koh Jia-Ling Speaker :
Nadav Eiron, Kevin S.McCurley, JohA.Tomlin IBM Almaden Research Center WWW’04 CSE 450 Web Mining Presented by Zaihan Yang.
1 Discovering Web Communities in the Blogspace Ying Zhou, Joseph Davis (HICSS 2007)
Measuring User Influence in Twitter: The Million Follower Fallacy Meeyoung Cha Hamed Haddadi Fabricio Benevenuto Krishna P. Gummadi.
APICS QUICK GUIDE TO TWITTER January 19, 2016 This presentation was created by the APICS Marketing Team for members, chapters, and partners who want to.
1 Link Privacy in Social Networks Aleksandra Korolova, Rajeev Motwani, Shubha U. Nabar CIKM’08 Advisor: Dr. Koh, JiaLing Speaker: Li, HueiJyun Date: 2009/3/30.
GRAPH AND LINK MINING 1. Graphs - Basics 2 Undirected Graphs Undirected Graph: The edges are undirected pairs – they can be traversed in any direction.
Your Company Competitor Report {Insert Company Logo Here}
Discover How Your Business Can Benefit from a Facebook Fanpage
Twitter Overview of today’s presentation: how to tweet
Twitter Terms Tweeps are the people who follow you.
Date: 2012/11/15 Author: Jin Young Kim, Kevyn Collins-Thompson,
Presentation transcript:

Date: 2012/4/23 Source: Michael J. Welch. al(WSDM’11) Advisor: Jia-ling, Koh Speaker: Jiun Jia, Chiou Topical semantics of twitter links 1

Outline  Introduction  Modeling Twitter  Analysis of the graph  Exploring link semantics  Experiment  Conclusion 2

A rich graphical model for Twitter with multiple semantic edges. The relationship between users and topics with respect to two types of edges. 1)Follow link: one user is reading what the other is writing. 2)Retweet link: one user reposts what another user posted. The act of repeating a user’s post carries a stronger indication of topical relevance. 3

User’s dual role on Twitter: ─ content consumer,or reader interested in what other users post. ─ content producer,or writer by publishing new posts. Follow link: one user is reading what the other is writing. ─ A user follows other users ∵ He/She interested in reading the topic(s) they write about. ─ Other users follow him/her ∵ They interested in reading the topic(s) he/she writes about. (may differ from what he/she reads.) 4

5 Recent efforts to leverage this social data to rank users by quality and topical relevance have largely focused on the “follow” relationship. Twitter’s data offers additional implicit relationships between users, however, such as “retweets” and “mentions”. mentions: username” Retweet: username :message”  Newer Style: allows a user to click and generate a “retweet” with a link to the page. Past(old style) retweet

Construct and organize a group of users referred to as a list.  Topical lists generally centered around the discussion of common interests Politics or subjects. → Politics  Classification lists generally formed to group users who share a common trait Celebrities professional athletes → Celebrities or professional athletes 6

7 Full Twitter Graph two two types of entities which could be represented as nodes: users and tweets four four types of relationships between these nodes which would be represented as directional edges: follows publishes user follows tweet publishes

user mentions retweets mentions tweet retweets UserTweet UserFollowPublish TweetMentionRetweet 8

9 Additional Twitter Information There are three important pieces of information that are not captured in this graph representation: Time timestamp information : each post was written as well as when accounts were created. ‚Hyperlinks standard hyperlinks embedded in the posts augmented: third node type ( Web page[URL] ) Difficulty: common use of URL shortening services Ex: TinyURL and bit.ly ƒPost Content textual content of a post can potentially be useful

10 The Simplified Twitter Graph(only include user nodes) The user-user follow links remain as they are from the Full Twitter graph. Add a retweet edge from user user(a) to user(b).

Follow edges celebrities writerreader celebrities 11

Retweet edges 12

13 Posting Frequency the number of posts published vs. the number of users writing that many posts

14 Overall posting behavior of a user Possible connections between the user as a reader and the user as a writer. (1) a user acts primarily as a reader (sink) with little or no posts (2) a user frequently retweets posts of interest but writes little to no original content (3) a user contributes significant new content. number of posts written by the user’s friends number of posts published by the user Size Size: User’s PageRank based on follow edge Shade Shade: originality

15 follow link on Twitter from user a to user b ─ an endorsement of quality or interest. user a, acting as a reader, is interested in user b acting as writer. retweet link ─ User a will retweet the posts of user b if he either is interested in writing about the topic or expects his readers to be interested in this post. ─ connection from user a as a writer to user b as a writer. Reader User a Writer User b Writer User a Writer User b follow retweet

16 follow links -importance or “trustworthiness”. Retweet links-topical importance or writing “interesting” posts. 14 th rank 7 th rank

17

18 Tweetmeme: The top user according to retweet-based PageRank follow links → the quality of a user being popular or well known. retweet links → the quality of being influential or producing newsworthy or topically relevant posts. the rankings appear affected by spam or “marketing” techniques. ddlovato(actress and singer Demi Lovato)

RoF(u):Retweet by Friends the users who u has seen at least one post from via a retweet. Fr(u): The set of users whom user u follows. FoF(u):Friends of Friends The set of users the friends of u follow. 19

20 u 1,u 2, u 3, u 4, u 5, u 6, u 7, u 8, u 9, u 10 uaua u a ‘s friends ubub ubub u 1 u 2 u 3 u 4 u 5 u 6 u 7 u 8 u 9 u u 1 u 2 u 3 u 4 u 5 u 6 u 7 u 8 u 9 u 10 ubub

21 users are more likely to follow people they see retweeted than those who are merely “Friends of Friends”. Next: follow Why follow links are less suited for determining topical relevance.

Starting from a seed set of users who are members of the same topical list. two sets of users: ─ all users who are exactly one follow edge away from any of the seed members (at least one seed member follows them) ─ the users who are exactly one retweet edge away from the seed members (at least one seed member has retweeted one of their posts). Selected a random sample of 25 users from each of these sets and manually assessed them for topical relevance. Experiment for two lists, one focused on “photography” and the other on “design”.  The number of relevant users in the follow-generated samples: 4 and 5  The number of relevant users in the retweet-generated samples: 19 and 20 22

23 Manually collected 9 topical lists from listorious.com, a directory of popular lists on Twitter.listorious.com Selected the 30 highest ranking users for each graph variation. Evaluate the relevance of these top ranked users to the original topic.(the content of their tweets, biography, username, and any external websites listed on profile.) A total of 12 people participated in the survey. Each list was evaluated by at least 2 people. Topics: politics, technology, economic,.……..

24 R k (U): the set of users from U judged relevant in evaluation k of a particular list. U: set of users List 1: 10 List 2: 25 List 3: 15 judged relevant Total user:

25 Precision and Relevance for follow links and retweet links averaged over the 9 different topical lists  Relevant users discovered by retweet links have, on average, fewer followers than those discovered by follows links.  The number of followers a user has is not directly related to their relevance for a particular topic.

26  Twitter’s importance stems not only from its high traffic ranking, but also the amazingly rich structure it provides and realtime information it makes available. PageRank  This paper have demonstrated important distinctions between edge types in the graph, noting that the varying semantics and properties of these edges will have significant implications on graph algorithms such as PageRank.  Shown that retweet edges preserve topical relevance significantly better than follow edges.

27

Given topic t Follower Si Tweet 1 Tweet 2 Tweet 3 Tweet 4 Si’s friends S1 S2 S3 28

Sb’s influence on Sc is two times of that of Sa. 29