Ming Yan, Jitao Sang, Tao Mei, ChangSheng Xu

Slides:



Advertisements
Similar presentations
Mining User Similarity Based on Location History Yu Zheng, Quannan Li, Xing Xie Microsoft Research Asia.
Advertisements

TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.
Finding Topic-sensitive Influential Twitterers Presenter 吴伟涛 TwitterRank:
Dong Liu Xian-Sheng Hua Linjun Yang Meng Weng Hong-Jian Zhang.
Comparing Twitter Summarization Algorithms for Multiple Post Summaries David Inouye and Jugal K. Kalita SocialCom May 10 Hyewon Lim.
We Know #Tag: Does the Dual Role Affect Hashtag Adoption? Lei Yang 1, Tao Sun 2, Ming Zhang 2, Qiaozhu Mei 1 1 School of Information, the University.
Cross-linking Folksonomies Harith Alani. Multiple SNS Accounts del.icio.us.
Personalizing Web Page Recommendation via Collaborative Filtering and Topic-Aware Markov Model Qingyan Yang, Ju Fan, Jianyong Wang, Lizhu Zhou Database.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
1 1 Chenhao Tan, 1 Jie Tang, 2 Jimeng Sun, 3 Quan Lin, 4 Fengjiao Wang 1 Department of Computer Science and Technology, Tsinghua University, China 2 IBM.
Neighborhood Formation and Anomaly Detection in Bipartite Graphs Jimeng Sun Huiming Qu Deepayan Chakrabarti Christos Faloutsos Speaker: Jimeng Sun.
Context-Aware Query Classification Huanhuan Cao 1, Derek Hao Hu 2, Dou Shen 3, Daxin Jiang 4, Jian-Tao Sun 4, Enhong Chen 1 and Qiang Yang 2 1 University.
The Social Web: A laboratory for studying s ocial networks, tagging and beyond Kristina Lerman USC Information Sciences Institute.
Nonnegative Shared Subspace Learning and Its Application to Social Media Retrieval Presenter: Andy Lim.
Mining Cross-network Association for YouTube Video Promotion Ming Yan Institute of Automation, C hinese Academy of Sciences May 15, 2014.
1 1 Chenhao Tan, 1 Jie Tang, 2 Jimeng Sun, 3 Quan Lin, 4 Fengjiao Wang 1 Department of Computer Science and Technology, Tsinghua University, China 2 IBM.
Mao Ye, Peifeng Yin, Wang-Chien Lee, Dik-Lun Lee Pennsylvania State Univ. and HKUST SIGIR 11.
School of Electronics Engineering and Computer Science Peking University Beijing, P.R. China Ziqi Wang, Yuwei Tan, Ming Zhang.
Right Buddy Makes the Difference: an Early Exploration of Social Relation Analysis in Multimedia Applications Jitao Sang, Changsheng Xu*. 1 Institute of.
Growing a Tree in the Forest: Constructing Folksonomies by Integrating Structured Metadata Anon Plangprasopchok 1, Kristina Lerman 1, Lise Getoor 2 1 USC.
Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.
FaceBook and Your Business Women in Technology in Nigeria Presented by Mrs M.O Alade Women in Technology in Nigeria
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
FaceTrust: Assessing the Credibility of Online Personas via Social Networks Michael Sirivianos, Kyungbaek Kim and Xiaowei Yang in collaboration with J.W.
Our Twitter Profiles, Our Selves: Predicting Personality with Twitter Daniele Quercia, Michal Kosinski, David Stillwell, Jon Crowcroft COMP4332 Wong Po.
1 Formal Models for Expert Finding on DBLP Bibliography Data Presented by: Hongbo Deng Co-worked with: Irwin King and Michael R. Lyu Department of Computer.
Understanding Cross-site Linking in Online Social Networks Yang Chen 1, Chenfan Zhuang 2, Qiang Cao 1, Pan Hui 3 1 Duke University 2 Tsinghua University.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Evaluation of novelty metrics for sentence-level novelty mining Presenter : Lin, Shu-Han Authors : Flora.
No Title, yet Hyunwoo Kim SNU IDB Lab. September 11, 2008.
RecSys 2011 Review Qi Zhao Outline Overview Sessions – Algorithms – Recommenders and the Social Web – Multi-dimensional Recommendation, Context-
User Profiling based on Folksonomy Information in Web 2.0 for Personalized Recommender Systems Huizhi (Elly) Liang Supervisors: Yue Xu, Yuefeng Li, Richi.
Mining Cross-network Association for YouTube Video Promotion Ming Yan, Jitao Sang, Changsheng Xu*. 1 Institute of Automation, Chinese Academy of Sciences,
Tag Ranking Present by Jie Xiao Dept. of Computer Science Univ. of Texas at San Antonio.
Recommending Twitter Users to Follow Using Content and Collaborative Filtering Approaches John HannonJohn Hannon, Mike Bennett, Barry SmythBarry Smyth.
Using a Model of Social Dynamics to Predict Popularity of News Kristina Lerman Tad Hogg USC Information Sciences Institute HP Labs WWW 2010.
Zibin Zheng DR 2 : Dynamic Request Routing for Tolerating Latency Variability in Cloud Applications CLOUD 2013 Jieming Zhu, Zibin.
By Gianluca Stringhini, Christopher Kruegel and Giovanni Vigna Presented By Awrad Mohammed Ali 1.
Computer Science Department, Peking University
A Structured Approach to Query Recommendation With Social Annotation Data 童薇 2010/12/3.
Jiafeng Guo(ICT) Xueqi Cheng(ICT) Hua-Wei Shen(ICT) Gu Xu (MSRA) Speaker: Rui-Rui Li Supervisor: Prof. Ben Kao.
Institute of Computing Technology, Chinese Academy of Sciences 1 A Unified Framework of Recommending Diverse and Relevant Queries Speaker: Xiaofei Zhu.
Using Social Network Analysis Methods for the Prediction of Faulty Components Gholamreza Safi.
Jing Ye 1,2, Yu Hu 1, and Xiaowei Li 1 1 Key Laboratory of Computer System and Architecture Institute of Computing Technology Chinese Academy of Sciences.
+ User-induced Links in Collaborative Tagging Systems Ching-man Au Yeung, Nicholas Gibbins, Nigel Shadbolt CIKM’09 Speaker: Nonhlanhla Shongwe 18 January.
Intelligent DataBase System Lab, NCKU, Taiwan Josh Jia-Ching Ying, Eric Hsueh-Chan Lu, Wen-Ning Kuo and Vincent S. Tseng Institute of Computer Science.
Background Methods Use of free to use social media to disseminate information in healthcare has increased, but evidence of the effect of this effort is.
Hongbo Deng, Michael R. Lyu and Irwin King
More Than Relevance: High Utility Query Recommendation By Mining Users' Search Behaviors Xiaofei Zhu, Jiafeng Guo, Xueqi Cheng, Yanyan Lan Institute of.
11 A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 1, Michael R. Lyu 1, Irwin King 1,2 1 The Chinese.
Unsupervised Streaming Feature Selection in Social Media
Don’t Follow me : Spam Detection in Twitter January 12, 2011 In-seok An SNU Internet Database Lab. Alex Hai Wang The Pensylvania State University International.
Evaluating Event Credibility on Twitter Presented by Yanan Xie College of Computer Science, Zhejiang University 2012.
Mining Social Ties Beyond Homophily Hongwei Liang * Ke Wang * Feida Zhu # * Simon Fraser University, Canada # Singapore Management University, Singapore.
He Xiangnan (PhD student) 11/2/2012 Research Updates.
Link Prediction Class Data Mining Technology for Business and Society
Exploring Social Tagging Graph for Web Object Classification
Neighborhood - based Tag Prediction
Reducing controversy by connecting opposing views
Link Prediction Seminar Social Media Mining University UC3M
Collective Network Linkage across Heterogeneous Social Platforms
Dieudo Mulamba November 2017
Social Media Account Management Services
Location Recommendation — for Out-of-Town Users in Location-Based Social Network Yina Meng.
KDD Reviews 周天烁 2018年5月9日.
Jinhong Jung, Woojung Jin, Lee Sael, U Kang, ICDM ‘16
Zhengyu Deng, Jitao Sang, Changsheng Xu
Personalized Celebrity Video Search Based on Cross-space Mining
GANG: Detecting Fraudulent Users in OSNs
A Classification-based Approach to Question Routing in Community Question Answering Tom Chao Zhou 22, Feb, 2010 Department of Computer.
--WWW 2010, Hongji Bao, Edward Y. Chang
Presentation transcript:

Ming Yan, Jitao Sang, Tao Mei, ChangSheng Xu FriendTransfer: Cold-start Friend Recommendation with Cross-platform Transfer Learning of Social Knowledge Ming Yan, Jitao Sang, Tao Mei, ChangSheng Xu Good morning, everyone! I’m Ming Yan, from Chinese Academy of Sciences. Today, I will talk about a novel cold-start friend recommendation application based on cross-platform collaboration. 1 Institute of Automation, Chinese Academy of Sciences 2 Microsoft Research Asia, Beijing, P.R. China Jun 30, 2013

Outline Motivation Data Analysis Application Conclusion First I will introduce the background and our motivation. Then an in-depth data analysis on cross-platform user relation and behavior will be provided. Based on the data analysis, we further design a cold-start friend recommendation application. Finally comes the conclusion.

Outline Motivation Data Analysis Application Conclusion

possibility to perform user-centric cross-platform data analysis burst of social media sites Aggregation tools/websites 5.6 social media accounts per user Visit about 3 different social media sites per day about.me FriendFeed Google+ Nowadays, social media is very popular. Various and massive social media sites spring up like mushrooms. This in one way leads to that more and more users are engaged in multiple social media sites simultaneously. (read script on slides). In another way, some aggregation tools have been developed to aggregate user accounts in different social media sites such as about.me, FriendFeed and Google+. These both provide huge possibility to perform user-centric cross-platform data analysis. possibility to perform user-centric cross-platform data analysis

FriendTransfer Motivation: how can we exploit rich social information of users in an auxiliary platform to help another platform perform the cold-start friend recommendation task. auxiliary platform group image join upload tag tag cloud tag target platform Now let’s think about a scenario like this: Image you are an experienced user in flickr, where a friend network is carefully maintained and social activities are updated everyday (join interesting groups, upload images and tag the images). When Twitter becomes popular, it will be difficult if you desire to try this new service, since you are totally new in this platform (you don’t know who and where your friends are) and the new platform knows nothing about your interests for intelligent recommendation. Therefore, our motivation is how can we exploit rich social information of users in an auxiliary platform to help another platform perform the cold-start friend recommendation task. (with user accounts association) ? ? tweet post post tweet

Related Work Friend suggestion based on a single platform Topology / Structure (David Liben-Nowell et al. CIKM’03) Homophily (Jinfeng Zhuang et al. MM’11) Cross-platform Analysis Cross-platform collaboration (F. Abel et al. UMUAI’11, Suman D. Roy et al. MM’12) Multi-platform difference analysis (Kristina Lerman et al. ICWSM’10) Friend suggestion will fail for completely cold-start scenario. Not on user level for most cross-platform analysis. Now let’s first review some related work, mainly two parts: The first part is related with the friend suggestion methods, where topology or structure and homophily are always utilized. The second part is cross-platform analysis, where cross-platform collaboration and multi-platform difference analysis have been widely studied. For the previous work, friend suggestion will fail for completely cold-start scenario, social topology and user-centric are not well explored for most cross-platform analysis. Topology: Based on the number of common friends Homophily: suggest users whose profiles, interests, or updates have substantial overlap with the receiver of the recommendation

Our Work Two Parts Flowchart Cross-platform Data Analysis Cold-start Friend Recommendation Flowchart Relation& Behavior Observation Results Our work has analyzed what social relation and behavior can better transfer user preferences between different platforms (that is what information can better transfer from one platform to another), and how to combine them for improved applications, therefore we have mainly two parts of work: One: data analysis work and the other one: friend recommendation application with cross-platform transfer. This is the overall flowchart of our main work (data collection, data analysis and friend recommendation application based on the data analysis) and we will describe every part one by one in detail later. Data Collection Data Analysis Application

Outline Motivation Data Analysis Application Conclusion

Data Collection Flickr 40K users 1,457 users crawl filter Twitter Image set Interested groups Tag cloud person bird tree sky beach Contact list Flickr 40K users 1,457 users Friend list crawl filter To obtain a collection of users who have both accounts in Flickr and Twitter, we started from google+ website and collected about 40k users in total. Then we kept only those who have both flickr and twitter accounts and filtered the isolated users, which results in the final 1,457 user dataset. Meanwhile, the users’ rich social context information are also downloaded from their Flickr and Twitter accounts respectively (such as image set, interested groups and contact list). Most of the subsequent data analysis work is done in this dataset. Tweets Twitter Jack: I’m a fan both in Flickr and Twitter All User Accounts 40K Users With Both Accounts 3,003 Both Accounts + Not isolated 1,457 Follower list

Data Analysis Three Questions 1 Relation can transfer? 2 What type relation for easier transfer? Now let’s focus on our data analysis work. The following three questions can be the main line for our data analysis work 1. Do the user relations can transfer across different platforms? 2. What type of social relation is easier to transfer across different platforms? 3. Do the social behavior and relation can collaborate in some way? 3 Behavior and relation can collaborate?

Data Analysis Three Questions 1 Relation can transfer? Flickr Twitter Now let’s answer the questions one by one. For the first question, We analyze to what extent can relations on one platform transfer to another platform. Twitter

Social relation Analysis A global analysis High transfer ratio and more closely connected in Flickr Social Platform Bidirectional follow ratio (in friends) Cross follow ratio (in friends) Avg(Contact Num) Flickr 0.5843 0.55 111.93 Twitter 0.4503 0.315 1273.84 We made a global analysis on the user relation situation in Flickr and Twitter respectively. Because Flickr and Twitter are both directional social network, we define 4 types of social relations between users: No relation (we don’t follow each other), Unidirectional relation (I follow you, but you don’t follow back on me), bidirectional relation (I follow you, and you also follow back on me), and also a cross-platform user relation which we call cross follow relation (it means I follow you both in Flickr and in Twitter) The table shows the results, we can see that Twitter has a much larger social circle than Flickr while Flickr is more closely connected and has a higher transfer ratio. Table. Global social relation situation between Flickr and Twitter 4 types of relations: (1) No relation (2)Unidirectional relation (3) Bidirectional relation (4) Cross follow relation

Data Analysis Three Questions 2 What type relation for easier transfer? Flickr As we have different types of relations both in Flickr and Twitter, We further analyze what type of relation is easier to transfer across different social platforms (bidirectional or unidirectional?). Twitter

Social relation Analysis Bidirectional relation is more reliable This table shows the comparison of relation type transfer situation. We count how many bidirectional friend pairs and unidirectional friend pairs in Flickr become bidirectional, unidirectional or no relation in Twitter respectively. The result shows the bidirectional relation has a much higher transfer ratio, so it’s more reliable for this type of relation to transfer. Table. Comparison of relation type transfer situation

Data Analysis Three Questions 3 Behavior and relation can collaborate? group join Flickr tag join Other than multiple social relations, users also have rich social behaviors. So we further try to investigate whether social behavior and social relation can collaborate in some way. tag Twitter

Social Behavior Analysis We model the user similarity in three ways: Common Contact Number (CCN) Common Interested Group Number (CGN) Tag-based Similarity (TBS) Social behavior and relation have consistency relation or not Avg.CCN Avg.CGN Avg.TBS With relation 20.2777 4.8649 0.0550 Without relation 2.4259 1.9430 0.0211 For the social behavior analysis, we model the user similarity in the following three ways using their rich behaviors: This table shows the comparison of the social behaviors between the user pairs with and without relations. We can see that friends have a bigger common contact number, common interested group number and tag-based similarity than unknown user pairs which indicates that social behavior and relation have some sort of consistency. Table. Comparison of the social behaviors between the user pairs with and without relations

Social Behavior Analysis Common contact and tag-based profile can promote cross-follow relations relation type Avg.CCN Avg.CGN Avg.TBS Only follow in Flickr 15.7250 5.3340 0.0403 Cross follow both in Flickr and Twitter 23.0743 4.5768 0.0913 To further facilitate our cross-platform transfer analysis, we also make a comparison of the social behaviors between the user pairs with and without cross-follow relations. From the table we can see that cross follow user pairs have bigger common contact number and tag-based similarity which indicates these two kinds of behaviors can further promote cross-follow relations to some extent. Table. Comparison of the social behaviors between the user pairs with and without cross-follow relations

Data Analysis Results 1 Flickr more closely connected than Twitter 2 Bidirectional relation is more reliable 3 Social behavior and relation have some sort of consistency Finally we got the following four observations from data analysis results. Based on these observations, we further design a cold-start friend recommendation application with cross-platform boosting. 4 Common contact and tag profile can promote cross-follow

Outline Motivation Data Analysis Application Conclusion Next, I will introduce our application.

Cold-start friend recommendation Formulation We formulate our application as a random walk with restart problem on the user graph. 𝑟 𝑘 (𝑗)=𝛼 𝑖 𝑟 𝑘−1 (𝑖) 𝑝 𝑖𝑗 +(1−𝛼 𝑣 𝑗 Relevance score of node j at iteration k Initial restart score for each node Transition probability from node i to node j In our application, we deal with the top-k friend recommendation problem by leveraging user’s rich social information in another platform. And we formulate the application as a random walk with restart problem on the user graph where users are nodes and edges between them are weighted by their similarity on social behavior. We iterate the random walk process until a final stable state is arrived and the final recommendation list is obtained by sorting the stable relevance score r(j) Iterate until a final stable state: 𝑟 𝜋 =(1−𝛼) 𝐼−𝛼𝑃 −1 𝑣

Cold-start friend recommendation Formulation details Social relation transfer: initial relevance score 𝑣 𝑗 0, 𝑖𝑓 𝑢 𝑖 ℎ𝑎𝑠 𝑛𝑜 𝑟𝑒𝑙𝑎𝑡𝑖𝑜𝑛 𝑤𝑖𝑡ℎ 𝑢 𝑗 1, 𝑖𝑓 𝑢 𝑖 ℎ𝑎𝑠 𝑢𝑛𝑖𝑑𝑖𝑟𝑒𝑐𝑡𝑖𝑜𝑛𝑎𝑙 𝑟𝑒𝑙𝑎𝑡𝑖𝑜𝑛 𝑤𝑖𝑡ℎ 𝑢 𝑗 2,𝑖𝑓 𝑢 𝑖 ℎ𝑎𝑠 𝑏𝑖𝑑𝑖𝑟𝑒𝑐𝑡𝑖𝑜𝑛𝑎𝑙 𝑟𝑒𝑙𝑎𝑡𝑖𝑜𝑛 𝑤𝑖𝑡ℎ 𝑢 𝑗 𝑣 𝑗 = Social behavior transfer: transition probability 𝑝 𝑖𝑗 Based on the data analysis, we formulate the parameter (mainly initial relevance score vj and transition probability pij) in our random walk model as below: For the social relation transfer, as the social relations have high probability to transfer (especially the bidirectional relation), we formulate the initial relevance score vj with the social relation like this. For the social behavior transfer, as the common contact behavior and tag-based profile can promote cross-follow relationship, we formulate the transition probability pij mainly with these behaviors. Finally we combine the reliable social relation with the convinced social behavior to make better recommendation. (Note that these formulations are all computed by leveraging the information in another social platform since it’s cold-start in the original platform.) 𝑝 𝑖𝑗 = 𝑠 𝑖𝑗 𝑘 𝑠 𝑖𝑘

Experiment Experimental Settings Evaluation Metrics Baseline methods 328 users with accounts both in Flickr and Twitter 315k images with tags in Flickr 8.01 friends in Flickr and 13.34 friends in Twitter Evaluation Metrics Top-k average precision, recall and F-score Baseline methods Friend(Frd), Common Contact(CC), Tag, Group(Grp) Frd + CC, Frd + Tag, Frd + Grp These are our final experiment settings. And we utilize the top-k average precision, recall and F-score as our evaluation metrics. For the baseline methods, we have two kinds of baselines: The first kind uses only one kind of information to sort users, the other one directly combines two kinds of information to sort users (e.g. friend and also high common contact number then more likely higher relevance score)

Experiment Results These are our final experiment results. We can see that all the methods that combine social relation information with social behavior information (5-10th in Table 5) can achieve better results than the methods which only utilize one type of information (1-4th in Table 5) and our random walk based method which combines two kinds of information (Our1, Our2, Our3) outperforms the baseline method that directly combines them (5-7th in Table 5). We also analyzed the results to different tradeoff parameter 𝛼 (the right figure), and the performance is not sensitive to the change of this parameter which validates the robustness of our algorithm.

Outline Motivation Data Analysis Application Conclusion Finally come to our conclusion.

Conclusion Contributions Future work An in-depth data analysis on cross-platform user relation and behavior information Cold-start friend recommendation by leveraging rich information in another social platform A simple but robust random walk based fusion method for heterogeneous social information Future work Fusion of social information in different modalities Cross-platform social pattern mining To summarize, we have these contributions: We conduct an in-depth data analysis on cross-platform user level We design a cold-start friend recommendation application with cross-platform boosting We utilize a simple but robust random walk based fusion method to combine social relation with social behavior information. Our future work includes the fusion of multiple social information in different modalities across different social platforms and we are also expecting to mine some general cross-platform social patterns in nature.

Thanks! Q&A Thanks for your time and attention.