Self-introduction Name:  鲍鹏 (Peng Bao) Research Interests:  Popularity Prediction, Information Diffusion, Social Network , etc… Grade:  In the third.

Slides:



Advertisements
Similar presentations
INDIANAUNIVERSITYINDIANAUNIVERSITY GENI Global Environment for Network Innovation James Williams Director – International Networking Director – Operational.
Advertisements

Data Mining and Machine Learning Lab eTrust: Understanding Trust Evolution in an Online World Jiliang Tang, Huiji Gao and Huan Liu Computer Science and.
Collaborative QoS Prediction in Cloud Computing Department of Computer Science & Engineering The Chinese University of Hong Kong Hong Kong, China Rocky.
Influence and Passivity in Social Media Daniel M. Romero, Wojciech Galuba, Sitaram Asur, and Bernardo A. Huberman Social Computing Lab, HP Labs.
ICDM, Shenzhen, 2014 Flu Gone Viral: Syndromic Surveillance of Flu on Twitter using Temporal Topic Models Liangzhe Chen, K. S. M. Tozammel Hossain, Patrick.
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
Analysis and Modeling of Social Networks Foudalis Ilias.
Modelling Paying Behavior in Game Social Networks Zhanpeng Fang +, Xinyu Zhou +, Jie Tang +, Wei Shao #, A.C.M. Fong *, Longjun Sun #, Ying Ding -, Ling.
COLLABORATIVE FILTERING Mustafa Cavdar Neslihan Bulut.
We Know #Tag: Does the Dual Role Affect Hashtag Adoption? Lei Yang 1, Tao Sun 2, Ming Zhang 2, Qiaozhu Mei 1 1 School of Information, the University.
Graph Data Management Lab School of Computer Science , Bristol, UK.
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
UNDERSTANDING VISIBLE AND LATENT INTERACTIONS IN ONLINE SOCIAL NETWORK Presented by: Nisha Ranga Under guidance of : Prof. Augustin Chaintreau.
Chen Cheng1, Haiqin Yang1, Irwin King1,2 and Michael R. Lyu1
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Time-dependent Similarity Measure of Queries Using Historical Click- through Data Qiankun Zhao*, Steven C. H. Hoi*, Tie-Yan Liu, et al. Presented by: Tie-Yan.
Author: Jason Weston et., al PANS Presented by Tie Wang Protein Ranking: From Local to global structure in protein similarity network.
Influence and Correlation in Social Networks Aris Anagnostopoulos Ravi Kumar Mohammad Mahdian.
Social Context Based Recommendation Systems and Trust Inference Student: Andrea Manrique ID: ITEC810, Macquarie University1 Advisor: A/Prof. Yan.
Computer Science 1 Web as a graph Anna Karpovsky.
The Social Web: A laboratory for studying s ocial networks, tagging and beyond Kristina Lerman USC Information Sciences Institute.
Civic Engagement Partnering Through Service Learning with Curricula James Lawler Pace University March 2004.
A Measurement-driven Analysis of Information Propagation in the Flickr Social Network WWW09 报告人: 徐波.
Models of Influence in Online Social Networks
Modelling Paying Behavior in Game Social Networks Zhanpeng Fang +, Xinyu Zhou +, Jie Tang +, Wei Shao #, A.C.M. Fong *, Longjun Sun #, Ying Ding -, Ling.
Consumer Behavior, Market Research
Using Friendship Ties and Family Circles for Link Prediction Elena Zheleva, Lise Getoor, Jennifer Golbeck, Ugur Kuter (SNAKDD 2008)
Authors: Xu Cheng, Haitao Li, Jiangchuan Liu School of Computing Science, Simon Fraser University, British Columbia, Canada. Speaker : 童耀民 MA1G0222.
Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.
Kristina Lerman Aram Galstyan USC Information Sciences Institute Analysis of Social Voting Patterns on Digg.
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
Advanced Software Engineering PROJECT. 1. MapReduce Join (2 students)  Focused on performance analysis on different implementation of join processors.
DIGITAL COMMUNITIES Chapter Copyright © 2013 Pearson Education, Inc. publishing as Prentice Hall.
Understanding Cross-site Linking in Online Social Networks Yang Chen 1, Chenfan Zhuang 2, Qiang Cao 1, Pan Hui 3 1 Duke University 2 Tsinghua University.
Tie Strength, Embeddedness & Social Influence: Evidence from a Large Scale Networked Experiment Sinan Aral, Dylan Walker Presented by: Mengqi Qiu(Mendy)
A Graph-based Friend Recommendation System Using Genetic Algorithm
Distributed Maintenance of Cache Freshness in Opportunistic Mobile Networks Wei Gao and Guohong Cao Dept. of Computer Science and Engineering Pennsylvania.
Predicting Positive and Negative Links in Online Social Networks
Using a Model of Social Dynamics to Predict Popularity of News Kristina Lerman Tad Hogg USC Information Sciences Institute HP Labs WWW 2010.
Feedback Effects between Similarity and Social Influence in Online Communities David Crandall, Dan Cosley, Daniel Huttenlocher, Jon Kleinberg, Siddharth.
--He Xiangnan PhD student Importance Estimation of User-generated Data.
FDOT Transit Office Modeling Initiatives The Transit Office has undertaken a number of initiatives in collaboration with the Systems Planning Office and.
Xintao Wu Jan 18, 2013 Retweeting Behavior and Spectral Graph Analysis in Social Media.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.
Network Community Behavior to Infer Human Activities.
Tracking Critical-Mass Outbreaks in Social Contagions (FA DEF)
We.b : The web of short URLs Demetris Antoniades, lasonas Polakis, Gerogios Kontaxis, Elias Athansapoulos, Sotiris loannidis, Evangelos P.Markatos, Thomas.
A Latent Social Approach to YouTube Popularity Prediction Amandianeze Nwana Prof. Salman Avestimehr Prof. Tsuhan Chen.
ICONIP 2010, Sydney, Australia 1 An Enhanced Semi-supervised Recommendation Model Based on Green’s Function Dingyan Wang and Irwin King Dept. of Computer.
Don’t Follow me : Spam Detection in Twitter January 12, 2011 In-seok An SNU Internet Database Lab. Alex Hai Wang The Pensylvania State University International.
Predicting Short-Term Interests Using Activity-Based Search Context CIKM’10 Advisor: Jia Ling, Koh Speaker: Yu Cheng, Hsieh.
A Connectivity-Based Popularity Prediction Approach for Social Networks Huangmao Quan, Ana Milicic, Slobodan Vucetic, and Jie Wu Department of Computer.
Exploring Social Influence via Posterior Effect of Word-of-Mouth Recommendations Junming Huang, Xue-Qi Cheng, Hua-Wei Shen, Tao Zhou, Xiaolong Jin WSDM.
Measuring User Influence in Twitter: The Million Follower Fallacy Meeyoung Cha Hamed Haddadi Fabricio Benevenuto Krishna P. Gummadi.
Software Quality Assurance Chip Ene, February 14, 2015.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Using a model of Social Dynamics to Predict Popularity of News Kristina Lerman, Tad Hogg USC Information Sciences Institute, Institute for Molecular Manufacturing.
Hao Ma, Dengyong Zhou, Chao Liu Microsoft Research Michael R. Lyu
Stochastic Models of User-Contributory Web Sites
A Viewpoint-based Approach for Interaction Graph Analysis
Master thesis: Automatic Extraction of Design Decision Relationships from a Task Management System Kick-Off Matthias Ruppel, 8th of May 2017, Munich.
User Joining Behavior in Online Forums
Comparison of Social Networks by Likhitha Ravi
E-Commerce Theories & Practices
Predict Failures with Developer Networks and Social Network Analysis
MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks Jing Zhang+, Bo Chen+, Xianming Wang+, Fengmei Jin+, Hong Chen+, Cuiping.
GANG: Detecting Fraudulent Users in OSNs
GhostLink: Latent Network Inference for Influence-aware Recommendation
Yingze Wang and Shi-Kuo Chang University of Pittsburgh
Modeling Topic Diffusion in Scientific Collaboration Networks
Presentation transcript:

Self-introduction Name:  鲍鹏 (Peng Bao) Research Interests:  Popularity Prediction, Information Diffusion, Social Network , etc… Grade:  In the third year pursuing for the PhD. Group:  NASC(Network Analysis and Social Computing) Lab:  Research Center of Web Data Science & Engineering Doctoral supervisor:  Prof. Xue-Qi Cheng

popularity prediction in Microblogging Network ——An empirical study Authors: Peng Bao, Hua-Wei Shen, Junming Huang, Xue-Qi Cheng Previous Work

Outline Background & Motivation Problem definition Related works Preliminary study Structural characteristics Prediction & Results Conclusions and Discussions

Background BURST of SNS Everyone is a member in the We the Media age! Sina Weibo plays an more and more important social role. Opportunities and Challenges  Special issues in Sci./Nat.  Computational social science [D. Lazer et al. Science 323, (2009)]

Challenging An interesting and fundamental question  How to track, to understand, and to predict the information flow on the network? To predict the long-term popularity of online content is very HARD!  Popularity is unequally distributed. high interaction among users intrinsic interestingness of content external influence from traditional media active period of users

Motivation Popularity prediction is USEFUL!  From technology view Drive enterprises to design a cost-effective cache and content distribution mechanism system  From business view Help journalists, content providers, advertisers, news recommend systems to provide information services and to design viral marketing strategy  From sociology view Reveal the human collective behavior Facilitate governors to supervise and to guide public opinion Increasing availability of data increase Predictability! Increasing availability of data increase Predictability!

Problem definition Popularity prediction: Given a tweet and its forward information before an indicating time t i, We want to predict the popularity p(t r ) at a reference time t r.  Indicating time t i : The time at which we observe the information of a tweet.  Reference time t r : The time at which we intend to predict the popularity of a tweet.  Popularity p(t) : The number of times that a tweet is re-tweeted at time t.

Related works Temporal correlation based [SzaBo et al. C ACM 2010]  Strong correlation between Early and later log popularity  Linear regression Visibility and Interestingness based [Lerman et al. WWW 2010]  User behavior modeling  Estimate the interestingness

Related works cont.’ Matrix Factorization based [Cui et al. SIGIR 2011]  Estimate the latent factor of user and item Feature based [Hong et al. WWW 2011]  Formalized to classification problem  Logistic regression Temporal pattern based [Matsubara et al. KDD 2012]  Periodical  Avoid infinity  Power-law decay Existing methods mainly focused on the quality of content, the interface of the social media site, the collective behavior of users. We focus on the structural characteristics of the networks spanned by early adopters We focus on the structural characteristics of the networks spanned by early adopters

Preliminary study Popularity distribution The popularity of tweets roughly follows a power-law distribution, distributes very unequally. The popularity of tweets roughly follows a power-law distribution, distributes very unequally.

Preliminary study Lifespan of tweets Most tweets receive 80% of the final popularity in 24 hours and 90% in 48 hours. The lifespan of tweets follows a log-normal distribution. Most tweets receive 80% of the final popularity in 24 hours and 90% in 48 hours. The lifespan of tweets follows a log-normal distribution.

Preliminary study Active period We should consider the variation in hourly activity cycles The daily variation has no obvious relationship with week cycle and are event-related. We should consider the variation in hourly activity cycles The daily variation has no obvious relationship with week cycle and are event-related. “Wenzhou train collision”

Temporal correlation of logarithmic popularity The correlation is weak with large deviation. The Pearson Correlation Coefficients is 0.74 It is less reliable to predict the popularity of a tweet if we just use its earlier popularity alone. The correlation is weak with large deviation. The Pearson Correlation Coefficients is 0.74 It is less reliable to predict the popularity of a tweet if we just use its earlier popularity alone.

Structural characteristics We explore the network consisting of early adopters  Link density: the ratio of the number of existing follow- ship links and the number of all possible links.  Diffusion depth: the length of the longest path from the submitter to anyone of them.

Structural characteristics Empirical found The structural characteristics provide strong evidence to help estimate the final popularity The structural characteristics provide strong evidence to help estimate the final popularity

Prediction and Results Comparison approaches: Evaluation methods: Experiment results

Conclusions We empirically study structural characteristics, which can provide critical indicators The prediction accuracy can be significantly improved by incorporating the factor of structural diversity The conclusion capture the intuition It provides us INSIGHTS to further study

Discussions Accumulative effect Temporal characteristics Event-based prediction

Accumulative Effect of Multiple Exposure in Information Diffusion on Social Network On-Going Work

Exposures and Adoptions Exposures: Node’s neighbor exposes the node to the contagion Adoption: The node acts (e.g. re-tweet) on the contagion t3t3 t1t1 t2t2 Time: t 1 < t 2 < t 3 < … < t n

Problem definition Exposure Curve: Probability of re-tweeting a tweet for a user depends on the number of friends who have already re-tweeted. Dependence

Example Application Marketing agency: would like you to adopt/buy product X They estimate the adoption curve Should they expose you to X three times? Or, is it better to expose you X, then Y and then X again?

What we are doing Classify the TWEETS by  Has URL or not  Has Event or not  Has Multiple Events or not  Deeper analysis on the ME for different event Classify the USERS by  User’s degree  User’s active period  Local clustering coefficient

What we are doing cont.’ Structural diversity between the source of multiple exposures  Fix the number of exposure times, check Link density Number of connected components Temporal effect Temporal motif You will see the results soon!

Closing Remarks We should do MORE… We knew A LITTLE. We have done MANY! This field is a piece of WILD but Fertile mineral land.

Acknowledgement Thank to all members in the NASC group ( for helpful discussions and suggestionswww.groupnasc.org Collaborators Xue-Qi Cheng,Hua-Wei Shen,Junming Huang

Thanks! Q$A Weibo: