To Blog or Not to Blog: Characterizing and Predicting Retention in Community Blogs Imrul Kayes 1, Xiang Zuo 1, Da Wang 2, Jacob Chakareski 3 1 University.

Slides:



Advertisements
Similar presentations
Robin L. Donaldson May 5, 2010 Prospectus Defense Florida State University College of Communication and Information.
Advertisements

Modeling Blog Dynamics Speaker: Michaela Götz Joint work with: Jure Leskovec, Mary McGlohon, Christos Faloutsos Cornell University Carnegie Mellon University.
Government Uses and Benefits. Social Communications Specialist Chair, Statewide Social Media
Analysis and Modeling of Social Networks Foudalis Ilias.
Comparison of Social Networks by Likhitha Ravi. Outline What is a social network? Elements of social network Previous studies What is missing in previous.
HIRE Experience ! Sacramento Professional Network 1 How to Optimize your Use of LinkedIn and Other Social Media in your Job Search April 2, 2013.
Asking Questions on the Internet
Masters Thesis Defense Amit Karandikar Advisor: Dr. Anupam Joshi Committee: Dr. Finin, Dr. Yesha, Dr. Oates Date: 1 st May 2007 Time: 9:30 am Place: ITE.
Directional triadic closure and edge deletion mechanism induce asymmetry in directed edge properties.
Graphical Results Results The Effect of Virtual Worlds on Adolescents' Real World Lives Using an upper-division undergraduate social science course, students.
Maureen Meadows Senior Lecturer in Management, Open University Business School.
On the Structure, Properties and Utility of Internal Corporate Blogs Pranam Kolari Tim Finin, Yelena Yesha, Yaacov Yesha Kelly Lyons, Stephen Perelgut,
UNDERSTANDING VISIBLE AND LATENT INTERACTIONS IN ONLINE SOCIAL NETWORK Presented by: Nisha Ranga Under guidance of : Prof. Augustin Chaintreau.
1 Predictors of customer perceived software quality Paul Luo Li (ISRI – CMU) Audris Mockus (Avaya Research) Ping Zhang (Avaya Research)
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
BLOGS The New Contemporary Media Landscape BY LAINE ST. PAUL.
Behavior Analytics Social Media Mining. 2 Measures and Metrics 2 Social Media Mining Behavior Analytics Examples of Behavior Analytics What motivates.
How to Expand Your School’s Online Reach using Facebook, Blogs and Twitter.
Models of Influence in Online Social Networks
User generated content Professional and College Football.
Exploring the Gendered Space of Attributes that Predict Expectations of Success in STEM Gender in STEM Research Symposium: Pathways to Parity in STEM September.
Using Friendship Ties and Family Circles for Link Prediction Elena Zheleva, Lise Getoor, Jennifer Golbeck, Ugur Kuter (SNAKDD 2008)
Social Networking and On-Line Communities: Classification and Research Trends Maria Ioannidou, Eugenia Raptotasiou, Ioannis Anagnostopoulos.
University of California at Santa Barbara Christo Wilson, Bryce Boe, Alessandra Sala, Krishna P. N. Puttaswamy, and Ben Zhao.
Social Media at LISC June LISC Social Media What is it? New ways to distribute our news and stories that engages, interacts and shares. Why do it?
Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.
1 Computing with Social Networks on the Web (2008 slide deck) Jennifer Golbeck University of Maryland, College Park Jim Hendler Rensselaer Polytechnic.
Using Transactional Information to Predict Link Strength in Online Social Networks Indika Kahanda and Jennifer Neville Purdue University.
Our Twitter Profiles, Our Selves: Predicting Personality with Twitter Daniele Quercia, Michal Kosinski, David Stillwell, Jon Crowcroft COMP4332 Wong Po.
Understanding Cross-site Linking in Online Social Networks Yang Chen 1, Chenfan Zhuang 2, Qiang Cao 1, Pan Hui 3 1 Duke University 2 Tsinghua University.
Tie Strength, Embeddedness & Social Influence: Evidence from a Large Scale Networked Experiment Sinan Aral, Dylan Walker Presented by: Mengqi Qiu(Mendy)
By Pamela Comer EDLD 5362 Informational Systems Management.
©2015 Apigee Corp. All Rights Reserved. Preserving signal in customer journeys Joy Thomas, Apigee Jagdish Chand, Visa.
Designing Ranking Systems for Consumer Reviews: The Economic Impact of Customer Sentiment in Electronic Markets Anindya Ghose Panagiotis Ipeirotis Stern.
A Graph-based Friend Recommendation System Using Genetic Algorithm
Multiple Linear Regression. Purpose To analyze the relationship between a single dependent variable and several independent variables.
Feedback Effects between Similarity and Social Influence in Online Communities David Crandall, Dan Cosley, Daniel Huttenlocher, Jon Kleinberg, Siddharth.
NTU Natural Language Processing Lab. 1 Investment and Attention in the Weblog Community Advisor: Hsin-Hsi Chen Speaker: Sheng-Chung Yen.
Correlation and Linear Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Working with Conceptual Frameworks “We aren’t just making this all up.”
LOGO Identifying Opinion Leaders in the Blogosphere Xiaodan Song, Yun Chi, Koji Hino, Belle L. Tseng CIKM 2007 Advisor : Dr. Koh Jia-Ling Speaker : Tu.
Network Community Behavior to Infer Human Activities.
Tagging Systems and Their Effect on Resource Popularity Austin Wester.
Mining information from social media
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
MySpace & Facebook By Veronica Baca. MySpace Tom Anderson August 2003 Social Networking Website Free service Required Age: 14 & over A virtual community.
Social Networks in the Education Setting Ryan Schaaf Jason Poirier Technology Teachers, Howard County Public.
1 Discovering Web Communities in the Blogspace Ying Zhou, Joseph Davis (HICSS 2007)
A Connectivity-Based Popularity Prediction Approach for Social Networks Huangmao Quan, Ana Milicic, Slobodan Vucetic, and Jie Wu Department of Computer.
FELICIAN UNIVERSITY Creating a Learning Community Using Knowledge Management and Social Media Dr. John Zanetich, Associate Professor Felician University.
Social Media & Social Networking 101 Canadian Society of Safety Engineering (CSSE)
Measuring User Influence in Twitter: The Million Follower Fallacy Meeyoung Cha Hamed Haddadi Fabricio Benevenuto Krishna P. Gummadi.
How Chapters Can use Social Media Mark Storace Sacramento Chapter March 2013.
Krzysztof Sobieszek, University of Warsaw Dominik Batorski, University of Warsaw Łukasz Bolikowski, University of Warsaw Opinion leaders in social media.
Applying Link-based Classification to Label Blogs Smriti Bhagat, Irina Rozenbaum Graham Cormode.
Using Blog Properties to Improve Retrieval Gilad Mishne (ICWSM 2007)
How Chapters Can use Social Media Mark Storace Sacramento Chapter Jan 2011.
Why Use Social Media?.
First-Year Experience Seminars: A Benchmark Study of Targeted Courses for Developmental Education Students.
The Spread of Media Content through the Blogosphere
Comparison of Social Networks by Likhitha Ravi
Dieudo Mulamba November 2017
Generative Model To Construct Blog and Post Networks In Blogosphere
The likelihood of linking to a popular website is higher
Predict Failures with Developer Networks and Social Network Analysis
New Mexico Broadband Program Internet Tools for Small Business
Analyzing Two Participation Strategies in an Undergraduate Course Community Francisco Gutierrez Gustavo Zurita
Introduction to Regression
“The Spread of Physical Activity Through Social Networks”
Yingze Wang and Shi-Kuo Chang University of Pittsburgh
Presentation transcript:

To Blog or Not to Blog: Characterizing and Predicting Retention in Community Blogs Imrul Kayes 1, Xiang Zuo 1, Da Wang 2, Jacob Chakareski 3 1 University of South Florida 2 Hubei University of Technology 3 University of Alabama

2 What is a blog?  A blog is a personal journal published on the Web.  Blogs are usually the work of a single individual, occasionally of a small group and often themed on a focused topic.  Blogging platform allow the creation of online profiles to link to other bloggers. This blogger to blogger declared social ties create a social network.

3 The Impacts of Blogs  Blogging has become immensely popular and been widely used. e.g., WordPress alone is used by over 14.7% out of “top 1 million” websites according to Alexa.  Citizen journalism had high impacts in major events. e.g., South Asia tsunami, London terrorist bombings and New Orleans Hurricane Katrina.  Blogosphere provides a platform for different aspects of virtual and real life. e.g., viral marketing, sales prediction and counter terrorism efforts.

4 Retention Problems in Blog Community  Participation is often sparse and uneven One-third of listed users had no interactions during a three- month observation period.  Contribution churn is high Only 11.5% of the users who posted in one month returned to post in the second month. Cummings et al., 2002 Jones et al., 2004

5 Research Questions  What motivates a user to join a blogging community? (well studied in literature)  What motivates the blogger to continue participating (retention) in the blog community? (our focus)

6 BlogSter Community  BlogSter is a blogging community that features specific-interests blogs.  It is a combination of blogging and social networking.  Spam-free blogs.

7 BlogSter Community Data Set 91% of total posts Type NodesEdgesConnected ComponentsPosts Bloggers’ profiles are public 17,43672, ,114  Data collected by using Metropolis-Hastings Random Walk algorithm.  The largest connected component has 14,323 nodes and 64,888 edges, which includes 82% of nodes in the network. Gjoka et al., 2011

8 Research Questions on Retention  Question 1: What variables predict high retention?  Question 2: How well do these variables predict user retention?

9 Analyzing Variables Affect Users’ retention in BlogSter  Predictor Variables (Five categories) Network metrics specific variables: centralities, clustering coefficient. User activity oriented variables: posts, comments, photos, network age. User physiology oriented variables: age, gender. Interactional variables: blog traffic, other users’ comments. Relational variables: social tie strength, friends retention.  Output Variable Retention = Points

10 Network metrics and retention  Observations: The majority of centralities are positively correlated with points Degree centrality has the highest correlation with points Closeness centrality has the weakest correlation with points Clustering coefficient is negatively correlated with points Correlation between network metrics and blog points

11 Activities and Retention Observations Higher number of posts, comments or photos mean higher blogger points The more active a user is, the more point she has. Correlation and distribution of users’ activities and points

12 Physiology and retention Observations Male bloggers have higher retention than female bloggers Bloggers’ age also has correlation with their retention. Correlation and distribution of physiology and points Corr.(age, point) = 0.21, p < 0.05.

13 Social Tie and Retention Observations Users who are socially close have higher retention similarity Distribution of users’ social ties and points

14 Interaction and Retention Distribution of users’ interactions and points Observations The more retention a user has, the higher number of comments she gets on her blogs.

15 Predicting users’ retention in BlogSter Question: Can we use these different types of variables (network metrics, activity metrics, physiological, interactional and relational) to predict user retention? A multiple linear regression model:

16 Prediction Results Blog traffic, degree rank and user comments are the most influential predictors. Adjusted R is 0.837, which implies the model can explain 83.7% of variation around points.

17 Summary  Retention in the blog community  Analyzing factors affect users’ retention in blogs e.g., users’ network topology attributes, users’ social behaviors, social ties and physiological factors, etc.  Predicting users’ retention with different types of factors Build a multiple linear regression model to predict user’s retention  Conclusion Male and senior bloggers who have friends with higher retention are more retained in the blog community and also get higher attention from others (reflected by interaction intensity)

18