Empirical Analysis of Implicit Brand Networks on Social Media

Slides:



Advertisements
Similar presentations
Influence and Passivity in Social Media Daniel M. Romero, Wojciech Galuba, Sitaram Asur, and Bernardo A. Huberman Social Computing Lab, HP Labs.
Advertisements

Analysis and Modeling of Social Networks Foudalis Ilias.
Social Media Intro to Business & Marketing. The most three most trusted forms of advertising are: Recommendations from people I know - 90% Consumer opinions.
CIKM’2008 Presentation Oct. 27, 2008 Napa, California
UNDERSTANDING VISIBLE AND LATENT INTERACTIONS IN ONLINE SOCIAL NETWORK Presented by: Nisha Ranga Under guidance of : Prof. Augustin Chaintreau.
Overview of Web Data Mining and Applications Part I
Models of Influence in Online Social Networks
The United States air transportation network analysis Dorothy Cheung.
Topic 13 Network Models Credits: C. Faloutsos and J. Leskovec Tutorial
A Distributed and Privacy Preserving Algorithm for Identifying Information Hubs in Social Networks M.U. Ilyas, Z Shafiq, Alex Liu, H Radha Michigan State.
Social Networking and On-Line Communities: Classification and Research Trends Maria Ioannidou, Eugenia Raptotasiou, Ioannis Anagnostopoulos.
CrossTour Fair "The Danubian Al-TOUR-native" January 2012 Innovations in E-marketing with a specific focus on SMEs Pieter Piket, Wesley Put- van.
A Media-based Social Interactions Analysis Procedure Alan Keller Gomes and Maria da Graça Campos Pimentel SAC’12 17 March 2015 Hyewon Lim.
Data Analysis in YouTube. Introduction Social network + a video sharing media – Potential environment to propagate an influence. Friendship network and.
Introduction to Text and Web Mining. I. Text Mining is part of our lives.
To Blog or Not to Blog: Characterizing and Predicting Retention in Community Blogs Imrul Kayes 1, Xiang Zuo 1, Da Wang 2, Jacob Chakareski 3 1 University.
A Graph-based Friend Recommendation System Using Genetic Algorithm
Mining Social Networks for Personalized Prioritization Shinjae Yoo, Yiming Yang, Frank Lin, II-Chul Moon [KDD ’09] 1 Advisor: Dr. Koh Jia-Ling Reporter:
How Useful are Your Comments? Analyzing and Predicting YouTube Comments and Comment Ratings Stefan Siersdorfer, Sergiu Chelaru, Wolfgang Nejdl, Jose San.
Department of Electrical Engineering and Computer Science Kunpeng Zhang, Yu Cheng, Yusheng Xie, Doug Downey, Ankit Agrawal, Alok Choudhary {kzh980,ych133,
Social Network Analysis. Outline l Background of social networks –Definition, examples and properties l Data in social networks –Data creation, flow and.
Network Community Behavior to Infer Human Activities.
Click to Add Title A Systematic Framework for Sentiment Identification by Modeling User Social Effects Kunpeng Zhang Assistant Professor Department of.
CS 590 Term Project Epidemic model on Facebook
Speaker : Yu-Hui Chen Authors : Dinuka A. Soysa, Denis Guangyin Chen, Oscar C. Au, and Amine Bermak From : 2013 IEEE Symposium on Computational Intelligence.
Page  1 Social Media in Business Session 8 Agenda Guest Speaker: Cathy Reilly, Agros.org Facebook and LinkedIn Student Presentations Class Project: Create.
A Connectivity-Based Popularity Prediction Approach for Social Networks Huangmao Quan, Ana Milicic, Slobodan Vucetic, and Jie Wu Department of Computer.
Measuring User Influence in Twitter: The Million Follower Fallacy Meeyoung Cha Hamed Haddadi Fabricio Benevenuto Krishna P. Gummadi.
More than words: Social network’s text mining for consumer brand sentiments Expert Systems with Applications 40 (2013) 4241–4251 Mohamed M. Mostafa Reporter.
 DM-Group Meeting Liangzhe Chen, Oct Papers to be present  RSC: Mining and Modeling Temporal Activity in Social Media  KDD’15  A. F. Costa,
1 New metrics for characterizing the significance of nodes in wireless networks via path-based neighborhood analysis Leandros A. Maglaras 1 Dimitrios Katsaros.
Social Networks Some content from Ding-Zhu Du, Lada Adamic, and Eytan Adar.
Uncovering Social Spammers: Social Honeypots + Machine Learning
Fan Engagement Solution
Hot Off the Press: 2013 Ranking Factors
Finding Dense and Connected Subgraphs in Dual Networks
Topical Authority Detection and Sentiment Analysis on Top Influencers
A Viewpoint-based Approach for Interaction Graph Analysis
Social Networks Analysis
The Spread of Media Content through the Blogosphere
Groups of vertices and Core-periphery structure
Tutorial: Big Data Algorithms and Applications Under Hadoop
Retailing Techniques For China SESSION #05
The comparison of a web content mining study with an online survey.
Discover How Your Business Can Benefit from a Facebook Fanpage
A Network Science Approach to Fake News Detection on Social Media
Comparison of Social Networks by Likhitha Ravi
Empirical analysis of Chinese airport network as a complex weighted network Methodology Section Presented by Di Li.
Traffic Audit Industry: Internet of Things (IoT) Ted Politidis Head of SEO
E-Commerce Theories & Practices
Network analysis.
Community detection in graphs
Dieudo Mulamba November 2017
Generative Model To Construct Blog and Post Networks In Blogosphere
chapter 9 : social media marketing 1 : Introduction
Stop Data Wrangling, Start Transforming Data to Intelligence
Apache Spark & Complex Network
Postdoc, School of Information, University of Arizona
A Locality Model of the Evolution of Blog Networks
Social Media and Networking: What it is & why it’s important
Online Tool Screen shots
Department of Computer Science University of York
A Network Science Approach to Fake News Detection on Social Media
Binghui Wang, Le Zhang, Neil Zhenqiang Gong
Video Ad Mining for Predicting Revenue using Random Forest
Social Media Management
PolyAnalyst Web Report Training
Knowledge Sharing Mechanism in Social Networking for Learning
“The Spread of Physical Activity Through Social Networks”
Yingze Wang and Shi-Kuo Chang University of Pittsburgh
Presentation transcript:

Empirical Analysis of Implicit Brand Networks on Social Media Authors: Kunpeng Zhang, Sid Bhattacharya, SudhaRam September 2, 2014

Introduction – three parties User-generated content about social brands on social media platforms Textual: comments, posts, tweets, etc. Actions: becoming fan, following, like, share, etc. Networks Explicit: user friendship, user following, etc. Implicit: brand-brand, and others. Social media platforms Social brands Social users

Research questions User generated social content and user interactions on social media are employed to construct implicit brand-brand networks; Research Question I: What is the structure of a brand-brand network? Research Question II: What is the relationship between an influential brand the number of fans for the brand? Research Question III: What is the relationship between an influential brand and sentiment of social users/fans?

Related work Consumer-brand interactions K. de Valck, G. H. van Bruggen, and B. Wierenga. Virtual communities: A marketing perspective. Decis. Support Syst., 47(3):185–203, June 2009. A. M. Turri, K. H. Smith, and E. Kemp. Developing Affective Brand Commitment Through Social Media. Journal of Electronic Commerce Research, 14(3):201–214, 2013. Information diffusion over consumer networks S. Hill, F. Provost, and C. Volinsky. Network-based marketing: Identifying likely adopters via consumer networks. Statistical Science, 22(2):256–275, 2006. R. Iyengar, C. Van den Bulte, and T. W. Valente. Opinion leadership and social contagion in new product diffusion. Marketing Science, 30(2):195–212, Mar. 2011. S. Nam, P. Manchanda, and P. K. Chintagunta. The effect of signal quality and contiguous word of mouth on customer acquisition for a video-on-demand service. Marketing Science, 29(4):690–700, 2010. Network studies M. J. Newman. A measure of betweenness centrality based on random walks. Social Networks, 27(1):39 – 54, 2005.

Why study implicit brand-brand networks? Explicit networks ignore interactions among users and brands Useful for Identifying influential brands Facilitating targeted online advertising

Overall framework 1. Data collection 4.1 Network measures Research question I 4.2 Influential brand identification Research question II 4.3 Textual sentiment identification Research question III 2. Data cleansing 3. Brand-brand network extraction

Description and statistics of raw dataset Data collection Facebook data (Graph API) For each brand, download posts, comments, likes, and public user profile information Time frame: 01/01/2009 – 01/01/2013 Approximately 2 TB Description and statistics of raw dataset Number of downloaded brands 13,806 Number of unique users 286,862,823 Number of unique brand countries 122 Number of unique brand categories defined by Facebook 172

Data cleansing Remove brands for which most posts and comments are non-English; Simple spam user removal

Spam user removal Users connecting to an extremely large number of brands are likely to be spam users or bots. Users tend to Comment on 4,5 brands on average Like 7,8 brands on average Users making many duplicate comments containing URL links

Dataset after Cleansing Description and statistics before and after data cleansing. Cleaned dataset containing top 2,000 brands. After cleaning After selecting top brands Number of brands 7,580 2,000 Number of unique users 97,699,832 16,306,977 Number of comments 2, 327, 635, 302 470, 742, 158 Number of positive comments 651, 231, 870 179, 009, 470 Number of negative comments 234, 571, 177 60, 613, 968 Number of brand categories 150 118 Number of posts 13, 206, 402 3, 793, 941

Brand-brand network Weighted and undirected brand-brand network (B) A node is a brand A link between two brands is created if the same user commented on or liked posts made by both brands Network generation using Hadoop (MapReduce algorithm)

Network normalization (B Bn) A comparison across brands requires normalization of link weights. Global maximum weight based technique will lose global network semantics such as the distribution of connection strength among links of a brand relative to the size of a brand: Connection (b1,b3) vs. connection (b1,b2), (100%) of b3 users connected to b1;only 10% of b2 users interested in b1.

Network normalization (B Bn) Two step normalization strategy: Step I: normalize each individual link between two brands bi, bj by setting Step II: normalize all by setting , where fi and fj are number of fans for brand i and brand j, respectively.

Network measures Property Network Bn Number of nodes 2,000 Number of links 965,605 Average weighted degree 0.662 Network density 0.483 Network diameter 4 Average clustering coefficient 0.785 Average weighted clustering coefficient 0.882 Average path length 1.503

Network Measures: Centrality Degree centrality measures the connectivity of a node Closeness centrality Measures how far a node is from the rest of nodes Betweeness centrality A node acts as a bridge connecting two communities Eigenvector centrality Measures the influence of a node

Influential brand identification Eigenvector centrality

Influential brands Top 10 influential brands Rank Brand name Category Barack Obama Politician 2 CNN Media news publishing 3 Starbucks Food beverages 4 Coca-cola 5 Victoria’s secret Clothing 6 True blood TV show 7 Dexter 8 Tack bell 9 Lady Gaga Musician band 10 Pepsi

Influential brand identification Category distribution of top 100 influential brands

Further Analysis: brand-brand network Sentiment identification (random forest machine learning on features using 3 components) Sentiment classified as: Positive, negative, neutral Sentiment of a brand Relationships using Spearman Rank Correlation: Sentiment of a brand VS. eigenvector centrality of a brand Size of a brand VS. eigenvector centrality of a brand

Results and Implications Sentiment vs. eigenvector centrality Size vs. eigenvector centrality -0.282 0.676 Size of brand has high positive correlation (.676) with its influence: Big brand likely to influence other brands in the network. The influence/importance of a brand within the network has a low but negative correlation (-0.282) with its sentiment. Implication: negative comments on brands are likely to propagate much faster and get more attention than positive comments.

Conclusion and Future Work Implicit Brand-Brand network using social interactions and its structure Scalable (MapReduce) algorithms for large scale network construction and analysis Understanding Relationship between size/influence, sentiment/influence Targeted Online marketing/advertising Spread of sentiment and brand communities Evolution of network over time/location: Dynamic network analysis

Questions? Thank you