Prediction of Influencers from Word Use Chan Shing Hei.

Slides:



Advertisements
Similar presentations
Deema Abdal Hafeth MSc student by research School of Computer Science, University of Lincoln Dr Amr Ahmed Supervisor Dr David Cobham supervisor.
Advertisements

Influence and Passivity in Social Media Daniel M. Romero, Wojciech Galuba, Sitaram Asur, and Bernardo A. Huberman Social Computing Lab, HP Labs.
Identifying Sarcasm in Twitter: A Closer Look
Towards Twitter Context Summarization with User Influence Models Yi Chang et al. WSDM 2013 Hyewon Lim 21 June 2013.
Distant Supervision for Emotion Classification in Twitter posts 1/17.
BIA 660 Web Analytics - Midterm Akshta Chougule Hao Han Di Huo Xi Lu Laura Sills Bank Of America.
Open access and data processing of Social Media (Twitter) data – a new and valuable consumer research instrument Thierry Worch, Anne Hasted & Hal MacFie.
BEHAVIORAL PREDICTION OF TWITTER USERS BASED ON TEXTUAL INFORMATION Shiyao Wang.
Sarcasm Detection on Twitter A Behavioral Modeling Approach
Predicting the Semantic Orientation of Adjective Vasileios Hatzivassiloglou and Kathleen R. McKeown Presented By Yash Satsangi.
What is Learning All about ?  Get knowledge of by study, experience, or being taught  Become aware by information or from observation  Commit to memory.
Forecasting with Twitter data Presented by : Thusitha Chandrapala MARTA ARIAS, ARGIMIRO ARRATIA, and RAMON XURIGUERA.
More than words: Social networks’ text mining for consumer brand sentiments A Case on Text Mining Key words: Sentiment analysis, SNS Mining Opinion Mining,
Opinion mining in social networks Student: Aleksandar Ponjavić 3244/2014 Mentor: Profesor dr Veljko Milutinović.
Opinion Mining Using Econometrics: A Case Study on Reputation Systems Anindya Ghose, Panagiotis G. Ipeirotis, and Arun Sundararajan Leonard N. Stern School.
Our Twitter Profiles, Our Selves: Predicting Personality with Twitter Daniele Quercia, Michal Kosinski, David Stillwell, Jon Crowcroft COMP4332 Wong Po.
Learning user preferences for 2CP-regression for a recommender system Alan Eckhardt, Peter Vojtáš Department of Software Engineering, Charles University.
Can We Predict Eat Out Behavior of a Person from Tweets and Check-ins? Md. Taksir Hasan Majumder ( ) Md. Mahabur Rahman ( ) Department of Computer.
Predicting the Future With Social Media. Introduction Goal – How buzz and attention is created for different movies and how that changes over time.
Reduction of Training Noises for Text Classifiers Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan.
Today Ensemble Methods. Recap of the course. Classifier Fusion
Predicting Personality from Twitter 1 Predicting Personality with Social Media 2 Jennifer Golbeck, Cristina Robles, Michon Edmondson 1, Karen Turner SocialCom.
By Gianluca Stringhini, Christopher Kruegel and Giovanni Vigna Presented By Awrad Mohammed Ali 1.
PIER Research Methods Protocol Analysis Module Hua Ai Language Technologies Institute/ PSLC.
META-LEARNING FOR AUTOMATIC SELECTION OF ALGORITHMS FOR TEXT CLASSIFICATION Karol Furdík, Ján Paralič, Gabriel Tutoky {Jan.Paralic,
Author Age Prediction from Text using Linear Regression Dong Nguyen Noah A. Smith Carolyn P. Rose.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Sentiment Analysis with Incremental Human-in-the-Loop Learning and Lexical Resource Customization Shubhanshu Mishra 1, Jana Diesner 1, Jason Byrne 2, Elizabeth.
Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.
Date: 2015/11/19 Author: Reza Zafarani, Huan Liu Source: CIKM '15
Computational Approaches for Biomarker Discovery SubbaLakshmiswetha Patchamatla.
Click to Add Title A Systematic Framework for Sentiment Identification by Modeling User Social Effects Kunpeng Zhang Assistant Professor Department of.
This week on social media Oct 29 th -Nov 4 th. General stats.
Post-Ranking query suggestion by diversifying search Chao Wang.
Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Carl Vondrick, Aditya Khosla, Tomasz Malisiewicz, Antonio Torralba Massachusetts Institute of Technology
 Effective Multi-Label Active Learning for Text Classification Bishan yang, Juan-Tao Sun, Tengjiao Wang, Zheng Chen KDD’ 09 Supervisor: Koh Jia-Ling Presenter:
Fabricio Benevenuto, Gabriel Magno, Tiago Rodrigues, and Virgilio Almeida Universidade Federal de Minas Gerais Belo Horizonte, Brazil ACSAC 2010 Fabricio.
 Hailey Maurer and Liya Zalaltdinova Lying Words: Predicting Deception From Linguistic Styles by Matthew L. Newman, James W. Pennebaker, Diane S. Berry.
Measuring User Influence in Twitter: The Million Follower Fallacy Meeyoung Cha Hamed Haddadi Fabricio Benevenuto Krishna P. Gummadi.
Antisocial Behavior in Online Discussion Communities Authors: Justin Cheng, Cristian Danescu-Niculescu-Mizily, Jure Leskovec Presented by: Ananya Subburathinam.
Alvin CHAN Kay CHEUNG Alex YING Relationship between Twitter Events and Real-life.
Unveiling Zeus Automated Classification of Malware Samples Abedelaziz Mohaisen Omar Alrawi Verisign Inc, VA, USA Verisign Labs, VA, USA
Finnish sentiment analysis and a tool for social media tracking
PREDICTION ON TWEET FROM DYNAMIC INTERACTION Group 19 Chan Pui Yee Wong Tsz Wing Yeung Chun Kit.
Opinion spam and Analysis 소프트웨어공학 연구실 G 최효린 1 / 35.
Introduction Sara Ahmadian, & Delroy L. Paulhus University of British Columbia Predicting Success among Republican Candidates: Personality and Verbal Style.
IDENTIFYING GREAT TEACHERS THROUGH THEIR ONLINE PRESENCE Evanthia Faliagka, Maria Rigou, Spiros Sirmakessis.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Machine Learning with Spark MLlib
Sentiment Analysis of Twitter Messages Using Word2Vec
Beyond the Words: Predicting User Personality from Heterogeneous Information Presenter: Benyi Gong.
By : Namesh Kher Big Data Insights – INFM 750
Learning Mid-Level Features For Recognition
A GACP and GTMCP company
Introduction to Azure Machine Learning Studio
Proportion of Original Tweets
Microsoft Ignite NZ October 2016 SKYCITY, Auckland.
The Assistive System Progress Report 2 Shifali Kumar Bishwo Gurung
Predicting Prevalence of Influenza-Like Illness From Geo-Tagged Tweets
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Multivariate Methods Berlin Chen
Basics of ML Rohan Suri.
Machine Learning – a Probabilistic Perspective
Analyzing Influence of Social Media Through Twitter
THE ASSISTIVE SYSTEM SHIFALI KUMAR BISHWO GURUNG JAMES CHOU
Presentation transcript:

Prediction of Influencers from Word Use Chan Shing Hei

Detecting influential users before they show observable signals of influence Method: psycholinguistic category scores from word usage Twitter Data  built predictive models of influence from such category based features Introduction

Measuring Influence measure influence score  the average number of retweets generated from a user's tweets

Dataset Twitter's streaming API Nov 1, 2013 to Nov 14, 2013 randomly sample 1000 users Tweets from last one month (Oct 2013) historical tweets (max 200)  average influence score: SD: 0.098

Psycholinguistic Analysis from text How to measure word use ?  users’ historical tweets with the Linguistic Inquiry and Word Count (LIWC) 2001 dictionary How to computed his/her LIWC based scores?  in each category as the ratio of the number of occurrences of words in that category in one’s tweets and the total number of words in his/her tweets

Influence and Word Use

Finding from analysis LIWC category that negatively correlated with influence score: Negative emotion physical states inhibition

LIWC category that positively correlated with influence score: more interactive positive feelings or emotion determination and desires for the future Finding from previous analysis

Prediction Models Using Weka regression analysis and a classification  influence score Evaluate the performance

Regression analysis linear regressions  predict influence score using LIWC measures

Classification study 1 supervised binary machine learning algorithms

divide influence scores into 10 equal sized bins, and trained supervised classifiers with 10 classes Classification study 2

Suggestion Adding more criteria for measure influence Other social media Real application – political campaigns

Conclusion Correlations of word usage with influence behavior discovers a set of psycholinguistic categories identify users early to be an influencer