Online Bayesian Models for Personal Analytics in Social Media Svitlana Volkova and Benjamin Van Durme

Slides:



Advertisements
Similar presentations
1 ©2009 MeeMix MeeMix – A personalized Experience.
Advertisements

1 ©2009 MeeMix MeeMix – A personalized Experience.
Systematic Data Selection to Mine Concept Drifting Data Streams Wei Fan IBM T.J.Watson.
Inferring User Political Preferences from Streaming Communications Svitlana Volkova 1, Glen Coppersmith 2 and Benjamin Van Durme 1,2 1 Center for Language.
COMP423 Intelligent Agents. Recommender systems Two approaches – Collaborative Filtering Based on feedback from other users who have rated a similar set.
1 RegionKNN: A Scalable Hybrid Collaborative Filtering Algorithm for Personalized Web Service Recommendation Xi Chen, Xudong Liu, Zicheng Huang, and Hailong.
An RG theory of cultural evolution Gábor Fáth Hungarian Academy of Sciences Budapest, Hungary in collaboration with Miklos Sarvary - INSEAD, Fontainebleau,
Introduction to Supervised Machine Learning Concepts PRESENTED BY B. Barla Cambazoglu February 21, 2014.
2. Introduction Multiple Multiplicative Factor Model For Collaborative Filtering Benjamin Marlin University of Toronto. Department of Computer Science.
Computing Trust in Social Networks
Neural Technology and Fuzzy Systems in Network Security Project Progress 2 Group 2: Omar Ehtisham Anwar Aneela Laeeq
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Collaborative Filtering Shaun Kaasten CPSC CSCW.
Exploration & Exploitation in Adaptive Filtering Based on Bayesian Active Learning Yi Zhang, Jamie Callan Carnegie Mellon Univ. Wei Xu NEC Lab America.
Recommender systems Ram Akella November 26 th 2008.
Extracting Interest Tags from Twitter User Biographies Ying Ding, Jing Jiang School of Information Systems Singapore Management University AIRS 2014, Kuching,
12 -1 Lecture 12 User Modeling Topics –Basics –Example User Model –Construction of User Models –Updating of User Models –Applications.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Perceptions of Behavioral Advertising among CMU Community Ashwini Rao April 21, 2014.
Large-Scale Cost-sensitive Online Social Network Profile Linkage.
PNC 2011: Pacific Neighborhood Consortium S-Sense: An Opinion Mining Tool for Market Intelligence Choochart Haruechaiyasak and Alisa Kongthon Speech and.
1 Efficiently Learning the Accuracy of Labeling Sources for Selective Sampling by Pinar Donmez, Jaime Carbonell, Jeff Schneider School of Computer Science,
Internet Marketing. What is Marketing? n The strategies and actions firms take to establish a relationship with a consumer and encourage purchases of.
AGES 2.0: Activating and Guiding the Engagement of Seniors through Social Media 1.
On Sparsity and Drift for Effective Real- time Filtering in Microblogs Date : 2014/05/13 Source : CIKM’13 Advisor : Prof. Jia-Ling, Koh Speaker : Yi-Hsuan.
TOGETHER, WE CAN FIND BETTER. DIGITAL. AGENDA  The Bismarck Tribune and bismarcktribune.com readership  Bismarcktribune.com Advertising Options  Advanced.
1 Information Filtering & Recommender Systems (Lecture for CS410 Text Info Systems) ChengXiang Zhai Department of Computer Science University of Illinois,
Modeling Relationship Strength in Online Social Networks Rongjing Xiang: Purdue University Jennifer Neville: Purdue University Monica Rogati: LinkedIn.
Mean Field Inference in Dependency Networks: An Empirical Study Daniel Lowd and Arash Shamaei University of Oregon.
Data Mining and Machine Learning Lab Network Denoising in Social Media Huiji Gao, Xufei Wang, Jiliang Tang, and Huan Liu Data Mining and Machine Learning.
Tie Strength, Embeddedness & Social Influence: Evidence from a Large Scale Networked Experiment Sinan Aral, Dylan Walker Presented by: Mengqi Qiu(Mendy)
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
User Models for Personalization Josh Alspector Chief Technology Officer.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Personalizing Search on Shared Devices Ryen White and Ahmed Hassan Awadallah Microsoft Research, USA Contact:
Confidence-Aware Graph Regularization with Heterogeneous Pairwise Features Yuan FangUniversity of Illinois at Urbana-Champaign Bo-June (Paul) HsuMicrosoft.
Collaborative Information Retrieval - Collaborative Filtering systems - Recommender systems - Information Filtering Why do we need CIR? - IR system augmentation.
The Database and Info. Systems Lab. University of Illinois at Urbana-Champaign User Profiling in Ego-network: Co-profiling Attributes and Relationships.
Exploiting Wikipedia Categorization for Predicting Age and Gender of Blog Authors K Santosh Aditya Joshi Manish Gupta Vasudeva Varma
Improving Search Results Quality by Customizing Summary Lengths Michael Kaisser ★, Marti Hearst  and John B. Lowe ★ University of Edinburgh,  UC Berkeley,
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Usability & Prototyping Uday M. Shankar Yahoo! Bangalore India.
CoCQA : Co-Training Over Questions and Answers with an Application to Predicting Question Subjectivity Orientation Baoli Li, Yandong Liu, and Eugene Agichtein.
Recommender Systems. Recommender Systems (RSs) n RSs are software tools providing suggestions for items to be of use to users, such as what items to buy,
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Four or More: The New Demographic Mary Madden Pew Internet & American Life Project LITA President’s Program ALA – June 27, 2010 (and a bunch of other really.
Recognizing Stances in Online Debates Unsupervised opinion analysis method for debate-side classification. Mine the web to learn associations that are.
Socialbots and its implication On ONLINE SOCIAL Networks Md Abdul Alim, Xiang Li and Tianyi Pan Group 18.
Mining information from social media
Xiangnan Kong,Philip S. Yu An Ensemble-based Approach to Fast Classification of Multi-label Data Streams Dept. of Computer Science University of Illinois.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Predicting Consensus Ranking in Crowdsourced Setting Xi Chen Mentors: Paul Bennett and Eric Horvitz Collaborator: Kevyn Collins-Thompson Machine Learning.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Introduction Web analysis includes the study of users’ behavior on the web Traffic analysis – Usage analysis Behavior at particular website or across.
Google Confidential and Proprietary 1 1 Google Consumer Surveys Monetization Through Microsurveys Amy Adams Harding Head of Publisher Development
RESEARCH MOTHODOLOGY SZRZ6014 Dr. Farzana Kabir Ahmad Taqiyah Khadijah Ghazali (814537) SENTIMENT ANALYSIS FOR VOICE OF THE CUSTOMER.
Twitter as a Corpus for Sentiment Analysis and Opinion Mining
Applying Link-based Classification to Label Blogs Smriti Bhagat, Irina Rozenbaum Graham Cormode.
@nmoneypenny Innovating New Products & Services with Enterprise Social Graphing: Naomi Moneypenny.
Predicting and Adapting to Poor Speech Recognition in a Spoken Dialogue System Diane J. Litman AT&T Labs -- Research
DOWeR Detecting Outliers in Web Service Requests Master’s Presentation of Christian Blass.
Inferring Perceived Demographics from User Emotional Tone and User-Environment Emotional Svitlana Volkova 1, Yoram Bachrach 2 1 Center for Language and.
The Price of Free Privacy Leakage in Personalized Mobile In-App Ads
Contextual Intelligence as a Driver of Services Innovation
Web Mining Ref:
Summary Presented by : Aishwarya Deep Shukla
Aspect-based sentiment analysis
Personalizing Search on Shared Devices
Quantifying Deception Propagation on Social Networks
Presentation transcript:

Online Bayesian Models for Personal Analytics in Social Media Svitlana Volkova and Benjamin Van Durme Center for Language and Speech Processing, Johns Hopkins University, Human Language Technology Center of Excellence

Social Media Predictive Analytics Personalized, diverse and timely data Can reveal user interests, preferences and opinions Social Network Prediction App - DemographicsPro – WolphralAlpha Analytics –

User Attribute Prediction Task Political Preference Rao et al., 2010; Conover et al., 2011, Pennacchiotti and Popescu, 2011; Zamal et al., 2012; Cohen and Ruths, 2013; Volkova et. al, Communications Gender Garera and Yarowsky, 2009; Rao et al., 2010; Burger et al., 2011; Van Durme, 2012; Zamal et al., 2012; Bergsma and Van Durme, 2013 Age Rao et al., 2010; Zamal et al., 2012; Cohen and Ruth, 2013; Nguyen et al., 2011, 2013; Sap et al., 2014 … … … … AAAI 2015 Demo (joint work with Microsoft Research) Income, Education Level, Ethnicity, Life Satisfaction, Optimism, Personality, Showing Off, Self-Promoting

Outline I.Our Approach II.Dynamic (Streaming) Models III.Experimental Results IV.Practical Recommendations

Existing Approaches ~1K Tweets* …. … How long does it take for an average Twitter user to produce thousands of tweets? *Rao et al., 2010; Conover et al., 2011; Pennacchiotti and Popescu, 2011a; Burger et al., 2011; Zamal et al., 2012; Nguyen et al., 2013 Tweets as a document What if we want to make reliable predictions immediately after 10 tweets?

Attributed Social Networks *Conover et al., 2011; Pennacchiotti and Popescu, 2011a; Zamal et al., 2012; Volkova et al., 2014.

Our Approach Static (Batch) Predictions Streaming (Online) Inference Dynamic (Iterative) Learning and Prediction Offline training Offline predictions No or limited network information Offline training Online predictions in time (ACL’14) Exploring 6 types of neighborhoods ①Streaming nature of SM: dynamic training and prediction ②Network structure: joint user-neighbour streams ③Trade-off between prediction time vs. model quality Online predictions Relying on neighbors +Iterative re-training +Active learning +Interactive rationale annotation

Online Predictions: Iterative Bayesian Updates Time … ? ?

Iterative Batch Learning Time R D ? ? t1t1 … t1t1 Labeled Unlabeled t1t1 t1t1  Iterative Batch Retraining (IB)  Iterative Batch with Rationale Filtering (IBR) ? tmtm … tmtm t2t2 … t2t2 … tmtm t2t2 …

Rationales Rationales are explicitly highlighted ngrams in tweets that best justified why the annotators made their labeling decisions

Active Learning Labeled Unlabeled 1-Jan Feb Nov Dec-2011 Time … …  Active Without Oracle (AWOO)  Active With Rationale Filtering (AWR)  Active With Oracle (AWO)

Performance Metrics Accuracy over time: Find optimal models: – Data steam type (user, friend, user + friend) – Time (more correctly classified users faster) – Prediction quality (better accuracy over time)

Experimental Results: Data Stream and Prediction Time IBIBR AWOOAWR

Results: Iterative Batch Learning IB: higher recallIBR: higher precision Time: # correctly classified users increases over time IB faster, IBR slower Data stream selection: User + friend stream > user stream

Results: Active Learning AWOO: higher recallAWR: higher precision Time: Unlike IB/IBR models, AWOO/AWR models classify more users correctly faster (in Mar) but then plateaus

batch < active user + friend > user Results: Model Quality

Active with Oracle Annotations Oracle is 100% correct Thousands of tweets in training

Summary Active learning > iterative batch N, UN > U: “neighbors give you away” Higher confidence => higher precision, lower confidence => higher recall (as expected) Rationales significantly improve results

Practical Recommendations If you want to deliver ads fast but to be less confident in user attribute predictions: – use models with higher recall (AWOO, IB) – apply lower decision threshold e.g., 0.55 If you want to deliver ads to a true target crowd but latter in time: – use models with higher precision (AWR, IBR) – apply higher decision threshold e.g., 0.95 – models with rational filtering (IBR, AWR) require less computation (lower-dimensional feature vectors), are more accurate but annotations cost money (Mechanical Turk) For highly assortative attributes e.g., political preference use a joint user-neighbor stream

Applications Online targeted advertising Personalized marketing Large-scale real-time healthcare analytics Personalized recommendation systems and search Large-scale passive and real-time live polling Recruitment and human resource management Dating services

Applications (1) Online targeted advertising Targeting ads based on predicted user features Personalized marketing Detecting opinions users express about products or services within targeted populations Large-scale real-time healthcare analytics Identifying patterns of depression or mental illnesses within users of certain demographics Personalized recommendation systems and search

Applications (2) Recruitment and human resource management Estimating emotional stability and personality of the potential employees Measuring the overall well-being of the employees e.g., life satisfaction, happiness. Dating services Matching user profiles by comparing their personalities, interests and emotional tone in social media Large-scale passive and real-time live polling Mining political opinions, voting predictions for the groups of users with certain demographics

Thank you! Labeled Twitter network data for gender, age, political preference prediction: Interested in using our models for your research or collaboration: code and pre-trained models for inferring demographic attributes, personality and 6 Ekman’s emotions available on request: AAAI Technical Demo Inferring Latent User Properties from Texts Published in Social Media Wednesday, January 28 6:30 – 8:00 Zilker Ballroom I am on a job market. Hire me!

Cand-Centric Graph: Belief Updates ? … Time ? …

Cand-Centric: Prediction Time (1) User-Neighbor User Stream Dem Rep Prediction confidence: 0.95 vs Democrats are easier to predict than republicans Dem Rep Users classified correctly

Batch vs. Online Performance

Summary: Streaming (Online) Prediction Neighborhood content is useful * (homophily) Streaming models >> batch models Tweeting frequency matters a lot! Generalization of the classifiers depends on data sampling or annotation biases. *Pennacchiotti and Popescu, 2011a, 2001b; Conover et al., 2011a, 2001b; Golbeck et al., 2011; Zamal et al., 2012; Volkova et. al., 2014

Iterative Batch vs. Active w/o Oracle Predictions AWOO IB

Active with Oracle Learning