Download presentation
Presentation is loading. Please wait.
Published byHolly Fitzgerald Modified over 8 years ago
1
Inferring Perceived Demographics from User Emotional Tone and User-Environment Emotional Svitlana Volkova 1, Yoram Bachrach 2 1 Center for Language and Speech Processing Johns Hopkins University (now at Pacific Northwest National Laboratory) 2 Microsoft Research Cambridge
2
Social Media Obsession Personalized Diverse Timely Large volumes Billions of messages Millions of users Multilingual Automatically Inferring Demographics, Personality, Emotions, Opinions, Interests
3
What do they think and feel? Where do they go? What are their demographics and personality? What do they like? What do they buy? User Attribute Prediction Task
4
User Attribute Prediction Task Political Preference Rao et al., 2010; Conover et al., 2011, Pennacchiotti and Popescu, 2011; Zamal et al., 2012; Cohen and Ruths, 2013; Volkova et. al, 2014...... Communications Gender Garera and Yarowsky, 2009; Rao et al., 2010; Burger et al., 2011; Van Durme, 2012; Zamal et al., 2012; Bergsma and Van Durme, 2013 Age Rao et al., 2010; Zamal et al., 2012; Cohen and Ruth, 2013; Nguyen et al., 2011, 2013; Sap et al., 2014 … … … … AAAI 2015 Demo (joint work with Microsoft Research) Income, Education, Ethnicity, Life Satisfaction, Optimism, Personality (stress, excitement)
5
Research Question: Do emotional differences between users and their neighbours correlate with user demographics? What do they think and feel? What are their demographics? Emotional contagion on Facebook neighbours
6
Sampling Twitter Data English speaking users from the US and Canada Average Twitter users! No news or celebs! But not dead! –Tweet on average 4 – 10 tweets per day; > 20 friends 10K random users with 200 tweets = 2M tweets Randomly sample 15 of their neighbors U = 123K users, T = 25M tweets
7
GenderAge…IncomeTweetEmoSent Male≥25> $35KOmg I'm bored an annoyedAngerNeg ……………… Female<25≤ $35K@HarrysKiss ohhh!!thanksss!JoyPos Joy, Sad,..., Anger, Pos, Neg #emo + #syns Demographic Predictions Attribute Models Φ A (u) Crowdsourcing 10K Users + 113K Neighbours25M Tweets Gender, …, Age, Income Train EmoSent Predictions EmoSent Models Φ E (t) Train Demographic Classification EmoSent Classification 5K 75 K Approach 19 K How did we get annotations? How did we build the models? Sent Emo
8
Annotation Schemes Ways to get ≈“ground truth” annotations: Fun psychological tests (voluntarily): myPersonality project Profile info: Facebook, Youtube, Google+ e.g., relationship, gender, age but sparse for Twitter Self reports: “I am a republican…” (Volkova et al. 2013), “Happy ##th/st/nd/rd birthday to me” (Zamal et. al. 2012), “I have been diagnosed with …” (Coppersmith et. al. 2014), “I am a writer …” (Beller at. al., 2014) Distant supervision: following Obama vs. Romney (Zamal et. al. 2012), emotion hashtags (Mohammad et. al, 2014), links to blogs (Burger et. al., 2011) Crowdsourcing: subjective perceived annotations (Volkova et. al.2015), rationales (Bergsma et. al., 2013, Volkova et. al, 2014; 2015) A n n o t a t i o n B i a s e s
9
Perceived Attribute Annotation via Crowdsourcing 5K profiles Attribute Models Φ A (u) ULUL UPUP 5K 123K Trusted crowd $6/hour quality control
10
Tweets Revealing User Attributes ? ? ? ?
11
Predictive Models Supervised text classification: Lexical: binary word bigrams Socio-linguistic, syntactic and stylistic: smiles, hashtags, mixed punctuation, elongations, capitalization (POS tags, LIWC lexicon features, punctuation) Affect: emotion and sentiment proportions
12
Attribute Classification Quality
13
Predictive Features for Income
14
Comparison with Other Models Train \ TestUsersBurger at. al., 2011Gender graphPerceived Burger at. al., 201171,3120.71 0.83 Gender graph3830.470.790.53 Perceived4,9980.580.660.84 Train \ TestUsersGeo-centricCand-centricActivePerceived Geo-centric2700.660.470.530.31 Cand-centric1,0310.540.760.580.76 Active3710.600.630.870.76 Perceived2,4980.560.490.610.76
15
Emotion and Sentiment Prediction Quality Features: BOW + Negation, Stylistic +0.3F1 Elongations Yaay, woooow, Capitalization COOL, Mixed Punctuation ???!!! Hashtags and Emoticons 6 classes: joy, sadness, fear, surprise, disgust, anger 3 classes: positive, negative and neutral F1=0.78 (Roberts’12 0.67, Qadir’13 0.53, Mohammad’14 0.49) F1=0.66 SemEval-2013 (3K tweets) (1 st 0.69, 2 nd 0.65 out of 38 teams)
16
User-Environment Emotional Contrast Given Latent User Properties @modollar1. Happy bday bro FREE100 MISS YOU MY BOY Omg I'm bored an annoyed The Green Mile is phenomenal! @AmandaBWrites your new avi is adorable! @HarrysKiss ohhhhhh!!thanksss! User Emotional ToneEnvironment Emotional Tone Predicted Attributes
17
Methodology (1) Similarities between incoming and outgoing emotion and opinion distributions via Jensen- Shannon Divergence:
18
Methodology (2) Differences between every outgoing and incoming emotion and sentiment: Mann Whitney test on JSDs and emotion differences :
19
User-Environment Emotional Contrast Sadness Amplify + Dampen – Joy Fear SurpriseAnger Disgust Amplify + Dampen – Older Kids Dissatisfied Not Excited Older, High Income, Degree Male, Kids Stressed Pessimists
20
Attribute Prediction Improvement +0.04 +0.05 +0.11 +0.12 +0.11 +0.08 +0.17 +0.10 +0.08 +0.11 Gain over BOW
21
Predicting Demographics from User Outgoing Emotions and Opinions 0.76 0.58 0.65 0.66 0.72 0.76 0.75 0.76 0.69 0.73 AUC ROC Satisfied Optimist Dissatisfied Pessimist No Kids Below 25 y.o. Female Male 1/3 attributes AUC >=75%
22
Predicting Demographics from User- Environment Emotional Contrast 0.61 0.54 0.55 0.58 0.64 0.62 0.68 0.57 0.59 0.60 AUC ROC 1/2 attributes AUC >=60%
23
Applications Personalized recommendation and search Personalized marketing Online targeted advertising Recruitment and human resource management Real-time, large-scale polling Healthcare analytics
24
Summary I.Different demographics => varied reactions to the emotional tone in user environments II.Emotions and user-environment emotional contrasts improve attribute prediction
25
Questions? yobach@microsoft.com svitlanavolkova@pnnl.gov Data, code and models: http://www.cs.jhu.edu/~svitlana/
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.