Twitter Stance Detection with Bidirectional Conditional Encoding

Slides:



Advertisements
Similar presentations
Knowledge Transfer via Multiple Model Local Structure Mapping Jing Gao Wei Fan Jing JiangJiawei Han University of Illinois at Urbana-Champaign IBM T. J.
Advertisements

Knowledge Transfer via Multiple Model Local Structure Mapping Jing Gao, Wei Fan, Jing Jiang, Jiawei Han l Motivate Solution Framework Data Sets Synthetic.
Forward Semi-Supervised Feature Selection Jiangtao Ren, Zhengyuan Qiu, Wei Fan, Hong Cheng, and Philip S. Yu.
BEHAVIORAL PREDICTION OF TWITTER USERS BASED ON TEXTUAL INFORMATION Shiyao Wang.
Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.
Self Taught Learning : Transfer learning from unlabeled data Presented by: Shankar B S DMML Lab Rajat Raina et al, CS, Stanford ICML 2007.
Duyu Tang, Furu Wei, Nan Yang, Ming Zhou, Ting Liu, Bing Qin
Neural Networks in Data Mining “An Overview”
Learning from Multiple Outlooks Maayan Harel and Shie Mannor ICML 2011 Presented by Minhua Chen.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
Transfer Learning From Multiple Source Domains via Consensus Regularization Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, Qing He.
Predictive Modeling with Heterogeneous Sources Xiaoxiao Shi 1 Qi Liu 2 Wei Fan 3 Qiang Yang 4 Philip S. Yu 1 1 University of Illinois at Chicago 2 Tongji.
Authors : Ramon F. Astudillo, Silvio Amir, Wang Lin, Mario Silva, Isabel Trancoso Learning Word Representations from Scarce Data By: Aadil Hayat (13002)
Man vs. Machine: Adversarial Detection of Malicious Crowdsourcing Workers Gang Wang, Tianyi Wang, Haitao Zheng, Ben Y. Zhao, UC Santa Barbara, Usenix Security.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
Page 1 Ming Ji Department of Computer Science University of Illinois at Urbana-Champaign.
Lecture 10: 8/6/1435 Machine Learning Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Transfer Learning Task. Problem Identification Dataset : A Year: 2000 Features: 48 Training Model ‘M’ Testing 98.6% Training Model ‘M’ Testing 97% Dataset.
Machine Learning.
Sentiment Analysis with Incremental Human-in-the-Loop Learning and Lexical Resource Customization Shubhanshu Mishra 1, Jana Diesner 1, Jason Byrne 2, Elizabeth.
Neural Net Language Models
Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:
Tweets Discrimination Analysis
Network Management Lecture 13. MACHINE LEARNING TECHNIQUES 2 Dr. Atiq Ahmed Université de Balouchistan.
A Hierarchical Deep Temporal Model for Group Activity Recognition
Three Facets of Online Political Networks: Communities, Antagonisms, and Controversial Issues presented by Mert Ozer.
Unsupervised Learning of Video Representations using LSTMs
Mark Cieliebak Jan Deriu Dominik Egger Fatih Uzdilli
IEEE BIBM 2016 Xu Min, Wanwen Zeng, Ning Chen, Ting Chen*, Rui Jiang*
End-To-End Memory Networks
Machine Learning overview Chapter 18, 21
Jingcheng Du, B.S., Jun Xu, Ph.D., Hsingyi Song, MPH, Cui Tao, Ph.D.
It’s All About Me From Big Data Models to Personalized Experience
Supervised Time Series Pattern Discovery through Local Importance
Matt Gormley Lecture 16 October 24, 2016
A Hierarchical Model of Reviews for Aspect-based Sentiment Analysis
Introductory Seminar on Research: Fall 2017
Giuseppe Attardi Dipartimento di Informatica Università di Pisa
Are End-to-end Systems the Ultimate Solutions for NLP?
Deep learning and applications to Natural language processing
Unsupervised Learning and Autoencoders
Image Question Answering
Proportion of Original Tweets
CellNetQL Image Segmentation without Feature Definition
On-going research on Object Detection *Some modification after seminar
Supervised vs. unsupervised Learning
Word embeddings based mapping
Word embeddings based mapping
SVM Based Learning System for F-term Patent Classification
Knowledge Transfer via Multiple Model Local Structure Mapping
Predicting Prevalence of Influenza-Like Illness From Geo-Tagged Tweets
Socialized Word Embeddings
Text Mining & Natural Language Processing
Unsupervised Pretraining for Semantic Parsing
Active Learning with Unbalanced Classes & Example-Generation Queries
Predicting Body Movement and Recognizing Actions: an Integrated Framework for Mutual Benefits Boyu Wang and Minh Hoai Stony Brook University Experiments:
Enriching Taxonomies With Functional Domain Knowledge
Using Uneven Margins SVM and Perceptron for IE
Hierarchical, Perceptron-like Learning for OBIE
Deep Learning Authors: Yann LeCun, Yoshua Bengio, Geoffrey Hinton
Speaker: Ming-Chieh, Chiang
Applied Machine Learning For Quant Finance
Unsupervised Networks Closely related to clustering
3. Adversarial Teacher-Student Learning (AT/S)
Keshav Balasubramanian
Week 3 Presentation Ngoc Ta Aidean Sharghi.
Bidirectional LSTM-CRF Models for Sequence Tagging
Da-Rong Liu, Kuan-Yu Chen, Hung-Yi Lee, Lin-shan Lee
Austin Karingada, Jacob Handy, Adviser : Dr
The experiment based on hier-attention
Presentation transcript:

Twitter Stance Detection with Bidirectional Conditional Encoding Isabelle Augenstein, Tim Rocktäschel, Andreas Vlachos, Kalina Bontcheva {i.augenstein | t.rocktaschel}@cs.ucl.ac.uk, {a.vlachos | k.bontcheva}@sheffield.ac.uk Stance Detection Classify attitude of tweet towards target as “favor”, “against”, “none” Tweet: “No more Hillary Clinton” Target: Donald Trump Stance: FAVOR Training targets: Climate Change is a Real Concern, Feminist Movement, Atheism, Legalization of Abortion, Hillary Clinton Testing target: Donald Trump Bidirectional Conditional Encoding Bidirectional encoding of tweet conditioned on bidirectional encoding of target The stance is predicted using the last forward and reversed output representations Setting Unseen Target Setting Train on Climate Change Is A Real Concern, Feminist Movement, Atheism, Legalization of Abortion, Hillary Clinton tweets Test on Donald Trump tweets Weakly Supervised Setting Weakly label Donald tweets using hashtags for training Challenges Tweet interpretation depends on target Solution: bidirectional conditional model Labelled data not available for the test target Solution: 1) domain adaptation; 2) weak labelling Results Unseen Target Models Weakly Supervised Models Comparison Against SOA Data SemEval 2016 Task 6 Twitter Stance Detection corpus 5 628 labelled training tweets (1 278 about Hillary Clinton, used for dev) 278 013 unlabelled Donald Trump tweets 395 212 additional unlabelled tweets about all targets 707 Donald Trump testing tweets Experiments Stance Detection Models Sequence-agnostic: BoWV Target-agnostic: Tweet-only LSTM encoding (TweetOnly) Target+Tweet, no conditioning: Concatenated target and tweet LSTM encodings (Concat) Target+Tweet, conditioning: Target conditioned on tweet (TarCondTweet); tweet conditioned on target (TweetCondTar); bidirectional conditioning (BiCond) Word Embedding Training Random initialisation Trained fixed word embeddings on tweets Pre-trained word embeddings on tweets, continue training with supervised objective Conclusions Learning conditional sequence representation of targets and tweets better than representations without conditioning State of the art performance with weakly supervised model Good target representations with unsupervised pre-training Successful sequence representation learning with small data Model Macro F1 SVM-ngrams-comb (official) 0.2843 BiCond (Unseen Target) 0.4901 pkudblab (Weakly Supervised*) 0.5628 BiCond (Weakly Supervised) 0.5803 References SemEval 2016 Stance Detection: http://alt.qcri.org/semeval2016/task6/ Code: https://github.com/sheffieldnlp/stance-conditional Acknowledgements This work was partially supported by the European Union, grant No. 611233 PHEME (http://www.pheme.eu) and by the Microsoft Research PhD Scholarship Programme.