On Dataless Hierarchical Text Classification

Slides:



Advertisements
Similar presentations
Knowledge Transfer via Multiple Model Local Structure Mapping Jing Gao Wei Fan Jing JiangJiawei Han University of Illinois at Urbana-Champaign IBM T. J.
Advertisements

Albert Gatt Corpora and Statistical Methods Lecture 13.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Textual Relations Task Definition Annotate input text with disambiguated Wikipedia titles: Motivation Current state-of-the-art Wikifiers, using purely.
Engeniy Gabrilovich and Shaul Markovitch American Association for Artificial Intelligence 2006 Prepared by Qi Li.
Deep Learning in NLP Word representation and how to use it for Parsing
A Survey on Text Categorization with Machine Learning Chikayama lab. Dai Saito.
Data Visualization STAT 890, STAT 442, CM 462
Global and Local Wikification (GLOW) in TAC KBP Entity Linking Shared Task 2011 Lev Ratinov, Dan Roth This research is supported by the Defense Advanced.
Text Classification With Support Vector Machines
Semi-Supervised Clustering Jieping Ye Department of Computer Science and Engineering Arizona State University
Linguistic Regularities in Sparse and Explicit Word Representations
Advanced Multimedia Text Retrieval/Classification Tamara Berg.
Introduction to Data Mining Engineering Group in ACL.
Longbiao Kang, Baotian Hu, Xiangping Wu, Qingcai Chen, and Yan He Intelligent Computing Research Center, School of Computer Science and Technology, Harbin.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Classifying Tags Using Open Content Resources Simon Overell, Borkur Sigurbjornsson & Roelof van Zwol WSDM ‘09.
Text mining.
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
Eric H. Huang, Richard Socher, Christopher D. Manning, Andrew Y. Ng Computer Science Department, Stanford University, Stanford, CA 94305, USA ImprovingWord.
This work is supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center contract number.
Building high-level features using large-scale unsupervised learning Anh Nguyen, Bay-yuan Hsu CS290D – Data Mining (Spring 2014) University of California,
Constructing Knowledge Graph from Unstructured Text Image Source: Kundan Kumar Siddhant Manocha.
Semantic Embedding Space for Zero ­ Shot Action Recognition Xun XuTimothy HospedalesShaogang GongAuthors: Computer Vision Group Queen Mary University of.
Text mining. The Standard Data Mining process Text Mining Machine learning on text data Text Data mining Text analysis Part of Web mining Typical tasks.
Information Retrieval Lecture 6 Introduction to Information Retrieval (Manning et al. 2007) Chapter 16 For the MSc Computer Science Programme Dell Zhang.
Word representations: A simple and general method for semi-supervised learning Joseph Turian with Lev Ratinov and Yoshua Bengio Goodies:
Prepared by: Mahmoud Rafeek Al-Farra
A Clustering Method Based on Nonnegative Matrix Factorization for Text Mining Farial Shahnaz.
Evgeniy Gabrilovich and Shaul Markovitch
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
Iowa State University Department of Computer Science Center for Computational Intelligence, Learning, and Discovery Harris T. Lin, Sanghack Lee, Ngot Bui.
Iterative similarity based adaptation technique for Cross Domain text classification Under: Prof. Amitabha Mukherjee By: Narendra Roy Roll no: Group:
Advisor: Hsin-Hsi Chen Reporter: Chi-Hsin Yu Date: From Word Representations:... ACL2010, From Frequency... JAIR 2010 Representing Word... Psychological.
Second Language Learning From News Websites Word Sense Disambiguation using Word Embeddings.
Ganesh J, Soumyajit Ganguly, Manish Gupta, Vasudeva Varma, Vikram Pudi
Concept-Based Analysis of Scientific Literature Chen-Tse Tsai, Gourab Kundu, Dan Roth UIUC.
Vector Semantics Dense Vectors.
Data Mining and Text Mining. The Standard Data Mining process.
Hierarchical Semi-supervised Classification with Incomplete Class Hierarchies Bhavana Dalvi ¶*, Aditya Mishra †, and William W. Cohen * ¶ Allen Institute.
Unsupervised Sparse Vector Densification for Short Text Similarity
On Dataless Hierarchical Text Classification
Illinois CCG LoReHLT16 Situation Frame System
Cross-lingual Dataless Classification for Many Languages
Semi-Supervised Clustering
Korean version of GloVe Applying GloVe & word2vec model to Korean corpus speaker : 양희정 date :
Concept Grounding to Multiple Knowledge Bases via Indirect Supervision
Sofus A. Macskassy Fetch Technologies
School of Computer Science & Engineering
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Cross-lingual Dataless Classification for Many Languages
Relation Extraction CSCI-GA.2591
Introductory Seminar on Research: Fall 2017
Deep learning and applications to Natural language processing
Vector-Space (Distributional) Lexical Semantics
GLOW- Global and Local Algorithms for Disambiguation to Wikipedia
Margin-based Decomposed Amortized Inference
Unsupervised Learning and Autoencoders
Relational Inference for Wikification
Word Embeddings with Limited Memory
Prepared by: Mahmoud Rafeek Al-Farra
Automatic Information Fusion with Heterogeneous Information Networks
SVM Based Learning System for F-term Patent Classification
Knowledge Transfer via Multiple Model Local Structure Mapping
Socialized Word Embeddings
Ernest Valveny Computer Vision Center
Text Categorization Berlin Chen 2003 Reference:
Yangqiu Song Lane Department of CSEE West Virginia University
Presentation transcript:

On Dataless Hierarchical Text Classification Yangqiu Song and Dan Roth, Computer Science Department, University of Illinois at Urbana-Champaign Text Representation Classification Procedure Results Document Classification Label Representation 20newsgroups (Pure Dataless) Explicit Semantic Analysis (ESA) Label On Feb. 8, Dong Nguyen announced that he would be removing his hit game Flappy Bird from both the iOS and Android app stores, saying that the success of the game is something he never wanted. Some fans of the game took it personally, replying that they would either kill Nguyen or kill themselves if he followed through with his decision. Pick a label: Class1 or Class2 ? Gabrilovich and Markovitch. AAAI, 2006; IJCAI, 2007. Word Representation Similarity Barack Obama Timeline of the presidency of Barack Obama (2009) Family of Barack Obama Barack Obama citizenship conspiracy theories Barack Obama presidential primary campaign 2008 Document Representation Document Method Representation Feature Bag-of-words Sparse Vector Words’ TFIDF in non-zero elements ESA Wikipedia concept weights in non-zero elements Brown Cluster Cluster ID from root to path Word Embedding Dense Vector Compact real values Word Brown Clusters Brown et. al, Computational Linguistics, 1992. X-axis ESA: Number of concepts BC: Number of depth in word hierarchy WE: Number of dimensions of word embedding Mobile Game or Sports? Labels carry a lot of information! Unsupervised baseline OHLDA (Ha-Thuc, V., and Renders, J.-M. 2011. Large-scale hierarchical text classification without labeled data. In WSDM) Dataless Classification Hierarchical Classification Dataless definition No labeled data for training Depends on understanding the labels The importance of semantic representation A news article: Mobile Game or Sports? names of players, teams, activities of a match without mentioning the word sport New representation should be used Top-down classification Neural Network Word Embedding Supervised baseline Mikolov et. al. NIPS and HLTNAACL. 2013. Collobert et. al. JMLR 2011. (Senna) Turian et. al, ACL, 2010. Bottom-up classification (Flat classification) Mobile Game or Sports? Bootstrapping Conclusions Examples from: https://code.google.com/p/word2vec/ Dataless + semi-supervised learning It is possible to classify document into multiple (hierarchical) categories Systematic comparison Different representations Bottom-up vs. top-down Bootstrapping Possibility of classification in a larger labels space Chang, M.; Ratinov, L.; Roth, D.; and Srikumar, V. Importance of semantic represenation: Dataless classification. In AAAI. 2008. Elhoseiny, M.; Saleh, B.; and A.Elgammal. Write a classifier: Zero shot learning using purely textual descriptions. In ICCV, 2013. Initialize N documents for each label Pure dataless classifications Train a classifier to label N more documents for each label Continue until no unlabeled document This work is supported by the Army Research Laboratory (ARL) under agreement W911NF-09-2-0053, by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center contract number D11PC20155, and by DARPA under agreement number FA8750-13-2-0008.