WebChild: Harvesting and Organizing Commonsense Knowledge from Web

Slides:



Advertisements
Similar presentations
Date: 2014/05/06 Author: Michael Schuhmacher, Simon Paolo Ponzetto Source: WSDM’14 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Knowledge-based Graph Document.
Advertisements

YAGO: A Large Ontology from Wikipedia and WordNet Fabian M. Suchanek, Gjergji Kasneci, Gerhard Weikum Max-Planck-Institute for Computer Science, Saarbruecken,
Gerhard Weikum Max Planck Institute for Informatics & Saarland University Semantic Search: from Names and Phrases to.
Knowledge Base Completion via Search-Based Question Answering
Sequence Clustering and Labeling for Unsupervised Query Intent Discovery Speaker: Po-Hsien Shih Advisor: Jia-Ling Koh Source: WSDM’12 Date: 1 November,
Creating a Similarity Graph from WordNet
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Language-Independent Set Expansion of Named Entities using the Web Richard C. Wang & William W. Cohen Language Technologies Institute Carnegie Mellon University.
Designing clustering methods for ontology building: The Mo’K workbench Authors: Gilles Bisson, Claire Nédellec and Dolores Cañamero Presenter: Ovidiu Fortu.
Self-organizing Conceptual Map and Taxonomy of Adjectives Noriko Tomuro, DePaul University Kyoko Kanzaki, NICT Japan Hitoshi Isahara, NICT Japan April.
Saarbrucken / Germany ¨
Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.
Querying RDF Data with Text Annotated Graphs Lushan Han, Tim Finin, Anupam Joshi and Doreen Cheng SSDBM’15 
MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics – Bag of concepts – Semantic distance between two words.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Attribute Extraction and Scoring: A Probabilistic Approach Taesung Lee, Zhongyuan Wang, Haixun Wang, Seung-won Hwang Microsoft Research Asia Speaker: Bo.
Reyyan Yeniterzi Weakly-Supervised Discovery of Named Entities Using Web Search Queries Marius Pasca Google CIKM 2007.
Date : 2014/09/18 Author : Niket Tandon, Gerard de Melo, Fabian Suchanek, Gerhard Weikum Source : WSDM’14 Advisor : Jia-ling Koh Speaker : Shao-Chun Peng.
LOD for the Rest of Us Tim Finin, Anupam Joshi, Varish Mulwad and Lushan Han University of Maryland, Baltimore County 15 March 2012
Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.
Natural Language Questions for the Web of Data Mohamed Yahya, Klaus Berberich, Gerhard Weikum Max Planck Institute for Informatics, Germany Shady Elbassuoni.
Natural Language Questions for the Web of Data Mohamed Yahya 1, Klaus Berberich 1, Shady Elbassuoni 2 Maya Ramanath 3, Volker Tresp 4, Gerhard Weikum 1.
Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
Natural Language Questions for the Web of Data 1 Mohamed Yahya, Klaus Berberich, Gerhard Weikum Max Planck Institute for Informatics, Germany 2 Shady Elbassuoni.
Authors: Marius Pasca and Benjamin Van Durme Presented by Bonan Min Weakly-Supervised Acquisition of Open- Domain Classes and Class Attributes from Web.
Natural Language Questions for the Web of Data Mohamed Yahya, Klaus Berberich, Gerhard Weikum Max Planck Institute for Informatics, Germany Shady Elbassuoni.
Tutorial: Knowledge Bases for Web Content Analytics
Acquisition of Categorized Named Entities for Web Search Marius Pasca Google Inc. from Conference on Information and Knowledge Management (CIKM) ’04.
Knowledge Structure Vijay Meena ( ) Gaurav Meena ( )
Gaby Nativ, SDBI  Motivation  Other Ontologies  System overview  YAGO Dive IN  LEILA  NAGA  Conclusion.
Semantic Grounding of Tag Relatedness in Social Bookmarking Systems Ciro Cattuto, Dominik Benz, Andreas Hotho, Gerd Stumme ISWC 2008 Hyewon Lim January.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Sentiment Analysis Using Common- Sense and Context Information Basant Agarwal 1,2, Namita Mittal 2, Pooja Bansal 2, and Sonal Garg 2 1 Department of Computer.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics Semantic distance between two words.
Question Classification Ling573 NLP Systems and Applications April 25, 2013.
1 Commonsense Reasoning in and over Natural Language Hugo Liu Push Singh Media Laboratory Massachusetts Institute of Technology Cambridge, MA 02139, USA.
Neural representation and decoding of the meanings of words
Einat Minkov University of Haifa, Israel CL course, U
Science Process Skills
An Introduction to Markov Logic Networks in Knowledge Bases
Information Organization: Overview
Science Process Skills
and Knowledge Graphs for Query Expansion Saeid Balaneshinkordan
Title {Calibri 36 Font Size}
HowToKB: Mining HowTo Knowledge from Online Communities
Science Process Skills
Commonsense Knowledge Acquisition and Applications
Semantic Network & Knowledge Graph
Enhanced Dependency Jiajie Yu Wentao Ding.
Knowlywood: Mining Activity Knowledge from Hollywood Narratives
Acquiring Comparative Commonsense Knowledge from the Web
Learning Emoji Embeddings Using Emoji Co-Occurrence Network Graph
Enriching Structured Knowledge with Open Information
Commonsense in Parts: Mining Part-Whole Relations from the Web and Image Tags Niket Tandon1, Charles Hariman1, Jacopo Urbani1,2, Anna.
DBpedia 2014 Liang Zheng 9.22.
A Graph-Based Approach to Learn Semantic Descriptions of Data Sources
Ying Dai Faculty of software and information science,
Effective Entity Recognition and Typing by Relation Phrase-Based Clustering
Information Networks: State of the Art
Semantic Similarity Methods in WordNet and their Application to Information Retrieval on the Web Yizhe Ge.
Semantic Nets and Frames
ProBase: common Sense Concept KB and Short Text Understanding
Information Organization: Overview
Aishwarya Singh 03/18/2019 Acquiring Comparative Commonsense Knowledge from the Web Niket Tandon, Gerard de Melo, Gerhard Weikum.
Tantan Liu, Fan Wang, Gagan Agrawal The Ohio State University
Presented by: Anurag Paul
Presentation transcript:

WebChild: Harvesting and Organizing Commonsense Knowledge from Web Niket Tandon Max Planck Institute for Informatics Saarbrücken, Germany Joint work with: Gerard de Melo, Fabian Suchanek, Gerhard Weikum

Why Computers Need Commonsense Knowledge Who looks hot ? pop-singer-n1 hasAppearance hot-a3 What tastes hot ? chili-n1 hasTaste hot-a9 What is hot ? volcano-n1 hasTemperature hot-a1

Why Knowledge Bases Are Not Sufficient Freebase (+ Dbpedia, Yago, …) Jay-Z bornOn 4-Dec-1969 Jay-Z bornIn Brooklyn Brooklyn locatedIn NewYorkCity Jay-Z marriedTo Beyonce ….. only facts about named entities ConceptNet (+ …) pop-singer isa musician pop-singer hasProperty hot volcano hasProperty hot action hasProperty hot ….. hot  hot  hot Start with: In the age of google graph and knowledge-bases, you might think that no other KB is required. only hasProperty or relatedTo

Key Novelties of WebChild Fine-grained relations for commonsense knowledge (derived from WordNet): hasAppearance, hasTaste, hasTemperature, hasShape, evokesEmotion, ….. Sense-disambiguated arguments of knowledge triples (mapped to WordNet): pop-singer-n1 hasAppearance hot-a3 chili-n1 hasTaste hot-a9 volcano-n1 hasTemperature hot-a1 Semantically refined (see next slide) = Fine grained and sense disambiguated. Remind people that WordNet is a large lexical db of English words containing fine grained senses of every word.

Semantically refined commonsense triples 1. Extract generic: salsa hasProperty hot beautiful rose salsa was really hot … Patterns <adj> <noun> <noun> linking_verb [adverb] <adj>

Semantically refined commonsense triples 1. Extract generic: salsa hasProperty hot 2. Refine: salsa-n1 hasTaste hot-a9 WordNet “salsa” 19 fine-grained relations hasEmotion hasSound hasTaste hasAppearance … WordNet “hot”

Semantically refined commonsense triples Refine: salsa-n1 hasTaste hot-a9 what has taste disambiguate, classify, rank how does it taste Domain Population Computing Assertion Range Population pizza-n1 sauce-n1 java-n2 … chocolate-n2 , sweet-a1 milk-n1, tasty-a1 … spicy-a1 hot-a9 sweet-a1 … We decompose our problem into three parts. S ; P ; O Noun sense ; frequent pairs ; Adj sense

Graph construction per relation (e.g. hasTaste) Edge weight: taxonomic (between senses) , co-occurrence statistics (between words), distributional (between word, senses). 0.4 salsa 0.3 0.8 sauce

Label Propagation on constructed graph for domain of hasTaste 0.4 0.4 salsa salsa 0.3 0.3 0.8 0.8 sauce sauce Seed label loss Similar node diff label loss Regularize Talk: about labels = valid or not valid. In ICMAD, only vertices containing initial labels are smoothed. $$\left [ \mu_1 \sum_{v}p_v^{inj}\sum_{l}{ (Y_{vl}-\hat{Y}_{vl})^2 + \mu_2 \sum_{v,u} p_v^{cont}W_{vu} \sum_{l}{ (Y_{vl}-\hat{Y}_{ul})^2 } + \mu_3 \sum_{vl}{ (\hat{Y}_{vl} -R_{vl})^2} +\mu_4 \sum_v \sum_{k\neq l} C_{in(lk)}Y_{vk}(\hat{Y_{vl}}-\hat{Y_{vk}})^2 \right ]}$$ \\ ++ \hspace{1cm} s.t. \hspace{1cm} \hat{Y_{vl}} \geq \hat{Y_{vl}} , v \in V, l' \in implied(l)

Similar node diff label loss WebChild : Model Domain (hasTaste) Range (hasTaste) Assertions (hasTaste) Seed label loss Similar node diff label loss Regularize

Experiments Accuracy: over manually sampled data. Statistics: Large, semantically refined commonsense knowledge. Hartung: (0.25 Million) assertions only #instances Precision Noun senses 221 K 0.80 Adj senses 7.7 K 0.90 Assertions 4.6 M 0.82

WebChild: Examples ... ... ... Set expansion for: keyboard-n1 Domain (hasShape) face-n1 leaf-n1 ... Range (hasShape) triangular-a1 tapered-a1 ... Assertions (hasSshape) lens-n1, spherical-a2 palace-n2, domed-a1 ... Set expansion for: keyboard-n1 Set expansion for: keyboard-n2 Top 10 adjectives ergonomic, foldable, sensitive, black, comfortable, compact, lightweight, comfy, pro, waterproof Top 5 expansions keyboard, usb keyboard, computer keyboard, qwerty keyboard, optical mouse, touch screen Top 10 adjectives universal, magnetic, small, ornamental, decorative, solid, heavy, white, light, cosmetic Top 5 expansions wall mount, mounting bracket, wooden frame, carry case, pouch

Conclusion Graph methods help overcome sparsity of commonsense in text. WebChild: First commonsense KB with fine-grained relations and disambiguated arguments ; 4.6 million assertions including domain and range for 19 relations. Publically available at: www.mpi-inf.mpg.de/yago-naga/webchild/ Say it: People usually say commonsense knowledge cannot be found in text. This paper shows that with graph-based methods you can still uncover it and even infer fine-grained disambiguated knowledge. In general, for deeper text understanding and for AI complete tasks. Give sweet house translation example..

Additional slides.

Use Case: Set Expansion Output: top ranked adjectives and similar nouns (cosine over attributes) . Input: chocolate-n2 Top 10 adjectives smooth, assorted, dark, fine, delectable, black, decadent, white, yummy, creamy Top 5 expansions chocolate bar, chocolate cake, milk chocolate, chocolate chip, chocolate fudge Input: keyboard-n1 Top 10 adjectives ergonomic, foldable, sensitive, black, comfortable, compact, lightweight, comfy, pro, waterproof Top 5 expansions keyboard, usb keyboard, computer keyboard, qwerty keyboard, optical mouse, touch screen Experiments: Baselines – Google sets Results Use case

Approach For range and domain population: Extract a large list of ambiguous (potentially noisy) candidates. Construct a weighted graph of ambiguous words and their senses. Mark few seed nodes in the graph. Use propagation concept: similar nodes (beautiful) (lovely) have similar labels For computing assertion: Use the range and domain to prune search space of assertions (for a relation) Use propagation concept: similar nodes (car, sweet) (car, lovely) similar labels.

Approach: Extract and refine red rose rose was very beautiful temperature was hot Y/adj X/noun X/noun linking_verb adverb Y/adj Google n-grams

Goal: Semantically refined commonsense properties Connect nouns with adjectives via fine-grained relations 1. Extract: suit hasProperty hot 2. Refine: suit-n2 quality . appearance hot-a3 state feeling emotion motion quality appearance color beauty attribute smell sound taste physical temperature weight WordNet “suit” Lawsuit Dress Playing card suit … WordNet “hot” Burning Violent Stylish …

Experiments Accuracy and coverage : manually sampled data. Statistics: Large, semantically refined commonsense knowledge. System Domain Range Assertions Controlled LDA MFS (Hartung et al. 2011) 0.71 0.30 0.35 WebChild 0.83 0.90 0.82 Hartung: (0.25 Million) assertions only #instances Precision Noun senses 221 K 0.80 Adj senses 7.7 K 0.90 Assertions 4.6 M 0.82

Related Work Commonsense Knowledge Automatically constructed Unambiguous arguments Fine-grained relations Linked Data   Cyc Concept Net WebChild "Linked Data, as discussed in the other talks in this session"

Goal: Semantically refined commonsense properties 1. Extract: mole hasProperty hot 2. Refine: mole-n3 taste hot-a4 WordNet “mole” Gram molecule Skin mark Sauce Animal … 19 fine-grained relations Emotion Sound Taste Appearance … WordNet “hot” Burning Violent Stylish Spicy …

Goal: Semantically refined commonsense properties Refine: mole-n3 taste hot-a4 in domain of taste disambiguate, classify, rank in range of taste Domain Population Computing Assertion Range Population domain (taste) pizza-n1 sauce-n1 java-n2 … assertion (taste) salsa-n1 , hot-a4 chocolate-n2 , sweet-a1 milk-n1, tasty-a1 … range (taste) spicy-a1 hot-a4 sweet-a1 …

Graph construction Edge weight: taxonomic (between senses) , co-occurrence statistics (between words), distributional (between word, senses). One graph per attr. (here, hasTaste)

Label Propagation on constructed graph Seed label loss Similar node diff label loss Regularize In ICMAD, only vertices containing initial labels are smoothed. $$\left [ \mu_1 \sum_{v}p_v^{inj}\sum_{l}{ (Y_{vl}-\hat{Y}_{vl})^2 + \mu_2 \sum_{v,u} p_v^{cont}W_{vu} \sum_{l}{ (Y_{vl}-\hat{Y}_{ul})^2 } + \mu_3 \sum_{vl}{ (\hat{Y}_{vl} -R_{vl})^2} +\mu_4 \sum_v \sum_{k\neq l} C_{in(lk)}Y_{vk}(\hat{Y_{vl}}-\hat{Y_{vk}})^2 \right ]}$$ \\ ++ \hspace{1cm} s.t. \hspace{1cm} \hat{Y_{vl}} \geq \hat{Y_{vl}} , v \in V, l' \in implied(l)

WebChild: Examples Set expansion for: keyboard-n1 Domain Range Assertions hasTaste strawberry-n1 sweet-a1 biscuit-n2, sweet-a1 java-n2 hot-a9 chilli-n1, hot-a9 hasShape face-n1 triangular-a1 lens-n1, spherical-a2 leaf-n1 tapered-a1 table-n2, domed-a1 Set expansion for: keyboard-n1 Top 10 adjectives ergonomic, foldable, sensitive, black, comfortable, compact, lightweight, comfy, pro, waterproof Top 5 expansions keyboard, usb keyboard, computer keyboard, qwerty keyboard, optical mouse, touch screen

Why Computers Need Commonsense Knowledge Who looks cool ? Who lives cool ? rapper hasLook cool, popSinger hasLook cool,

Commonsense Knowledge Image search query: “adventurous person” should also match an image of a man “rock climb­ing” (evokes emotion “thrilling”) What is red, edible, tasty and soft? What is similar to chocolate bar, but soft?

Why Computers Need Commonsense Knowledge Who looks cool ? Who lives cool ? rapper hasLook cool, popSinger hasLook cool,

Commonsense from the Web Image search query: “adventurous person” should also match an image of a man “rock climb­ing” (evokes emotion “thrilling”) What is red, edible, tasty and soft? What is similar to chocolate bar, but soft? Coarse-grained CKB Fine-grained CKB Comparative fine-grained CKB, Applications 2010-11 2012-13 2013 - MS PhD1 PhD2 - PhDN Niket Tandon Supervisor: Prof. Gerhard Weikum Collaborator: Prof. Gerard de Melo Max Planck Institute for Informatics

Commonsense from the Web Commonsense Knowledge Automatically constructed Unambiguous arguments Fine-grained relations Linked Data   Cyc Concept Net, Tandon AAAI’11 WebChild WSDM’14 "Linked Data, as discussed in the other talks in this session"