Learning Attributes and Relations

Slides:



Advertisements
Similar presentations
SWG Strategy (C) Copyright IBM Corp. 2006, All Rights Reserved. P4 Task 2 Fact Extraction using a CNL Current Status David Mott, Dave Braines, ETS,
Advertisements

CILC2011 A framework for structured knowledge extraction and representation from natural language via deep sentence analysis Stefania Costantini Niva Florio.
A Bilingual Corpus of Inter-linked Events Tommaso Caselli♠, Nancy Ide ♣, Roberto Bartolini ♠ ♠ Istituto di Linguistica Computazionale – ILC-CNR Pisa ♣
Toward Linguistically Grounded Ontologies by Paul Buitelaar, Philipp Cimiano, Peter Haase, and Michael Sintek (Ireland, Netherlands, Germany) presented.
INTRODUCTION TO ARTIFICIAL INTELLIGENCE Massimo Poesio LECTURE 6 The Cognitive Perspective: Generative Lexicon.
Building an Ontology-based Multilingual Lexicon for Word Sense Disambiguation in Machine Translation Lian-Tze Lim & Tang Enya Kong Unit Terjemahan Melalui.
Machine Learning and the Semantic Web
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
NLP and Speech Course Review. Morphological Analyzer Lexicon Part-of-Speech (POS) Tagging Grammar Rules Parser thethe – determiner Det NP → Det.
NLP and Speech 2004 Feature Structures Feature Structures and Unification.
NLDB 2004 ORAKEL: A Natural Language Interface to an F-Logic Knowledge Base Philipp Cimiano Institute AIFB University of Karlsruhe NLDB 2004.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Information Extraction and Ontology Learning Guided by Web Directory Authors:Martin Kavalec Vojtěch Svátek Presenter: Mark Vickers.
1 Noun Homograph Disambiguation Using Local Context in Large Text Corpora Marti A. Hearst Presented by: Heng Ji Mar. 29, 2004.
Predicting the Semantic Orientation of Adjectives
Designing clustering methods for ontology building: The Mo’K workbench Authors: Gilles Bisson, Claire Nédellec and Dolores Cañamero Presenter: Ovidiu Fortu.
XML on Semantic Web. Outline The Semantic Web Ontology XML Probabilistic DTD References.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
1 Reading time evidence for enriched composition McElree et al. (2001) Rianne Oostwoud-Wijdenes & Maartje Schulpen.
Type shifting and coercion Henriëtte de Swart November 2010.
Employing Two Question Answering Systems in TREC 2005 Harabagiu, Moldovan, et al 2005 Language Computer Corporation.
Concept Hierarchy Induction by Philipp Cimiano
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved Digital Enterprise Research Institute Ontologies & Natural Language.
Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters Presented by Sole.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
INTRODUCTION TO ARTIFICIAL INTELLIGENCE Massimo Poesio Unsupervised and Semi-Supervised Relation Extraction.
Ontology Learning from Text: A Survey of Methods Source: LDV Forum,Volume 20, Number 2, 2005 Authors: Chris Biemann Reporter:Yong-Xiang Chen.
Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification on Reviews Peter D. Turney Institute for Information Technology National.
Linguistic modeling of professional terminology Olga Klevtsova, Tyumen State University, Russia.
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
1 Statistical NLP: Lecture 10 Lexical Acquisition.
CROSSMARC Web Pages Collection: Crawling and Spidering Components Vangelis Karkaletsis Institute of Informatics & Telecommunications NCSR “Demokritos”
W ORD S ENSE D ISAMBIGUATION By Mahmood Soltani Tehran University 2009/12/24 1.
Presented by Rani Qumsiyeh & Andrew Zitzelberger.
A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources Author: Carmen Banea, Rada Mihalcea, Janyce Wiebe Source:
Modelling Human Thematic Fit Judgments IGK Colloquium 3/2/2005 Ulrike Padó.
인공지능 연구실 황명진 FSNLP Introduction. 2 The beginning Linguistic science 의 4 부분 –Cognitive side of how human acquire, produce, and understand.
1 Statistical NLP: Lecture 7 Collocations. 2 Introduction 4 Collocations are characterized by limited compositionality. 4 Large overlap between the concepts.
The interface between model-theoretic and corpus-based semantics
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Towards Linguistically Grounded Ontologies Paul Buitelaar, Philipp Cimiano, Peter Haase, and Michael Sintek Proceedings of the 6 th European Semantic Web.
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
Learning Taxonomic Relations from Heterogeneous Evidence Philipp Cimiano Aleksander Pivk Lars Schmidt-Thieme Steffen Staab (ECAI 2004)
Using Wikipedia for Hierarchical Finer Categorization of Named Entities Aasish Pappu Language Technologies Institute Carnegie Mellon University PACLIC.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
NTNU Speech Lab 1 Topic Themes for Multi-Document Summarization Sanda Harabagiu and Finley Lacatusu Language Computer Corporation Presented by Yi-Ting.
Chapter 3 Language Acquisition: A Linguistic Treatment Jang, HaYoung Biointelligence Laborotary Seoul National University.
Automatic Ontology Extraction Miloš Husák RASLAN 2010.
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Korean version of GloVe Applying GloVe & word2vec model to Korean corpus speaker : 양희정 date :
Statistical NLP: Lecture 7
Sentiment analysis algorithms and applications: A survey
Topic 2 (ii) Metadata concepts, standards, models and registries
Statistical NLP: Lecture 3
ece 627 intelligent web: ontology and beyond
Ontology From Wikipedia, the free encyclopedia
INAGO Project Automatic Knowledge Base Generation from Text for Interactive Question Answering.
Improving a Pipeline Architecture for Shallow Discourse Parsing
Machine Learning in Natural Language Processing
Statistical NLP: Lecture 9
A method for WSD on Unrestricted Text
Automatic Detection of Causal Relations for Question Answering
Towards Semantics Generation
Linguistic Essentials
Applied Linguistics Chapter Four: Corpus Linguistics
Extracting Why Text Segment from Web Based on Grammar-gram
Statistical NLP : Lecture 9 Word Sense Disambiguation
Statistical NLP: Lecture 10
Presentation transcript:

Learning Attributes and Relations Ontology Learning from Text Philipp Cimiano, Chapter 7 11/27/2012 Cimiano Chapter 7 - PL

7.1 Common Approaches Collocations Syntactic Dependencies Lexico-syntactic patterns (Hearst, 1992) Road Map: Learning Attributes Learning the Appropriate Generalization Level for Relations Learning Qualia Structures 11/27/2012 Cimiano Chapter 7 - PL

7.2 Learning Attributes Attribute = a relation with a datatype as range Learn name, domain, and range 11/27/2012 Cimiano Chapter 7 - PL

11/27/2012 Cimiano Chapter 7 - PL

11/27/2012 Cimiano Chapter 7 - PL

11/27/2012 Cimiano Chapter 7 - PL

11/27/2012 Cimiano Chapter 7 - PL

7.3 Learning Relations from Corpora Based on verbal expressions Generalization of argument wrt taxonomy Statistical measures for generalization Approach: Extract verb frames Generalize verb frames – statistical measures: Conditional probability Pointwise mutual information X2 measure 11/27/2012 Cimiano Chapter 7 - PL

11/27/2012 Cimiano Chapter 7 - PL

7.4 Learning Qualia Structures from the Web Aristotle’s four basic factors or causes: the material cause – made of the agentive cause – source of creation or change the formal cause – form or type the final cause – purpose, intention, or aim Pustejovsky (1991): four roles in qualia structure: constitutive – parts or components agentive – verb, action brings into existence formal – type information telic – purpose or function 11/27/2012 Cimiano Chapter 7 - PL

11/27/2012 Cimiano Chapter 7 - PL

11/27/2012 Cimiano Chapter 7 - PL

11/27/2012 Cimiano Chapter 7 - PL

11/27/2012 Cimiano Chapter 7 - PL

11/27/2012 Cimiano Chapter 7 - PL

7.5 Related Work Madche and Staab (2000) – generalized association rules algorithm Kavalec and Svátek (2005) – label verbs as “above expectations” Gamallo et al. (2002) – map syntactic dependencies to semantic relations Ciaramita et al. (2005) – learn conceptual relations from the GENIA corpus Heyer et al. (2001) – derive conceptual relations from corpus collocations and heuristics Ogata and Collier ( 2004) – relations based on patterns similar to Hearst + reasoning 11/27/2012 Cimiano Chapter 7 - PL

more Related Work Yamaguchi (2001) – WordNet pruning + Schuetze’s word space method with word 4-grams Buitelaar et al. (2004) – mapping with slots for domain and range, manual inspection Pesio and Almuhareb ( 2005) – six categories of attributes, train a classifier Yamada and Baldwin (2004) – learn telic and agentive relations from corpora, dependencies Claveau et al. (2003) – Inductive Logic Programming to find qualia verbs Pustejovsky et al. (1993) – acquire relations from corpora, statistics and theoretical lexicon principles 11/27/2012 Cimiano Chapter 7 - PL

7.6 Conclusion and Open Issues Use adjectives and WordNot to learn attributes Statistical learning of slots for verbs Learning qualia structures Learning from noun phrases? Meaning of qualia roles? Distinguish word senses? Gold standard for qualia structures? Calculus to reason on partial results? Constrain according to linguistic principles? 11/27/2012 Cimiano Chapter 7 - PL