July 2015 Cognitum An IJCAI-15 Workshop on Cognitive Knowledge Acquisition and Applications Illinois-Profiler: Knowledge Schemas at Scale Zhiye Fei, Daniel.

Slides:



Advertisements
Similar presentations
Latent Variables Naman Agarwal Michael Nute May 1, 2013.
Advertisements

Kindergarten Reading at PS 11
October 2014 Paul Kantor’s Fusion Fest Workshop Making Sense of Unstructured Data Dan Roth Department of Computer Science University of Illinois at Urbana-Champaign.
RDF Tutorial.
Page 1 SRL via Generalized Inference Vasin Punyakanok, Dan Roth, Wen-tau Yih, Dav Zimak, Yuancheng Tu Department of Computer Science University of Illinois.
Page 1 CS 546 Machine Learning in NLP Structured Prediction: Theories and Applications to Natural Language Processing Dan Roth Department of Computer Science.
Textual Relations Task Definition Annotate input text with disambiguated Wikipedia titles: Motivation Current state-of-the-art Wikifiers, using purely.
Relational Entity Linking with Cross Document Coreference Xiao Cheng, Bingling Chen, Rajhans Samdani, Kai-Wei Chang, Zhiye Fei and Dan Roth University.
Page 1 CS 546 Machine Learning in NLP Structured Prediction: Theories and Applications to Natural Language Processing Dan Roth Department of Computer Science.
A Linear Programming Formulation for Global Inference in Natural Language Tasks Dan RothWen-tau Yih Department of Computer Science University of Illinois.
Page 1 Learning and Global Inference for Information Access and Natural Language Understanding Dan Roth Department of Computer Science University of Illinois.
MCTest-160 [1] contains pairs of narratives and multiple choice questions. An example of narrative QA: Story: “Sally had a very exciting summer vacation.
Firat Batmaz, Chris Hinde Computer Science Loughborough University A Diagram Drawing Tool For Semi–Automatic Assessment Of Conceptual Database Diagrams.
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
DeepDive Deep Linguistic Processing with Condor Feng Niu, Christopher Ré, and Ce Zhang Hazy Research Group University of Wisconsin-Madison
Large-Scale Cost-sensitive Online Social Network Profile Linkage.
1 CS598 DNR FALL 2005 Machine Learning in Natural Language Dan Roth University of Illinois, Urbana-Champaign
Page 1 Global Inference and Learning Towards Natural Language Understanding Dan Roth Department of Computer Science University of Illinois at Urbana-Champaign.
Global Inference in Learning for Natural Language Processing Vasin Punyakanok Department of Computer Science University of Illinois at Urbana-Champaign.
Relational Inference for Wikification
1 Brainstorming Themes for 2006 Paul Tarau University of North Texas Dec 2005.
1 Introduction to Modeling Languages Striving for Engineering Precision in Information Systems Jim Carpenter Bureau of Labor Statistics, and President,
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
C OLLECTIVE ANNOTATION OF WIKIPEDIA ENTITIES IN WEB TEXT - Presented by Avinash S Bharadwaj ( )
Chapter 2 Modeling and Finding Abnormal Nodes. How to define abnormal nodes ? One plausible answer is : –A node is abnormal if there are no or very few.
Additional Unit 2 Lecture Notes New Instructional Design Focus School of Education Additional Unit 2 Lecture Notes New Instructional Design Focus School.
Sue Lewis The Western Vocational Progression Consortium.
Knowledge Management in Theory and Practice
Scott Duvall, Brett South, Stéphane Meystre A Hands-on Introduction to Natural Language Processing in Healthcare Annotation as a Central Task for Development.
Towards Natural Question-Guided Search Alexander Kotov ChengXiang Zhai University of Illinois at Urbana-Champaign.
Coreference Resolution with Knowledge Haoruo Peng March 20,
June 2013 Inferning Workshop, ICML, Atlanta GA Amortized Integer Linear Programming Inference Dan Roth Department of Computer Science University of Illinois.
INTRODUCTIONCS446 Fall ’15 CS 446: Machine Learning Dan Roth University of Illinois, Urbana-Champaign
1 Learning Sub-structures of Document Semantic Graphs for Document Summarization 1 Jure Leskovec, 1 Marko Grobelnik, 2 Natasa Milic-Frayling 1 Jozef Stefan.
LOD for the Rest of Us Tim Finin, Anupam Joshi, Varish Mulwad and Lushan Han University of Maryland, Baltimore County 15 March 2012
100 acre wood.
CS 6998 NLP for the Web Columbia University 04/22/2010 Analyzing Wikipedia and Gold-Standard Corpora for NER Training William Y. Wang Computer Science.
Indirect Supervision Protocols for Learning in Natural Language Processing II. Learning by Inventing Binary Labels This work is supported by DARPA funding.
Page 1 March 2011 Local and Global Algorithms for Disambiguation to Wikipedia Lev Ratinov 1, Dan Roth 1, Doug Downey 2, Mike Anderson 3 1 University of.
Part4 Methodology of Database Design Chapter 07- Overview of Conceptual Database Design Lu Wei College of Software and Microelectronics Northwestern Polytechnical.
C. Lawrence Zitnick Microsoft Research, Redmond Devi Parikh Virginia Tech Bringing Semantics Into Focus Using Visual.
Relation Alignment for Textual Entailment Recognition Cognitive Computation Group, University of Illinois Experimental ResultsTitle Mark Sammons, V.G.Vinod.
Carrying out a statistics investigation. A process.
Page 1 INARC Report Dan Roth, UIUC March 2011 Local and Global Algorithms for Disambiguation to Wikipedia Lev Ratinov & Dan Roth Department of Computer.
Automatic Question Answering  Introduction  Factoid Based Question Answering.
Inference Protocols for Coreference Resolution Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Nick Rizzolo, Mark Sammons, and Dan Roth This research.
Page 1© Crown copyright 2004 FLUME Marco Christoforou, Rupert Ford, Steve Mullerworth, Graham Riley, Allyn Treshansky, et. al. 19 October 2007.
Major Science Project Process A blueprint for experiment success.
Catalyst – January 7 1. When you see a graph what do you do to understand it? 2. What was the statistic that shocked you the most from the achievement.
Knowledge Management in Theory and Practice
Page 1 CS 546 Machine Learning in NLP Structured Prediction: Theories and Applications in Natural Language Processing Dan Roth Department of Computer Science.
Department of Computer Science The University of Texas at Austin USA Joint Entity and Relation Extraction using Card-Pyramid Parsing Rohit J. Kate Raymond.
Toward Entity Retrieval over Structured and Text Data Mayssam Sayyadian, Azadeh Shakery, AnHai Doan, ChengXiang Zhai Department of Computer Science University.
Solving Hard Coreference Problems Haoruo Peng, Daniel Khashabi and Dan Roth Problem Description  Problems with Existing Coref Systems Rely heavily on.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
NELL Knowledge Base of Verbs
Part 2 Applications of ILP Formulations in Natural Language Processing
Entity-Level Modelling for Coreference Resolution
Two Discourse Driven Language Models for Semantics
Improving a Pipeline Architecture for Shallow Discourse Parsing
X Ambiguity & Variability The Challenge The Wikifier Solution
Lecture 24: NER & Entity Linking
Relational Inference for Wikification
CSc4730/6730 Scientific Visualization
CS246: Information Retrieval
Using Uneven Margins SVM and Perceptron for IE
Discriminative Probabilistic Models for Relational Data
Unsupervised Learning of Narrative Schemas and their Participants
The Winograd Schema Challenge Hector J. Levesque AAAI, 2011
Dan Roth Department of Computer Science
Presentation transcript:

July 2015 Cognitum An IJCAI-15 Workshop on Cognitive Knowledge Acquisition and Applications Illinois-Profiler: Knowledge Schemas at Scale Zhiye Fei, Daniel Khashabi, Haoruo Peng, Hao Wu & Dan Roth Department of Computer Science University of Illinois at Urbana-Champaign Page 1

Comprehension 1. Christopher Robin was born in England. 2. Winnie the Pooh is a title of a book. 3. Christopher Robin’s dad was a magician. 4. Christopher Robin must be at least 65 now. (ENGLAND, June, 1989) - Christopher Robin is alive and well. He lives in England. He is the same person that you read about in the book, Winnie the Pooh. As a boy, Chris lived in a pretty home called Cotchfield Farm. When Chris was three years old, his father wrote a poem about him. The poem was printed in a magazine for others to read. Mr. Robin then wrote a book. He made up a fairy tale land where Chris lived. His friends were animals. There was a bear called Winnie the Pooh. There was also an owl and a young pig, called a piglet. All the animals were stuffed toys that Chris owned. Mr. Robin made them come to life with his words. The places in the story were all near Cotchfield Farm. Winnie the Pooh was written in Children still love to read about Christopher Robin and his animal friends. Most people don't know he is a real person who is grown now. He has written two books of his own. They tell what it is like to be famous. This is an Inference Problem Page 2

What is being Repaired? The ball hit the window and Bill repaired it. 3 PERSON repaired window PERSON repaired ball vs

Structured Knowledge Jimbo arrested Robert because he stole an elephant The subject of “stole” is more likely to be the object of “arrest” then the subject of “arrest”. 4 Jimbo stole an elephant vs. Robert stole an elephant * arrested SOMEONE because SOMEONE stole * vs. SOMEONE arrested * because SOMEONE stole *

Knowledge for Many Tasks [Larry Robbins], founder of Glenview Capital Management, bought shares of [Endo International Plc]...” [Seattle] fired John Doe after the team lost its 7 th game in a row. Organization is more likely than a location to be the subject of “fire”. 5 NER TAGS = {PERSON, LOCATION, ORGANIZIATION, …}

Knowledge is Essential Textual Inference requires additional knowledge.  More than just local features: need to know things. There is a need to think about how to make use of this knowledge in textual inference.  Not here; see my talk at the Neuro-Symbolic Workshop This work: Graph-based formulation for modelling knowledge schemas  The necessity of disambiguation  The acquisition process Profiler as a public resource  Contains pre-computed statistics  Many concepts/entities; many knowledge schemas 6

Terminology Schema: A way to define a structure and, consequently, semantics, specified by a template.  We use it Graphically to define pieces of knowledge Pivot: a key node in each schema Profile(pivot): a set of schemas with a common pivot  Instantiated schema, with statistics 7 Profile: Entity:Obama (President) … …

Disambiguation is Important 8 The airport is located south of Seattle. Seattle played well today! A pivot is a pair: (mention, Wikipedia URL)

Knowledge Schema as a Graph 9

Knowledge Schema Descriptions A Description:  A schema (template) defined in the FDL language, which corresponds to a set of grounded elements matching the definition. Description of a schema graph  The set of all instances matching the schema graph.  Descriptions are defined Recursively 10

Knowledge Schema Descriptions (2) 11

Knowledge Schema Descriptions (3) 12

Knowledge Schema Descriptions (4) 13

Knowledge Schemas The definition can be generalized for any graph.  See the general definition in the paper. Why important?  A way of formalizing general knowledge over relational structures  A systematic way to represent and acquire knowledge  Compatible with functional programming languages  See a talk on: Saul: Towards Declarative Learning Based Programming Parisa Kordjamshidi, Hao Wu, Dan Roth Presented on Tuesday, July 28; 9:40 Relational Learning Session 14

The Acquisition Procedure 15 Try our demo: 4 million 1,455 GB 200 mid-end EC2 nodes, 3 hours, 420$ 200GB ~3,5 M Wiki profiles ~300,000 verb sense profiles Illinois CloudNLP: a suite of state-of-the art NLP tools. Made available also on AWS.

Knowledge Schema as a Graph The annotations used in the currecnt system:  Attribute & values; Roles 16 WordRaw text LemmaRaw text POSlabels form Penn Treebank NER{ PER, ORG, LOC, MISC } WikifierWikipedia urls VerbsenseVerb sense from Verbnet Before After NearestBefore NearestAfter AdjacentToBefore AdjacentToAfter ExclusiveContainin g HasOverlap DependencyPath(l) Co-referred SubjectOf ObjectOf

Wikification: The Reference Problem Blumenthal (D) is a candidate for the U.S. Senate seat now held by Christopher Dodd (D), and he has held a commanding lead in the race since he entered it. But the Times report has the potential to fundamentally reshape the contest in the Nutmeg State. Page 17

Who is Alex Smith? Quarterback of the Kansas City Chief Tight End of the Cincinnati Bengals San Diego: The San Diego Chargers (A Football team) Ravens: The Baltimore Ravens (A Football team) Contextual Disambiguation Alex Smith Smith Alex Smith Smith

Middle Eastern Politics Quarterback of the Kansas City Chief Tight End of the Cincinnati Bengals Mahmoud Abbas Abu Mazen Mahmoud Abbas: Abu Mazen: Variability: Getting around multiple surface representations. Co-reference resolution within & across documents, with grounding

The Profiler DB Each entry corresponds to a disambiguated entity/Concept Mapping to Wikipedia grounds entities in the “world” and allow us to profile unique entities, rather than “mentions” in text. In particular, we have distinct entries for: Clinton (Bill), Clinton (Hilary), Clinton (lake), Clinton (Illinois),….. Page 20

The Acquisition Procedure 21 Try our demo: 4 million 1,455 GB 200 mid-end EC2 nodes, 3 hours, 420$ 200GB ~3,5 M Wiki profiles ~300,000 verb sense profiles Illinois CloudNLP: a suite of state-of-the art NLP tools. Made available also on AWS.

Experimental Evidence We are at early stages of experimental validation (and refinement) of the acquisition and inference with the profiler. Co-reference Resolution Identifying Attributes of Entities  Profession Page 22

Experiment 1: Co-reference Resolution We build upon our previous work (Peng et al, 2015). Extended the Winograd data (Rahman & Ng)  to general co-reference instances: WinoCoref data Schemas are converted automatically, given an instance, into constraints that are used in an Integer Linear Programming formulation. 23 Jack threw the bags of John into the water since he mistakenly asked him to carry his bags.

Experiment 1: Co-reference Resolution Metrics:  Precision for Winograd dataset  AntePre for WinoCored dataset: Consider all the binary decisions of connecting pronouns to nominal mentions AntePre is the ratio of correct binary decisions to the total decisions 24 DatasetWinogradWinoCoref MetricPrecisionAntePre (Rahman & Ng, 2012) (Peng et al, 2015) Our paper

Experiment 2: Classifying occupations Observations: Profiles of people contain information about their occupation. 25

Experiment 2: Classifying occupations Created a dataset of People-Profession based on Wikipedia Steps:  Pick a bunch of profiles (of people) with known jobs  Merge the profiles of the ones with the same job (occupation profiles)  Given a new profile, decide which occupation profile it is closest: % of the test cases, the correct answer is among the top-5 J. K. Rowling Quentin Tarantino Joe Biden Ernest Hemingway Peter Jackson Jimmy Carter Steven Spielberg John F. Kennedy

Future Directions Extensions of the Profiler  Richer set of schemas  Richer annotations  More data  Incorporating the profiler as a part of feature extraction system, within a learning framework Profiler, beyond a resource, but as a tool to engineer knowledge. Inference  How to best use the profiles Experiments  Different tasks need to be explored. 27 Thank You!

References (Peng et al, 2015): Solving Hard Coreference Problems. Haoruo Peng*, Daniel Khashabi* and Dan Roth. NAACL (Rahman &Ng, 2012): “Resolving Complex Cases of Definite Pronouns: The Winograd Schema Challenge”, Altaf Rahman, Vincent Ng, EMNLP, (Cumby&Roth, 2003) “Learning with feature description logics”. Cumby, Chad M., and Dan Roth, ILP. Springer,