Semantic Enrichment of Text with Background Knowledge Anselmo Peñas NLP & IR Group UNED nlp.uned.es Eduard Hovy USC / ISI isi.edu.

Slides:



Advertisements
Similar presentations
Machine Reading Anselmo Peñas UNED NLP & IR Group nlp.uned.es.
Advertisements

American Football: The Basics. The Field The football field is 100 yards long. On both ends there is an End Zone where all touchdowns are scored. In each.
FLAG FOOTBALL.
GYM FLAG FOOTBALL Equipment Used Belts and Flags Football
American Football. Goal: Win the Game Time for Game - 4 Quarters, Half Time Show 2 Teams, 11 people per Team Offense - has the ball Defense - stops Offense,
American Football. The Field, Time of Game, and Players.
Question Answering for Machine Reading Evaluation Evaluation Campaign at CLEF 2011 Anselmo Peñas (UNED, Spain) Eduard Hovy (USC-ISI, USA) Pamela Forner.
-Every year many draft eligible quarterbacks have to make the tough decision of either declaring for the NFL draft or staying in school. -Every April.
Fleetball.
Fleetball.
Football Van Buren Middle School Ms. Denlinger Van Buren Middle School Ms. Denlinger.
GATORBALL.
Unsupervised Acquisition of Axioms to Paraphrase Noun Compounds and Genitives CICLING 2012, New Delhi Anselmo Peñas NLP & IR Group, UNED, Spain Ekaterina.
CORE 9 PHYSICAL EDUCATION
Fantasy Football By: Carson Barnette. Fantasy Football Fantasy football is a game based on the performance of NFL players How do you play? What does this.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Basi di dati distribuite Prof. M.T. PAZIENZA a.a
Introduction to CL Session 1: 7/08/2011. What is computational linguistics? Processing natural language text by computers  for practical applications.
Toward a Reading Machine Eduard Hovy Hans Chalupsky Jerry Hobbs Andrew Philpot Dirk Hovy Stephen Tratz … USC - ISI Anselmo Peñas UNED NLP & IR Group nlp.uned.es.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Tackle Football Overview
Creating Math Stories Mini-Project Student Name: Robert English September 26th.
By Skye Smith. End zone: the area between the end line and goal line bounded by the sidelines, which a team on offense tries to enter to score a touchdown.
GYM FLAG FOOTBALL Equipment Used Equipment Used Belts and Flags Belts and Flags Football Football Cones or lines to mark the field Cones or lines to mark.
Flag Football Springboro Junior High. Positions Quarterback- The player who throws the ball to other players on the team. Receivers- The players who go.
Defense Positions. Defensive Linemen Includes both tackles and both ends. D-Linemen line up in front of the offensive linemen, but the have to make sure.
Gerunds and Gerund Phrases 8 th Grade English Clauses and Verbals Unit.
Verbals, Gerunds and Gerund Phrases
Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.
Mining the Semantic Web: Requirements for Machine Learning Fabio Ciravegna, Sam Chapman Presented by Steve Hookway 10/20/05.
 Flag Football Rules and Game Play Los Alisos Intermediate
Automatic Detection of Tags for Political Blogs Khairun-nisa Hassanali Vasileios Hatzivassiloglou The University.
For Dummies. Safety Tackles CornerBack Linebackers Tackles Center Wide Receivers Quarterback RunningBack Line of Scrimmage.
Flag Football History Flag Football is believed to have begun in the U.S. military during World War 11 to prevent injury to military personnel playing.
Football Made Easy UNDERSTANDING THE GAME. Offensive Player Positions  Center (C): first person to touch the ball and he snaps to the Quarterback (QB);
The Game GENERAL RULES: GENERAL RULES: The team with possession (legal control of the ball) is the offensive team. Normally, five players are the interior.
Mark St. Jean Ryan Welsh Alex Clark Charles Angotto.
Do Now Have you ever played football before? What do you know about the sport? What do you think is the most challenging aspect of football?
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
Football* *american. defense offense The Quarterback passes the ball.
Welcome to the Third Annual SEBMS 6 th grade Fantasy Football Draft Night!
*Inventor/ Place of origin* To score by crossing the goal line (run/pass)
FOOTBALL. Football Gameplay Positions-Offense Positions-Defense Field Red is input.
Evaluating Answer Validation in multi- stream Question Answering Álvaro Rodrigo, Anselmo Peñas, Felisa Verdejo UNED NLP & IR group nlp.uned.es The Second.
FLAG FOOTBALL 6/10/ History 1.American football as we know it originated in the late 1800s developing from two English sports, soccer and rugby.
 Flag Football Rules and Game Play Los Alisos Intermediate
AUTONOMOUS REQUIREMENTS SPECIFICATION PROCESSING USING NATURAL LANGUAGE PROCESSING - Vivek Punjabi.
F OOTBALL. D ID T HAT J UST H APPEN ? &feature=related.
Gameplay Positions-Offense Positions-Defense Field.
A is for American Football by LukeM. A is for American Football.
Football Terminology.
FOOTBALL HPWB 7 th & 8 th Mr. Flores. Football American football was developed in the 1800’s. The concept of game is to advance ball to endzone either.
An Ontology-based Automatic Semantic Annotation Approach for Patent Document Retrieval in Product Innovation Design Feng Wang, Lanfen Lin, Zhou Yang College.
Click to Begin Take the Challenge!! STUDENTS CLICK HERE FOR REVIEW STUDENTS CLICK HERE FOR REVIEW TEACHERS CLICK HERE FOR TEACHER’S PAGE TEACHERS CLICK.
Football Please take notes and save them, as they will be counted as part of your test grade.
Harnessing the Semantic Web to Answer Scientific Questions:
Development of the Amphibian Anatomical Ontology
Football.
Region One Football Officials Association
Physical Education 2016.
Region One Football Officials Association
Football.
Automatic Detection of Causal Relations for Question Answering
The Game of Football.
Flag FootballJeopardy
CS246: Information Retrieval
Region One Football Officials Association
Region One Football Officials Association
Presentation transcript:

Semantic Enrichment of Text with Background Knowledge Anselmo Peñas NLP & IR Group UNED nlp.uned.es Eduard Hovy USC / ISI isi.edu

UNED nlp.uned.es Text omits information San Francisco's Eric Davis intercepted a Steve Walsh pass on the next series to set up a seven-yard Young touchdown pass to Brent Jones.

UNED nlp.uned.es Make explicit implicit information Implicit(More) explicit San Francisco’s Eric DavisEric Davis plays for San Francisco E.D. is a player, S.F. is a team Eric Davis intercepted pass 1 - Steve Walsh pass 1 Steve Walsh threw pass 1 Steve Walsh threw interception 1 … Young touchdown pass 2 Young completed pass 2 for touchdown… touchdown pass 2 to Brent JonesBrent Jones caught pass 2 for touchdown San Francisco's Eric Davis intercepted a Steve Walsh pass on the next series to set up a seven-yard Young touchdown pass to Brent Jones.

UNED nlp.uned.es Goals General Goal Automatic recovering of such omitted information Enrichment is the process of adding explicitly to a text’s representation the information that is either implicit or missing in the text

UNED nlp.uned.es The enrichment cycle Cycle: 1. Read text from collection 2. Ruminate in BKB 3. Enrich text representation 4. Repeat Domain Docs. Reading Background Knowledge Base Rumination Enrichment

UNED nlp.uned.es Goals Specific goals of this work Explore the idea of using “Proposition Stores” as Background Knowledge for enrichment Explore procedures for enrichment Determine the kinds of knowledge that Proposition Stores must include to enable enrichment

UNED nlp.uned.es Outline 1. Intro 2. BKB 3. Enrichment 4. Features of BKBs for Enrichment 5. Conclusion

UNED nlp.uned.es Elements in our BKB Entities Classes: not limited to a predefined set Instances: proper nouns (in this first approach) Class:has-instance:Instance relations Propositions: Predefined syntactic structures NV, NVPN NVN, NVNPN NPN, AN …

UNED nlp.uned.es Extraction of propositions Patterns over dependency trees prop( Type, Form : DependencyConstrains : NodeConstrains ). Examples: prop(nv, [N,V] : [V:N:nsubj, not(V:_:'dobj')] : [verb(V)]). prop(nvnpn, [N1,V,N2,P,N3]:[V:N2:'dobj', V:N3:Prep, subj(V,N1)]:[prep(Prep,P)]). prop(has_value, [N,Val]:[N:Val:_]:[nn(N), cd(Val), not(lemma(Val,'one'))]).

UNED nlp.uned.es Background Knowledge Base (NFL, US football) ?> NN NNP:’pass’ NN 24 'Marino’:'pass‘ NN 17 'Kelly':'pass' NN 15 'Elway’:'pass’ … ?>X:has-instance:’Marino’ 20 'quarterback':has-instance:'Marino' 6 'passer':has-instance:'Marino' 4 'leader':has-instance:'Marino' 3 'veteran':has-instance:'Marino' 2 'player':has-instance:'Marino' ?> NPN 'pass':X:'touchdown‘ NPN 712 'pass':'for':'touchdown' NPN 24 'pass':'include':'touchdown’ … ?> NVN 'quarterback':X:'pass' NVN 98 'quarterback':'throw':'pass' NVN 27 'quarterback':'complete':'pass‘ … ?> NVNPN 'NNP':X:'pass':Y:'touchdown' NVNPN 189 'NNP':'catch':'pass':'for':'touchdown' NVNPN 26 'NNP':'complete':'pass':'for':'touchdown‘ … ?> NVN 'end':X:'pass‘ NVN 28 'end':'catch':'pass' NVN 6 'end':'drop':'pass‘ …

UNED nlp.uned.es Outline 1. Intro 2. BKB 3. Enrichment 4. Features of BKBs for Enrichment 5. Conclusion

UNED nlp.uned.es Enrichment example (1) …to set up a 7-yard Young touchdown pass to Brent Jones pass Young touchdown Jones nn to Young pass ?> X:has-instance:Young X=quarterback ?> NVN:quarterback:X:pass X=throw X=complete pass to Jones ?> X:has-instance:Jones X=end ?> NVN:end:X:pass X=catch X=drop

UNED nlp.uned.es Enrichment example (2) pass Young touchdown Jones throw complete nn catch drop touchdown pass ?> NVN touchdown:X:pass False ?> NPN pass:X:touchdown X=for …to set up a 7-yard Young touchdown pass to Brent Jones

UNED nlp.uned.es Enrichment example (3) pass Young touchdown Jones throw complete for catch drop ?> NVNPN NAME:X:pass:for:touchdown X=complete X=catch …to set up a 7-yard Young touchdown pass to Brent Jones

UNED nlp.uned.es Enrichment example (4) pass Young touchdown Jones complete for catch  Young complete pass for touchdown  Jones catch pass for touchdown …to set up a 7-yard Young touchdown pass to Brent Jones

UNED nlp.uned.es Enrichment Build context for instances Build context for dependencies Finding prepositions Finding verbs Constrain interpretations

UNED nlp.uned.es Enrichment example (5) San Francisco's Eric Davis intercepted a Steve Walsh pass on the next series to set up a seven-yard Young touchdown pass to Brent Jones. Before enrichment for throw catch complete After enrichment

UNED nlp.uned.es Outline 1. Intro 2. BKB 3. Enrichment 4. Features of BKBs for Enrichment 5. Conclusion

UNED nlp.uned.es What BKBs need for enrichment? (1) Ability to answer about instances Not complete population But allow analogy Ability to constrain interpretations and accumulate evidence Several different queries over the same elements considering different syntactic structures Require normalization (and parsing)

UNED nlp.uned.es What BKBs need for enrichment? (1) Ability to discover entity classes with appropriate granularity level Quarterbacks throw passes Ends catch passes Tag an entity as person or even player is not specific enough for enrichment Text frequently introduces the relevant class (appropriate granularity level) for understanding

UNED nlp.uned.es What BKBs need for enrichment? (2) Ability to digest enough knowledge adapted to the domain Crucial Approaches Macro-reading (web scale) + domain adaptation Shallow NLP, lack of normalization Reading in context (suggested here) Domain partitioning Deeper NLP, specific domain NLP

UNED nlp.uned.es Digest enough knowledge DART: general domain propositions store TextRunner: general domain (web-scale) BKB: specific domain propositions store (only 30,000 docs) ?> quarterback:X:pass DARTTextRunnerBKB (US Football) (no results)(~200) threw (~100) completed (36) to throw (26) has thrown (19) makes (19) has (18) fires (99) throw (25) complete (7) have (5) attempt (5) not-throw (4) toss (3) release

UNED nlp.uned.es ?> X:intercept:pass DARTTextRunnerBKB (US Football) (13) person (6) person/place/organization (2) full-back (1) place (30) Early (26) Two plays (24) fumble (20) game (20) ball (17) Defensively (75) person (14) cornerback (11) defense (8) safety (7) group (5) linebacker Digest Knowledge in the domain (entity classes)

UNED nlp.uned.es Digest Knowledge in the domain (ambiguity problem) ?> person:X:pass DARTTextRunnerBKB (US Football) (47) make (45) take (36) complete (30) throw (25) let (23) catch (1) make (1) expect (22) gets (17) makes (10) has (10) receives (7) who has (7) must have (6) acting on (6) to catch (6) who buys (5) bought (5) admits (5) gives (824) catch (546) throw (256) complete (136) have (59) intercept (56) drop (39) not-catch (37) not-throw (36) snare (27) toss (23) pick off (20) run

UNED nlp.uned.es Domain issue ?> person:X:pass NFL Domain 905:nvn:[person:n, catch:v, pass:n]. 667:nvn:[person:n, throw:v, pass:n]. 286:nvn:[person:n, complete:v, pass:n]. 204:nvnpn:[person:n, catch:v, pass:n, for:in, yard:n]. 85:nvnpn:[person:n, catch:v, pass:n, for:in, touchdown:n]. IC Domain 6:nvn:[person:n, have:v, pass:n] 3:nvn:[person:n, see:v, pass:n] 1:nvnpn:[person:n, wear:v, pass:n, around:in, neck:n] BIO Domain

UNED nlp.uned.es Domain issue ?> X:receive:Y NFL Domain 55:nvn:[person:n, receive:v, call:n]. 34:nvn:[person:n, receive:v, offer:n]. 33:nvn:[person:n, receive:v, bonus:n]. 29:nvn:[team:class, receive:v, pick:n]. IC Domain 78 nvn:[person:n, receive:v, call:n] 44 nvn:[person:n, receive:v, letter:n] 35 nvn:[group:n, receive:v, information:n] 31 nvn:[person:n, receive:v, training:n] BIO Domain 24 nvn:[patients:n, receive:v, treatment:n] 14 nvn:[patients:n, receive:v, therapy:n] 13 nvn:[patients:n, receive:v, care:n]

UNED nlp.uned.es Outline 1. Intro 2. BKB 3. Enrichment 4. Features of BKBs for Enrichment 5. Conclusion

UNED nlp.uned.es Conclusions Limiting to a specific domain provides some powerful benefits Ambiguity is reduced Higher density of relevant propositions Different distribution of propositions across domains Amount of source text is reduced, allowing deeper processing such as parsing Specific tools for specific domains Proposition stores seem to be useful Improve parsing, corref, WSD,… We presented a new application: ENRICHMENT

UNED nlp.uned.es Current work Develop automatic procedures for Enrichment Need better Proposition Stores Selectional Preferences Lexical relatedness Structural /frame transformations …

UNED nlp.uned.es Future work Develop appropriate methodologies for evaluation Intrinsic? Extrinsic: QA over single documents? Reading comprehension tests?

Thanks!

UNED nlp.uned.es NVN 3 'quarterback':'find':'receiver‘ NVNPN 3 'quarterback':'throw':'pass':'to':'receiver' NVNPN 2 'quarterback':'complete':'pass':'to':'receiver' NVNPN 1 'receiver':'catch':'pass':'from':'quarterback‘ nvn:('NNP':'quarterback'):'hit':('NNP':'receiver'),177). nvnpn:('NNP':'quarterback'):'throw':'pass':'to':('NNP':'receiver'),143). nvnpn:('NNP':'quarterback'):'complete':'pass':'to':('NNP':'receiver'),79). nvn:('NNP':'quarterback'):'find':('NNP':'receiver'),69). nvnpn:('NNP':'receiver'):'catch':'pass':'from':('NNP':'quarterback'),43).