Lecture 13 Information Extraction

Slides:

Advertisements

Similar presentations

Automatic Timeline Generation from News Articles Josh Taylor and Jessica Jenkins.

Advertisements

A Machine Learning Approach to Coreference Resolution of Noun Phrases By W.M.Soon, H.T.Ng, D.C.Y.Lim Presented by Iman Sen.

Sequence Classification: Chunking Shallow Processing Techniques for NLP Ling570 November 28, 2011.

Chunk Parsing CS1573: AI Application Development, Spring 2003 (modified from Steven Bird’s notes)

Exploiting Dictionaries in Named Entity Extraction: Combining Semi-Markov Extraction Processes and Data Integration Methods William W. Cohen, Sunita Sarawagi.

A Self Learning Universal Concept Spotter By Tomek Strzalkowski and Jin Wang Original slides by Iman Sen Edited by Ralph Grishman.

1 A scheme for racquet sports video analysis with the combination of audio-visual information Visual Communication and Image Processing 2005 Liyuan Xing,

CSCI 5417 Information Retrieval Systems Jim Martin Lecture 21 11/8/2011.

Sunita Sarawagi.  Enables richer forms of queries  Facilitates source integration and queries spanning sources “Information Extraction refers to the.

Sequence Classification: Chunking & NER Shallow Processing Techniques for NLP Ling570 November 23, 2011.

Information Extraction Shallow Processing Techniques for NLP Ling570 December 5, 2011.

Named Entity Recognition LING 570 Fei Xia Week 10: 11/30/09.

CS4705.  Idea: ‘extract’ or tag particular types of information from arbitrary text or transcribed speech.

1 Natural Language Processing for the Web Prof. Kathleen McKeown 722 CEPSR, Office Hours: Wed, 1-2; Tues 4-5 TA: Yves Petinot 719 CEPSR,

Copyright © 2011, Oracle and/or its affiliates. All rights reserved. Oracle Enterprise Data Quality CDEP: Tailoring Parser Configuration.

Mining the Medical Literature Chirag Bhatt October 14 th, 2004.

Andreea Bodnari, 1 Peter Szolovits, 1 Ozlem Uzuner 2 1 MIT, CSAIL, Cambridge, MA, USA 2 Department of Information Studies, University at Albany SUNY, Albany,

Recognition of Multi-sentence n-ary Subcellular Localization Mentions in Biomedical Abstracts G. Melli, M. Ester, A. Sarkar Dec. 6, 2007

Empirical Methods in Information Extraction Claire Cardie Appeared in AI Magazine, 18:4, Summarized by Seong-Bae Park.

1 Statistical NLP: Lecture 10 Lexical Acquisition.

Researcher affiliation extraction from homepages I. Nagy, R. Farkas, M. Jelasity University of Szeged, Hungary.

A Survey for Interspeech Xavier Anguera Information Retrieval-based Dynamic TimeWarping.

Ling 570 Day 17: Named Entity Recognition Chunking.

Lecture 6 Hidden Markov Models Topics Smoothing again: Readings: Chapters January 16, 2013 CSCE 771 Natural Language Processing.

10/12/2015CPSC503 Winter CPSC 503 Computational Linguistics Lecture 10 Giuseppe Carenini.

Named Entity Tagging Thanks to Dan Jurafsky, Jim Martin, Ray Mooney, Tom Mitchell for slides.

CSA2050: Introduction to Computational Linguistics Part of Speech (POS) Tagging II Transformation Based Tagging Brill (1995)

Lecture 13 Information Extraction Topics Name Entity Recognition Relation detection Temporal and Event Processing Template Filling Readings: Chapter 22.

A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.

TimeML compliant text analysis for Temporal Reasoning Branimir Boguraev and Rie Kubota Ando.

February 2007CSA3050: Tagging III and Chunking 1 CSA2050: Natural Language Processing Tagging 3 and Chunking Transformation Based Tagging Chunking.

Acquisition of Categorized Named Entities for Web Search Marius Pasca Google Inc. from Conference on Information and Knowledge Management (CIKM) ’04.

Using Wikipedia for Hierarchical Finer Categorization of Named Entities Aasish Pappu Language Technologies Institute Carnegie Mellon University PACLIC.

Exploiting Named Entity Taggers in a Second Language Thamar Solorio Computer Science Department National Institute of Astrophysics, Optics and Electronics.

Department of Computer Science The University of Texas at Austin USA Joint Entity and Relation Extraction using Card-Pyramid Parsing Rohit J. Kate Raymond.

Overview of Statistical NLP IR Group Meeting March 7, 2006.

Feature Assignment LBSC 878 February 22, 1999 Douglas W. Oard and Dagobert Soergel.

Relation Extraction (RE) via Supervised Classification See: Jurafsky & Martin SLP book, Chapter 22 Exploring Various Knowledge in Relation Extraction.

That's What She Said: Double Entendre Identification Kiddon & Brun 2011.

Natural Language Processing Information Extraction Jim Martin (slightly modified by Jason Baldridge)

Automatically Labeled Data Generation for Large Scale Event Extraction

CSC 594 Topics in AI – Natural Language Processing

Sentiment analysis algorithms and applications: A survey

PRESENTED BY: PEAR A BHUIYAN

Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin

Named Entity Tagging Thanks to Dan Jurafsky, Jim Martin, Ray Mooney, Tom Mitchell for slides.

Robust Semantics, Information Extraction, and Information Retrieval

Distant supervision for relation extraction without labeled data

CSCE 590 Web Scraping – Information Retrieval

CSCI 5832 Natural Language Processing

Social Knowledge Mining

LING 388: Computers and Language

CS 388: Natural Language Processing: Syntactic Parsing

LING 388: Computers and Language

Statistical NLP: Lecture 9

CSCI 5832 Natural Language Processing

Introduction Task: extracting relational facts from text

iSRD Spam Review Detection with Imbalanced Data Distributions

Automatic Detection of Causal Relations for Question Answering

CSCI 5832 Natural Language Processing

Chunk Parsing CS1573: AI Application Development, Spring 2003

Text Mining & Natural Language Processing

Named Entity Tagging Thanks to Dan Jurafsky, Jim Martin, Ray Mooney, Tom Mitchell for slides.

Text Mining & Natural Language Processing

CS246: Information Retrieval

CSCI 5832 Natural Language Processing

Rachit Saluja 03/20/2019 Relation Extraction with Matrix Factorization and Universal Schemas Sebastian Riedel, Limin Yao, Andrew.

Statistical NLP : Lecture 9 Word Sense Disambiguation

Statistical NLP: Lecture 10

Presentation transcript:

Lecture 13 Information Extraction CSCE 771 Natural Language Processing Lecture 13 Information Extraction Topics Name Entity Recognition Relation detection Temporal and Event Processing Template Filling Readings: Chapter 22 February 27, 2013

Overview Last Time Today Readings Dialogues Human conversations Slides from Lecture24 Dialogue systems Dialogue Manager Design Finite State, Frame-based, Initiative: User, System, Mixed VoiceXML Information Extraction Readings Chapter 24, Chapter 22

Information extraction Information extraction – turns unstructured information buried in texts into structured data Extract proper nouns – “named entity recognition” Reference resolution – \ named entity mentions Pronoun references Relation Detection and classification Event detection and classification Temporal analysis Template filling

Template Filling Example template for “airfare raise”

Figure 22.1 List of Named Entity Types

Figure 22.2 Examples of Named Entity Types

Figure 22.3 Categorical Ambiguities

Figure 22.4 Categorical Ambiguity

Figure 22.5 Chunk Parser for Named Entities

Figure 22.6 Features used in Training NER Gazetteers – lists of place names www.geonames.com www.census.gov

Figure 22.7 Selected Shape Features

Figure 22.8 Feature encoding for NER

Figure 22.9 NER as sequence labeling

Figure 22.10 Statistical Seq. Labeling

Evaluation of Named Entity Rec. Sys. Recall terms from Information retreival Recall = #correctly labeled / total # that should be labeled Precision = # correctly labeled / total # labeled F- measure where β weights preferences β=1 balanced β>1 favors recall β<1 favors precision

NER Performance revisited Recall, Precision, F High performance systems F ~ .92 for PERSONS and LOCATIONS and ~.84 for ORG Practical NER Make several passes on text Start by using highest precision rules (maybe at expense of recall) make sure what you get is right Search for substring matches or previously detected names using probabilistic searches string matching metrics(Chap 19) Name lists focused on domain Probabilistic sequence labeling techniques using previous tags

Relation Detection and classification Consider Sample text: Citing high fuel prices, [ORG United Airlines] said [TIME Friday] it has increased fares by [MONEY $6] per round trip on flights to some cities also served by lower-cost carriers. [ORG American Airlines], a unit of [ORG AMR Corp.], immediately matched the move, spokesman [PERSON Tim Wagner] said. [ORG United Airlines] an unit of [ORG UAL Corp.], said the increase took effect [TIME Thursday] and applies to most routes where it competes against discount carriers, such as [LOC Chicago] to [LOC Dallas] and [LOC Denver] to [LOC San Francisco]. After identifying named entities what else can we extract? Relations

Fig 22.11 Example semantic relations

Figure 22.12 Example Extraction

Figure 22.13 Supervised Learning Approaches to Relation Analysis Algorithm two step process Identify whether pair of named entities are related Classifier is trained to label relations

Factors used in Classifying Features of the named entities Named entity types of the two arguments Concatenation of the two entity types Headwords of the arguments Bag-of-words from each of the arguments Words in text Bag-of-words and Bag-of-digrams Stemmed versions Distance between named entities (words / named entities) Syntactic structure Parse related structures

Figure 22.14 a-part-of relation

Figure 22.15 Sample features Extracted

Bootstrapping Example “Has a hub at” Consider the pattern / * has a hub at * / Google search 22.4 Milwaukee-based Midwest has a hub at KCI 22.5 Delta has a hub at LaGuardia … Two ways to fail False positive: e.g. a star topology has a hub at its center False negative? Just miss 22.11 No frill rival easyJet, which has established a hub at Liverpool

Figure 22.16 Bootstrapping Relation Extraction

Using Features to restrict patterns 22.13 Budget airline Ryanair, which uses Charleroi as a hub, scrapped all weekend flights / [ORG] , which uses a hub at [LOC] /

Semantic Drift Note it will be difficult (impossible) to get annotated materials for training Accuracy of process is heavily dependant on initial sees Semantic Drift – Occurs when erroneous patterns(seeds) leads to the introduction of erroneous tuples

Fig 22.17 Temporal and Durational Expressions Absolute temporal expressions Relative temporal expressions

Fig 22.18 Temporal lexical triggers

Fig 22.19 MITRE’s tempEx tagger-perl

Fig 22.20 Features used to train IOB

Figure 22.21 TimeML temporal markup

Temporal Normalization iSO 8601 - standard for encoding temporal values YYYY-MM-DD

Figure 22.22 Sample ISO Patterns

Event Detection and Analysis Event Detection and classification

Fig 22.23 Features for Event Detection Features used in rule-based and statistical techniques

Fig 22.24 Allen’s 13 temporal Relations

Figure 22.24 continued

Figure 22.25 Example from Timebank Corpus

Template Filling

Figure 22.26 Templates produced by Faustus 1997

Figure 22.27 Levels of processing in Faustus

Figure 22.28 Faustus Stage 2

Figure 22.29 The 5 Partial Templates of Faustus

Figure 22.30 Articles in PubMed

Figure 22.31 biomedical classes of named entities