1 Simple Algorithms for Complex Relation Extraction with Applications to Biomedical IE Ryan McDonald Fernando Pereira Seth Kulick CIS and IRCS, University.

Slides:



Advertisements
Similar presentations
Concept Learning and the General-to-Specific Ordering
Advertisements

A Probabilistic Representation of Systemic Functional Grammar Robert Munro Department of Linguistics, SOAS, University of London.
Copyright © 2002 Pearson Education, Inc. 1. Chapter 2 An Introduction to Functions.
Chapter 3 Introduction to Quantitative Research
1 A Statistical Analysis of the Precision-Recall Graph Ralf Herbrich Microsoft Research UK Joint work with Hugo Zaragoza and Simon Hill.
1 Unsupervised Ontology Induction From Text Hoifung Poon Dept. Computer Science & Eng. University of Washington (Joint work with Pedro Domingos)
eClassifier: Tool for Taxonomies
The creation of "Yaolan.com" A Site for Pre-natal and Parenting Education in Chinese by James Caldwell DAE Interactive Marketing a Web Connection Company.
1 OOA-HR Workshop, 11 October 2006 Semantic Metadata Extraction using GATE Diana Maynard Natural Language Processing Group University of Sheffield, UK.
0 - 0.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Limitations of the relational model 1. 2 Overview application areas for which the relational model is inadequate - reasons drawbacks of relational DBMSs.
Peer-to-peer and agent-based computing Basic Theory of Agency.
Bayesian network for gene regulatory network construction
LABELING TURKISH NEWS STORIES WITH CRF Prof. Dr. Eşref Adalı ISTANBUL TECHNICAL UNIVERSITY COMPUTER ENGINEERING 1.
On-line Index Selection for Physical Database Tuning
1 Carnegie Mellon Immune Cells Detection of the In Vivo Rejecting Heart in USPIO-Enhanced MRI Hsun-Hsien Chang 1, José M. F. Moura 1, Yijen L. Wu 2, and.
A Joint Model of Implicit Arguments for Nominal Predicates Matthew Gerber and Joyce Y. Chai Department of Computer Science Michigan State University East.
Lecture plan Outline of DB design process Entity-relationship model
A Prototype Implementation of a Framework for Organising Virtual Exhibitions over the Web Ali Elbekai, Nick Rossiter School of Computing, Engineering and.
Addition 1’s to 20.
Test B, 100 Subtraction Facts
1 Scheduling Crossbar Switches Who do we chose to traverse the switch in the next time slot? N N 11.
1/22 Worst and Best-Case Coverage in Sensor Networks Seapahn Meguerdichian, Farinaz Koushanfar, Miodrag Potkonjak, and Mani Srivastava IEEE TRANSACTIONS.
Introduction to Recursion and Recursive Algorithms
Trees Chapter 11.
Minimum Vertex Cover in Rectangle Graphs
Multi-Document Person Name Resolution Michael Ben Fleischman (MIT), Eduard Hovy (USC) From Proceedings of ACL-42 Reference Resolution workshop 2004.
Constrained Approximate Maximum Entropy Learning (CAMEL) Varun Ganapathi, David Vickrey, John Duchi, Daphne Koller Stanford University TexPoint fonts used.
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data John Lafferty Andrew McCallum Fernando Pereira.
Learning with Probabilistic Features for Improved Pipeline Models Razvan C. Bunescu Electrical Engineering and Computer Science Ohio University Athens,
A Graphical Model For Simultaneous Partitioning And Labeling Philip Cowans & Martin Szummer AISTATS, Jan 2005 Cambridge.
6/29/051 New Frontiers in Corpus Annotation Workshop, 6/29/05 Ann Bies – Linguistic Data Consortium* Seth Kulick – Institute for Research in Cognitive.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
Extracting Structured Data from Web Page Arvind Arasu, Hector Garcia-Molina ACM SIGMOD 2003.
 Text mining for biology and medicine: Glasgow, Feb , 2008 Biomedical information extraction at the University of Pennsylvania Mark Liberman
Finding Advertising Keywords on Web Pages Scott Wen-tau YihJoshua Goodman Microsoft Research Vitor R. Carvalho Carnegie Mellon University.
Information Extraction with Unlabeled Data Rayid Ghani Joint work with: Rosie Jones (CMU) Tom Mitchell (CMU & WhizBang! Labs) Ellen Riloff (University.
Learning Table Extraction from Examples Ashwin Tengli, Yiming Yang and Nian Li Ma School of Computer Science Carnegie Mellon University Coling 04.
CHAMELEON : A Hierarchical Clustering Algorithm Using Dynamic Modeling
Tutorial 19 Dina Said. Indexing Data 1. A data entry k* is an actual data record (with search key value k 2. A data entry is a (k, rid) pair, where rid.
Graphical models for part of speech tagging
A Markov Random Field Model for Term Dependencies Donald Metzler W. Bruce Croft Present by Chia-Hao Lee.
BioNLP related talks and demos at ACL and CONLL ‘05 Presented by Beatrice Alex BioNLP meeting 11 th of July 2005.
Incident Threading for News Passages (CIKM 09) Speaker: Yi-lin,Hsu Advisor: Dr. Koh, Jia-ling. Date:2010/06/14.
Kansas State University Department of Computing and Information Sciences CIS 730: Introduction to Artificial Intelligence Lecture 25 Wednesday, 20 October.
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
1 Learning Sub-structures of Document Semantic Graphs for Document Summarization 1 Jure Leskovec, 1 Marko Grobelnik, 2 Natasa Milic-Frayling 1 Jozef Stefan.
Formal Specification of Intrusion Signatures and Detection Rules By Jean-Philippe Pouzol and Mireille Ducassé 15 th IEEE Computer Security Foundations.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
Graph-based Text Classification: Learn from Your Neighbors Ralitsa Angelova , Gerhard Weikum : Max Planck Institute for Informatics Stuhlsatzenhausweg.
1/21 Automatic Discovery of Intentions in Text and its Application to Question Answering (ACL 2005 Student Research Workshop )
Powerpoint Templates Page 1 Powerpoint Templates Scalable Text Classification with Sparse Generative Modeling Antti PuurulaWaikato University.
2015/12/121 Extracting Key Terms From Noisy and Multi-theme Documents Maria Grineva, Maxim Grinev and Dmitry Lizorkin Proceeding of the 18th International.
Indexing Correlated Probabilistic Databases Bhargav Kanagal, Amol Deshpande University of Maryland, College Park, USA SIGMOD Presented.
Fixed parameter algorithms for protein similarity search under mRNA structure constrains A joint work by: G. Blin, G. Fertin, D. Hermelin, and S. Vialette.
Information Extraction from Single and Multiple Sentences Mark Stevenson Department of Computer Science University of Sheffield, UK.
John Lafferty Andrew McCallum Fernando Pereira
4. Relationship Extraction Part 4 of Information Extraction Sunita Sarawagi 9/7/2012CS 652, Peter Lindes1.
March, 2007RCO LLC, RCO Text Analysis Technologies for information extraction and business intelligence We can tell you everything about.
Finding Regular Simple Paths Sept. 2013Yangjun Chen ACS Finding Regular Simple Paths in Graph Databases Basic definitions Regular paths Regular simple.
Department of Computer Science The University of Texas at Austin USA Joint Entity and Relation Extraction using Card-Pyramid Parsing Rohit J. Kate Raymond.
Graphical Models for Segmenting and Labeling Sequence Data Manoj Kumar Chinnakotla NLP-AI Seminar.
A Brief Introduction to Distant Supervision
Automatic Detection of Causal Relations for Question Answering
Trees L Al-zaid Math1101.
Trees 11.1 Introduction to Trees Dr. Halimah Alshehri.
Rachit Saluja 03/20/2019 Relation Extraction with Matrix Factorization and Universal Schemas Sebastian Riedel, Limin Yao, Andrew.
Presentation transcript:

1 Simple Algorithms for Complex Relation Extraction with Applications to Biomedical IE Ryan McDonald Fernando Pereira Seth Kulick CIS and IRCS, University of Pennsylvania, Philadelphia, PA Scott Winters Yang Jin Pete White Division of Oncology, Childrens Hospital of Pennsylvania, Philadelphia, PA ACL 2005

2 Abstract Simple two-stage method for extracting complex relations between named entities in text. –n-ary relation –first stage: create a graph from pairs of entities –two stage: maximal cliques in the graph Experiment on biomedical text

3 Introduction - 1/2 n-ary relation –The relation is definded by the schema (t 1,…, t n ) t i is entity types –The tuple in the relations is a list of entities (e 1,...,e n ) Type(e 1 )=t 1 or e i = Example : –Type : {person, job, company} John Smith is the CEO at Inc. Corp. (John Smith, CEO, Inc. Corp.) Everyday John Smith goes to his office at Inc. Corp. (John Smith,, Inc. Corp.)

4 Introduction - 2/2 Application : –Question answer –Automatic database generation –Intelligent document searching and indexing Most relation extraction systems focus on: –Binary relation : Such as employee of relation protein-protein interaction relation –Extracting keyphrases to represent relation in social networks from Web. (Matsuo et al., IJCAI-07)

5 Previous Work Zelenko et al., 2003 –Binary relation in news text John Smith, not Jane Smith, works at IBM. (John Smith, IBM) : positive (Jane Smith, IBM) : negative Miller et al., 2000 –Identify all relations Relation extraction from probabilistic parsing tree Rosario and Hearst, 2004 –Extracting seven relationships between treatments and diseases

6 Definitions n-ary relation –The relation is definded by the schema (t 1,…, t n ) t i is entity types –The tuple in the relations is a list of entities (e1,...,en) Type(e 1 )=t 1 or e i = A maximal clique –An undirected graph G=(V,E) V: vertices, E: a set of edges –A clique C of G is a subgraph of G in which there is an edge between every pair of vertices. –A maximal clique of G is a clique C=(Vc, Ec) such that there is no other clique C=(Vc, Ec) such that Vc Vc.

7 Methods : Classifying Binary Relations-1/3 Example : {person, job, company} –John and Jane are CEOs at Inc. Corp. and Biz. Corp. respectively. –12 possible tuples Problems with building a classifier –Exponential run time –How to manage incomplete but correct instancesincomplete but correct instances (John,,Inc. Corp.) If it is marked as negative, –the model might incorrectly disfavor features that correlate John to Inc.Corp.. If it is labeled as positive, –the model may tend to prefer the shorter and more compact incomplete relations. If we ignore instances of this form, –the data would be heavily skewed towards negative instances.

8 Methods : Classifying Binary Relations-2/3 Solution : –The set of all possible pairs is much smaller then the set of all possible complex relation instances. –To train a classifier to identify pairs of related entities. Positive : –(John,CEO), (John, Inc. Corp.), (CEO, Inc. Corp.), (CEO, Biz. Corp.), (Jane,CEO) and (Jane, Biz. Corp.). Negative : –(John, Biz. Corp.) and (Jane, Inc. Corp.)

9 Methods : Classifying Binary Relations-3/3 Learning a binary relation classifier : –A standard maximum entropy classifier (Berger et al., 1996) implemented as part of MALLET (McCallum, 2002)

10 Methods : Reconstructing Complex Relations Example : According to binary classifier –(John,CEO), (John, Inc. Corp.), (John, Biz. Corp.), (CEO, Inc. Corp.), (CEO, Biz. Corp.) and (Jane,CEO). –Relation Graph : Figure 2a –Cliques : Figure 2b Algorithm for finding all maximal cliques : –Born and Kerbosch, 1973

11 Methods : Probabilistic Cliques The above approach has a major shortcoming in that it assumes the output of the binary classifier to be absolutely correct. Weight of a clique (C) –w(e) : weight (probabilistic) of edge e A vaild tuple : (C) 0.5

12 Experiments-1/2 Extracting genomic variation events from biomedical text (Mcdonal et al., 2004) (var-type, location, initial-state, altered-state) –At codons 12 and 61, the occurrence of point mutations from G/A to T/G were observed –(point mutation, codon 12, G, T) –(point mutation, codon 61, A, G) 447 abstracts selected from MEDLINE –4691 sentences –4773 entities and 1218 relations –Of the 1218 relations : 760 have two, 283 have one, 175 have no arguments 38% cannot be handled using binary relations 4% of the relations annotated are non-sentential Maximum recall : 96%

13 Experiments-2/2 MC: –Uses the maximum entropy binary classifier coupled with the maximal clique complex relation reconstructor. PC: –Same as above, except it uses the probabilistic clique complex relation reconstructor. NE: –A maximum entropy classifier that naively enumerates all possible relation instances as described in Page 7.Page 7

14 Experiments : Results-1/2

15 Experiments : Results-2/2

16 Conclusions and Future Work Complex relation extraction: –Binary relation learning: Maximum Entropy Classifier –Finding maximal cliques in graph –Genomic variation relations Future work –Parse trees –Learn how to cluster vertices into relational groups –A vertex/entity can participate in one or more relation

17 Learning Field Compatibilities to Extract Database Records from Unstructured Text –M Wick, A Culotta, A McCallum - (EMNLP 2006) Using Dependency Parsing and Probabilistic Inference to Extract Rela-tionships between Genes –B Goertzel, H Pinto, A Heljakka, IF Goertzel, M – (BioNLP 2006) Relation Extraction for Semantic Intranet Annotations –L Specia, C Baldassarre, E Motta - kmi.open.ac.uk –Relation Extraction for Semantic Intranet Annotations Technical Report