ComplQA: Complex Question Answering over Knowledge Base

Slides:



Advertisements
Similar presentations
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
Advertisements

Date: 2014/05/06 Author: Michael Schuhmacher, Simon Paolo Ponzetto Source: WSDM’14 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Knowledge-based Graph Document.
gSpan: Graph-based substructure pattern mining
O(N 1.5 ) divide-and-conquer technique for Minimum Spanning Tree problem Step 1: Divide the graph into  N sub-graph by clustering. Step 2: Solve each.
Fast Algorithms For Hierarchical Range Histogram Constructions
Large-Scale Entity-Based Online Social Network Profile Linkage.
Patch to the Future: Unsupervised Visual Prediction
Using Schema Matching to Simplify Heterogeneous Data Translation Tova Milo, Sagit Zohar Tel Aviv University.
Interactive Generation of Integrated Schemas Laura Chiticariu et al. Presented by: Meher Talat Shaikh.
Aki Hecht Seminar in Databases (236826) January 2009
A High Performance Application Representation for Reconfigurable Systems Wenrui GongGang WangRyan Kastner Department of Electrical and Computer Engineering.
Extracting Structured Data from Web Page Arvind Arasu, Hector Garcia-Molina ACM SIGMOD 2003.
Page 1 Multidatabase Querying by Context Ramon Lawrence, Ken Barker Multidatabase Querying by Context.
Unsupervised Category Modeling, Recognition and Segmentation Sinisa Todorovic and Narendra Ahuja.
ANHAI DOAN ALON HALEVY ZACHARY IVES Chapter 6: General Schema Manipulation Operators PRINCIPLES OF DATA INTEGRATION.
Data Structures Lecture-1:Introduction
Large-Scale Cost-sensitive Online Social Network Profile Linkage.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
OOSE 01/17 Institute of Computer Science and Information Engineering, National Cheng Kung University Member:Q 薛弘志 P 蔡文豪 F 周詩御.
Recognition of Multi-sentence n-ary Subcellular Localization Mentions in Biomedical Abstracts G. Melli, M. Ester, A. Sarkar Dec. 6, 2007
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
MINING RELATED QUERIES FROM SEARCH ENGINE QUERY LOGS Xiaodong Shi and Christopher C. Yang Definitions: Query Record: A query record represents the submission.
Graph Data Management Lab, School of Computer Science gdm.fudan.edu.cn XMLSnippet: A Coding Assistant for XML Configuration Snippet.
GLOSSARY COMPILATION Alex Kotov (akotov2) Hanna Zhong (hzhong) Hoa Nguyen (hnguyen4) Zhenyu Yang (zyang2)
Interoperability in Information Schemas Ruben Mendes Orientador: Prof. José Borbinha MEIC-Tagus Instituto Superior Técnico.
Extracting Semantic Constraint from Description Text for Semantic Web Service Discovery Dengping Wei, Ting Wang, Ji Wang, and Yaodong Chen Reporter: Ting.
AN IMPLEMENTATION OF A REGULAR EXPRESSION PARSER
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
P15 Lai Xiaoni (U077151L) Qiao Li (U077194E) Saw Woei Yuh (U077146X) Wang Yong (U077138Y)
RRXS Redundancy reducing XML storage in relations O. MERT ERKUŞ A. ONUR DOĞUÇ
Q2Semantic: A Lightweight Keyword Interface to Semantic Search Haofen Wang 1, Kang Zhang 1, Qiaoling Liu 1, Thanh Tran 2, and Yong Yu 1 1 Apex Lab, Shanghai.
A Systematic Exploration of the Feature Space for Relation Extraction Jing Jiang & ChengXiang Zhai Department of Computer Science University of Illinois,
Parallel XML Parsing Using Meta-DFAs Yinfei Pan 1, Ying Zhang 1, Kenneth Chiu 1, Wei Lu 2 1 State University of New York (SUNY) Binghamton 2 Indiana University.
Using a Named Entity Tagger to Generalise Surface Matching Text Patterns for Question Answering Mark A. Greenwood and Robert Gaizauskas Natural Language.
Mining Dependency Relations for Query Expansion in Passage Retrieval Renxu Sun, Chai-Huat Ong, Tat-Seng Chua National University of Singapore SIGIR2006.
DeepDive Introduction Dongfang Xu Ph.D student, School of Information, University of Arizona Sept 10, 2015.
The Database and Info. Systems Lab. University of Illinois at Urbana-Champaign Understanding Web Query Interfaces: Best-Efforts Parsing with Hidden Syntax.
Alex Stabile. Research Questions: Could a computer learn to distinguish between different composers? Why does music by different composers even sound.
Graphs Definition: a graph is an abstract representation of a set of objects where some pairs of the objects are connected by links. The interconnected.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation By: Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou.
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
Open question answering over curated and extracted knowledge bases
DATA MINING © Prentice Hall.
Rey-Long Liu Dept. of Medical Informatics Tzu Chi University Taiwan
Web News Sentence Searching Using Linguistic Graph Similarity
CS416 Compiler Design lec00-outline September 19, 2018
Associative Query Answering via Query Feature Similarity
Web Data Extraction Based on Partial Tree Alignment
Programming Language Syntax 7
Introduction CI612 Compiler Design CI612 Compiler Design.
Large Graph Mining: Power Tools and a Practitioner’s guide
Knowledge Base Completion
Lecture 19-Problem Solving 4 Incremental Method
Table Cell Search for Question Answering Huan Sun
MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks Jing Zhang+, Bo Chen+, Xianming Wang+, Fengmei Jin+, Hong Chen+, Cuiping.
[jws13] Evaluation of instance matching tools: The experience of OAEI
G-CORE: A Core for Future Graph Query Languages
Graph and Tensor Mining for fun and profit
Block Matching for Ontologies
CS416 Compiler Design lec00-outline February 23, 2019
Towards Exploratory Relationship Search: A Clustering-Based Approach
Danyun Xu, Gong Cheng*, Yuzhong Qu
Kbcqa ywsun, llzhang.
Related Work ywsun.
Enriching Taxonomies With Functional Domain Knowledge
Supporting High-Performance Data Processing on Flat-Files
Lec00-outline May 18, 2019 Compiler Design CS416 Compiler Design.
Heterogeneous Graph Attention Network
Presentation transcript:

ComplQA: Complex Question Answering over Knowledge Base Yawei Sun, Lingling Zhang, Gong Cheng, and Yuzhong Qu

Outline Introduction Approach Experiments Conclusion

Introduction Simple Question Complex Question Simple Relation e.g., Who is the wife of Barack Obama ? Complex Question Multiple Relations e.g., What type of music featured on the album Epica was composed by Mozart ?

Introduction - existing works and challenge Siva: transform dependency tree to logical form. (emnlp17) Abu: recognize sub-questions from dependency tree. (www17) Zou: construct query graph based on dependency tree. (tkde18) Challenge 1: parser error problem Question parser error leads to their ill-formed representations.

Introduction - existing works and challenge Cheng: structurally isomorphic.(acl17) Siva: not isomorphic, two operations: contract, and expand. (emnlp17) Zou: not isomorphic, spanning tree of super graph. (tkde18) Challenge 2: structure mismatch problem

Introduction - our solution To alleviate structure mismatch problem Expand existing work by ‘Insert Node’ operation. To alleviate parser error problem Propose Span Tree What type was composed ? by Mozart of music featured on the album Epica

Approach - pipeline (1) Parsing (2) Grounding Question 1.1 Span Tree Grounded Graph Question Ungrounded Graph 1.1 Span Tree 2.1 Node Linking 1.2 Graph Generation 2.2 Structure Mapping

Example: What type of music featured on the album Epica was composed by Mozart ? Class: what type of music ?music.genre 1 1 featured album composed music.album.genre music.artist.genre 2 3 2 3 Entity: Epica Entity: Mozart m.010rcgyt m.082db Ungrounded Graph Grounded Graph

1. Parsing p ungrounded graph question)= p ungrounded graph span tree)∗p span tree question) 1.1 Span Tree: p span tree question) 1.2 Graph Generation: p ungrounded graph span tree)

1.1 Span Tree - p span tree question) Definition: a skeleton with modifier spans (allowed nested) Algorithm 1. Seq2List: Input: question; Output: span tree tokens = [token1, token2,…tokenn] num_layer_list = [0…0], epoch=0 While not finished do epoch += 1 Predict redundancy_span Update(num_layer_list, redundancy_span, epoch) Shield redundancy_span in tokens Update(num_layer_list, skeleton, epoch+1) Return List2Tree(num_layer_list) Algorithm 2. List2Lree: Input: num_layer_list, tokens Output: span tree span_tree = tree() While i <= max(num_layer_list) do span_tree.add(current_span) span_tree.add(child_span) hd = find_headword (child_span, current_span) span_tree.addedge(span,child_span, hd) Return span_tree

1.1 Span Tree - example epoch What type of music featured on the album Epica was composed by Mozart 1 2 3 What type was composed ? by Mozart [4, 4, 2, 2, 1, 1, 1, 1, 1, 4, 4, 3, 3, 4] of music featured on the album Epica

1.2 Graph Generation p ungrounded graph span tree) 1.2.1 Node Recognition 1.2.2 Relation Extraction Class: what type of music 1 What type was composed ? by Mozart featured album composed 2 3 of music featured on the album Epica Entity: Epica Entity: Mozart Ungrounded Graph

2. Grounding p grounded graph ungrounded graph) 2.1 Node Linking 2.2 Structure Mapping

2.1 Node Linking Entity Linking Class Linking A dictionary-based method E.g., Mozart -- m.082db Class Linking A word-embedding similarity method E.g,, what type of music -- music.genre ?music.genre 1 2 3 m.010rcgyt m.082db

2.2 Structure Mapping 2.2.1 Maping 2.2.2 Path Match 2.2.3 Ranking Method 1: Instance-Level Method 2: Schema-Level 2.2.2 Path Match 2.2.3 Ranking Maximum-entropy model ?music.genre 1 music.album.genre music.artist.genre 2 3 m.010rcgyt m.082db

2.2.1 Maping Instance-Level Schema-Level ?music.genre ?music.genre Insert Node Mediator m.010rcgyt --{}-- ?x m.082db --{}-- ?x Schema-Level Insert Node m.082db --{}-- ?x 1 2 3 m.010rcgyt m.082db 2 3 m.010rcgyt m.082db

2.2.2 Path Match

Experiments - GraphQuestions Baselines F1 SEMPRE 13 10.80 PARASEMPRE 14 12.79 JACANA 14 5.08 Siva 17 17.6 Cheng 17 17.02 Dong 17 20.4 ComplQA 26.16

Experiments - ComplexWebQuestions Baselines Precision SIMPQA + PRETRAINED 19.9 SPLITQA + PRETRAINED 25.9 MHQA-GRN 30.1 SplitQA + data augmentation 34.2 Human 63 ComplQA

Conclusion Parsing Grounding Challenge 1: parser error problem Solution: Span Tree Grounding Challenge 2: structure mismatch problem Solution: Insert Node

Thank you Q & A