Gong Cheng,Danyun Xu,Yuzhong Qu Summarizing Entity Descriptions for Effective and Efficient Human-centered Entity Linking Gong Cheng,Danyun Xu,Yuzhong Qu
Motivation Entity Linking Manual Efforts Gold-standard links for evaluation crowdsourcing
Approaches Characterizing power Characterizing Diversity information Logical inference String/numerical similarity Unique feature high characterzing power Overlap information
Approaches Characterizing power Combinatorial optimization problem Binary quadratic knapsack problem details
Approaches Differentiating power Logical inference String/numerical similarity
Approaches Differentiating power Combinatorial optimization problem Binary quadratic multidimensional knapsack problem details
Approaches Relevance Class Vector Model: CF-IIF(class frequency-inverse instance frequency), cosine similarity MMR:
Approaches Combination Binary quadratic multidimensional knapsack problem
Experiments DataSet Approaches Task KB: DBpedia Text corpora: AQUAINT,IITB Approaches DESC, CHR, DFF, CNT, COMB, RELIN Task Choose correct entity from three candidate entities according the entity mention and its context Rate and comment
Evaluation Extrinsic Evaluation 30 students, each for a total of 72 tasks, or 36 tasks from each corpora
Evaluation Intrinsic Evaluation Dominating factors: characterizing power and information overlap
Evaluation Intrinsic Evaluation Comments: CHR: DFF COMB 53% IITB highly distinguishing features 50% IITB different types helped them filter out noise entities easily 60% AQUAINT apart from different types, almost no useful information 80% IITB some highly distinguishing features 90% AQUAINT 53% comprehensive information was provided
Thanks Q&A