Entity Description Pattern Extraction and Their Usage in Entity Query

Slides:



Advertisements
Similar presentations
CWS: A Comparative Web Search System Jian-Tao Sun, Xuanhui Wang, § Dou Shen Hua-Jun Zeng, Zheng Chen Microsoft Research Asia University of Illinois at.
Advertisements

Compose Workflow. Home page To compose a workflow navigate to the “Workflow Editor” page.
Query Chain Focused Summarization Tal Baumel, Rafi Cohen, Michael Elhadad Jan 2014.
A Phrase Mining Framework for Recursive Construction of a Topical Hierarchy Date : 2014/04/15 Source : KDD’13 Authors : Chi Wang, Marina Danilevsky, Nihit.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
Chase Repp.  knowledge discovery  searching, analyzing, and sifting through large data sets to find new patterns, trends, and relationships contained.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
COM (Co-Occurrence Miner): Graph Classification Based on Pattern Co-occurrence Ning Jin, Calvin Young, Wei Wang University of North Carolina at Chapel.
Towards Semantic Web Mining Bettina Berndt Andreas Hotho Gerd Stumme.
LinkSelector: A Web Mining Approach to Hyperlink Selection for Web Portals Xiao Fang University of Arizona 10/18/2002.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
NUITS: A Novel User Interface for Efficient Keyword Search over Databases The integration of DB and IR provides users with a wide range of high quality.
Web Usage Mining with Semantic Analysis Date: 2013/12/18 Author: Laura Hollink, Peter Mika, Roi Blanco Source: WWW’13 Advisor: Jia-Ling Koh Speaker: Pei-Hao.
Assembler Efficient Discovery of Spatial Co-evolving Patterns in Massive Geo-sensory Data Sheng QIAN SIGKDD 2015.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Mining Optimal Decision Trees from Itemset Lattices Dr, Siegfried Nijssen Dr. Elisa Fromont KDD 2007.
Entity Recognition via Querying DBpedia ElShaimaa Ali.
Building Search Portals With SP2013 Search. 2 SharePoint 2013 Search  Introduction  Changes in the Architecture  Result Sources  Query Rules/Result.
Extracting Semantic Constraint from Description Text for Semantic Web Service Discovery Dengping Wei, Ting Wang, Ji Wang, and Yaodong Chen Reporter: Ting.
1 Mining Test Oracles for Search Engines Wujie Zheng
Web Usage Mining for Semantic Web Personalization جینی شیره شعاعی زهرا.
Friends and Locations Recommendation with the use of LBSN By EKUNDAYO OLUFEMI ADEOLA
Summarization of XML Documents K Sarath Kumar. Outline I.Motivation II.System for XML Summarization III.Ranking Model and Summary Generation IV.Example.
Improving Web Search Results Using Affinity Graph Benyu Zhang, Hua Li, Yi Liu, Lei Ji, Wensi Xi, Weiguo Fan, Zheng Chen, Wei-Ying Ma Microsoft Research.
1 Web-Page Summarization Using Clickthrough Data* JianTao Sun, Yuchang Lu Dept. of Computer Science TsingHua University Beijing , China Dou Shen,
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
Wikipedia as Sense Inventory to Improve Diversity in Web Search Results Celina SantamariaJulio GonzaloJavier Artiles nlp.uned.es UNED,c/Juan del Rosal,
FlexTable: Using a Dynamic Relation Model to Store RDF Data IDS Lab. Seungseok Kang.
Shridhar Bhalerao CMSC 601 Finding Implicit Relations in the Semantic Web.
From Path Tree To Frequent Patterns: A Framework for Mining Frequent Patterns Yabo Xu, Jeffrey Xu Yu, Guimei Liu, Hongjun Lu, Proc. of the 2002 IEEE International.
DBpedia - A Crystallization Point
Characterizing Knowledge on the Semantic Web with Watson Mathieu d’Aquin, Claudio Baldassarre, Laurian Gridinoc, Sofia Angeletou, Marta Sabou, Enrico Motta.
On Frequent Chatters Mining Claudio Lucchese 1 st HPC Lab Workshop 6/15/12 1st HPC Workshp - Claudio Lucchese.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Finding similar items by leveraging social tag clouds Speaker: Po-Hsien Shih Advisor: Jia-Ling Koh Source: SAC 2012’ Date: October 4, 2012.
Web mining is the use of data mining techniques to automatically discover and extract information from Web documents/services
1 API Recommendation Wujie Zheng
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
1 Cross Market Modeling for Query- Entity Matching Manish Gupta, Prashant Borole, Praful Hebbar, Rupesh Mehta, Niranjan Nayak.
LRM-RDA Gordon Dunsire
Multiple approaches to archival description
A Research Oriented Study Report By :- Akash Saxena
Analyzing and Securing Social Networks
A Schema and Instance Based RDF Dataset Summarization Tool
Gong Cheng, Yanan Zhang, and Yuzhong Qu
LRM-RDA Gordon Dunsire.
Cross-library API Recommendation Using Web Search Engines
Mining Complex Data COMP Seminar Spring 2011.
Statistical Knowledge Patterns: Identifying Synonymous Relations in Large Linked Datasets Ziqi Zhang, Anna Lisa Gentile, Eva Blomqvist, Isabelle Augenstein,
COMP5331 FP-Tree Prepared by Raymond Wong Presented by Raymond Wong
Type-directed Topic Segmentation of Entity Descriptions
RDF graph summaries 金成 2014/11/3.
Qingxia Liu Interactive Hierarchical Tag Clouds for Summarizing Spatiotemporal Social Contents [ICDE 2014] Kang, Wei, Anthony KH Tung,
Web Mining Department of Computer Science and Engg.
FP-Growth Wenlong Zhang.
Towards Exploratory Relationship Search: A Clustering-Based Approach
Information Networks: State of the Art
Qingxia Liu A Generative Interpretation of RDF Dataset  and its Application in Summarization Qingxia Liu 2019/4/6.
Filtering Properties of Entities By Class
An Approach to Abstractive Multi-Entity Summarization
Faceted Filter Jidong Jiang
Metadata use in the Statistical Value Chain
Linked Data Reuse in the Language Services Industry
Tantan Liu, Fan Wang, Gagan Agrawal The Ohio State University
Semantic Navigation over Linked Data Using the Link Pattern Space
Text Mining Application Programming Chapter 9 Text Categorization
Links Liang Zheng
Requirements “Content Guide”
Embedding based entity summarization
Linked Data Ryan McAlister.
Presentation transcript:

Entity Description Pattern Extraction and Their Usage in Entity Query Websoft 蒋继东 2014/4/21

Content Goal Interface design Workflow Measure Experiment

Our goal Extract EDPs from a given RDF datasets Automatically generate a form-based query using EDPs

What is an EDP Entity Description Pattern (EDP) A dataset is composed of many entities. For any one of them, we extract its classes and properties to make a collection, which is called an Entity Description Pattern (EDP).

Entity query interface Input a class Person actor, actor_name, actor_actorid, performance, page (50603) Select ?x where { ?x a Person. } Query form EDP hierarchy director, director_directorid, director_name, label, page (…) editor, editor_name, editor_editorid, label, page (…)

Entity query interface Person EDP1 actor, actor_name, actor_actorid, performance, page (50603) Select ?x where { ?x a Person. ?x a actor. ?x actor_name ___. ?x actor_actorid ___. ?x performance ___. ?x page ___. } director, director_directorid, director_name, label, page (…) editor, editor_name, editor_editorid, label, page (…)

Entity query interface EDP path Person EDP1 →EDP1.1 actor, actor_name, actor_actorid, label, performance, page (32471) Select ?x where { ?x a Person. ?x a actor. ?x actor_name ___. ?x actor_actorid ___. ?x performance ___. ?x page ___. ?x label ___. } actor, actor_name, actor_actorid, performance, page, sameAs (…) actor, actor_name, actor_actorid, performance, page, hasPhotoCollection (…)

Workflow EDP extraction Divide a given dataset to entities and extract EDPs with frequency from it. {[Person, hasParent, name, shoesize], [Person, hasSister, name], [Woman, name, hasSister]}

Person, name, hasParent, shoesize Workflow EDP selection Top k or coverage ratio? EDP hierarchy EDPs Person, name Person, name, hasSister Women, name, hasSister Person, name, hasParent, shoesize

Measure Sub-EDP Maximum frequent itemsets mining Maximum coverage Included, subclass, sub-property… Maximum frequent itemsets mining Threshold? Maximum coverage EDP rank

Experiment Jamendo 1047950 triples. 335925 entities. 34 EDPs. The top 8 EDPs covered 90 percent of the entities and top 12 EDPs covered more than 99 percent of the entities. Few hierarchical relationships.

Experiment LinkedMDB 6148121 triples. 694399 entities. 8460 EDPs. The top 40 EDPs covered 83.3 percent of the entities. Abundant hierarchical relationships.

Thanks