Presentation is loading. Please wait.

Presentation is loading. Please wait.

Entity Description Pattern Extraction and Their Usage in Entity Query

Similar presentations


Presentation on theme: "Entity Description Pattern Extraction and Their Usage in Entity Query"— Presentation transcript:

1 Entity Description Pattern Extraction and Their Usage in Entity Query
Websoft 蒋继东 2014/4/21

2 Content Goal Interface design Workflow Measure Experiment

3 Our goal Extract EDPs from a given RDF datasets
Automatically generate a form-based query using EDPs

4 What is an EDP Entity Description Pattern (EDP)
A dataset is composed of many entities. For any one of them, we extract its classes and properties to make a collection, which is called an Entity Description Pattern (EDP).

5 Entity query interface
Input a class Person actor, actor_name, actor_actorid, performance, page (50603) Select ?x where { ?x a Person. } Query form EDP hierarchy director, director_directorid, director_name, label, page (…) editor, editor_name, editor_editorid, label, page (…)

6 Entity query interface
Person EDP1 actor, actor_name, actor_actorid, performance, page (50603) Select ?x where { ?x a Person. ?x a actor. ?x actor_name ___. ?x actor_actorid ___. ?x performance ___. ?x page ___. } director, director_directorid, director_name, label, page (…) editor, editor_name, editor_editorid, label, page (…)

7 Entity query interface
EDP path Person EDP1 →EDP1.1 actor, actor_name, actor_actorid, label, performance, page (32471) Select ?x where { ?x a Person. ?x a actor. ?x actor_name ___. ?x actor_actorid ___. ?x performance ___. ?x page ___. ?x label ___. } actor, actor_name, actor_actorid, performance, page, sameAs (…) actor, actor_name, actor_actorid, performance, page, hasPhotoCollection (…)

8 Workflow EDP extraction
Divide a given dataset to entities and extract EDPs with frequency from it. {[Person, hasParent, name, shoesize], [Person, hasSister, name], [Woman, name, hasSister]}

9 Person, name, hasParent, shoesize
Workflow EDP selection Top k or coverage ratio? EDP hierarchy EDPs Person, name Person, name, hasSister Women, name, hasSister Person, name, hasParent, shoesize

10 Measure Sub-EDP Maximum frequent itemsets mining Maximum coverage
Included, subclass, sub-property… Maximum frequent itemsets mining Threshold? Maximum coverage EDP rank

11 Experiment Jamendo 1047950 triples. 335925 entities. 34 EDPs.
The top 8 EDPs covered 90 percent of the entities and top 12 EDPs covered more than 99 percent of the entities. Few hierarchical relationships.

12 Experiment LinkedMDB 6148121 triples. 694399 entities. 8460 EDPs.
The top 40 EDPs covered 83.3 percent of the entities. Abundant hierarchical relationships.

13 Thanks


Download ppt "Entity Description Pattern Extraction and Their Usage in Entity Query"

Similar presentations


Ads by Google