Learning Co-reference Relations for FOAF Instances Jennifer Sleeman and Tim Finin, University of Maryland, Baltimore County Motivation Establishing co-reference.

Slides:



Advertisements
Similar presentations
Machine Learning: Intro and Supervised Classification
Advertisements

Semantic Web Thanks to folks at LAIT lab Sources include :
Combining Classification and Model Trees for Handling Ordinal Problems D. Anyfantis, M. Karagiannopoulos S. B. Kotsiantis, P. E. Pintelas Educational Software.
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
OWL TUTORIAL APT CSA 3003 OWL ANNOTATOR Charlie Abela CSAI Department.
Classification of the aesthetic value of images based on histogram features By Xavier Clements & Tristan Penman Supervisors: Vic Ciesielski, Xiadong Li.
Machine Learning in Practice Lecture 3 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Computing FOAF Co-reference Relations with Rules and Machine Learning Jennifer Sleeman and Tim Finin University of Maryland, Baltimore County The Third.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 The Enhanced Entity- Relationship (EER) Model.
Reliability and Information Gain Ida Sprinkhuizen-Kuyper Evgueni Smirnov Georgi Nalbantov (UM/EUR)
Information Extraction from HTML: General Machine Learning Approach Using SRV.
11/8/20051 Ontology Translation on the Semantic Web D. Dou, D. McDermott, P. Qi Computer Science, Yale University Presented by Z. Chen CIS 607 SII, Week.
Department of Computer Science, University of Maryland, College Park 1 Sharath Srinivas - CMSC 818Z, Spring 2007 Semantic Web and Knowledge Representation.
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
Describing Syntax and Semantics
Experimental Evaluation
For Better Accuracy Eick: Ensemble Learning
A Privacy Preserving Efficient Protocol for Semantic Similarity Join Using Long String Attributes Bilal Hawashin, Farshad Fotouhi Traian Marius Truta Department.
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Semantic Analytics on Social Networks: Experiences in Addressing the Problem of Conflict of Interest Detection Boanerges Aleman-Meza, Meenakshi Nagarajan,
BILINGUAL CO-TRAINING FOR MONOLINGUAL HYPONYMY-RELATION ACQUISITION Jong-Hoon Oh, Kiyotaka Uchimoto, Kentaro Torisawa ACL 2009.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
1 Expert Finding for eCollaboration Using FOAF with RuleML Rules MCeTECH May 2006 Jie Li 1,2, Harold Boley 1,2, Virendrakumar C. Bhavsar 1, Jing.
1 Scaling multi-class Support Vector Machines using inter- class confusion Author:Shantanu Sunita Sarawagi Sunita Sarawagi Soumen Chakrabarti Soumen Chakrabarti.
Mathematical Modeling and Formal Specification Languages CIS 376 Bruce R. Maxim UM-Dearborn.
A Z Approach in Validating ORA-SS Data Models Scott Uk-Jin Lee Jing Sun Gillian Dobbie Yuan Fang Li.
Samad Paydar Web Technology Lab. Ferdowsi University of Mashhad 10 th August 2011.
1 Learning Sub-structures of Document Semantic Graphs for Document Summarization 1 Jure Leskovec, 1 Marko Grobelnik, 2 Natasa Milic-Frayling 1 Jozef Stefan.
Database Systems: Enhanced Entity-Relationship Modeling Dr. Taysir Hassan Abdel Hamid.
Today Ensemble Methods. Recap of the course. Classifier Fusion
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Stefan Mutter, Mark Hall, Eibe Frank University of Freiburg, Germany University of Waikato, New Zealand The 17th Australian Joint Conference on Artificial.
Multi-Relational Data Mining: An Introduction Joe Paulowskey.
Reputation Network Analysis for Filtering Ravi Emani Ramesh Ravindran.
Part4 Methodology of Database Design Chapter 07- Overview of Conceptual Database Design Lu Wei College of Software and Microelectronics Northwestern Polytechnical.
Event-Centric Summary Generation Lucy Vanderwende, Michele Banko and Arul Menezes One Microsoft Way, WA, USA DUC 2004.
Ensemble with Neighbor Rules Voting Itt Romneeyangkurn, Sukree Sinthupinyo Faculty of Computer Science Thammasat University.
Cooperative Classifiers Rozita Dara Supervisor: Prof. Kamel Pattern Analysis and Machine Intelligence Lab University of Waterloo.
Text Mining & NLP based Algorithm to populate ontology with A-Box individuals and object properties Alexandre Kouznetsov and Christopher J. O. Baker, University.
Using linked data to interpret tables Varish Mulwad September 14,
Inference Protocols for Coreference Resolution Kai-Wei Chang, Rajhans Samdani, Alla Rozovskaya, Nick Rizzolo, Mark Sammons, and Dan Roth This research.
Ontology Quality by Detection of Conflicts in Metadata Budak I. Arpinar Karthikeyan Giriloganathan Boanerges Aleman-Meza LSDIS lab Computer Science University.
ELIS – Multimedia Lab PREMIS OWL Sam Coppens Multimedia Lab Department of Electronics and Information Systems Faculty of Engineering Ghent University.
Intelligent Database Systems Lab Presenter: CHANG, SHIH-JIE Authors: Luca Cagliero, Paolo Garza 2013.DKE. Improving classification models with taxonomy.
CityStateMayorPopulation BaltimoreMDS.C.Rawlings-Blake637,418 SeattleWAM.McGinn617,334 BostonMAT.Menino645,169 RaleighNCC.Meeker405,791 We are laying a.
Semantic Web Knowledge Fusion Jennifer Sleeman University of Maryland, Baltimore County Motivation Definitions Methodology Evaluation Future Work Based.
Practical RDF Chapter 12. Ontologies: RDF Business Models Shelley Powers, O’Reilly SNU IDB Lab. Taikyoung Kim.
Text Based Similarity Metrics and Delta for Semantic Web Graphs Krishnamurthy Koduvayur Viswanathan Monday, June 28,
Support Vector Machines and Kernel Methods for Co-Reference Resolution 2007 Summer Workshop on Human Language Technology Center for Language and Speech.
Semantic Web for the Working Ontologist - RDFS-Plus TEAM C 현근수, 김영욱, 백상윤, 이용현.
An Ontology-based Approach to Context Modeling and Reasoning in Pervasive Computing Dejene Ejigu, Marian Scuturici, Lionel Brunie Laboratoire INSA de Lyon,
Virtual Examples for Text Classification with Support Vector Machines Manabu Sassano Proceedings of the 2003 Conference on Emprical Methods in Natural.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
1 An infrastructure for context-awareness based on first order logic 송지수 ISI LAB.
Semantic Web for the Working Ontologist Dean Allemang Jim Hendler SNU IDB laboratory.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Artificial Intelligence Logical Agents Chapter 7.
High resolution product by SVM. L’Aquila experience and prospects for the validation site R. Anniballe DIET- Sapienza University of Rome.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Ensemble methods with Data Streams
Simone Paolo Ponzetto University of Heidelberg Massimo Poesio
Saisai Gong, Wei Hu, Yuzhong Qu
Improving a Pipeline Architecture for Shallow Discourse Parsing
ece 720 intelligent web: ontology and beyond
Text Based Similarity Metrics and Delta for Semantic Web Graphs
An Interactive Approach to Collectively Resolving URI Coreference
Relations for functions.
Logics for Data and Knowledge Representation
Presentation transcript:

Learning Co-reference Relations for FOAF Instances Jennifer Sleeman and Tim Finin, University of Maryland, Baltimore County Motivation Establishing co-reference relations for entities is a common problem. Our goal is to establish co- reference relations among FOAF agents. FOAF co-referent issues: No global unique identifiers Inverse Functional Properties not always reliable Multiple versions of FOAF files for a single entity When two instances are thought to be co-referent, information can be combined providing a more complete representation of the entity. In Semantic Web this is termed as 'smushing'. Smushing issues: Outdated information Conflicting information Other alignment-based issues owl:sameAs dangers Co-Referent Predicate : coref a owl:TransitiveProperty. :coref a owl:SymmetricProperty. owl:sameAs rdfs:subPropertyOf :coref. :notCoref a owl:SymmetricProperty. owl:differentFrom rdfs:subPropertyOf :notCoref. {?a :notCoref ?b. ?b :coref ?c.} => {?a :notCoref ?c}. {?a foaf:knows ?b.} => {?a :notCoref ?b}. Methodology Results Future Work After co-reference is established among pairs we cluster our pairs and use these clusters for future co-reference evaluations. We use an ensemble approach with both the rules and a classifier to evaluate pairs. Predicting accurately co-referent/non-co-referent pairs Enhanced clustering algorithm Application to RDF documents non-FOAF specific For experiment one: 900 pairs designated non-match majority other rules returned undetermined state For experiment two we show in Table 1: only inverse functional property rule positive cases majority resulted in undetermined state knows rule resulted in non-coreferent state During E2 clustering, first phase resulted in 90% accuracy. Errors occurred in pairs that should have been clustered but were not. A second round of clustering yielded no new relationship pairs among instances but cluster to cluster pairing did occur. E2 # of PairsRuleRule Conclusion differentFromUndetermined 47184Inverse functionalUndetermined 2402Inverse functionalCo-referent Knows graphUndetermined sameAs Undetermined Knows graphNot Co-referent  1 st experiment resulting in 50,000 triples/500 entity mentions/600 training  2 nd experiment with 250,000 triples/3500 entity mentions/1800 training classes  10-fold validation with results shown in Table 2 Figure 2 & Figure 3 : Clustering Approaches Table 1: Rules-based Results ExperimentTP RateFP RatePrecisionRecallF-Measure E E Table 2: 10-Fold Cross Validation Test coreferent coref Two FOAF instances are determined to be co-referent. Instance 1 and 2 add an explicit coref property for each other and form cluster 1. It is determined that cluster 1 and FOAF instance 3 are co-referent. Instance 3 joins cluster 1 and instance 1 and 2 have an explicit coref property that joins each with instance 3. 4 coreferent coref FOAF instance 1 and 2 are determined to be co-referent. Instance 1 and 2 add an explicit coref property for each other and form cluster 1. Instance 3 and 4 add an explicit coref property for each other and form cluster 2. It is determined that cluster 1 and cluster 2 are co-referent. Each instance adds an explicit coref property for each other. 34 coreferent FOAF instance 3 and 4 are determined to be co-referent. 2 coref The following axioms in N3 are for the coref and notCoref properties. coref – transitive and symmetric, owl:sameAs as a sub-property notCoref – symmetric but not transitive, owl:differentFrom as sub-property Figure 1: System Architecture Ingestion Candidate Pair Generation Candidate Pair Generation Rule-based Reasoning Rule-based Reasoning Machine Learning Machine Learning Model Generation Abstract entity generation Potential pairs reduces workload for classifier Deductive Decisions Deductive Decisions Predictions clusters form new abstract entities Co-referent designation and clustering 1. Generate candidate pairs 2. Generate a rules-based model 3. Perform classification using SVMs 4. Designate pairs as co-referent 5. Cluster pairs