Download presentation
Presentation is loading. Please wait.
Published byElwin Brown Modified over 9 years ago
1
KAnOE: Research Centre for Knowledge Analytics and Ontological Engineering Managing Semantic Data NACLIN-2014, 10 Dec 2014 Dr. Kavi Mahesh Dean of Research, PES University 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 1
2
Managing Semantic Data in Research Data Services Trend: Publish research data along with paper Digital library of research data How do we manage this data? E.g., our research requires: Several Tera Bytes of data 5 billion data elements so far 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 2
3
Inverting the Publication Model Past: Description of research results in English Show samples of data “Results, Discussion, Conclusion” framework Present: Publish article and entire dataset No links between article and data 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 3
4
The Inverted Publication Model Future: Inverted model: Publish self-contained data Publish data analytics Annotate the data with English descriptions where needed Rich linkage between datasets Web of linked data… 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 4
5
Illustration of Publishing Data 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 5
6
1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 6
7
1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 7
8
Self-Contained Dataset Requirements: Have a proper and consistent structure; Define each element both syntactically and semantically; Specify all the semantic constraints on permissible data values, their types and cardinalities; and Specify data provenance, etc. 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 8
9
Ontology of Research Data In other words, an ontology of research data Where is the “Dublin Core” of research data? E.g., CERIF 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 9
10
Why Semantic Data Management? Epistemology of science: Verifying research results Making sense of someone else’s data Documenting the usage scenario of data 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 10
11
1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 11
12
Ontology-based multi-domain metadata for research data management using triple stores Full Text: PDF Buy this Article Authors: João Rocha da Silva Universidade do Porto/INESC TEC, Portugal Cristina Ribeiro DEI, Universidade do Porto/INESC TEC, Portugal João Correia Lopes DEI, Universidade do Porto/INESC TEC, PortugalBuy this ArticleJoão Rocha da SilvaUniversidade do Porto/INESC TEC, PortugalCristina RibeiroDEI, Universidade do Porto/INESC TEC, PortugalJoão Correia LopesDEI, Universidade do Porto/INESC TEC, Portugal 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 12
13
Data on the Web: 5-Star Rating System * Data on the Web: E.g., data published as a set of scanned images ** Machine-Readable Data: E.g., data published as a spreadsheet *** Non-Proprietary Format: E.g., data published as a CSV file **** RDF Data: E.g., a drug database published in RDF ***** Linked RDF Data: Links to other people’s data are included. E.g., the Dbpedia dataset extracted from wikipedia 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 13
14
Linked Open Data: Principles Use URIs as names of things: E.g, mention author by URI, not just name. Use HTTP URIs so that people can look up those names. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL). Include links to other URIs, so people can discover more things. Sir Tim Berners-Lee 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 14
15
Linked Open Research Data Services Requirements: Uniquely identify all entities used in datasets such as experiments, specimens, locations, organizations, etc.; Interlink parts of datasets with precise parts of an article in both directions; Classify datasets using a suitable universal classification scheme; Cite other datasets, i.e., refer to them through links; Manage multiple versions and revisions of datasets; and Incorporate a suitable controlled vocabulary or ontology. 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 15
16
Architecture of Digital Library of Data 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 16
17
An Ontology for Research Data 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 17
18
Concluding Remarks Publishing and citing research data will be a common practice Digital libraries need to manage research data Data needs to be self-contained, therefore semantic Linked open data is promising We need a proper ontology of research data Keyword search may be good enough for documents, but not for datasets 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 18
19
Questions? Thank you! http://www.kanoe.org http://ontology.org.in ontology@pes.edu 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 19
20
http://www.kanoe.org 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 20
21
How? By applying Natural Language Generation Techniques on structure and semantics of Linked Open Datasets and underlying Ontologies. 1/25/2016 21 (c) Dr. Kavi Mahesh; Do not copy or distribute
22
Input Triples SubjectPredicateobject 1/25/2016 22 (c) Dr. Kavi Mahesh; Do not copy or distribute
23
Ontology for Discourse Structuring 1/25/2016 23 (c) Dr. Kavi Mahesh; Do not copy or distribute
24
Classes 1/25/2016 24 (c) Dr. Kavi Mahesh; Do not copy or distribute
25
Subclasses 1/25/2016 25 (c) Dr. Kavi Mahesh; Do not copy or distribute
26
Individuals 1/25/2016 26 (c) Dr. Kavi Mahesh; Do not copy or distribute
27
Ontology as a Chart 1/25/2016 27 (c) Dr. Kavi Mahesh; Do not copy or distribute
28
A few snapshots of the “ MECHANISM ” ontology, in the protégé software, are shown: 1/25/2016 28 (c) Dr. Kavi Mahesh; Do not copy or distribute
29
The 12 subgroups: 1/25/2016 29 (c) Dr. Kavi Mahesh; Do not copy or distribute
30
The functions accommodated under every subgroup: 1/25/2016 30 (c) Dr. Kavi Mahesh; Do not copy or distribute
31
Hierarchy of mechanisms: 1/25/2016 31 (c) Dr. Kavi Mahesh; Do not copy or distribute
32
Object & data properties being added to each mechanism: 1/25/2016 32 (c) Dr. Kavi Mahesh; Do not copy or distribute
33
1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 33
34
Subclasses and their Descriptions 1/25/2016 34 (c) Dr. Kavi Mahesh; Do not copy or distribute
35
Object properties 1/25/2016 35 (c) Dr. Kavi Mahesh; Do not copy or distribute
36
Data properties added 1/25/2016 36 (c) Dr. Kavi Mahesh; Do not copy or distribute
37
Linked Open Data Tools Pallavi Karanth ©KAnOE, PES Institute of Technology 1/25/2016 37 (c) Dr. Kavi Mahesh; Do not copy or distribute
38
Data Data ©KAnOE, PES Institute of Technology 1/25/2016 38 (c) Dr. Kavi Mahesh; Do not copy or distribute
39
Web for Data Discovery ©KAnOE, PES Institute of Technology 1/25/2016 39 (c) Dr. Kavi Mahesh; Do not copy or distribute
40
Web for Data Discovery ©KAnOE, PES Institute of Technology 1/25/2016 40 (c) Dr. Kavi Mahesh; Do not copy or distribute
41
Machine Understandable Data ©KAnOE, PES Institute of Technology 1/25/2016 41 (c) Dr. Kavi Mahesh; Do not copy or distribute
42
Machine Understandable Data ©KAnOE, PES Institute of Technology Ram Nickname DOB 19-04-78 Location Bangalore 1/25/2016 42 (c) Dr. Kavi Mahesh; Do not copy or distribute
43
Open Data and Linked Data Open Data - open access Linked Data Semantic Machine Readable ©KAnOE, PES Institute of Technology 1/25/2016 43 (c) Dr. Kavi Mahesh; Do not copy or distribute
44
Linked Open Data Five Star Rating of Linked Data by Tim Berners Lee ★ make your stuff available on the Web (whatever format) under an open license ★★ make it available as structured data (e.g., Excel instead of image scan of a table) ★★★ use non-proprietary formats (e.g., CSV instead of Excel) ★★★★ use URIs to denote things, so that people can point at your stuff ★★★★★ link your data to other data to provide context About 50 billion triples as of 2013 ©KAnOE, PES Institute of Technology 1/25/2016 44 (c) Dr. Kavi Mahesh; Do not copy or distribute
45
LOD - IT (Kappa) For Software Developers Technical Helpdesk LOD-IT Video LOD-IT Video LOD-IT Demo LOD-IT Demo ©KAnOE, PES Institute of Technology 1/25/2016 45 (c) Dr. Kavi Mahesh; Do not copy or distribute
46
LODScape Ontology based Multiple LOD Object Browser DbPedia and Freebase datasets used LODScape Demo LODScape Demo ©KAnOE, PES Institute of Technology 1/25/2016 46 (c) Dr. Kavi Mahesh; Do not copy or distribute
47
Semantic Smart-Aleck Automatic Fact Generator Based on Interestingness Algorithm Uses Dbpedia and Yago datasets SemanticSmartAleck Demo SemanticSmartAleck Demo ©KAnOE, PES Institute of Technology 1/25/2016 47 (c) Dr. Kavi Mahesh; Do not copy or distribute
48
Acknowledgments 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 48
49
Suggestions? Thank you! http://www.kanoe.org http://ontology.org.in ontology@pes.edu 1/25/2016 (c) Dr. Kavi Mahesh; Do not copy or distribute 49
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.