Mapping Existing Data Sources into VIVO Pedro Szekely, Craig Knoblock, Maria Muslea and Shubham Gupta University of Southern California/ISI.

Slides:



Advertisements
Similar presentations
Easily retrieve data from the Baan database
Advertisements

Improving Human-Semantic Web Interaction: The Rhizomer Experience Roberto García and Rosa Gil GRIHO - Human Computer Interaction Research Group Universitat.
Database Management Using Microsoft Access Xinhua Chen, Ph.D. Chinese Association of Professionals in Science and Technology March 23, 2003.
Micro Control Solutions Stability System II rev. 6.4
CC SQL Utilities.
SupportCenter Plus Product Overview. Overview 1.What is SupportCenter Plus (SCP) 2.Benefits of SCP 3.Licensing & Pricing 4.Questions.
DATA ANALYTICS. NORMS Cell Phones on Vibrate Respect all opinions.
Connecting Knowledge Silos using Federated Text Mining Guy Singh Senior Manager, Product & Strategic Alliances ©2014 Linguamatics Ltd.
Software to Manage EEP Vegetation Plot Data A design proposal Michael Lee January 31, 2011.
Tutorial 11: Connecting to External Data
Accessing Your Data Primarius Users Group Ways to Access to Your Data Accessing Your Data Reports using the Report Interface Exporting Reports.
NODEM reporter Flexible Information Generating Tool for Asset Managers.
Access Tutorial 8 Sharing, Integrating, and Analyzing Data
Page 1 ISMT E-120 Introduction to Microsoft Access & Relational Databases The Influence of Software and Hardware Technologies on Business Productivity.
Ricerca Distribuita Semantica Protocolli opensource per la condivisione di risorse online.
Information Integration Intelligence with TopBraid Suite SemTech, San Jose, Holger Knublauch
Rajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute.
>> Building a PPT from the ActiveInterface web pages Chris Harrington Active Interface, Inc.
The Asset Inventory Management module assists with data collection and discovery management processes. Collected information is interpreted and automatically.
Improve the way you create, manage and distribute information INNOVATION INSPIRATION Relational database integration with RDF/OWL.
Exploring Microsoft Access Chapter 4 Relational Databases, External Data, Charts, and the Switchboard.
ONTOLOGY ENGINEERING Lab #1 - August 25, Lab Syllabus 2  Lab 1 – 8/25: Introduction and Overview of Protégé  Lab 2 – 9/8: Building an ontology.
Supported by EU projects 12/12/2013 Athens, Greece Open Data in Agriculture Hands-on with data infrastructures that can power your agricultural data products.
6 th Annual Focus Users’ Conference Manage Integrations Presented by: Mike Morris.
XP New Perspectives on Integrating Microsoft Office 2003 Tutorial 3 1 Integrating Microsoft Office 2003 Tutorial 3 – Integrating Word, Excel, Access, and.
DATA, SITE AND RESOURCE MANAGEMENT SOFTWARE. A Windows application software designed for use with Stylitis data loggers. EMMETRON consolidates resources,
1 Integrating Microsoft Office 2003 Tutorial 3 – Integrating Word, Excel, Access, and PowerPoint.
XP New Perspectives on Integrating Microsoft Office XP Tutorial 3 1 Integrating Microsoft Office XP Tutorial 3 – Integrating Word, Excel, Access, and PowerPoint.
National Levee Database Interactive Reports Instructions NLD Point of Contact 1 US Army Corps of Engineers.
DAY 14: MICROSOFT ACCESS – CHAPTER 1 Madhuri Siddula October 1, 2015.
26 Mar 04 1 Application Software Practical 5/6 MS Access.
Security: Log OnLog On Configuration Tabs: File File View Support Labels Help Easy Icons: MasterMaster Tools Check In/Out Transfer Reserve Reports Download.
DAY 15: ACCESS CHAPTER 1 Rahul Kavi October 6,
MapInfo Professional 11.0: getting started Xiaogang (Marshall) Ma School of Science Rensselaer Polytechnic Institute Friday, January 25, 2013 GIS in the.
1 Computing for Todays Lecture 21 Yumei Huo Spring 2006.
The material contained in this document is proprietary to Triniti Corporation (Triniti). This material may not be disclosed, duplicated or otherwise revealed,
1 EndNote X2 Your Bibliographic Management Tool 29 September 2009 Humanities and Social Sciences Resource Teams.
1 MS Access. 2 Database – collection of related data Relational Database Management System (RDBMS) – software that uses related data stored in different.
IN THE NAME OF GOD. Reference Citing Software.
Irwin/McGraw-Hill Copyright © 2000 The McGraw-Hill Companies. All Rights reserved Whitten Bentley DittmanSYSTEMS ANALYSIS AND DESIGN METHODS5th Edition.
Semantic Web Portal: A Platform for Better Browsing and Visualizing Semantic Data Ying Ding et al. Jin Guang Zheng, Tetherless World Constellation.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Prizms for Data Publication and Management Katie Chastain May 9, 2014.
3-1 Modeling Basic Entities DBMS Create Sort Search Addition Deletion Modification Create Sort Search Addition Deletion Modification DBMS is a Software.
Semantic Water Quality Portal Jin Guang Zheng and Ping Wang Tetherless World Constellation.
Semantic Web unleashes your data! The Semantic Web will transform the use of content. Semantic Web – is an extension of the current web. Semantic Web.
VIVO architecture March 1, Major Components Vitro is a general-purpose Web-based application leveraging semantic standards VIVO is a customized.
Microsoft Access By Ritesh Sharma. Introduction Microsoft Access is a desktop database program that enables you to enter, store, analyze,and present data.For.
Data Exchange Framework
Lesson 17 Mail Merge. Overview Create a main document. Create a data source. Insert merge fields into a main document. Perform a mail merge. Use data.
GETTING STARTED WITH. EndNote allows you to: Access your references from any computer with internet Collect references from online sources Drop citations.
Mapping for the interwebs
Developer 2000 CSE 4504/6504 Lab.
Connect UNAVCO, a VIVO for a Scientific Community
DTIAtlasFiberAnalyzer Tutorial
Semantic Database Builder
Databases and Information Management
JDXpert Workday Integration
Computers Are Your Future
Access Tutorial 8 Sharing, Integrating, and Analyzing Data
Electronic Field Study Advanced User Training
Databases and Information Management
A Graph-Based Approach to Learn Semantic Descriptions of Data Sources
Manipulating and Sharing Data in a Database
Convert (flatten) IATI XML file to CSV file(s) using XQUERY
SupportCenter Plus Product Overview.
Tutorial 8 Sharing, Integrating, and Analyzing Data
Click2Export Export & Dynamics 365/CRM Reports/Word/Excel Templates in 1 Click
Metadata supported full-text search in a web archive
customer / university logo
Presentation transcript:

Mapping Existing Data Sources into VIVO Pedro Szekely, Craig Knoblock, Maria Muslea and Shubham Gupta University of Southern California/ISI

Outline Problem Current methods for importing data into VIVO Karma approach Demo Conclusions Pedro Szekely

Problem: Data Ingest Data ingest refers to any process of loading existing data into VIVO other than by direct interaction with VIVO's content editing interfaces. Typically this involves downloading or exporting data of interest from an online database or a local system of record. VIVO Data Ingest Guide: Pedro Szekely

Current Methods for Importing Data into VIVO Pedro Szekely

VIVO Provided Ingest Methods Writing SPARQL Queries Convert external data (e.g., CSV) into RDF Map data onto VIVO ontology Construct SPARQL query  VIVO RDF Harvester Data Ingest Option 1: Convert data into predefined CSV format Supports limited set of data fields Option 2: Edit existing XSL scripts for your data = Programming Pedro Szekely

Example Data People Organizations Positions Pedro Szekely

VIVO Data Ingest Guide Step #1: Create a Local Ontology Data Ingest Menu Step#2: Create Workspace Models Step#3: Pull External Data File into RDF Step# 4: Map Tabular Data onto Ontology Step#5: Construct the Ingested Entities Step#6: Load to Webapp Pedro Szekely

VIVO Data Ingest Guide Step #1: Create a Local Ontology Data Ingest Menu Step#2: Create Workspace Models Step#3: Pull External Data File into RDF Step# 4: Map Tabular Data onto Ontology Step#5: Construct the Ingested Entities Step#6: Load to Webapp Pedro Szekely

VIVO Ontology Pedro Szekely

VIVO Data Ingest Guide Step #1: Create a Local Ontology Data Ingest Menu Step#2: Create Workspace Models Step#3: Pull External Data File into RDF Step# 4: Map Tabular Data onto Ontology Step#5: Construct the Ingested Entities Step#6: Load to Webapp Pedro Szekely

Step#5: Construct the Ingested Entities Construct { ?person. ?person ?fullname. ?person ?first. ?person ?middle. ?person ?last. ?person ?title. ?person ?phone. ?person ?fax. ?person ? . ?person ?hrid. } Where { ?person ?fullname. ?person ?first. optional { ?person ?middle. } ?person ?last. ?person ?title. ?person ?phone. ?person ?fax. ?person ? . ?person ?hrid. } Write the following SPARQL query Constructs the people entities Pedro Szekely

SPARQL Ingest Is Difficult Construct { ?person. ?person ?fullname. ?person ?first. ?person ?middle. ?person ?last. ?person ?title. ?person ?phone. ?person ?fax. ?person ? . ?person ?hrid. } Where { ?person ?fullname. ?person ?first. optional { ?person ?middle. } ?person ?last. ?person ?title. ?person ?phone. ?person ?fax. ?person ? . ?person ?hrid. } Construct { ?org. ?org ?deptID. ?org ?name. } Where { ?org ?deptID. ?org ?name. } Construct { ?position. ?position ?year. ?position ?title. ?position ?person. ?person ?position. } Where { ?position ?orgID. ?position ?year. ?position ?title. ?position ?posthrid. ?person ?perhrid. FILTER((?posthrid)=(?perhrid)) } Construct { ?position. ?position ?year. ?position ?title. ?org ?position. ?position ?org. } Where { ?position ?year. ?position ?title. ?position ?postOrgID. ?org ?orgID. FILTER((?postOrgID)=(?orgID)) } Pedro Szekely

Harvester Data Ingest <xsl:if test="not( $this/db-CSV:DEPARTMENTNAME = '' or $this/db-CSV:DEPARTMENTNAME = 'null' )"> <core:organizationForPosition rdf:resource= "{$baseURI}position/positionFor{$personid}from{$this/db-CSV:STARTDATE}"/> Program in XSLT Pedro Szekely

Karma Approach KARMA SourcesRDF Pedro Szekely

Overall Karma Effort 15 KARMA Pedro Szekely

Using Karma to Ingest Data into VIVO KARMA Pedro Szekely

Karma Benefits Programming Interactive Easy Fast Pedro Szekely

Karma Workspace Pedro SzekelyModelWorksheets CommandHistory

Karma Models: Semantic Types Pedro Szekely Semantic Types Capture semantics of the values in each column in terms of classes and properties in the ontology the peopleID of a FacultyMemberthe label of an Organization Karma learns to recognize semantic types each time the user assigns one manually

Karma Models: Relationships Pedro Szekely Relationships Capture the relationships among columns in terms of classes and properties in the ontology the relationship between Position and FacultyMember is positionForPerson Karma automatically computes relationships based on the object properties defined in the ontology

Karma Demo Using Karma to ingest data samples from the “Data Ingest Guide” Pedro Szekely

Conclusions Pedro Szekely

Conclusions Generic data-to-ontology-to-RDF mapping tool Easy to use: interactive, no programming Used Karma to populate USC VIVO instance Open source: you can use it too Pedro Szekely

From Simon Gaeremynck, Sakai Foundation Pedro Szekely

More Information Using Karma to ingest VIVO data Publications and videos Software download (open source) Contacts: Pedro Szekely