Getting More from Your VIVO Mike Conlon, UF Melissa Haendel, OHSU Kristi Holmes, Northwestern.

Slides:



Advertisements
Similar presentations
Getting Your Web Site Found. Meta Tags Description Tag This allows you to influence the description of your page with the web crawlers.
Advertisements

AESuniversity Ad hoc Reporting. Ad hoc Reports What are ad hoc reports? Why would you use ad hoc reports? Creating an ad hoc report from a query Building.
What is a Database By: Cristian Dubon.
A complete citation, notecard, and outlining tool
Jeopardy Objects Navigation Buttons True/False Parts of a Report Vocabulary Q $100 Q $200 Q $300 Q $400 Q $500 Q $100 Q $200 Q $300 Q $400 Q $500 Final.
RDF Tutorial.
How to study for A&P Adapted from “get ready for A&P” By Lori K. Garrett.
How To Create A Web Page By: Mikeon Briddy & Darrius Jacobs April 10,2006 Mr. Barnett.
Preparing for Employment
Google Chrome & Search C Chapter 18. Objectives 1.Use Google Chrome to navigate the Word Wide Web. 2.Manage bookmarks for web pages. 3.Perform basic keyword.
Wincite Introduces Knowledge Notebooks A new approach to collecting, organizing and distributing internal and external information sources and analysis.
Word Lesson 8 Increasing Efficiency Using Word
Data Analysis in Excel. Importance of Data Analysis Tracking and analyzing data are increasingly important in business, medicine, sports, politics, and.
CSCI 572 Project Presentation Mohsen Taheriyan Semantic Search on FOAF profiles.
Learn how to search for information the smart way Choose your own adventure!
Experimental Psychology PSY 433
Chapter 12 Information Systems. 2 Chapter Goals Define the role of general information systems Explain how spreadsheets are organized Create spreadsheets.
Pet Fish and High Cholesterol in the WHI OS: An Analysis Example Joe Larson 5 / 6 / 09.
Tech Tools For Collaboration In Professional Learning Communities.
Academic Computing Services 2010 Microsoft ® Office Visio ® 2007 Training Get to know Visio.
Working with SharePoint Document Libraries. What are document libraries? Document libraries are collections of files that you can share with team members.
Basic HTML The Magic Of Web Pages. Create an HTML folder  Make a folder in your H drive and name it “HTML”. We will save EVERYTHING for this unit here.
Website Designing Using Ms FrontPage FrontPage 2003 Create a Web site with FrontPage.
Lecture 3 – Data Storage with XML+AJAX and MySQL+socket.io
“Giving Credit Where Credit is Due!”. What is EasyBib? Using someone else’s work without giving that person credit for their work Plagiarism Online tool.
Moodle (Course Management Systems). Assignments 1 Assignments are a refreshingly simple method for collecting student work. They are a simple and flexible.
FACEBOOK IS THE BEST THING THAT EVER HAPPENED TO FRIENDSHIP WHY I LIKE FACEBOOK! By Mike Matthews.
The Semantic Web Web Science Systems Development Spring 2015.
Mail merge I: Use mail merge for mass mailings Perform a complete mail merge Now you’ll walk through the process of performing a mail merge by using the.
ONTOLOGY ENGINEERING Lab #1 - August 25, Lab Syllabus 2  Lab 1 – 8/25: Introduction and Overview of Protégé  Lab 2 – 9/8: Building an ontology.
15.1 Fundamentals of HTML.
How to Set-up Your Local Listing. Welcome This tutorial will take you through the steps to set-up (or edit) your Local listing to ensure you get the most.
Colleague, Excel & Word Best of Friends Presented by: Joan Kaun & Yvonne Nelson College of the Rockies.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Key Applications Module Lesson 21 — Access Essentials
Collecting Things Together - Lists 1. We’ve seen that Python can store things in memory and retrieve, using names. Sometime we want to store a bunch of.
Downloading and Installing Autodesk Revit 2016
Chapter 17 Creating a Database.
Database Systems Microsoft Access Practical #3 Queries Nos 215.
ISU Basic SAS commands Laboratory No. 1 Computer Techniques for Biological Research Animal Science 500 Ken Stalder, Professor Department of Animal Science.
SESSION 3.1 This section covers using the query window in design view to create a query and sorting & filtering data while in a datasheet view. Microsoft.
National Levee Database Interactive Reports Instructions NLD Point of Contact 1 US Army Corps of Engineers.
Microsoft ® Office Excel 2003 Training Using XML in Excel SynAppSys Educational Services presents:
1 Data Manipulation (with SQL) HRP223 – 2010 October 13, 2010 Copyright © Leland Stanford Junior University. All rights reserved. Warning: This.
O FFICE M ANAGEMENT T OOL - II B BA -V I TH. Abdus Salam2 Week-7 Introduction to Query Introduction to Query Querying from Multiple Tables Querying from.
11/25/2015Slide 1 Scripts are short programs that repeat sequences of SPSS commands. SPSS includes a computer language called Sax Basic for the creation.
Files Tutor: You will need ….
An Introduction to Forms. The Major Steps of a MicroSoft Access Database  Tables  Queries  Forms  Macros  Reports  Modules On our road map, we are.
VIVO is funded by member institutions. Visit our web site to find out more about becoming a member. vivoweb.org “ VIVO Roadmap Jim Blake, Mike Conlon,
ONLINE SERVICES FOR ACCESSING PHC MICRODATA POPULATION AND HOUSEHOLD CENSUS QUERY SYSTEM APPLICATION (1992, 2002 and 2011 CENSUSES)
CPSC 203 Introduction to Computers T97 By Jie (Jeff) Gao.
Welcome to NU 304 Instructor: Kelley Edds, MSN, APN-BC.
CC L A W EB DE D ATOS P RIMAVERA 2015 Lecture 7: SPARQL (1.0) Aidan Hogan
+ Publishing Your First Post USING WORDPRESS. + A CMS (content management system) is an application that allows you to publish, edit, modify, organize,
Access Queries and Forms. Adding a New Field  To insert a field after you have saved your table, open Access, and open the table  It is easier to add.
How to Set-up Your Business Alliance Listing. Welcome  This tutorial will take you through the steps to set-up (or edit) your listing to ensure you get.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Jacynthe Touchette, MSI JGH Health Sciences Library
Online PD Basic HTML The Magic Of Web Pages
AP CSP: Cleaning Data & Creating Summary Tables
CC La Web de Datos Primavera 2017 Lecture 7: SPARQL [i]
Experimental Psychology
Writing for Academic Journals
How to Run a DataOnDemand Report
CC La Web de Datos Primavera 2016 Lecture 7: SPARQL (1.0)
Louisiana: Our History.
Spreadsheets, Modelling & Databases
Introduction to Access
In order to execute a search you can…
Advanced Searching Tips
Presentation transcript:

Getting More from Your VIVO Mike Conlon, UF Melissa Haendel, OHSU Kristi Holmes, Northwestern

VIVO Data VIVO represents data in “triples” – subject, predicate, object The ontology is a semantic model that describes the world of scholarship For example (not actual VIVO-ISF) – mike isA faculty-member – mike wrote paper – paper hasTitle “The Long and Winding Road” – paper publishedIn “Journal of Irreproducible Results”

All Things have URI All things in VIVO have a Uniform Resource Locator (URI) In VIVO, URI usually look like Mike Conlon’s URI at UF VIVO is: –

A real triple Mike is a faculty member We have seen that “Mike” has the URI – – We will put this in brackets to tell the computer “the stuff in brackets is a URI” So perhaps: is a faculty member < So far so good. We have specified the subject.

And now for the predicate We want to say “is a,” as in Mike is a faculty member On the semantic Web, we say “Mike hasType Faculty Member” And the way we say “has type” is to use the rdf ontology. The rdf ontology is fundamental to the semantic web. VIVO uses a few rdf predicates. The common is rdf:type.

More than one ontology rdf:type – Ontology is rdf – Predicate is type So we say < rdf:type Faculty Member You might expect that “Faculty Member” here is english, not a precise statement in an ontology

On to the predicate VIVO-ISF defines a class FacultyMember We write vivo:FacultyMember To specify that class So to say, “Mike is a Faculty Member” we write < rdf:type vivo:FacultyMember.

Two Addenda 1.“rdfs:type” can be abbreviated “a” So we write: < a vivo:FacultyMember 2.When we write triples, we always end them with a period: < a vivo:FacultyMember.

Voila! Everything that follows has to do with specifying the triples you want, and what you want to do with them

We want to use VIVO Data “Use” – Make lists – Count things – Make reports – Get data out of VIVO for use in Excel Or statistics software (R, SAS, SPSS, Stata, …) Or reporting software (Crystal Reports,...) Or visualization software (R, …)

We will write SPARQL Queries SPARQL is a “query language” for asking for data from a set of triples SPARQL is fun and easy. But like any computer language, it is precise, in other words, you will get what you ask for.

We will use University of Florida Data Why? – Because we can – Because alternatives are lacking (mid to large scale real-world data on scholarship, accessible for training purposes)

About UF

Get Signed In Each table will share an account and sign in to an account at UF VIVO Navigate to “Site Admin” “SPARQL Query”

Prefixes for the names of ontologies Comments describe your query An overly complex sample query Choices regarding output formats Beyond what we will cover today DON’T PUSH THIS BUTTON!

Our First Query SELECT ?x WHERE { ?x a vivo:FacultyMember. }

Running This Query Delete the query in the window. Keep the prefixes Type the query into the window Look at it carefully Select an output format (RS-TEXT is fine for now) Push Submit

Query #2 SELECT ?x ?label WHERE { ?x a vivo:FacultyMember. ?x rdfs:label ?label. }

What happened? The good: – We got faculty members and their names – We got the columns in the order on the SELECT statement Not so good: – Some people have more than one name – The names are not in order

Query #3 SELECT ?x ?label WHERE { ?x a vivo:FacultyMember. ?x rdfs:label ?label. } GROUP BY ?x

What happened? We did not get labels! When we use a GROUP BY, we need to say what we want to have happen to the potentially multiple values of label that might be in the data It’s going to look complex, but you’ll get used to it

Query #4 SELECT ?x (MIN(?label) AS ?label) WHERE { ?x a vivo:FacultyMember. ?x rdfs:label ?label. } GROUP BY ?x ORDER BY ?label

Query #5 SELECT ?x (MIN(?label) AS ?label) WHERE { ?x a vivo:FacultyMember. ?x rdfs:label ?label. ?x a ufVivo:UFCurrentEntity. } GROUP BY ?x ORDER BY ?label

Let’s get a spreadsheet Select CSV Rerun the query A CSV file downloads Open it in spreadsheet software (Excel or other)

We’ve got data! Time to rejoice! Everything is a triple We can write SPARQL queries to select triples and get data

A second look at Query #5 SELECT ?x (MIN(?label) AS ?label) WHERE { ?x a vivo:FacultyMember. ?x rdfs:label ?label. ?x a ufVivo:UFCurrentEntity. } GROUP BY ?x ORDER BY ?label

Variables in Query #5 SELECT ?x (MIN(?label) AS ?label) WHERE { ?x a vivo:FacultyMember. ?x rdfs:label ?label. ?x a ufVivo:UFCurrentEntity. } GROUP BY ?x ORDER BY ?label

Ontology prefixes Query #5 SELECT ?x (MIN(?label) AS ?label) WHERE { ?x a vivo:FacultyMember. ?x rdfs:label ?label. ?x a ufVivo:UFCurrentEntity. } GROUP BY ?x ORDER BY ?label

Predicates in Query #5 SELECT ?x (MIN(?label) AS ?label) WHERE { ?x a vivo:FacultyMember. ?x rdfs:label ?label. ?x a ufVivo:UFCurrentEntity. } GROUP BY ?x ORDER BY ?label

Object Classes in Query #5 SELECT ?x (MIN(?label) AS ?label) WHERE { ?x a vivo:FacultyMember. ?x rdfs:label ?label. ?x a ufVivo:UFCurrentEntity. } GROUP BY ?x ORDER BY ?label

Statement order in Query #5 SELECT ?x (MIN(?label) AS ?label) WHERE { ?x a vivo:FacultyMember. ?x rdfs:label ?label. ?x a ufVivo:UFCurrentEntity. } GROUP BY ?x ORDER BY ?label X must be a FacultyMember AND ?x must have at least one label AND ?x must have at least one label AND X must be a UFCurrentEntity AND X must be a UFCurrentEntity

How to count To count things in SPARQL, use the COUNT aggregation Let’s count the faculty

Query #6 # Count the UF faculty SELECT (COUNT(DISTINCT ?x) AS ?nfaculty) WHERE { ?x a vivo:FacultyMember. ?x rdfs:label ?label. ?x a ufVivo:UFCurrentEntity. }

Inspecting Query #6 # Count the UF faculty SELECT (COUNT(DISTINCT ?x) AS ?nfaculty) WHERE { ?x a vivo:FacultyMember. ?x rdfs:label ?label. ?x a ufVivo:UFCurrentEntity. } Comments begin with # Comments are a good idea Comments begin with # Comments are a good idea COUNT the URI Use DISTINCT to be sure you are counting values of “?x”, not rows in a query result Same triple selected as before No GROUP BY or ORDER BY

Break Time!

Where were we? Triples: Subject Predicate Object – URI, Ontology prefixes, predicates, class names SPARQL Queries – SELECT, WHERE, GROUP BY, ORDER BY – MIN, COUNT DISTINCT – Triple patterns with variables ending with periods – Comments Thinking about data – Unique URI – Multiple labels

How do we know what’s in the ontologies? Several methods: 1.Look at pictures 2.Read the files 3.Search the web 4.Ask friends 5.Reverse engineering This is kind of important. We need to pin this up somewhere

Relationship diagrams can be found in the VIVO wiki

The diagram tells us: What articles are called What journals are called and how articles and journals are connected What dates are called and how dates and articles are collected How articles are connected to people through Authorships How contact information is arranged And much more It’s a treasure map! Much of what we need to know to work with publications is on display Similar relationship diagrams for grants, positions, education background, advising, membership, courses

Query #7: Inspect a pub # Get all the triples for a specified subject URI SELECT ?p ?o WHERE { ?p ?o. }

The SPARQL output tell us: The article has nine type assertions UF adds data regarding date of harvest and harvester name, grants cited Also has a datetime, a doi, publication venue, start and end pages, pmid, pmcid, nihmsid, title, volume Has 2 subject area assertions Has 5 “relatedBy” assertions (why?) Has obo:ARG_ (google it) (why?)

Now we area ready for recipes We can inspect entities using “reverse engineering” to inspect the triples for any subject We can review the diagrams to understand the relationships between things

28 SPARQL queries tested with VIVO 1.6 on UF VIVO Each is marked with SPARQL level of difficulty Reports, Data Management, People, Papers, Grants, Organizations Real-world examples. Most are the result of questions asked by stakeholders

Alpha List of People who have Opted In to VIVO Easy UF has an extension to tag people who have opted in to VIVO. These people are always excluded from various removal processes Query looks like the list of Faculty Members

List the Triples with a specified subject URI We’ve done this one already Incredibly useful for discovering the ontology What type of entity is this example inspecting?

Count papers by Concept Takes a minute to run Processes 54,000 academic articles

Find all the papers with “guideline” in the title New things – We are using InformationResource rather than AcademicArticle. Guideline might be in a report or other type of information resource – We are returning the date of publication Publication has a dateTimeValue dateTimeValue has a dateTime – We are using a FILTER command to select resources that have “guideline” in the title – We are using the DESC modifier on ORDER BY to get the results in descending order by date

Find 2014 papers in top journals New features – Date selection – Article has journal, journal has name – FILTER on issn list

List papers in college by number of collaborating departments New features – Paper relatedBy authorship – authorShip relates person – UNION – acts as an “or” Person has home department in college of medicine “OR” Person is in a unit that is a part of the college of medicine Unit has name

What’s next Continue to explore recipes Can highlight new features Questions

Thank you!