Consuming JSON-LD: Experiments with Primo's Latest Linked Data

Slides:



Advertisements
Similar presentations
xID Web Services (xISBN, xOCLCnum, xISSN) FRBR grouping of editions and formats Tim McCormick Product Manager, Grid Services Xiaoming.
Advertisements

XID Web Services Roy Tennant OCLC Research OCLC API Mashathon Monday, 8 February 2010.
XID Web services Xiaoming Liu Senior Software Engineer OCLC.
Natural Language Processing for LODLAM Presented at IGeLU 2014 by Corey A Harper A brief intro to machine learning & data science for Libraries.
Metadata Working Group Forum Cornell Metadata Normalization A Case Study in Primo -- and -- Linked Open Data In Libraries.
Business Intelligence Last Update Copyright Kenneth M. Chipps Ph.D
Federated Searching Pre-Conference Workshop - The federated searching cookbook Qin Zhu HP Labs Research Library February 18, 2007.
1 Newspaper Digitisation Workflows Rose Holley- Manager ANDP Presentation to Cultural Heritage Digitisation professionals 26 November 2008.
Hive: A data warehouse on Hadoop Based on Facebook Team’s paperon Facebook Team’s paper 8/18/20151.
Exchange formats and APIs Questions – how and when to access metadata? – lifecycle/status – how to access? can things disappear? – is CSV enough? – is.
Project Report Presentation and Update October 10, 2014 Jeff Mixter - OCLC Research Patrick OBrien - Montana State Univeristy Kenning Arlitsch - Montana.
What difference a good tool? using Endeca for a faceted catalog Emily Lynema NCSU Libraries ACRL Delaware Valley Chapter Fall Program November 3, 2006.
Today’s Agenda 1.Where we’re at 2.Upcoming requests for help 3.Stats deep dive.
Electronic Resources and Libraries March 1, 2011 Austin, Texas Craig Harkema, Charlene Sorensen, Karim Tharani Transcendental Metadata: A Collaborative.
1 LINKED OPEN DATA – an introduction Elisabeth Robinson EXPANIA 2014.
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
Context and Prosopography: Putting the 'Archives' Into LOD-LAM Corey A Harper SAA MDOR
Quality Data: Fresno State's Analytics Strategy Rob Robinson Web Developer for Fresno
Bringing it All Together: NODC’s Geoportal Server as an Integration Tool for Interoperable Data Services Kenneth S. Casey, Ph.D. YuanJie Li NOAA National.
CWIC Developers Meeting January 29 th 2014 Calin Duma Service Level Agreements High-Availability, Reliability and Performance.
Interoperability through Library APIs Library Technology Services Open House 7/30/15.
Data Mining By Dave Maung.
WIRED Week 3 Syllabus Update (next week) Readings Overview - Quick Review of Last Week’s IR Models (if time) - Evaluating IR Systems - Understanding Queries.
Analyzing Data with Advanced Visualizations
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Migration of Physical to Electronic (P2E) Resources in Alma
Ex Libris, LOD and BIBFRAME
Combine_and_stir (Aleph data + RDF + Python + other things) IGeLU 2015 Developer’s Day Budapest, Hungary Laura Akerman.
Linked Library (+AM) Data Presented LITA Next-Generation Catalog IG Corey A Harper Publish, Enrich, Relate and Un-Silo.
Book web site:
4/19/ :02 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Getting started with Power BI
Business Directory REST API
Make Power BI Your Own with the Power BI APIs
Make Power BI Your Own with the Power BI APIs
Judith Nagata, Electronic Resources Librarian
The BARTOC story: from blog to basic to full terminology registry
Data Analytics → Demystified
Steering Group Member, Link Digital
Alma Link Resolver Usage
Michael Mast Senior Architect
Embedding Knowledge in HTML
Twitter & NoSQL Integration with MVC4 Web API
OpenAIRE Services for Funders
Mining Citation Data Using the Web of Science API
9/19/2018 7:06 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Make Power BI Your Own with the Power BI APIs
Gen-Tao Chiang Data and Analytic Engineer
November 8th, 2017 Matthew Davis and John Fink
SharePoint Essentials Toolkit
Data Exploration Of Wikipedia
Primo RESTful APIs Paul McBride Wei Dai.
Make Power BI Your Own with the Power BI APIs
Data Science introduction.
SQL Server Query Plans Journeyman and Beyond
Student name Student ID Degree program Area of specialization
Saravana Kumar CEO/Founder - Kovai Atomic Scope – Product Update.
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
The Search Engine Architecture
Academic & More Group 4 谢知晖 王逸雄 郭嘉宋 程若愚.
Student name Student ID Degree program
Combining ArcGIS for Server & ArcGIS Online
Web archives as a research subject
Gregg Kellogg – Spec Ops
Python for Data Analysis
Application Discovery,
Student name Student ID Degree program Area of specialization
Who is Using your webSite?
Microsoft Azure Data Catalog
Presentation transcript:

Consuming JSON-LD: Experiments with Primo's Latest Linked Data Corey Harper 2015-09-05 IGeLU 2015 – Developers Day 1

Curl | jq | less to Beta Search REST API curl "http://bobcatdev.library.nyu.edu/primo_library/libweb/webservices/rest/v1/pnxs?q=any,contains,monkeys&inst=NYU" | jq -C '.docs' | less -r

require 'rdf‘ ‘json/ld’ 'rdf/turtle‘ 'openssl' Ruby RDF & JSON-LD require 'rdf‘ ‘json/ld’ 'rdf/turtle‘ 'openssl' resource = RDF::Resource(RDF::URI.new("http://bob catdev.library.nyu.edu/primo_library/libwe b/webservices/rest/v1/pnxs/L/nyu_aleph0 01770007?inst=NYU")) graph = RDF::Graph.new << JSON::LD::API.toRdf(resource) graph.dump(:ttl, prefixes: {list prefixes})

This doesn’t actually work. Not _quite_ valid JSON-LD What just happened? This doesn’t actually work. Not _quite_ valid JSON-LD It needs an actual context Add the context yourself, and you get errors when validating / linting http://json-ld.org/playground/ https://github.com/ExLibrisGroup/Primo.PN X-context/blob/master/PNX-context.json

Still works in JQ (but maybe not JSON Tools?)

Consuming JSON (-LD?): Experiments in with (and Without) Primo's New RESTful Analytics Search API 7

Distribution of Titles

Understanding your collections Understanding queries and usage Why do this stuff? Understanding your collections Understanding queries and usage Identifying Strengths Topic Modeling Clustering Recommendation systems (Automatic Classification?)

On Github (Also, DPLA Examples) https://github.com/chrpr/dpla-analytics/blob/master/primo/REST-api.ipynb

Tree Map of Title Words

Data Science Venn Diagram http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram 2015-04-18 Harper - Can Metadata be Quantified? - DPLAFest 2015 14

APIs have much potential Limited in scope (for now) Analytics Writ Large APIs have much potential Limited in scope (for now) NYU External Data Warehousing Analyze Query Logs Analyze Dedup Merger Analyze URLs with (and outside) of Aleph

Tableau – Business Intelligence R – Stats Analysis Python Toolkit Tableau – Business Intelligence R – Stats Analysis Python SciPy, NumPy, Pandas, etc NLTK jq, awk, sed, grep, sort, uniq, tr, wc, etc. 2015-04-18 Harper - Can Metadata be Quantified? - DPLAFest 2015 16

Data Quality Control

Duplicate OCLC # Analysis

Collection Management Decisions Warehouse combines: Primo dedupmrg & frbr matches Ebook SUSHI/Counter Stats Aleph Circ Stats Offsite & De-accessioning Decisions Regression analysis to demo correlations 2015-04-18 Harper - Can Metadata be Quantified? - DPLAFest 2015 20

CRISP-DM "CRISP-DM Process Diagram" by Kenneth Jensen Own work. Licensed under CC BY-SA 3.0 via Wikimedia Commons http://commons.wikimedia.org/wiki/File:CRISP-DM_Process_Diagram.png 2015-04-18 Harper - Can Metadata be Quantified? - DPLAFest 2015 21

Possibilities & Next Steps Exploratory Data Analysis. More. Answering questions about data quality More Topic Maps Bi- and Tri-gram Tokenization, Hapaxes Data Cleanup and QA Processing income batch data Integrate with other data streams: Google Analytics, AppDynamics, Kibana Cross system logs, searches, etc. 2015-04-18 Harper - Can Metadata be Quantified? - DPLAFest 2015 22

corey.harper@nyu.edu 212.998.2479 @chrpr 23 Thanks! 2015-04-18 Harper - Can Metadata be Quantified? - DPLAFest 2015 23 23