Download presentation
Presentation is loading. Please wait.
Published byAlexandra Foster Modified over 6 years ago
1
Consuming JSON-LD: Experiments with Primo's Latest Linked Data
Corey Harper IGeLU 2015 – Developers Day 1
2
Curl | jq | less to Beta Search REST API
curl " | jq -C '.docs' | less -r
4
require 'rdf‘ ‘json/ld’ 'rdf/turtle‘ 'openssl'
Ruby RDF & JSON-LD require 'rdf‘ ‘json/ld’ 'rdf/turtle‘ 'openssl' resource = RDF::Resource(RDF::URI.new(" catdev.library.nyu.edu/primo_library/libwe b/webservices/rest/v1/pnxs/L/nyu_aleph ?inst=NYU")) graph = RDF::Graph.new << JSON::LD::API.toRdf(resource) graph.dump(:ttl, prefixes: {list prefixes})
5
This doesn’t actually work. Not _quite_ valid JSON-LD
What just happened? This doesn’t actually work. Not _quite_ valid JSON-LD It needs an actual context Add the context yourself, and you get errors when validating / linting X-context/blob/master/PNX-context.json
6
Still works in JQ (but maybe not JSON Tools?)
7
Consuming JSON (-LD?): Experiments in with (and Without) Primo's New RESTful Analytics Search API
7
10
Distribution of Titles
11
Understanding your collections Understanding queries and usage
Why do this stuff? Understanding your collections Understanding queries and usage Identifying Strengths Topic Modeling Clustering Recommendation systems (Automatic Classification?)
12
On Github (Also, DPLA Examples)
13
Tree Map of Title Words
14
Data Science Venn Diagram
Harper - Can Metadata be Quantified? - DPLAFest 2015 14
15
APIs have much potential Limited in scope (for now)
Analytics Writ Large APIs have much potential Limited in scope (for now) NYU External Data Warehousing Analyze Query Logs Analyze Dedup Merger Analyze URLs with (and outside) of Aleph
16
Tableau – Business Intelligence R – Stats Analysis Python
Toolkit Tableau – Business Intelligence R – Stats Analysis Python SciPy, NumPy, Pandas, etc NLTK jq, awk, sed, grep, sort, uniq, tr, wc, etc. Harper - Can Metadata be Quantified? - DPLAFest 2015 16
17
Data Quality Control
18
Duplicate OCLC # Analysis
20
Collection Management Decisions
Warehouse combines: Primo dedupmrg & frbr matches Ebook SUSHI/Counter Stats Aleph Circ Stats Offsite & De-accessioning Decisions Regression analysis to demo correlations Harper - Can Metadata be Quantified? - DPLAFest 2015 20
21
CRISP-DM "CRISP-DM Process Diagram" by Kenneth Jensen
Own work. Licensed under CC BY-SA 3.0 via Wikimedia Commons Harper - Can Metadata be Quantified? - DPLAFest 2015 21
22
Possibilities & Next Steps
Exploratory Data Analysis. More. Answering questions about data quality More Topic Maps Bi- and Tri-gram Tokenization, Hapaxes Data Cleanup and QA Processing income batch data Integrate with other data streams: Google Analytics, AppDynamics, Kibana Cross system logs, searches, etc. Harper - Can Metadata be Quantified? - DPLAFest 2015 22
23
corey.harper@nyu.edu 212.998.2479 @chrpr 23 Thanks! 2015-04-18
Harper - Can Metadata be Quantified? - DPLAFest 2015 23 23
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.