Presentation is loading. Please wait.

Presentation is loading. Please wait.

Consuming JSON-LD: Experiments with Primo's Latest Linked Data

Similar presentations


Presentation on theme: "Consuming JSON-LD: Experiments with Primo's Latest Linked Data"— Presentation transcript:

1 Consuming JSON-LD: Experiments with Primo's Latest Linked Data
Corey Harper IGeLU 2015 – Developers Day 1

2 Curl | jq | less to Beta Search REST API
curl " | jq -C '.docs' | less -r

3

4 require 'rdf‘ ‘json/ld’ 'rdf/turtle‘ 'openssl'
Ruby RDF & JSON-LD require 'rdf‘ ‘json/ld’ 'rdf/turtle‘ 'openssl' resource = RDF::Resource(RDF::URI.new(" catdev.library.nyu.edu/primo_library/libwe b/webservices/rest/v1/pnxs/L/nyu_aleph ?inst=NYU")) graph = RDF::Graph.new << JSON::LD::API.toRdf(resource) graph.dump(:ttl, prefixes: {list prefixes})

5 This doesn’t actually work. Not _quite_ valid JSON-LD
What just happened? This doesn’t actually work. Not _quite_ valid JSON-LD It needs an actual context Add the context yourself, and you get errors when validating / linting X-context/blob/master/PNX-context.json

6 Still works in JQ (but maybe not JSON Tools?)

7 Consuming JSON (-LD?): Experiments in with (and Without) Primo's New RESTful Analytics Search API
7

8

9

10 Distribution of Titles

11 Understanding your collections Understanding queries and usage
Why do this stuff? Understanding your collections Understanding queries and usage Identifying Strengths Topic Modeling Clustering Recommendation systems (Automatic Classification?)

12 On Github (Also, DPLA Examples)

13 Tree Map of Title Words

14 Data Science Venn Diagram
Harper - Can Metadata be Quantified? - DPLAFest 2015 14

15 APIs have much potential Limited in scope (for now)
Analytics Writ Large APIs have much potential Limited in scope (for now) NYU External Data Warehousing Analyze Query Logs Analyze Dedup Merger Analyze URLs with (and outside) of Aleph

16 Tableau – Business Intelligence R – Stats Analysis Python
Toolkit Tableau – Business Intelligence R – Stats Analysis Python SciPy, NumPy, Pandas, etc NLTK jq, awk, sed, grep, sort, uniq, tr, wc, etc. Harper - Can Metadata be Quantified? - DPLAFest 2015 16

17 Data Quality Control

18 Duplicate OCLC # Analysis

19

20 Collection Management Decisions
Warehouse combines: Primo dedupmrg & frbr matches Ebook SUSHI/Counter Stats Aleph Circ Stats Offsite & De-accessioning Decisions Regression analysis to demo correlations Harper - Can Metadata be Quantified? - DPLAFest 2015 20

21 CRISP-DM "CRISP-DM Process Diagram" by Kenneth Jensen
Own work. Licensed under CC BY-SA 3.0 via Wikimedia Commons Harper - Can Metadata be Quantified? - DPLAFest 2015 21

22 Possibilities & Next Steps
Exploratory Data Analysis. More. Answering questions about data quality More Topic Maps Bi- and Tri-gram Tokenization, Hapaxes Data Cleanup and QA Processing income batch data Integrate with other data streams: Google Analytics, AppDynamics, Kibana Cross system logs, searches, etc. Harper - Can Metadata be Quantified? - DPLAFest 2015 22

23 corey.harper@nyu.edu 212.998.2479 @chrpr 23 Thanks! 2015-04-18
Harper - Can Metadata be Quantified? - DPLAFest 2015 23 23


Download ppt "Consuming JSON-LD: Experiments with Primo's Latest Linked Data"

Similar presentations


Ads by Google