Consuming JSONLD Experiments with Primos Latest Linked Data
- Slides: 23
Consuming JSON-LD: Experiments with Primo's Latest Linked Data Corey Harper 2015 -09 -05 IGe. LU 2015 – Developers Day
Curl | jq | less to Beta Search REST API curl "http: //bobcatdev. library. nyu. edu/ primo_library/libweb/webservices /rest/v 1/pnxs? q=any, contains, mo nkeys&inst=NYU" | jq -C '. docs' | less -r
Ruby RDF & JSON-LD require 'rdf‘ ‘json/ld’ 'rdf/turtle‘ 'openssl' resource = RDF: : Resource(RDF: : URI. new("http: //bob catdev. library. nyu. edu/primo_library/libwe b/webservices/rest/v 1/pnxs/L/nyu_aleph 0 01770007? inst=NYU")) graph = RDF: : Graph. new << JSON: : LD: : API. to. Rdf(resource) graph. dump(: ttl, prefixes: {list prefixes})
What just happened? • This doesn’t actually work. • Not _quite_ valid JSON-LD • It needs an actual context • Add the context yourself, and you get errors when validating / linting ▫ http: //json-ld. org/playground/ ▫ https: //github. com/Ex. Libris. Group/Primo. PNX -context/blob/master/PNX-context. json
Still works in JQ (but maybe not JSON Tools? )
Consuming JSON (-LD? ): Experiments in with (and Without) Primo's New RESTful Analytics Search API
Distribution of Titles
Why do this stuff? • Understanding your collections • Understanding queries and usage • Identifying Strengths • Topic Modeling ▫ Clustering ▫ Recommendation systems ▫ (Automatic Classification? )
On Github (Also, DPLA Examples) https: //github. com/chrpr/dplaanalytics/blob/master/primo/RESTapi. ipynb
Tree Map of Title Words
Data Science Venn Diagram http: //drewconway. com/zia/2013/3/26/the-data-science-venn-diagram 2015 -04 -18 Harper - Can Metadata be Quantified? - DPLAFest 2015 14
Analytics Writ Large • APIs have much potential • Limited in scope (for now) • NYU External Data Warehousing • Analyze Query Logs • Analyze Dedup Merger • Analyze URLs with (and outside) of Aleph
Toolkit • Tableau – Business Intelligence • R – Stats Analysis • Python • Sci. Py, Num. Py, Pandas, etc • NLTK • jq, awk, sed, grep, sort, uniq, tr, wc, etc. 2015 -04 -18 Harper - Can Metadata be Quantified? - DPLAFest 2015 16
Data Quality Control
Duplicate OCLC # Analysis
Collection Management Decisions Warehouse combines: • Primo dedupmrg & frbr matches • Ebook SUSHI/Counter Stats • Aleph Circ Stats Offsite & De-accessioning Decisions Regression analysis to demo correlations 2015 -04 -18 Harper - Can Metadata be Quantified? - DPLAFest 2015 20
CRISP-DM "CRISP-DM Process Diagram" by Kenneth Jensen Own work. Licensed under CC BY-SA 3. 0 via Wikimedia Commons http: //commons. wikimedia. org/wiki/File: CRISP-DM_Process_Diagram. png 21 Harper - Can Metadata be Quantified? - DPLAFest 2015 -04 -18
Possibilities & Next Steps • • Exploratory Data Analysis. More. Answering questions about data quality More Topic Maps Bi- and Tri-gram Tokenization, Hapaxes • Data Cleanup and QA • Processing income batch data • Integrate with other data streams: • Google Analytics, App. Dynamics, Kibana • Cross system logs, searches, etc. 2015 -04 -18 Harper - Can Metadata be Quantified? - DPLAFest 2015 22
Thanks! corey. harper@nyu. edu 212. 998. 2479 @chrpr 2015 -04 -18 Harper - Can Metadata be Quantified? - DPLAFest 2015 23
- Singly vs doubly linked list
- Singly vs doubly linked list
- List adalah
- Wco data model
- Numerps primos
- Formula costo unitario
- Todos os numeros de 1 a 24
- Que son los costos primos
- Your, fam.) esposo es italiano.
- Intentalo sus primos son franceses
- Denis primos
- Adjetivos posesivos
- Descomponer 8 en factores primos
- Atributos primos
- Hermanas de marge simpson como se llaman
- Nmeros primos
- Nmeros primos
- 10²⁷
- Factores primos lineales de un polinomio
- Numeros primos reversibles
- Descomponer 36
- Polinomios primos
- Christopher asked me if i wanted to dance
- Latest prayer points for the philippines