http www flickr comphotosrogersmith313323541 MOODy Investigations into Massive

  • Slides: 31
Download presentation
http: //www. flickr. com/photos/rogersmith/313323541/ MOODy : ) Investigations into Massive Open Online Discovery at

http: //www. flickr. com/photos/rogersmith/313323541/ MOODy : ) Investigations into Massive Open Online Discovery at IU Juliet Hardesty (@jlhardes) Courtney Greene Mc. Donald (@xocg) Bryan J Brown (@bryjbrown) Digital Library Brown Bag | December 11, 2013 Tweet it! #dlbb

http: //www. flickr. com/photos/danielito 311/5847295876/ You’ve heard of MOOCs… We’d like to introduce you

http: //www. flickr. com/photos/danielito 311/5847295876/ You’ve heard of MOOCs… We’d like to introduce you to MOODs … Massive Open Online Discovery

Digital Collections Search (beta) : Blacklight discovery interface http: //webapp 1. dlib. indiana. edu/dcs/

Digital Collections Search (beta) : Blacklight discovery interface http: //webapp 1. dlib. indiana. edu/dcs/ IUCAT: Blacklight discovery interface http: //iucat. iu. edu IUB Library Web Site Search: Drupal + Solr Currently in development (wireframe)

One Big Index in the Sky (Solr) http: //www. flickr. com/photos/thukral/1983931186/

One Big Index in the Sky (Solr) http: //www. flickr. com/photos/thukral/1983931186/

Phase One • Enable indexing & combined search of Indiana University catalog, repository, digital

Phase One • Enable indexing & combined search of Indiana University catalog, repository, digital collections, website data – Combine feeds of catalog (IUCAT) & digital collections (DCS) data via MODS – Single index: author, date, format, language, location, & subject – Facets: author, date, format, & subject

IUC AT : M MO ARC DS to IU Collections Data (MODS) S D

IUC AT : M MO ARC DS to IU Collections Data (MODS) S D O M : S DC Website Content

Phase Two • Expose IU dataset for potential combination with other institutional datasets IU

Phase Two • Expose IU dataset for potential combination with other institutional datasets IU Dataset Pat Q Scholar I. M. Hacker

 • Goal: Identify best practices for UX around discovery in the context of

• Goal: Identify best practices for UX around discovery in the context of metadata • Surveyed institutions to identify approaches to Solr indexing and end-user options

View the survey at http: //bit. ly/13 qh. CD 7

View the survey at http: //bit. ly/13 qh. CD 7

Solr Schemas Used Solr Schemas Apache Solr - unmodified 3 Apache Solr - modified

Solr Schemas Used Solr Schemas Apache Solr - unmodified 3 Apache Solr - modified 15 Solr. Marc - unmodified 1 Solr. Marc - modified 8 Other 5 0 2 4 6 8 10 12 14 16 Other; 5; 16% Solr/Solr. Marc - unmodified; 4; 12% Solr/Solr. Marc - modified; 23; 72%

Solr Schema Fields Modified Types of Fields Modified 25 20 22 21 19 18

Solr Schema Fields Modified Types of Fields Modified 25 20 22 21 19 18 15 10 3 5 0 <copy. Field> <dynamic. Field> Other Solr. Marc. properties modified (10 responses) 8 7 7 7 6 5 4 4 3 3 3 2 1 r he Ot ap _m ng ua la en m ru st ge t_ m _m at m _m try un co ap ap 0 ap ap io po sit co m ca lln um n_ er be r_ a_ m m ap x de in 0 0 fo r 1 fig <fields> co n <types>

Reasons for Modifying Fields Connections to original metadata http: //www. flickr. com/photos/arielle_kristina/4095456119/ http: //www.

Reasons for Modifying Fields Connections to original metadata http: //www. flickr. com/photos/arielle_kristina/4095456119/ http: //www. flickr. com/photos/cobalt/4191469239/

nc e ds ai de on sp re or s d) ar bo nt

nc e ds ai de on sp re or s d) ar bo nt es ag s gs in Ev e m ng di or Fin eo id (v /c es ts ip cr us an M m Ga gi rd s ct s ph ra og s s ls ite je ob co re in ov M un d So ot 3 D s/ s na ur Jo ge pa Ph eb W tic le Ar ok Bo What is being indexed? 20 18 16 14 12 10 8 Non full-text / Derivative size / Information only 6 Full-text / Streaming / Original size 4 2 0

Metadata mapped to Solr/Solr. MARC 15 EAD 4 MODS 10 TEI 2 METS 1

Metadata mapped to Solr/Solr. MARC 15 EAD 4 MODS 10 TEI 2 METS 1 FRBR 0 PBCore 1 DC 4 Other 10 0 2 4 6 8 10 12 14 16

Discovery Layer Other; 8; 33% Blacklight; 10; 42% Vu. Find; 6; 25%

Discovery Layer Other; 8; 33% Blacklight; 10; 42% Vu. Find; 6; 25%

Shared File Sets Just what we asked for! http: //www. flickr. com/photos/annettepedrosian/2108145618/

Shared File Sets Just what we asked for! http: //www. flickr. com/photos/annettepedrosian/2108145618/

Proof of Concept Internship Goal: Explore possibilities of combining multiple metadata feeds into one

Proof of Concept Internship Goal: Explore possibilities of combining multiple metadata feeds into one central index Feed 1 Feed 2 ? Index

Questions What are our data sources? ? Index

Questions What are our data sources? ? Index

Questions What is Apache Solr, and how does it work? ? ? IUCAT Fedora

Questions What is Apache Solr, and how does it work? ? ? IUCAT Fedora ? ? Solr Librarian-friendly documentation is on the way! ?

Questions What’s the best way to get the data? IUCAT Fedora ? ? ?

Questions What’s the best way to get the data? IUCAT Fedora ? ? ? Solr

Questions What’s the “native” format? ? IUCAT Z 39. 50 ? Fedora OAI-PMH ?

Questions What’s the “native” format? ? IUCAT Z 39. 50 ? Fedora OAI-PMH ? Solr

Questions What data should we index? MARCXML IUCAT Z 39. 50 MODS Fedora OAI-PMH

Questions What data should we index? MARCXML IUCAT Z 39. 50 MODS Fedora OAI-PMH ? ? Solr

Our custom Solr schema

Our custom Solr schema

Questions How can we transform it? MARCXML IUCAT Z 39. 50 ? schema. x

Questions How can we transform it? MARCXML IUCAT Z 39. 50 ? schema. x ml Solr MODS Fedora OAI-PMH ?

Questions How can we automate the process? MARCXML IUCAT Z 39. 50 XSLT Batch

Questions How can we automate the process? MARCXML IUCAT Z 39. 50 XSLT Batch Ingest MODS Fedora OAI-PMH XSLT schema. x ml Solr

Future Goals ? ? ? XSLT Z 39. 50 XSLT MARCXML IUCAT schema. xml

Future Goals ? ? ? XSLT Z 39. 50 XSLT MARCXML IUCAT schema. xml MODS Fedora OAI-PMH XSLT ? ? Batch Ingest Solr

Future Goals ? ? ? XSLT Z 39. 50 XSLT MARCXML IUCAT schema. xml

Future Goals ? ? ? XSLT Z 39. 50 XSLT MARCXML IUCAT schema. xml MODS Fedora OAI-PMH XSLT ? ? Batch Ingest Solr

Future Goals ? ? ? XSLT Z 39. 50 XSLT MARCXML IUCAT schema. xml

Future Goals ? ? ? XSLT Z 39. 50 XSLT MARCXML IUCAT schema. xml MODS Fedora OAI-PMH XSLT ? ? Batch Ingest Solr

Plans Moving Forward http: //www. flickr. com/photos/usfws_alaska/7376551524/

Plans Moving Forward http: //www. flickr. com/photos/usfws_alaska/7376551524/

Questions? Comments! Kittens >^. . ^< http: //www. flickr. com/photos/notemily/5394289051/

Questions? Comments! Kittens >^. . ^< http: //www. flickr. com/photos/notemily/5394289051/

THANK YOU!!1!11!! Thank you! Julie (jlhardes@indiana. edu) Courtney (crgreene@indiana. edu) Bryan (bryjbrow@Indiana. edu) More

THANK YOU!!1!11!! Thank you! Julie (jlhardes@indiana. edu) Courtney (crgreene@indiana. edu) Bryan (bryjbrow@Indiana. edu) More info (posters, more data, &c) at http: //bit. ly/meta-lita-2013 These slides will shortly be available via IU Scholarworks