Endeca NCSU Libraries Kristin Antelman NCSU Libraries June

  • Slides: 42
Download presentation
Endeca @ NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006

Endeca @ NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006

Overview n The problem n Quick demo n Technical overview n Implementation process n

Overview n The problem n Quick demo n Technical overview n Implementation process n Use data n Assessment data n Next steps

Why did we do this? n Existing catalogs are hard to use: – known

Why did we do this? n Existing catalogs are hard to use: – known item searching works pretty well, but … ! – users often do keyword searching on topics Y C N A V E L E and get large result sets returned in system R O N sort order – catalogs are unforgiving on spelling errors, stemming

Catalog value is buried n Subject headings are not leveraged in searching – they

Catalog value is buried n Subject headings are not leveraged in searching – they should be browsed or linked from, not searched n Data from the item record is not leveraged – should be able to filter by item type, location, circulation status, popularity

What does the Endeca software do? n Provides search software for ecommerce companies n

What does the Endeca software do? n Provides search software for ecommerce companies n Faceted browse of structured metadata; goal is to expose the ontology

Endeca technical overview Endeca Information Access Platform NCSU exports and reformats Data Foundry Parse

Endeca technical overview Endeca Information Access Platform NCSU exports and reformats Data Foundry Parse text files Raw MARC data MDEX Engine Indices Flat text files HTTP Client browser NCSU Web Application

Integrating Endeca - Enhancements n Marc. Adapter plugin for raw MARC data. – Eliminate

Integrating Endeca - Enhancements n Marc. Adapter plugin for raw MARC data. – Eliminate need for external MARC 21 translation and file merging n Partial Updates – Update circulation data multiple times throughout the day

Implementation process n Timeline – – – License / negotiation: Spring 2005 Acquire: Summer

Implementation process n Timeline – – – License / negotiation: Spring 2005 Acquire: Summer 2005 Implementation: August 2005 – January 12, 2006 n 7 representative team members n Java-trained librarian (30 -40 hrs/wk for 14 weeks) n It doesn’t have to be perfect! – functional requirements, metadata, interface issues (total of 40 -60 hours) – project manager: approximately 10 hours per week for 20 weeks

Key decision points n Search interface

Key decision points n Search interface

Main search page Endeca Web 2

Main search page Endeca Web 2

Advanced search

Advanced search

A few major issues n Search n interface Selecting dimensions and their order

A few major issues n Search n interface Selecting dimensions and their order

Dimensions 9. Availability 10. Library of Congress Classification 1. 2. 3. 4. 5. 6.

Dimensions 9. Availability 10. Library of Congress Classification 1. 2. 3. 4. 5. 6. 7. 8. Subject: Topic Subject: Genre Format Library Subject: Region Subject: Era Language Author

A few major issues n Search interface n Selecting dimensions and their order n

A few major issues n Search interface n Selecting dimensions and their order n Defining the relevance algorithm

Relevance defined n Relevance ranking in Endeca – select from a variety of modules

Relevance defined n Relevance ranking in Endeca – select from a variety of modules and order them based on importance n At NCSU… 1. Original query term(s) (no thesaurus, stemming, spell correction) 2. Exact phrase match 3. Field ranking (Title higher than Author higher than Table of Contents, etc. ) 4. Number of fields that contain term(s) …

Use data

Use data

Some search statistics (March - May 2006)

Some search statistics (March - May 2006)

Sorting statistics (March – May 2006)

Sorting statistics (March – May 2006)

Some navigation statistics (March - May 2006)

Some navigation statistics (March - May 2006)

Assessment

Assessment

Some user reaction “The new Endeca system is incredible. It would be difficult to

Some user reaction “The new Endeca system is incredible. It would be difficult to exaggerate how much better it is than our old online card catalog (and therefore that of most other universities). I've found myself searching the catalog just for fun, whereas before it was a chore to find what I needed. ” - NCSU Undergrad, Statistics “The new library catalog search features are a big improvement over the old system. Not only is the search extremely fast, but seemingly it's much more intelligent as well. ” - NCSU faculty, Psychology

Topical searching tasks

Topical searching tasks

Average topical task duration

Average topical task duration

Testing relevance n Are search results in Endeca more likely to be relevant to

Testing relevance n Are search results in Endeca more likely to be relevant to a user’s query than search results in Web 2 OPAC? n 100 topical user searches from 1 month in fall 2005 n How many of top 5 results relevant? – 40% relevant in Web 2 OPAC – 68% relevant in Endeca catalog

Future plans n FRBR-ized displays n FAST (Faceted Access to Subject Terms) instead of

Future plans n FRBR-ized displays n FAST (Faceted Access to Subject Terms) instead of LCSH n Enrich records with supplemental content n More n Use integration with website search Endeca to index local collections

Thank you project page: www. lib. ncsu. edu/endeca

Thank you project page: www. lib. ncsu. edu/endeca