Lab federated searching searching distributed data searching harvested
Lab : federated searching: searching distributed data & searching harvested data CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel herbertv@cs. cornell. edu Access to PC orpcuser orpcpw 1 herbert van de sompel
federated services e-print FTXT A&I OPAC image 2 herbert van de sompel
federated searching • Distributed search approach ~ Z 39. 50, SDLIP, . . . • today: Meta. Lib • commercial product by Ex Libris • searches repositories using “whichever” technique • normalizes results before presenting them to the user • can merge results after initial presentation • Harvesting approach ~ OAI • today: ARC, the first OAI service provider 3 herbert van de sompel
Meta. Lib Goal: Unique, consistent interface across library resources (think Google for library collection) Broadcast searching over a large collection of heterogeneous resources: • Different metadata syntax (MARC, EAD, Dublin Core, TEI, unspecified, . . . ) • Different protocols (Z 39. 50, HTTP, native ALEPH, screen scraping) • Linking via integrated SFX server • User personalization • Administration of resources 4 herbert van de sompel
Application Level: Technology Level: Information Gateway Universal Gateway User Admin Resources’ administering Accurate, target -sensitive Customized, personalized services searching 5 Context-sensitive linking herbert van de sompel
Information Gateway Universal Gateway User Admin Resources’ administering Accurate, target -sensitive Customized, personalized services searching 6 Context-sensitive linking herbert van de sompel
The Information Gateway: The Knowledge. Base includes all the library Resources n Cataloging Information (per collection): Collections Name of collection, owner, subject, services, language n Configuration Information (per Meta. Lib resource): Interfacing protocol, internal format, rules of conversion Resources 7 herbert van de sompel
Knowledge. Base Example of cataloged collection USMARC 8 245 Title: Queen Elizabeth II Library 270 Location: Memorial University of Newfoundland | St. John’s, Newfoundland | Canada | AC 15 S 7 307 Access Times: Monday-Thursday 8: 30 -20: 45 Closed Saturdays 520 Description: Main library covers humanities, science, computers, physical ed, social sciences, and engineering 546 Language: English 531 Access: Open to the public 901 Administrator: Iscott@mun. ca 650 Subject: Computer Science 650 Subject: Pure Science 650 Subject: Humanities herbert van de sompel
9 Information Gateway Universal Gateway User Admin Database of catalogs and databases Accurate, target -sensitive Customized, personalized services searching Context-sensitive linking herbert van de sompel
Search Processed to Conform to Information Resources Universal Gateway ALEPH OTHER Z 39. 50 Diverse 10 Information HTTP Resources herbert van de sompel
Search Command Adapted to Various Resources Author: Kryger, Meir Title: sleep WAU=Kryger, Meir AND WTI=sleep (%20 sleep[TITL]%20)%20 AND%20(%20 K 1=Kryger, WAU=Kryger, 1003=Kryger-M? Meir AND AND 4=sleep WTI=sleep ryger[AUTH]%20 Meir[AUTH]%20) 11 Z 39. 50 HTTP ALEPH Library of Congress Med. Line Pub. Med KOBV herbert van de sompel
The Universal Gateway enables the use of basic components via API Universal Gateway FIND PRESENT Universal 12 COMBINE SET Gateway FIND DUPLICATES Functions herbert van de sompel
URLs Distributed approach: Meta. Lib http: //metalib 01. exlibris-usa. com/V Harvesting approach: ARC http: //arc. cs. odu. edu/ 13 herbert van de sompel
Pop quiz: reference linking papers Go http: //63. 70. 76. 27: 8080/cs 502/ Logon • Box 1 : Firstname • Box 2 : Lastname • Box 3 : netid • Click take Take Quiz Submit all responses at once 14 herbert van de sompel
Make-up Pop quiz: SODA and FEDORA papers Go http: //63. 70. 76. 27: 8080/cs 502/ Logon • Box 1 : Firstname • Box 2 : Lastname • Box 3 : netid • Click take Take Quiz Submit all responses at once 15 herbert van de sompel
- Slides: 15