Alexandria Digital Library ADL What is ADL What
Alexandria Digital Library (ADL) What is ADL? What is it suppose to do? Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
ADL Mission To provide a distributed spatially searchable digital library of geographically referenced materials. The library's components may be distributed (spread across the Internet) or coexist within a single network or desktop. Geographically-referenced means that all the information objects in the library will be associated with one or more regions ("footprints") on the surface of the Earth. Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Alexandria Digital Library (ADL) NSF funded digital library project 1994 -98 New method to organize & search for information Focused on geographical information Internet searching and data delivery Operational library 1999 -Present 2. 8 million bibliographic records 5. 5 million place names records 7. 5 terabytes of on-line data Available to the public via the Internet Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
What Is Spatial Information? Museum Artifacts Art about … Zoological Habitat Study Geographical Data Archives Botanical Survey Earth Science Data Archeological Digs Masi-Carver UCDL 82002 LA Hilton Books about … Alexandria digital Library – Davidson Library, UCSB ADL
What information do you have about here? Museum Artifacts ADL Library of Distributed Spatial Information Objects Earth Art Other Digital Archives Zoological Habitat Study If it has a latitude and longitude then it can be in the ADL library Masi-Carver UCDL 82002 LA Hilton Botanical Study Ocean Science Data Archeological Dig Alexandria digital Library – Davidson Library, UCSB ADL
ADL Organization The ADL project has: An operational library run by the Davidson Library, A research component (ADEPT) funded by NSF and others, and A gazetteer (place name index and geocoder) run by the Davidson Library Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Operational Partners Implementers AUT – (Auckland University of Technology) Software implementation and content builder DLESE – (Digital Library for Earth Systems Education) Software implementation and content builder CNR – (Center for National Research, Pisa Italy) Content Builders ADEPT – Educational classroom content CASS – (Center for the Analysis of Sacred Sites) – Video, sound, imagery text ESSW – MODIS real-time spacecraft imagery Scripps – SIOExplorer Oceanographic Data UCSB Davidson Library 08. 2002 Alexandria digital Library ADL
Alexandria Digital Library (ADL) History Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Prototypes Rapid Prototype (CD ROM + Arc View) Java Application Marc & FGDC Union Catalog Web Version 1 Search Optimized Fields, AKA “Search Buckets” Java Application CDL Web Client Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Marc & FGDC Web Prototype (1995) Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Java Application Prototype (1997) Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
“Webclient” Interface (2002) 1/2 Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
“Webclient” Interface (2002) 2/2 Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
ADL - Web Gazetteer Printed Report Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Alexandria Digital Library (ADL) Current ADL Architecture Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Common Features of the Prototypes Map Place name search Search definition frame/panel/tab Vocabulary support where appropriate Standardized citation & metadata display/formatting Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
ADL Architecture Goals (1/2) Catalog separate from the data distribution Metadata agnostic search methodology Data center reliability Collection level metadata Search buckets Strongly typed aggregated search field based on library concepts Facilitate quick/easy ingest of collections Abstract, searchable indexes Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
ADL Architecture Goals (2/2) Digital library for georeferenced information distributed heterogeneous rich services scalable many providers collections, large and small Standard components, interfaces Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Components/services collection registry thesaurus collection-level search shared vocabularies library content gazetteer item-level search, metadata management data access maps placenames to locations collection map collection item Masi-Carver UCDL 82002 LA Hilton background imagery, layering capability item Alexandria digital Library – Davidson Library, UCSB *many interconnections between services* ADL
Library Server Architecture user interface metadata mapper harvest loader item tracker client interface (XML / Java, HTTP, RMI) middleware access control; query fan-out; query result caching & ranking collection referencing & registration collection interface (XML / Java) internal collections Masi-Carver UCDL 82002 LA Hilton generic database driver Z 39. 50 driver proxy driver Alexandria digital Library – Davidson Library, UCSB collection aggregator ADL
Architecture - Buckets Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
What is a bucket? (1/3) Strongly-typed aggregated search fields based on library concepts Similar to Dublin Core, but define allowable content and search semantics, and are optimized for geospatial searching Facilitate quick/easy ingest of collections Abstract, searchable indexes: Location, Time, Type, Format, Originator, Assigned terms, Subject related text and Identifiers Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
What is a bucket? (2/3) Strongly typed, abstract metadata category with defined search semantics to which source metadata is mapped Key properties name Coverage date semantic definition The time period to which the item is relevant. data type (strictly observed) calendar date or range of calendar dates syntactic representation (strictly observed) ISO 8601 Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
What is a bucket? (3/3) Source metadata is mapped to buckets hold not just simple values “ 2001 -09 -08” but rather, explicit descriptions of those values (FGDC, 1. 3, “Time period of content”, “ 2001 -09 -08”) multiple values may be mapped per bucket Bucket definition includes search semantics defines query terms ISO 8601 date range defines query operators contains, overlaps, is-contained-in semantics are slightly fuzzy in certain cases to accommodate multiple implementations Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Standard buckets ADL Subject-related text Title Assigned term Originator Geographic location Coverage date Object type Format Identifier Masi-Carver UCDL 82002 LA Hilton Dublin Core DC. Subject DC. Title DC. Subject (qualified) DC. Creator + DC. Publisher DC. Coverage. Spatia DC. Coverage. Temporal DC. Type DC. Format DC. Identifier Alexandria digital Library – Davidson Library, UCSB ADL
Bucket Motivation Heterogeneous metadata Uniform client services Spatial search requires Strongly typed search fields Optimized for geospatial searching Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Summary A bucket is a strongly typed, abstract metadata category with defined search semantics to which source metadata is mapped Supports discovery/search across distributed, heterogeneous collections that use metadata structures of their choosing Supports high-level searching across collections and supports “drill-down” searching to the item-level metadata elements Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Benefits of the Architecture Standard Readily-Optimized Search Methodology Simplifies Design: Provides a client with a standard API for searching different data sources. Provides a way to discover a changed data locations. Scalability Scale by upgrading the database Scale by distributing the databases Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
ADL Metadata Ingest Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Collection Ingest Procedure Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
San Diego DRG Metadata Processing Extract Metadata Query Geodex Records 1: 24, 000 -118 to -116 34 to 32 A_1 a 1 a. 1 A-1 Total Records: ~330, 000 • By Scale : …………. . . 40, 000 • Within San Diego Boundary area: ………… 700 • Eliminate duplicate and dirty: ………………… 89 Clean Metadata Geodex Record “TITLE”: Imperial Beach, Ca. ; 32117 -E 1 Massage Digital Raster Graphic, DRG, of Imperial Beach, Ca. Programming is used to automate repetitive and time-consuming processes, extract portions of metadata and to change the format of metadata. • ACCESS, PERL, SQL UNIX shell script ADL Record Sys. control num. : o 32117 e 1 7. 5 minute topographic quadrangle. ADL Title Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Processes Organize Metadata Publisher Separate shared and unique values for every record. Stephen P. Teale Data Center Assign adl control number Shared (Parent) Unique (Child) Title of Particular DRG Digital Raster Graphic, DRG of Otay Mesa, CA, 7. 5 minute topographic quadrangle. Create Metadata Creation of values for required fields for which we don’t have info/metadata. Search (Visit Teale web pages for DRG production information) Original cataloging (access path) ADL Metadata Masi-Carver UCDL 82002 LA Hilton Calculation (determining resolution and footprint) Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Collection-level metadata Object Type Count cartographic works 324, 876 maps 324, 876 images 2, 014, 799 photographs 484, 083 aerial photographs 484, 083 • • • Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Alexandria Digital Library Future Directions Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Core Service Directions Lowering the barrier metadata management services OAI harvest loader improved packaging Service aggregation via harvesting Content-based searches, ranking text IR, image texture Collection discovery Display results over the map - layering Storage of user result sets on server Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
The Ideal ADL Entry Portal The Portal will be: Easy to use - allows patron to search collection w/out knowing keywords or jargon Flexible - to allow users of differing levels of geographic knowledge to find the data they seek in the minimal amount of time Help oriented - if user does not find what s/he wants, we in MIL will find out and use that knowledge to develop the collection Dynamic - so that the user will want to return to see the latest features, collections and tools Educational - so that the user can learn to use the site more effectively Interesting – uncluttered, new data, featured events Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
New Interface — Functional Areas BROWSE SEARCH non-query collection access view collection metadata browse items sequentially feature-of-the-week, etc. search formulation collection-level, item-level, & combined advanced search: complete power simple search: subset of advanced BUILD RESULTS persistent storage create, manage, publish collections create, import, manage items turn result set into collection view search results organize, sort view results spatially VIEW tools view/access items layer over map manipulate: tile, subset gazetteer thesaurus modal, popup Masi-Carver UCDL 82002 LA Hilton modal, popup query for & correct bad terms navigate, explore terms map omnipresent, stateful integrated with gazetteer supports layering Alexandria digital Library – Davidson Library, UCSB ADL
Summary Distributed, service-based architecture two search levels heterogeneous, native metadata rich, uniform services Status basis of UCSB MIL operational library http: //webclient. alexandria. ucsb. edu downloadable http: //www. alexandria. ucsb. edu/middleware initial full version late 2002 Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Thanks and Stay Tuned Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
Contact Information Larry Carver, AUL Library Technologies carver@library. ucsb. edu 805 -893 -4433 Catherine Masi, ADL Coordinator masi@library. ucsb. edu 805 -893 -7661 David Valentine, Senior Systems Engineer valentine@library. ucsb. edu 805 -893 -4545 Masi-Carver UCDL 82002 LA Hilton Alexandria digital Library – Davidson Library, UCSB ADL
- Slides: 54