Visual Materials Metadata Standards and Best Practices for
Visual Materials: Metadata, Standards, and Best Practices for Digital Libraries Howard Besser UCLA School of Education & Information http: //www. gseis. ucla. edu/~howard Besser--IS 208 #1 4/3/00 1
Metadata for Digital Libraries Models for Digital Libraries Importance of Metadata Standards Types and Uses of Metadata Discovery Metadata: The Dublin Core Administrative and Structural Metadata: The Making of America II Project Longevity Metadata Identification/Provenance The 4/99 NISO/DLF Image Metadata Workshop Various other Metadata Besser--IS 208 #1 4/3/00 2
Key problems we’re facing Discovery Longevity Interoperability- Besser--IS 208 #1 4/3/00 3
Serious Longevity Problems What we know from prior widespread digital file formats Images separating from their metadata Inaccessibility of software needed to view an image Inability to even decode the file format of an image Besser--IS 208 #1 4/3/00 4
Traditional Digital Library Model DL DL DL search & presentation user Besser--IS 208 #1 4/3/00 DL search & presentation user 5
Ideal Digital Library Model DL DL search & presentation user Besser--IS 208 #1 4/3/00 user 6
For Interoperability Digital Libraries Need Standards Discovery Metadata for finding Administrative Metadata for viewing and maintaining Structural Metadata for navigation . . . IP Rights Management Metadata for controlling access. . . Besser--IS 208 #1 4/3/00 7
Why are Standards and Metadata consensus important? Managing digital files over time Longevity Interoperability Veracity Recording in a consistent manner Will give vendors incentive to create applications that support this Besser--IS 208 #1 4/3/00 8
Why Standards? Why – – – do we need a standards? To make information universally available to users facilitate sharing and interchange of information To preserve information (make it safe from changes in hardware and software) Standards are the work of communities They are necessary so that communities can Besser--IS 208 work. #1 4/3/00 9
Why are you Managing this Information? Organizational mission & type Users Uses Besser--IS 208 #1 4/3/00 10
Questions to Ask What communities is this standard designed for? What type of information is this standard designed to handle? What functions is this standard designed to serve? What previous standards is it built upon? Does the standard prescribe how to create new records (or parts of records), or how to map from existing records? How far does the standard go? Semantics: Does it define element sets? Rules? Syntax? Besser--IS 208 #1 4/3/00 11
What is Metadata Structured data describing other data used to find or help manage information resources Aids in interoperability Titles, dates, captions, cataloging and indexing data, file headers, rights info, provenance, code books, transaction logs, . . . One person’s metadata is another’s data Besser--IS 208 #1 4/3/00 12
Sorting through the Standards Morass _ Data Structures (DC, CDWA, MARC, VRA Core, TEI, EAD, MESL data dict) _ _ _ Data Interchange (Z 39. 50) Data Values/vocabularies (LCSH, AAT, ULAN, TGN) Data Content/syntax (AACR 2) Besser--IS 208 #1 4/3/00 13
Semantics/Syntax/Structure _ Semantics – _ Syntax – – _ meaning, as defined by a community to meet their particular needs (DC) a systematic arrangement of data elements for machine processing facilitates the exchange and use of metadata among various applications (HTML, XML, RDF) Structure – a formal arrangement of the syntax with the goal of consistent representation of the semantics (rules defining field contents like 1/11/99) Besser--IS 208 #1 4/3/00 14
What is Metadata Types & Uses lots of different ways of dividing the clusters Besser--IS 208 #1 4/3/00 15
Uses of Metadata Discovery & Retrieval Identification/Provenance Rights Management Viewing Integrity Longevity Content rating Besser--IS 208 #1 4/3/00 16
Types of Metadata Descriptive Discovery & Retrieval Structural Administrative Intellectual Other Metadata Besser--IS 208 #1 4/3/00 17
Metadata -- Detailed Types Identification metadata Instance or Fixation metadata Source image metadata Content metadata Subject metadata Form and format metadata Besser--IS 208 #1 4/3/00 Context metadata Structure metadata Relationships metadata Terms & Conditions metadata Use history metadata 18
Containers and Packages of Metadata Warwick, not MARC _ _ _ modular overlapping extensible community-based designed for a networked world to aid commonality btwn communities while still providing full functionality within each community Besser--IS 208 #1 4/3/00 19
Some different schemes where Metdata is kept _ _ embedded withing the object (HTML tags) in a separate related DB maintained by same organization (OPAC, MOA II) in a separate DB maintained by a separate organization (Books in Print, ratings systems) derived on-the-fly from a different scheme (MARC-to-DC) Besser--IS 208 #1 4/3/00 20
Some Standards/Metadata Efforts _ _ _ Dublin Core Visual Resources Association (VRA) Core Encoded Archival Description (EAD) Computerized Interchange of Museum Information (CIMI) Records Export for Art and Cultural Heritage (REACH) Besser--IS 208 #1 4/3/00 21
Dublin Core (3/95) _ _ _ improve resource discovery anticipate precision problems of Web Crawlerbased searching tools existing metadata could be “dumbed down” elements should be simple to understand use, so that any individual should be able to assign terms him/herself software might eventually automatically generate very base-level metadata Besser--IS 208 #1 4/3/00 22
Dublin Core Title Format Creator Identifier Subject Source Description Language Publisher Relation Contributors Coverage Date Rights Type Besser--IS 208 #1 4/3/00 23
Dublin Core every element is both optional and repeatable elements are cross-disciplinary elements are extensible by organized communities can employ a syntax such as html’s <META> tagset Besser--IS 208 #1 4/3/00 24
DC Qualifiers _ allows one community to express important nuances and qualifications, while still making the basic importance available to communities with simple needs – our community can reflect alternate title, transliterated title, and main title, yet they will all be found under a simple Web search under “title” Besser--IS 208 #1 4/3/00 25
Discovery Metadata: Recent History _ _ _ Dublin Core (3/95) Warwick Framework (4/96) Image Metadata Workshop (9/96) Canberra, Helsinki, . . . DC (98) Digital Library Collaboratory (97 -) DC-8, Frankfurt 10/99 Besser--IS 208 #1 4/3/00 26
Dublin Core--further work _ Warwick Framework – – _ Canberra Qualifiers – – _ metadata packages for extensible functions layed groundwork for RDF refining the semantics of the element set to provide more precise info SUBELEMENT, SCHEME, LANG Granularity – no hierarchical relationships w/i a given DC record; only one record per discrete object (collection or item-level), and relationship field plus qualifier links them Besser--IS 208 #1 4/3/00 27
_ _ _ The Research Process and Functional Categories of Metadata Discovery Retrieval Collation Analysis Re-presentation
Making of America II Background of the DLF Project Administrative Metadata Structural Metadata Besser--IS 208 #1 4/3/00 29
Other Types of Metadata Longevity Identification/Provenance Rights Management Besser--IS 208 #1 4/3/00 30
The Short Life of Digital Info: Digital Longevity Problems Disappearing Information The Viewing Problem The Scrambling Problem The Inter-relation Problem The Custodial Problem The Translation Problem Besser--IS 208 #1 4/3/00 31
Identification/Provenance (Images) The number of variant forms of a work can be enormous Image Families A digital image frequently has many layers of parentage Information about the parentage that can indicate the quality and veracity of the image (Dublin Core "Source" and "Relation") how to deal with different versions derived from the same scan or different encoding schemes Vocabulary Standards to express this Besser--IS 208 #1 4/3/00 32
NISO/DLF Image Metadata Workshop Possible Goals Metadata fields Rules for Field Contents (authority control) Core set of necessary fields Syntax for expressing fields and contents (headers) Besser--IS 208 #1 4/3/00 33
Other Metadata Description of depiction/surrogate (What VRA calls its "Surrogate Categories") Description of original object Rights and Reproduction Information Location Information Besser--IS 208 #1 4/3/00 34
Data Structures: The VRA Core 28 elements specifically for visual resource collections Work Description Categories Visual Document Description Categories http: //www. oberlin. edu/~art/vra/dsc. html Besser--IS 208 #1 4/3/00 35
Data Value Metadata (vocabularies) LCSH TGM AAT ULAN TGN VRA Core Besser--IS 208 #1 4/3/00 36
Metadata for Digital Commerce DOI <indecs>- Besser--IS 208 #1 4/3/00 37
Metadata Mapping Crosswalks Resource Description Framework (RDF) Besser--IS 208 #1 4/3/00 38
Metadata Philosphies _ _ Minimalists vs. Structuralists From Pidgeon to Creole (add structure and tenses) Besser--IS 208 #1 4/3/00 39
Collaborative Metadata Projects OCLC CORC Project Computerized Interchange of Museum Information (CIMI) Besser--IS 208 #1 4/3/00 40
Visual Materials: Metadata, Standards, and Best Practices for Digital Libraries Howard Besser UCLA School of Education & Information Baca, Murtha (ed). Introduction to Metadata, Los Angeles: Getty Information Institute, 1998 http: //www. gseis. ucla. edu/~howard/image-meta. html http: //www. gseis. ucla. edu/~howard http: //sunsite. Berkeley. EDU/Imaging/Databases/#standards http: //sunsite. Berkeley. EDU/moa 2/ http: //sunsite. Berkeley. EDU/Longevity/ http: //www. gii. getty. edu/timeandbits/ http: //www. nlc-bnc. ca/ifla/II/metadata. htm http: //purl. oclc. org/metadata/dublin_core/ http: //purl. oclc. org/corc// http: //lcweb. loc. gov/ead/ http: //www. cimi. org/ Besser--IS 208 #1 4/3/00 41
- Slides: 41