Introduction to Metadata for Digital Asset Management Howard

  • Slides: 52
Download presentation
Introduction to Metadata for Digital Asset Management Howard Besser UCLA School of Education &

Introduction to Metadata for Digital Asset Management Howard Besser UCLA School of Education & Information http: //www. gseis. ucla. edu/~howard Besser--Metadata (Brazil) 1/6/01 1

Metadata: A fancy word for something familiar _ _ _ Cataloging Indexing Description …

Metadata: A fancy word for something familiar _ _ _ Cataloging Indexing Description … But also new elements of technical description (file format, compression schemes, file names, …) Besser--Metadata (Brazil) 1/6/01 2

Metadata for Digital Asset Management- Importance of Metadata Standards Types and Uses of Metadata

Metadata for Digital Asset Management- Importance of Metadata Standards Types and Uses of Metadata Discovery Metadata: The Dublin Core Administrative and Structural Metadata: The Making of America II Project Longevity Metadata Identification/Provenance The 4/99 NISO/DLF Image Metadata Workshop Various other Metadata Besser--Metadata (Brazil) 1/6/01 3

What is Metadata _ _ Structured data describing other data used to find or

What is Metadata _ _ Structured data describing other data used to find or help manage information resources Aids in interoperability Titles, dates, captions, cataloging and indexing data, file headers, rights info, provenance, code books, transaction logs, . . . One person’s metadata is another’s data Besser--Metadata (Brazil) 1/6/01 4

Sorting through the Standards Morass _ Data Structures (DC, CDWA, MARC, VRA Core, TEI,

Sorting through the Standards Morass _ Data Structures (DC, CDWA, MARC, VRA Core, TEI, EAD, MESL data dict) _ _ _ Data Interchange (Z 39. 50) Data Values/vocabularies (LCSH, AAT, ULAN, TGN) Data Content/syntax (AACR 2) Besser--Metadata (Brazil) 1/6/01 5

Semantics/Syntax/Structure _ Semantics – _ Syntax – – _ meaning, as defined by a

Semantics/Syntax/Structure _ Semantics – _ Syntax – – _ meaning, as defined by a community to meet their particular needs (DC) a systematic arrangement of data elements for machine processing facilitates the exchange and use of metadata among various applications (HTML, XML, RDF) Structure – a formal arrangement of the syntax with the goal of consistent representation of the semantics (rules defining field contents like 1/11/99) Besser--Metadata (Brazil) 1/6/01 6

What is Metadata Types & Uses lots of different ways of dividing the clusters

What is Metadata Types & Uses lots of different ways of dividing the clusters Besser--Metadata (Brazil) 1/6/01 7

Uses of Metadata _ _ _ _ Discovery & Retrieval Identification/Provenance Rights Management Viewing

Uses of Metadata _ _ _ _ Discovery & Retrieval Identification/Provenance Rights Management Viewing Integrity Longevity Content rating Besser--Metadata (Brazil) 1/6/01 8

Containers and Packages of Metadata Warwick, not MARC _ _ _ modular overlapping extensible

Containers and Packages of Metadata Warwick, not MARC _ _ _ modular overlapping extensible community-based designed for a networked world to aid commonality btwn communities while still providing full functionality within each community Besser--Metadata (Brazil) 1/6/01 9

Some different schemes where Metdata is kept _ _ embedded within the object (HTML

Some different schemes where Metdata is kept _ _ embedded within the object (HTML tags) in a separate related DB maintained by same organization (OPAC, MOA II) in a separate DB maintained by a separate organization (Books in Print, ratings systems) derived on-the-fly from a different scheme (MARC-to-DC) Besser--Metadata (Brazil) 1/6/01 10

Collaborative Metadata Projects Dublin Core NSF/ERCIM Digital Collaboratory OCLC CORC Project Visual Resources Association

Collaborative Metadata Projects Dublin Core NSF/ERCIM Digital Collaboratory OCLC CORC Project Visual Resources Association (VRA) Core Encoded Archival Description (EAD) Computerized Interchange of Museum Information (CIMI) Records Export for Art and Cultural Heritage (REACH) Besser--Metadata (Brazil) 1/6/01 11

CORC--Cooperative Online Resource Catalog _ _ both bib records & webliographies (pathfiinders) supports both

CORC--Cooperative Online Resource Catalog _ _ both bib records & webliographies (pathfiinders) supports both AACR 2/MARC and DC began 1/99, scheduled availability 7/00 100 -200 participants – – Academic libraries OCLC networks, special libraries, public libraries, state & national libraries, consortia Besser--Metadata (Brazil) 1/6/01 12

Dublin Core (3/95) _ _ _ improve resource discovery anticipate precision problems of Web

Dublin Core (3/95) _ _ _ improve resource discovery anticipate precision problems of Web Crawlerbased searching tools existing metadata could be “dumbed down” elements should be simple to understand use, so that any individual should be able to assign terms him/herself software might eventually automatically generate very base-level metadata Besser--Metadata (Brazil) 1/6/01 13

Dublin Core Title Format Creator Identifier Subject Source Description Language Publisher Relation Contributors Coverage

Dublin Core Title Format Creator Identifier Subject Source Description Language Publisher Relation Contributors Coverage Date Rights Type Besser--Metadata (Brazil) 1/6/01 14

Dublin Core every element is both optional and repeatable elements are cross-disciplinary elements are

Dublin Core every element is both optional and repeatable elements are cross-disciplinary elements are extensible by organized communities can employ a syntax such as html’s <META> tagset for use by Spiders and Harvesters May 2000 DLF Metadata Harvesting Project Besser--Metadata (Brazil) 1/6/01 15

DC Qualifiers _ _ allows one community to express important nuances and qualifications, while

DC Qualifiers _ _ allows one community to express important nuances and qualifications, while still making the basic importance available to communities with simple needs our community can reflect alternate title, transliterated title, and main title, yet they will all be found under a simple Web search under “title” Besser--Metadata (Brazil) 1/6/01 16

Discovery Metadata: Recent History _ _ _ Dublin Core (3/95) Warwick Framework (4/96) Image

Discovery Metadata: Recent History _ _ _ Dublin Core (3/95) Warwick Framework (4/96) Image Metadata Workshop (9/96) Canberra, Helsinki, . . . DC (98) Digital Library Collaboratory (97 -) DC-8, Frankfurt 10/99 Besser--Metadata (Brazil) 1/6/01 17

Dublin Core--further work _ Warwick Framework – – _ Canberra Qualifiers – – _

Dublin Core--further work _ Warwick Framework – – _ Canberra Qualifiers – – _ metadata packages for extensible functions layed groundwork for RDF refining the semantics of the element set to provide more precise info SUBELEMENT, SCHEME, LANG Granularity – no hierarchical relationships w/i a given DC record; only one record per discrete object (collection or item-level), and relationship field plus qualifier links them Besser--Metadata (Brazil) 1/6/01 18

_ _ _ The Research Process and Functional Categories of Metadata Discovery Retrieval Collation

_ _ _ The Research Process and Functional Categories of Metadata Discovery Retrieval Collation Analysis Re-presentation

Making of America II Background of the DLF Project Administrative Metadata Structural Metadata Besser--Metadata

Making of America II Background of the DLF Project Administrative Metadata Structural Metadata Besser--Metadata (Brazil) 1/6/01 20

MOA 2 Goal is Interpoerability Book example Besser--Metadata (Brazil) 1/6/01 21

MOA 2 Goal is Interpoerability Book example Besser--Metadata (Brazil) 1/6/01 21

 R DLF Metadata for Interoperability Testbed: the MOA II Project &D Distributed Repositories

R DLF Metadata for Interoperability Testbed: the MOA II Project &D Distributed Repositories Transportation, 1869 -1900 Testbed Project Best Practices Structural and administrative metadata Besser--Metadata (Brazil) 1/6/01 22

Previous Projects/Background Library Standards Background UC Berkeley Background Finding Aids EAD SGML EAD “Digital

Previous Projects/Background Library Standards Background UC Berkeley Background Finding Aids EAD SGML EAD “Digital Archives” Besser--Metadata (Brazil) 1/6/01 23

MOA II Classes of Objects Continuous Tone Photos Photo Albums Diaries, journals, letterpress books

MOA II Classes of Objects Continuous Tone Photos Photo Albums Diaries, journals, letterpress books Ledgers Correspondence Besser--Metadata (Brazil) 1/6/01 24

MOA II Metadata _ Administrative Metadata – _ Structural Metadata – _ for enhancing

MOA II Metadata _ Administrative Metadata – _ Structural Metadata – _ for enhancing resource management for reflecting internal hierarchies and relationships btwn parts Raw/Seared/Cooked Besser--Metadata (Brazil) 1/6/01 25

Administrative Metadata to uniquely identify a digital resource and manage it over time _

Administrative Metadata to uniquely identify a digital resource and manage it over time _ _ _ Information about where the various pieces/versions of the object reside Information to view the digital object Information about the scanning process Besser--Metadata (Brazil) 1/6/01 26

Structural Metadata: that which is relevant to presentation of the digital object to the

Structural Metadata: that which is relevant to presentation of the digital object to the user _ _ metadata defining the "object”: a book, a diary, a photo album metadata defining the “sub-objects”: pages (physical) or chapters and subheads (intellectual) Besser--Metadata (Brazil) 1/6/01 27

SGML, XML, HTML _ _ TEI for structured humanities text EAD for Finding Aids

SGML, XML, HTML _ _ TEI for structured humanities text EAD for Finding Aids Besser--Metadata (Brazil) 1/6/01 28

Other Types of Metadata_ _ _ Longevity Identification/Provenance Rights Management Besser--Metadata (Brazil) 1/6/01 29

Other Types of Metadata_ _ _ Longevity Identification/Provenance Rights Management Besser--Metadata (Brazil) 1/6/01 29

NISO/DLF Image Metadata Workshop Possible Goals Metadata fields Rules for Field Contents (authority control)

NISO/DLF Image Metadata Workshop Possible Goals Metadata fields Rules for Field Contents (authority control) Core set of necessary fields Syntax for expressing fields and contents (headers) Besser--Metadata (Brazil) 1/6/01 30

Image Metadata Focus on Metadata that may prove helpful for management use preservation .

Image Metadata Focus on Metadata that may prove helpful for management use preservation . . . Besser--Metadata (Brazil) 1/6/01 31

Image Metadata Break-out Groups: Work Done Characteristics and Features of Images Image Production and

Image Metadata Break-out Groups: Work Done Characteristics and Features of Images Image Production and Reformatting Features Image Identification and Integrity Besser--Metadata (Brazil) 1/6/01 32

Other Metadata _ _ Description of depiction/surrogate (What VRA calls its "Surrogate Categories") Description

Other Metadata _ _ Description of depiction/surrogate (What VRA calls its "Surrogate Categories") Description of original object Rights and Reproduction Information Location Information Besser--Metadata (Brazil) 1/6/01 33

Data Structures: The VRA Core 28 elements specifically for visual resource collections Work Description

Data Structures: The VRA Core 28 elements specifically for visual resource collections Work Description Categories Visual Document Description Categories http: //www. oberlin. edu/~art/vra/dsc. html Besser--Metadata (Brazil) 1/6/01 34

VRA Core: Work Description Categories Work type Title Measurements Material Technique Creator Role Date

VRA Core: Work Description Categories Work type Title Measurements Material Technique Creator Role Date Repository name Repository place Besser--Metadata (Brazil) 1/6/01 _ _ _ _ _ Repository number Current site Original site Style/period/group/movem ent Nationality/culture Subject Related work Relationship type Notes 35

VRA Core: Visual Document Description Categories Visual document type Visual document format Visual document

VRA Core: Visual Document Description Categories Visual document type Visual document format Visual document measurements Visual document date Visual document owner number Visual document view description Visual document subject Visual document source Besser--Metadata (Brazil) 1/6/01 36

Data Value Metadata (vocabularies) LCSH TGM AAT ULAN TGN VRA Core Besser--Metadata (Brazil) 1/6/01

Data Value Metadata (vocabularies) LCSH TGM AAT ULAN TGN VRA Core Besser--Metadata (Brazil) 1/6/01 37

LCSH very general Besser--Metadata (Brazil) 1/6/01 38

LCSH very general Besser--Metadata (Brazil) 1/6/01 38

Thesaurus for Graphic Materials designed for subject indexing of pictorial materials, particularly large general

Thesaurus for Graphic Materials designed for subject indexing of pictorial materials, particularly large general collections of historical images for cataloging and retrieval good for general audiences and broad approaches to the material TGM-I: Subject Terms & TGM-II: Genre and Physical Characteristic Terms http: //lcweb. loc. gov/rr/print/tgm/toc. html Besser--Metadata (Brazil) 1/6/01 39

AAT 120, 000 terms for describing objects, textual materials, images, architecture, and material culture

AAT 120, 000 terms for describing objects, textual materials, images, architecture, and material culture from antiquity to present large and complex http: //www. getty. edu/gri/vocabularies/ Besser--Metadata (Brazil) 1/6/01 40

ULAN name authority http: //www. getty. edu/gri/vocabularies/ Besser--Metadata (Brazil) 1/6/01 41

ULAN name authority http: //www. getty. edu/gri/vocabularies/ Besser--Metadata (Brazil) 1/6/01 41

Thesaurus of Geographic Names over 1 million records hierarchical and global throughout history most

Thesaurus of Geographic Names over 1 million records hierarchical and global throughout history most records include coordinates and descriptive notes Besser--Metadata (Brazil) 1/6/01 42

Metadata for Digital Commerce DOI <indecs>- Besser--Metadata (Brazil) 1/6/01 43

Metadata for Digital Commerce DOI <indecs>- Besser--Metadata (Brazil) 1/6/01 43

<Indecs> formal structure for describing and uniquely identifying intellectual property itself, the people and

<Indecs> formal structure for describing and uniquely identifying intellectual property itself, the people and businesses involved in its trading, and the agreements which they make about it (primarily for publishing, music, and visual arts) will develop high-level specifications for the services that will be required to implement a global IP trading system based on this <indecs> generic data model focus is on encoding rights at a high level, not on resource discovery likely to involve metadata schma registration and directory to allow interoperation of personal identifiers for rightsholders and users supported by EEC DG-13 First meeting July 1999 http: //www. indecs. org/ Besser--Metadata (Brazil) 1/6/01 44

Metadata Mapping Crosswalks Resource Description Framework (RDF) Besser--Metadata (Brazil) 1/6/01 45

Metadata Mapping Crosswalks Resource Description Framework (RDF) Besser--Metadata (Brazil) 1/6/01 45

Crosswalks mapping btwn differing metadata structures eliminate the need for monolithic, universally adopted standards

Crosswalks mapping btwn differing metadata structures eliminate the need for monolithic, universally adopted standards focus on flexibility and interoperatiblity RDF-based metadata registries Besser--Metadata (Brazil) 1/6/01 46

Crosswalk Example Besser--Metadata (Brazil) 1/6/01 47

Crosswalk Example Besser--Metadata (Brazil) 1/6/01 47

Resource Description Framework (RDF, spec released 2/99) _ _ _ W 3 C Metadata

Resource Description Framework (RDF, spec released 2/99) _ _ _ W 3 C Metadata activity designed to move the Web beyond simple links to semantically-rich relationships btwn resources metadata application using XML as a common syntax for exchange and processing flexible architecture for managing diverse applicationspecific metadata packets that can be processed by machines associates resources, property types, and corresponding values http: //www. w 3. org/RDF/ _ Besser--Metadata (Brazil) 1/6/01 48

RDF _ _ _ Resources (character strings, names, digital objects) Property (“is the author

RDF _ _ _ Resources (character strings, names, digital objects) Property (“is the author of”) Value resources+properties=relationships many different relationships can be reflected Besser--Metadata (Brazil) 1/6/01 49

XML-encoded RDF _ _ _ <? xml: namespace ns=http: //www. w 3. org/RDF prefix="RDF"

XML-encoded RDF _ _ _ <? xml: namespace ns=http: //www. w 3. org/RDF prefix="RDF" ? > <? xml: namespace ns=http: //purl. oclc. org/DC/ prefix="DC" ? > <RDF: RDF> <DC: Creator>Howard Besser</DC: Creator> </RDF: Description> </RDF: RDF> Besser--Metadata (Brazil) 1/6/01 50

Should you start building with RDF today? _ _ Tools are primitive Standard still

Should you start building with RDF today? _ _ Tools are primitive Standard still likely to evolve Besser--Metadata (Brazil) 1/6/01 51

Metadata for Digital Asset Mgmt Howard Besser UCLA School of Education & Information Baca,

Metadata for Digital Asset Mgmt Howard Besser UCLA School of Education & Information Baca, Murtha (ed). Introduction to Metadata, Los Angeles: Getty Information Institute, 1998 http: //www. getty. edu/gri/standard/intrometadata/ http: //sunsite. berkeley. edu/Imaging/Databases/#standards http: //sunsite. berkeley. edu/moa 2/ http: //sunsite. berkeley. edu/Longevity/ http: //www. ifla. org/II/metadata. htm http: //purl. oclc. org/metadata/dublin_core/ http: //purl. oclc. org/corc/ http: //lcweb. loc. gov/ead/ http: //www. gseis. ucla. edu/~howard/image-meta. html http: //www. gseis. ucla. edu/~howard/Metadata/UC-May 00/ Besser--Metadata (Brazil) 1/6/01 52 http: //sunsite. berkeley. edu/Metadata/sp 2000. html