Provenance Context Workshop Guiding Documents Overview of Guiding

  • Slides: 24
Download presentation
Provenance & Context Workshop - Guiding Documents Overview of Guiding Documents - R. Duerr

Provenance & Context Workshop - Guiding Documents Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

Outline: • OAIS Reference Model • USGCRP Guidance • PREMIS Metadata Standard Overview of

Outline: • OAIS Reference Model • USGCRP Guidance • PREMIS Metadata Standard Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

The OAIS Reference Model • A CCSDS and ISO standard detailing: § The responsibilities

The OAIS Reference Model • A CCSDS and ISO standard detailing: § The responsibilities of an archive § A functional model describing how to preserve information and make it available to users § An information model describing what ancillary information is needed to ensure that future users understand can use the information preserved § A common set of terminology that can be used to describe the above Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

OAIS Archive Responsibilities • Negotiate with information providers to receive and obtain sufficient rights

OAIS Archive Responsibilities • Negotiate with information providers to receive and obtain sufficient rights to appropriate information to ensure long-term preservation • Designate a community which should be able to understand the information preserved • Ensure that the information is independently understandable to that community • Document procedures and policies regarding data preservation and access • Make the information available Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

OAIS Functional Model OAIS Archive Preservation Planning Data Mgmt Producer Ingest Access Archive Administration

OAIS Functional Model OAIS Archive Preservation Planning Data Mgmt Producer Ingest Access Archive Administration MANAGEMENT Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009. Consumer

OAIS Information Model Content Information Preservation Description Information Packaging Information Package 1 Descriptive Information

OAIS Information Model Content Information Preservation Description Information Packaging Information Package 1 Descriptive Information About Package 1 Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

OAIS Information Model - Content Info. • Data Object - the information to be

OAIS Information Model - Content Info. • Data Object - the information to be preserved • Representational Information - allows a user to understand the data § Structure (e. g. , flat binary file, ASCII table, net-CDF file, HDF, etc. ) § Content (e. g. , a table of station IDs, dates, latitude, longitude, incidence angle, brightness temperature) Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

OAIS Info. Model - Preservation Description • Provenance - documents the history of the

OAIS Info. Model - Preservation Description • Provenance - documents the history of the object • Reference - documents object identifiers and their generation mechanisms • Fixity - documents methods used to ensure there are no undocumented changes • Context - the relationship of the object to its environment Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

Provenance and Contextual Information • Instrument/sensor characteristics including pre-flight or pre-operational performance measurements (e.

Provenance and Contextual Information • Instrument/sensor characteristics including pre-flight or pre-operational performance measurements (e. g. , spectral response, noise characteristics, etc. ) • Instrument/sensor calibration data and method • Processing algorithms and their scientific basis, including complete description of any sampling or mapping algorithm used in creation of the product (e. g. , contained in peer-reviewed papers, in some cases supplemented by thematic information introducing the data set or derived product) • Complete information on any ancillary data or other data sets used in generation or calibration of the data set or derived product 10 Presented by R. Duerr at the Documents Summer Institute on Data Curation, June 2 -5, 2008 Overview of Guiding - R. Duerr Graduate School of Library and Information Science, University ESIP Federation Meeting, Santa Barbara, July 2009. of Illinois at Urbana-Champaign

Provenance and Contextual Information (cont. ): • Processing history including versions of processing source

Provenance and Contextual Information (cont. ): • Processing history including versions of processing source code corresponding to versions of the data set or derived product held in the archive • Quality assessment information • Validation record, including identification of validation data sets • In the case of earth based data, station location and any changes in location, instrumentation, controlling agency, surrounding land use and other factors which could influence the long-term record • A bibliography of pertinent Technical Notes and articles, including refereed publications reporting on research using the data set • Information received back from users of the data set or product 11 Presented by R. Duerr at the Documents Summer Institute on Data Curation, June 2 -5, 2008 Overview of Guiding - R. Duerr Graduate School of Library and Information Science, University ESIP Federation Meeting, Santa Barbara, July 2009. of Illinois at Urbana-Champaign

PREMIS - Where did it come from? • PREMIS = PREservation Metadata: Implementation Strategies

PREMIS - Where did it come from? • PREMIS = PREservation Metadata: Implementation Strategies • Developed by an OCLC and RLG sponsored international working group § Representatives from libraries, museums, archives, government, and the private sector. • Based on the OAIS reference model Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

PREMIS - What’s its purpose? • Provide a core preservation metadata set with broad

PREMIS - What’s its purpose? • Provide a core preservation metadata set with broad applicability across the digital preservation community § Supported by a data dictionary § XML schema § Examples are provided Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

PREMIS - What’s its status? • Maintained by the Library of Congress • Editorial

PREMIS - What’s its status? • Maintained by the Library of Congress • Editorial board with international membership • User community consulted on changes through the PREMIS Implementers Group • Version 1 was released in June 2005 • Version 2 was released in March 2008 Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

PREMIS - Entity-Relationship Diagram Intellectual Entities Objects “an action that involves at least one

PREMIS - Entity-Relationship Diagram Intellectual Entities Objects “an action that involves at least one object or agent Rights “a“aperson, organization, or coherent set of content to unit the associated preservation “aknown discrete of information software program that is reasonably with described preservation inrepository” digital form”in as aevents unit” e. g. , created, archived, For example, a data file the life of an object” For example, a web site, data migrated e. g. , Dr. Spock “assertions ofdonated one or it more set or collection of data sets rights or permissions pertaining to an object or an agent” e. g. , copywrite Eventsnotice, legal statute, deposit agreement Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009. Agents

PREMIS - Types of Objects • Representation - “the set of files needed for

PREMIS - Types of Objects • Representation - “the set of files needed for a complete and reasonable rendition of an Intellectual Entity” • File • Bitstream - “contiguous or noncontiguous data within a file that has meaningful common properties for preservation purposes” Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

PREMIS - Object Metadata • object. Identifier § format o § object. Identifier. Type

PREMIS - Object Metadata • object. Identifier § format o § object. Identifier. Type § object. Identifier. Value • preservation. Level • object. Category* • object. Characteristics § composition. Level § fixity format. Designation q q o format. Name format. Version format. Registry q q q format. Registry. Name format. Registry. Key format. Registry. Role § significant. Properties* § inhibitors message. Digest. Algorithm o message. Digest. Originator o § size Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009. inhibitor. Type o inhibitor. Target o inhibitor. Key o

PREMIS - Object Metadata (continued) • creating. Application § § creating. Application. Name creating.

PREMIS - Object Metadata (continued) • creating. Application § § creating. Application. Name creating. Application. Version date. Created. By. Application Creating. Application. Extension • original. Name • storage § content. Location. Type o content. Location. Value o • signature. Information § § § § signature. Information. Encoding signer signature. Method signature. Value signature. Validation. Rules signature. Properties key. Information • signature. Information. Extension § storage. Medium Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

PREMIS - Object Metadata (continued) • environment § § § software environment. Characteristic environment.

PREMIS - Object Metadata (continued) • environment § § § software environment. Characteristic environment. Purpose environment. Note dependency. Name o dependency. Identifier o o o q q dependency. Identifier. Type dependency. Identifier. Value o sw. Name sw. Version sw. Type sw. Other. Information sw. Dependency § hardware Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009. hw. Name o hw. Type o hw. Other. Information o

PREMIS - Object Metadata (continued) • relationship § § relationship. Type relationship. Subtype related.

PREMIS - Object Metadata (continued) • relationship § § relationship. Type relationship. Subtype related. Object. Identification* related. Event. Identification* • linking. Event. Identifier* • linking. Intellectual. Entity. Identifier* • linking. Rights. Statement. Identifier* Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

PREMIS - Object Metadata Notes • Not all fields are mandatory • Not all

PREMIS - Object Metadata Notes • Not all fields are mandatory • Not all fields apply to all types of objects • Some fields are repeatable Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

PREMIS - Event Metadata • event. Identifier • linking. Agent. Identifier § event. Identifier.

PREMIS - Event Metadata • event. Identifier • linking. Agent. Identifier § event. Identifier. Type § event. Identifier. Value • • event. Type event. Date. Time event. Detail event. Outcome. Information § event. Outcome. Detail § linking. Agent. Identifier. Type § linking. Agent. Identifier. Value § linking. Agent. Role • linking. Object. Identifier § linking. Object. Identifier. Type § linking. Object. Identifier. Value § linking. Object. Identifier. Role Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

PREMIS - Agent Metadata • agent. Identifier § agent. Identifier. Type § agent. Identifier.

PREMIS - Agent Metadata • agent. Identifier § agent. Identifier. Type § agent. Identifier. Value • agent. Name • agent. Type Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.

PREMIS - Rights Metadata • rights. Statement § rights. Granted § rights. Statement Identifier

PREMIS - Rights Metadata • rights. Statement § rights. Granted § rights. Statement Identifier act o restriction o term. Of. Grant o rights. Statement Identifier. Type o rights. Statement Identifier. Value o § § rights. Basis copyright. Information* license. Information* statute. Information* q q o start. Date end. Date rights. Granted. Note § linking. Object. Identifier § linking. Agent. Identifier • rights. Extension Overview of Guiding Documents - R. Duerr ESIP Federation Meeting, Santa Barbara, July 2009.