Pres QT Workshop Outcomes Richard Johnson Sandra Gesing

  • Slides: 12
Download presentation
Pres. QT: Workshop Outcomes Richard Johnson, Sandra Gesing, Natalie Meyers, John Wang, Miranda Van

Pres. QT: Workshop Outcomes Richard Johnson, Sandra Gesing, Natalie Meyers, John Wang, Miranda Van Nevel presqt-contact-list@nd. edu https: //presqt. crc. nd. edu/ January 28, 2019 Hesburgh Libraries

Pres. QT A collaborative design effort to enhance reproducibility and more open sharing of

Pres. QT A collaborative design effort to enhance reproducibility and more open sharing of research data through open source development (July 2018 -June 2020) of Tools and RESTful Services to Improve Preservation and Re-use of Research Data & Software. https: //www. imls. gov/grants/awarded/lg-72 -16 -0122 -16 https: //www. imls. gov/grants/awarded/lg-70 -18 -0082 -18 Hesburgh Libraries

All Resources avail online http: //presqt. crc. nd. edu/ Hesburgh Libraries

All Resources avail online http: //presqt. crc. nd. edu/ Hesburgh Libraries

Pres. QT OSF Project An open project with all stakeholder input, workshop materials, and

Pres. QT OSF Project An open project with all stakeholder input, workshop materials, and meeting info shared on Open Science Framework. Project Partner https: //osf. io/d 3 jx 7/ https: //cos. io/

Goals of this workshop • Group discussion on the technical design document https: //osf.

Goals of this workshop • Group discussion on the technical design document https: //osf. io/anbhu/ It’s a living document! Your input will influence how Pres. QT will be implemented! . Hesburgh Libraries

Goals of the workshop • • • . Group discussion on the technical design

Goals of the workshop • • • . Group discussion on the technical design document https: //osf. io/m 7 hp 4/ It’s a living document! Your input will influence how Pres. QT will be implemented! Selected demos Breakout groups on – Preservation quality – Fixity – Keyword assignment User stories! Deep dives into user stories Expectations from you on the project and vice versa Hesburgh Libraries

Workshop Notes https: //osf. io/ytazk Hesburgh Libraries

Workshop Notes https: //osf. io/ytazk Hesburgh Libraries

Fixity Use Cases #1 - (Use case 3 from discussion) Check file has not

Fixity Use Cases #1 - (Use case 3 from discussion) Check file has not changed at rest over time, generate a checksum that can then be stored with the file at rest ● Then either integrated into datasource to check against checksum, or trigger check manually later? ● Have it be a web interface that is locally running javascript (open question on browser memory limits) ● For example, something like: md 5 sum #2 - (Use Case 2 from discussion): Verify a file has not changed after network file transfer ● Potentially a client based utility (CLI) that can generate a checksum. ● After file transfer on client supply expected checksum (transferred in separate transaction from file? ), and then client generates new checksum and checks if a match Hesburgh Libraries

Fixity Use Cases #3 - (Use Case 4 from discussion): As aggregating files from

Fixity Use Cases #3 - (Use Case 4 from discussion): As aggregating files from different sources together, check to see if may already exist, and if so can combine/extract already existing metadata to add. If so may not need to grab file again. ● May need to consider what level of granularity to validate (individual file, bag, etc. ) ● Store knowledge how to canonicalize files, and have common manifests to compare #4 - (Use Case 1 from discussion): Capturing Chain of Custody and make it public and verifiable, verifying object at each change We ruled this use case out of scope for Pres. QT - (Use Case 5 from discussion): Accidental deletions, prevent, detect and fix this somehow ● Perhaps check for changes to bags ● Have associate action logs to verify a trusted action? We will check whether Meta. DIG also has Fixity checks! Hesburgh Libraries

Keyword Use Cases #1 - Metadata Mapping Between Systems ● Pres. QT must support

Keyword Use Cases #1 - Metadata Mapping Between Systems ● Pres. QT must support direct metadata translations between various target systems. #2 - Keyword Suggestion and Generation ● “Expansion through ontologies… ● Employ topic modelling related techniques Hesburgh Libraries

Preservation Quality Use Cases #1 - Transfer File(s) from one Repository or Workspace to

Preservation Quality Use Cases #1 - Transfer File(s) from one Repository or Workspace to another Repository ● Implementing OSF to Curate. ND (Notre Dame IR) first ● Will then implement with other source/target pairs with partners #2 - Metadata Completeness Check / Score using Meta. DIG ● Use Case 5 from notes: Generate FAIR measurement to determine level of “FAIRness” #3 - Use Case 2: Check file if a desired file format for preservation quality, and if not recommend new format ● Meta. DIG in #2 be able to meet this use case ● Use Case 6: Deprecation warnings about particular formats #4 - Use Case 4: Detect if available for emulation of dependent software in emulation (e. g. , Eaa. Si) and then connect the two together ● Use Case 7: Check for computational reproducibility (see use case 4) Hesburgh Libraries

Preservation Quality Use Case Priorities Other Use Cases Use Case 1: When transfer of

Preservation Quality Use Case Priorities Other Use Cases Use Case 1: When transfer of ownership/stewardship of a resource, would like to already have metadata in place that gives provenance, usage, or generation information ● Define minimum set of metadata to capture Use Case 3: Ask for some kind of template or blueprint for an item to preserve, examples include: ● The code itself if software (or some kind of pseudo-code of the algorithms) ● Some kind of simpler example (like stick figure drawings of choreography) ● Jisc software depositor guidelines Hesburgh Libraries