NOAA Documentation Improvement Ted Habermann Completeness Rubric Scores
NOAA Documentation Improvement Ted Habermann Completeness (Rubric Scores) How do we measure and visualize improvements in NOAA Documentation? More Complete Less Metadata Not So Good More Metadata Less Complete Record Count
NGDC Solar Metadata History Late in 2011 Bill Denig, the Chief Bill’s experience and of the Solar and Terrestrial confidence increased and two Physics Division at NGDC, decided months later he extended hisrecords Since then Bill has improved existing that, in order to understand thisandcollection to include records added new, high-quality records to “metadata thing “ he had to translated from FGDC to ISO. steadily increase his average score. actually work on some metadata. He started working with one This increased the number of record on his desktop using records and decreased the Oxygen (the dreaded XML editor) scores (FGDC translations and the NGDC rubric. He got a generally yield scores between very high score and was able to 16 and 20). extend his experience to several other records. He achieved high scores with a small number of records.
Geophysics Components are re-useable pieces of documentation that allow “normalization” of information in metadata collections. One Month
NOAA Documentation Dashboard NOAA 46% Line Ofc 1 25% Line Ofc 2 80% Line Ofc 3 50% % Data Access Service Types Offered # T Historic Trend Metadata Dialects Used WMS WCS Mean NOAA Line Ofc 1 ISO FGDC OBIS DAP Esri WFS None Metadata Completeness Scores % of Records Documentation Accessibility % Records with Data Access Service Link DC Free Text None Line Ofc 2 Managers Scientists and Data Managers σ Min Max
Documentation Metric Services Collector Web Accessible Folders Calculate Metrics Validation Bad XML Bad Links Unique Contacts Bad x. Links Component Report Rubric Scores Database
Collection Characteristics Goal: Improve metadata when you revise it: C o m p l e t e n e s s Understanding Discovery FGDC Most Metadata Are Old Legacy Sporadic Work Periods Metadata Revision Date Recent
Line Office / Program Process Data Collectors/Providers Standards Experts Data Stewards Data Users 1. Identify Expertise Data Stewardship Teams 2. Assess Discovery Collections Use Datasets Understanding Services Initial Evaluation 3. Create and Improve Spirals Wiki Rubrics On-Going Evaluation 4. Publish/Preserve Consultation / Guidance Use Cases / Needs Catalogs WAFs On-Going Input
Documentation Capabilities WAF Unified access to metadata with multiple views Automated custom processing and metadata quality checks Translation from many dialects to consistent international standard Harvest targets for Geospatial One Stop and data. gov Spiral Tracker Consistent rubric score calculation Score distributions help identify improvement steps Database provides history and access to scores / records Supports stewardship teams and managers Wiki Provides community guidance, examples, successes Information available and shared with national and international partners
Required NOAA-Wide Support NOAA Line/Staff Offices and Programs must build on a strong foundation of support across all of NOAA 1. Develop and implement common metadata management tools 2. Use rubrics to establish a baseline and monitor progress 3. Promote and highlight good examples 4. Support training specifically targeted at improving NOAA’s data documentation 5. Initiate teams to work on “special documentation problems” that cross Line and Staff Offices 6. Encourage and support participation in the ISO and Open Geospatial Consortium (OGC)
Questions?
NOAA Documentation Improvement Components Homogeneous Directive Efficiently provide consistent guidance implementations, evaluations, and metadata management capabilities for all Line Offices and Programs Tools Plans Heterogeneous Vision and goals, identify standards, describe common nomenclature, identify responsibilities Community guidance, training, examples, best practices, broad input , Line Office and Program plans
Metric Calculation System THREDDS Rubric XSLT (TBD) FGDC Collector Record Scores Existing XSLT (NCDDC) ISO Rubric XSLT (TBD) Record Scores Web Accessible Folders Desktop Editors METAVIST , Cat. MDEdit, Arc. Catalog, XMLSpy, Oxygen Web Tools MERMaid, in. Port, NMMR, Geonetwork
Current Test Cases Co. RIS 1642 Records NMFS ~400 Records Collector in. Port NOS 737 Records FGDC NESDIS/ OAR 1521 Records ISO GOSIC ~400 Records UAF 552 Records Nc. ML DIF
Spiral Development / Training Check Back With Data Collectors/ Providers Check Back With Users Spiral 2 -N: Scientific Questions New Requirements New Use Cases Standard Guidance / Implementation Metadata Content Independent of standard Spiral 1: Initial Content
Spiral Development / Training: Potential Spirals Discovery Identification Id Title Abstract Resource Date Topic Category Theme Keyword Metadata Contact Science Contact Extent Geospatial Bounding Box Temporal Start/End Vertical Min/Max Place Keywords Connection Online. Resource: Linkage (URL) Name Description Function Understanding Text Searches Purpose Extent Description Lineage Statement Project Keywords Distribution Distributor Contact Online Resource Distribution Format Data Center Keywords Browse Graphic Content Information Attribute Type Attribute Names Attribute Definitions Attribute Units Acquisition Information Instrument Platform Instrument Keywords Platform Keywords Quality/Lineage Sources Process Steps Quality Reports / Coverages
Spiral Development / Training: Rubrics
Spiral Tracker Specific Records WAF Score History Score Distribution http: //www. ngdc. noaa. gov/idb/struts/results? t=103068&s=20&d=25
- Slides: 17