Astromat Data Transforming Access to Astromaterial Data NASAs

  • Slides: 60
Download presentation
@Astromat. Data Transforming Access to Astromaterial Data: NASA's Astromaterial Data System Lunar & Planetary

@Astromat. Data Transforming Access to Astromaterial Data: NASA's Astromaterial Data System Lunar & Planetary Science Conference 2019 Town Hall Astromaterials Data System contact: astromatdata@gmail. com

Agenda • Overview of the Astromaterials Data System • • System Components Why build

Agenda • Overview of the Astromaterials Data System • • System Components Why build the Astromaterials Data System? How can you benefit from the Astromaterials Data System? Development Plan • Available now: Astro. Mat’s Bibliography • What is to come and when • Astro. Mat & Moon. DB • How can you participate? • Questions/Discussion Astromaterials Data System contact: astromatdata@gmail. com 2

 • > 250, 000 astromaterials samples at JSC • average of 1, 500

• > 250, 000 astromaterials samples at JSC • average of 1, 500 samples allocated to >350 scientists world-wide every year Recording of Sample Order Lunar Sample Lab Astromaterials Data System contact: astromatdata@gmail. com Archive of Sampling History

Thousands of studies have been conducted on astromaterials. Erik Hauri and Alberto Saal with

Thousands of studies have been conducted on astromaterials. Erik Hauri and Alberto Saal with an ion microprobe at the Carnegie Institution for Science used to detect water in lunar samples. Credit: Steven Jacobsen/Northwestern University Astromaterials Data System contact: astromatdata@gmail. com These studies generated vast amounts of laboratory data. chemical, mineralogical, geochronological, experimental. . .

What is the Astromaterials Data System? Astromaterials Data System contact: astromatdata@gmail. com www. astromat.

What is the Astromaterials Data System? Astromaterials Data System contact: astromatdata@gmail. com www. astromat. org

Goals of the Astromaterials Data System • Preserve past, present, & future astromaterials data

Goals of the Astromaterials Data System • Preserve past, present, & future astromaterials data • Easy access to astromaterials data • Support innovative reuse & advanced data mining Astromaterials Data System contact: astromatdata@gmail. com 6

Astro. Mat Team Cindy Evans Ryan Zeigler Not pictured: Sruti Devendran (Data Curator) Ed

Astro. Mat Team Cindy Evans Ryan Zeigler Not pictured: Sruti Devendran (Data Curator) Ed Bohl (System Administrator) Astromaterials Data System contact: astromatdata@gmail. com

Astro. Mat System Components Repository Synthesis DB Data Curation Astromaterials Data System contact: astromatdata@gmail.

Astro. Mat System Components Repository Synthesis DB Data Curation Astromaterials Data System contact: astromatdata@gmail. com

Astromaterials Data System contact: astromatdata@gmail. com

Astromaterials Data System contact: astromatdata@gmail. com

 • • Cataloguing DOI assignment Versioning Long-term archiving Astro. Mat Repository Metadata Catalog

• • Cataloguing DOI assignment Versioning Long-term archiving Astro. Mat Repository Metadata Catalog Astromaterials Data System contact: astromatdata@gmail. com Files Astro. DB Synthesis Data & Metadata • Data & metadata integration & harmonization

 • • • Cataloguing DOI assignment Versioning Long-term archiving Astro. Mat Repository Metadata

• • • Cataloguing DOI assignment Versioning Long-term archiving Astro. Mat Repository Metadata Catalog Author(s) Title Abstract Keywords Related publication Funding award Etc. Astromaterials Data System contact: astromatdata@gmail. com Astro. DB Synthesis Files • • • Data Sample description Method description Uncertainties Etc. Data & Metadata • • • Data & metadata integration & harmonization Data Sample description Method description Uncertainties Author(s) Title Publication Etc.

Data Ingest • • • Cataloguing DOI assignment Versioning Long-term archiving New & unpublished

Data Ingest • • • Cataloguing DOI assignment Versioning Long-term archiving New & unpublished data (User contributions) Published data (entered by curators) Astro. Mat Repository Astro. DB Synthesis Metadata Catalog Author(s) Title Abstract Keywords Related publication Funding award Etc. Astromaterials Data System contact: astromatdata@gmail. com Files • • • Data Sample description Method description Uncertainties Etc. Data & Metadata • • • Data & metadata integration & harmonization Data Sample description Method description Uncertainties Author(s) Title Publication Etc.

Data Ingest • • Cataloguing DOI assignment Versioning Long-term archiving New & unpublished data

Data Ingest • • Cataloguing DOI assignment Versioning Long-term archiving New & unpublished data (User contributions) Published data (entered by curators) Astro. Mat Repository Astro. DB Synthesis Metadata Catalog Data & Metadata Files File download Customized data subset download Data Access Astromaterials Data System contact: astromatdata@gmail. com • Data & metadata integration & harmonization

Data Publication & Preservation Data Mining & Analysis Earth. Chem Library Pet. DB Database

Data Publication & Preservation Data Mining & Analysis Earth. Chem Library Pet. DB Database GUI (search, browse, download) GUI (search, extract, download) Metadata Catalog Postgre. SQL Database with data & metadata Data & Metadata File System User submission Ingestion by data curators Data Astromaterials Data System contact: astromatdata@gmail. com

Earth. Chem Library Astromaterials Data System contact: astromatdata@gmail. com 15

Earth. Chem Library Astromaterials Data System contact: astromatdata@gmail. com 15

Earth. Chem Library Astromaterials Data System contact: astromatdata@gmail. com 16

Earth. Chem Library Astromaterials Data System contact: astromatdata@gmail. com 16

Astromaterials Data System contact: astromatdata@gmail. com 17

Astromaterials Data System contact: astromatdata@gmail. com 17

Astromaterials Data System contact: astromatdata@gmail. com 18

Astromaterials Data System contact: astromatdata@gmail. com 18

Data Submission Tool Astromaterials Data System contact: astromatdata@gmail. com 19

Data Submission Tool Astromaterials Data System contact: astromatdata@gmail. com 19

Astromaterials Data System contact: astromatdata@gmail. com 1. Search for all ‘basalts’ that have data

Astromaterials Data System contact: astromatdata@gmail. com 1. Search for all ‘basalts’ that have data for ‘Ir’. 2. Give me all trace element data for these samples.

Astromaterials Data System contact: astromatdata@gmail. com 1. Search for all ‘basalts’ that have data

Astromaterials Data System contact: astromatdata@gmail. com 1. Search for all ‘basalts’ that have data for ‘Ir’. 2. Give me all trace element data for these samples.

Astromaterials Data System contact: astromatdata@gmail. com 1. Search for all ‘basalts’ that have data

Astromaterials Data System contact: astromatdata@gmail. com 1. Search for all ‘basalts’ that have data for ‘Ir’. 2. Give me all trace element data for these samples.

Why build the Astromaterials Data System? Astromaterials Data System contact: astromatdata@gmail. com

Why build the Astromaterials Data System? Astromaterials Data System contact: astromatdata@gmail. com

Reason #1: Make Astromaterials Data Open & FAIR Astromaterials Data System contact: astromatdata@gmail. com

Reason #1: Make Astromaterials Data Open & FAIR Astromaterials Data System contact: astromatdata@gmail. com

Recognizing the Value of Open Data Publicly funded research should be publicly available for

Recognizing the Value of Open Data Publicly funded research should be publicly available for public good. Transparency in research is essential to sustain the public trust. The validation of research data by the peer community is an essential function of the responsible conduct of research. Sharing of data makes research more efficient. Astromaterials Data System contact: astromatdata@gmail. com

NASA’s Current Focus on Open Data Astromaterials Data System contact: astromatdata@gmail. com

NASA’s Current Focus on Open Data Astromaterials Data System contact: astromatdata@gmail. com

FAIR Data Movement • Data should have sufficiently rich metadata that are accessible and

FAIR Data Movement • Data should have sufficiently rich metadata that are accessible and understandable to humans & machines. • Data should be deposited in certified trusted repositories, preferably with domain expertise. • Ensure persistent access & preservation. • Data should have a Persistent Identifier (e. g. DOI) • Ensure proper citation & attribution. • Ensure registration of metadata • Data should have clear usage licenses. Astromaterials Data System contact: astromatdata@gmail. com 27

Committing to FAIR Data in the Earth, Space & Environmental Sciences From Commitment Statement

Committing to FAIR Data in the Earth, Space & Environmental Sciences From Commitment Statement of the ‘Enabling FAIR Data’ project*: • Direct all core research outputs (data, software, samples and sample metadata) to trusted repositories. • Supplements will no longer be primary “archive” for data. • Data are cited via persistent identifier. • Adopt a shared set of author instructions (common set of expectations for authors in the ESES). • Provide common expectations for publication peer review when evaluating science and determining if the data, metadata, and software adequate. Astromaterials Data System contact: astromatdata@gmail. com *see: www. copdess. org 28

Reason #2: Get Data Ready for Data Science Astromaterials Data System contact: astromatdata@gmail. com

Reason #2: Get Data Ready for Data Science Astromaterials Data System contact: astromatdata@gmail. com

Reusability Problem: Data Wrangling Surveys in recent years show that data scientists still spend

Reusability Problem: Data Wrangling Surveys in recent years show that data scientists still spend 75 -80% of their time ‘data wrangling’. § RDA EU survey 2013 (75%) § Brodie 2015 (80%) § Crowd. Flower 2017 (80%) Source: Crowdflower Astromaterials Data System contact: astromatdata@gmail. com 30

A Typical Search Result. . . Astromaterials Data System contact: astromatdata@gmail. com 31

A Typical Search Result. . . Astromaterials Data System contact: astromatdata@gmail. com 31

“While each observation made on a sample has its own purpose and value, the

“While each observation made on a sample has its own purpose and value, the full potential of sample-based data can only be realized when the vast numbers of individual observations are combined like pieces of a puzzle to reveal large scale patterns in space, time, and property dimensions. ” K. Lehnert, R. Walls (2018): “Web of Samples: Realizing the Scientific Potential of Material Samples” Astromaterials Data System contact: astromatdata@gmail. com

Impact of Geochemical Databases Goldschmidt Conference 2008 Astromaterials Data System contact: astromatdata@gmail. com

Impact of Geochemical Databases Goldschmidt Conference 2008 Astromaterials Data System contact: astromatdata@gmail. com

Impact of Geochemical Databases Pet. DB just reached 800 citations in the literature! (which

Impact of Geochemical Databases Pet. DB just reached 800 citations in the literature! (which is just the tip of the iceberg) Goldschmidt Conference 2008 Astromaterials Data System contact: astromatdata@gmail. com

Data Science in Geochemistry Study shows a pervasive geochemical discontinuity approximately 2. 5 billion

Data Science in Geochemistry Study shows a pervasive geochemical discontinuity approximately 2. 5 billion years ago indicative of dramatic decreases in mantle melt fraction in basalts and in deep crustal melting/fractionation indicators. The Archean/Proterozoic geochemical transition coincides with sudden atmospheric oxygenation at the end of the Archean aeon, providing a temporal link between deep Earth geochemistry and the rise of atmospheric oxygen. Astromaterials Data System contact: astromatdata@gmail. com

Data Science in Geochemistry Study shows a pervasive geochemical discontinuity approximately 2. 5 billion

Data Science in Geochemistry Study shows a pervasive geochemical discontinuity approximately 2. 5 billion years ago indicative of dramatic decreases in mantle melt fraction in basalts and in deep crustal melting/fractionation indicators. The Archean/Proterozoic geochemical transition coincides with sudden atmospheric oxygenation at the end of the Archean aeon, providing a temporal link between deep Earth geochemistry and the rise of atmospheric oxygen. “Our analysis illustrates the opportunity presented by recent efforts to make large amounts of published geochemical data readily available and amenable to statistical analysis. ” Astromaterials Data System contact: astromatdata@gmail. com

Data Science in Geochemistry e c n dva A o t ata Study shows

Data Science in Geochemistry e c n dva A o t ata Study shows a pervasive geochemical D f o er discontinuity approximately 2. 5 billion w o ” ! P y e g h years ago indicative of dramatic t lo a g r n i e t n sdecreases in mantle melt fraction in i e M v r d a n H a “ basalts and in deep crustal , : y ) f g 2 o l 0 etro melting/fractionation indicators. ion ( P s s , e y s r t t s i d i m m e h ch The Archean/Proterozoic geochemical c s o d l e o G G. transition coincides with sudden n t i c e a c r “Our analysis illustrates the opportunity t n s e i b c atmospheric oxygenation at the end of a S r u o y t the Archean aeon, providing a temporal i presented by recent efforts to make large m b u s link between deep Earth geochemistry e s a e l Pamounts of published geochemical data readily and the rise of atmospheric oxygen. available and amenable to statistical analysis. ” Astromaterials Data System contact: astromatdata@gmail. com

Reason #3: Astromaterials laboratory data are hard to find, access, & reuse. Astromaterials Data

Reason #3: Astromaterials laboratory data are hard to find, access, & reuse. Astromaterials Data System contact: astromatdata@gmail. com

Astromaterials data are: disp erse d of sc ient across and ific p deca

Astromaterials data are: disp erse d of sc ient across and ific p deca conf d u eren blicatio es ce a bstr ns acts Astromaterials Data System contact: astromatdata@gmail. com as y t l a n o m r Never p line. pdf fo n o ublishe e l n i b d a s l i e l a i f v l a a u d i indiv

Development of Astro. Mat: • Compile and restore 40+ years of published data for

Development of Astro. Mat: • Compile and restore 40+ years of published data for all JSC collections into the Astro. DB database. * • Develop APIs and User Interfaces to find, explore, extract, and analyze the content of the Astro. DB database. • Develop the Astro. Mat repository for investigators to publish & archive their astromaterials laboratory data. • Develop community best practices for making astromaterials data FAIR. • Foster a community around Open & FAIR astromaterials data. * Lunar data already compiled by the PDART-funded Moon. DB project. Astromaterials Data System contact: astromatdata@gmail. com 40

Data Restoration Effort Astromaterials Data System contact: astromatdata@gmail. com Data entry for approx. 700

Data Restoration Effort Astromaterials Data System contact: astromatdata@gmail. com Data entry for approx. 700 references with lunar sample data completed 41

Data Restoration Effort • 2 data curators at Lamont • Internship program at Carnegie-GL

Data Restoration Effort • 2 data curators at Lamont • Internship program at Carnegie-GL led by Shaunna Morrison, starting in summer 2020 • Online software for data entry & management: Astro. Admin Completed by Astromaterials Data System contact: astromatdata@gmail. com 42

Data Restoration Effort • Focus initially on geo/cosmochemical, geochronological, & mineralogical data • Addition

Data Restoration Effort • Focus initially on geo/cosmochemical, geochronological, & mineralogical data • Addition of spectrographic data & images in Year 4 & 5 • Rich metadata that describe samples, analytical procedures, lab, & data quality (reference materials, uncertainties, blanks, etc. ) • Sample “geneology” • Links to related information (JSC sample databases, Met. Bull, images, publications, awards, etc. ) Astromaterials Data System contact: astromatdata@gmail. com 43

Development Timeline 4/2019 - 3/2020 Data Restoration 4/2020 – 3/2021 4/2021 – 3/2022 4/2022

Development Timeline 4/2019 - 3/2020 Data Restoration 4/2020 – 3/2021 4/2021 – 3/2022 4/2022 – 3/2023 4/2023 – 3/2024 Lunar Meteorites Other collections Astro. Repo Astro. Search V 1 V 2 Astro. Plot Astro. Desk Astro. API Astromaterials Data System contact: astromatdata@gmail. com V 4 V 1 V 2 V 5 (in-situ maps) Store queries Dashboard V 1 V 3 (spectra) V 2 V 3 V 4 V 5

Check out Astro. Mat’s bibliography! Astromaterials Data System contact: astromatdata@gmail. com www. astromat. org

Check out Astro. Mat’s bibliography! Astromaterials Data System contact: astromatdata@gmail. com www. astromat. org

Astromaterials Data System contact: astromatdata@gmail. com

Astromaterials Data System contact: astromatdata@gmail. com

h c r a e Se S ar ch Astromaterials Data System contact: astromatdata@gmail.

h c r a e Se S ar ch Astromaterials Data System contact: astromatdata@gmail. com

Fil te r So rt Astromaterials Data System contact: astromatdata@gmail. com

Fil te r So rt Astromaterials Data System contact: astromatdata@gmail. com

Su gg e st! s u t a t S Astromaterials Data System contact:

Su gg e st! s u t a t S Astromaterials Data System contact: astromatdata@gmail. com

Filtered by “Lunar Collection” Status “in db” Astromaterials Data System contact: astromatdata@gmail. com

Filtered by “Lunar Collection” Status “in db” Astromaterials Data System contact: astromatdata@gmail. com

Reference “Landing Page” Astromaterials Data System contact: astromatdata@gmail. com

Reference “Landing Page” Astromaterials Data System contact: astromatdata@gmail. com

Astromaterials Data System contact: astromatdata@gmail. com

Astromaterials Data System contact: astromatdata@gmail. com

Astromaterials Data System contact: astromatdata@gmail. com

Astromaterials Data System contact: astromatdata@gmail. com

Astromaterials Data System contact: astromatdata@gmail. com

Astromaterials Data System contact: astromatdata@gmail. com

Moon. DB Update • Restoration & synthesis of lunar sample data • Funded by

Moon. DB Update • Restoration & synthesis of lunar sample data • Funded by the NASA PDART program • Phase 1 (2015 -2017): Geochemistry of Apollo samples • Phase 2 (2018 -2020): Geochronology of lunar samples, geochemistry of lunar meteorites • Database accessible • Graphical User Interface: http: //search. moondb. org • APIs: http: //api. moondb. org • All Moon. DB data will be included in Astro. DB. Astromaterials Data System contact: astromatdata@gmail. com 55

Moon. DB: Geochronological Data & Tools • Interpreted ages will be included in the

Moon. DB: Geochronological Data & Tools • Interpreted ages will be included in the synthesis. • Detailed information about analytical method and data reduction in Geochron. • Initial focus on Ar/Ar ages • Integrating Ar. AR tools Astromaterials Data System contact: astromatdata@gmail. com 56

How You Will Benefit from Astro. Mat • Find, access, explore, and analyze 40+

How You Will Benefit from Astro. Mat • Find, access, explore, and analyze 40+ years of data acquired on astromaterials samples for your research. • “Statistical cosmochemistry” • Find samples for your research (e. g. , look for samples of specific composition) • Find out what analyses have already been performed on a sample. • Share your own data and get credit for it. • • Comply with funding agencies’ policies. Follow new publisher guidelines for FAIR data. Keep your data safe in a long-term archive. Get help for properly documenting your data so they remain useful. Astromaterials Data System contact: astromatdata@gmail. com 57

Participate! • Help us identify missing references. • Contribute missing metadata. • Contribute unpublished

Participate! • Help us identify missing references. • Contribute missing metadata. • Contribute unpublished data. • Encourage your colleagues to contribute. • Let us know about existing data collections. • Participate in usability tests, provide feedback. • Stay informed (Astro. Mat mailing list, twitter). • Twitter: @astromatdata • Mailing list sign up: http: //eepurl. com/gjx. CAP Astromaterials Data System contact: astromatdata@gmail. com 58

Questions: Data • Are there important data types that we are missing? • Would

Questions: Data • Are there important data types that we are missing? • Would you be willing to help with data & metadata review? • What type of support would you need to compile and contribute unpublished data? • Guidance? Templates? Workshops? Salary? Student support? • Do you know of large unpublished datasets that are at risk of being lost? • Analog data or data on obsolete media? Colleagues retiring? Astromaterials Data System contact: astromatdata@gmail. com 59

Questions • How relevant are plotting tools and what should be they be? •

Questions • How relevant are plotting tools and what should be they be? • Would you be willing to participate in Usability Tests? • Would you be willing to participate in an ‘Expert User Group’ to help guide requirement gathering? • Are ‘Data-to-Go’ (downloadable, pre-compiled datasets) desired? Astromaterials Data System contact: astromatdata@gmail. com 60