Smart Data in Motion for the Deep Carbon
Smart Data in Motion for the Deep Carbon Observatory. (Implementation of Open-World, Integrative, Transparent, Collaborative Science Data Platforms) Prof. Peter Fox (pfox@cs. rpi. edu, @taswegian, #twcrpi) Tetherless World Constellation Chair, Earth and Environmental Science/ Computer Science/ Cognitive Science/ IT and Web Science) Rensselaer Polytechnic Institute, Troy, NY USA And the Deep Carbon Observatory Data Science Team RDA/P 4 Science Track, Sept. 22, 2014 http: //tw. rpi. edu/web/doc/DCOData. In. Motion_RDA_Fox 20140922. ppt
Deep Carbon Observatory (DCO) … • “We are dedicated to achieving transformational understanding of carbon’s chemical and biological roles in Earth. ” www. deepcarbon. net
Data Science is … • Doing science with someone else’s data … – – – across datasets with models multi-dimensional, multi-scale, multi-mode complex data-types needing new analytic and visual approaches • Especially in multiple “dimensions” (functional) – E. g. Detection/ attribution methods/ algorithms – Visual exploration • Today – is collaborative, performed in a network, and webbased …
Collaboration and Integration needs … • “Enable DCO team leaders to create new groups and associate a number of content types --documents, discussions, blog posts, tasks, links, and bibliographic entries --- with the group, as well as simple event management (a private event calendar for the group) and embedding of external services (e. g. and esp. Google Calendar)” … more… (data, publications, projects)… a Knowledge Network … and a Virtual Organization (> 1000 people)
-> DCO Data Science Platform CKAN VIVO GHS – Handle. net
Data in Motion deepcarbon. net info. deepcarbon. net data. deepcarbon. net dx. deepcarbon. net
Collaboration tools Group Based Collaboration Group data deposit and reporting Listings of group content Listings of group documents Group management and messaging
Group bibliography Group shared calendar Group task management Group membership Group event management 9
All information is linked and traceable! 11
Research activities report dashboard: Results updated in real time as new member registered, new activities reported, new publication uploaded… TW-SPARQL Application: Parameterized Report Generation (using Drupal host) 12
S 2 S Faceted Browser Application: DCO People Browser Includes map facet for selecting people in region Facets in any order (open or closed) Uses VIVO ‘data’
TW-SPARQL Application: Dynamic, Stylized Menu Generation (using Drupal host) Menus based on parameterization of page See “Recent Findings" and "Projects" below Note also expanded view “>”
State to date… • Knowledge network – implements both the collaboration and the integration, reporting implements the transparency – It’s being USED • Many means of population – User generation – Machine generation • Contributing these enhancements back to opensource communities (CKAN, VIVO)
Thus… progress… • • Integrative – semantics Transparent – semantics Collaborative – semantics Application integration – Yep – semantics • So… smart data in motion
Data Science Team + Anusha, Jun, Mengyu, Chengcong, Harsha, Dan, …
Thank you • pfox@cs. rpi. edu and the DCO Data Science Team • @taswegian #twcrpi • http: //tw. rpi. edu/web/project/DCO-DS
- Slides: 17