- Slides: 12
CODATA-RDA Research Data Science Short Courses
Summary of the Problem § § 2 Recognised need for basic and advanced data skills for all researchers and research support roles. Barend Mons: 500 K people with advanced data skills need in Europe in the next decade. § Requirement is shared by all researchers and support staff. § Without more widespread data handling, data management and data science skills the benefits of the data revolution and open data will not be fully realised.
Summary of the Problem § Contemporary research – particularly when addressing the most significant, transdisciplinary research challenges – increasingly depends on a range of skills relating to data. These skills include the principles and practice of Open Science and research data management and curation, the development of a range of data platforms and infrastructures, the techniques of large scale analysis, statistics, visualisation and modelling techniques, software development and data annotation. The ensemble of these skills, relating to data in research, can usefully be called ‘Research Data Science’. 3
Highlights of the Deliverable: Working Group § Address a recognised need: provide additional training in core data skills for PG and Early Career researchers, research support. § Be Scalable and Global: work with partners internationally, open materials, training new teachers to create a network of schools: annual hub at Trieste, feeds the skills to put on schools at institutions worldwide. § Provide a Pathway for Specialisation: introductory school in core data skills, then an advanced school or specialised school for specific disciplines. § Be Sustainable: developing a business model with colleagues at the Cologne Business School, based on hub and node model with distributed funding. § Accredited and Quality Course: working with EDISON and others to ensure that the course meets recognised objectives and can be mapped to credit systems. 4
Deliverables from the Working Group 5 § A curriculum for a broad introductory course in ‘research data science’. § Reusable materials available for online delivery (as trialled for the champions and used in Trieste. § School in Trieste and a first cohort of ‘champions’ to run schools internationally. § A model for expansion and creating a sustainable network of schools for a broad research data science curriculum. § A prototype business model and plans for sustainability.
Introductory Research Data Science Curriculum 6 § § Open Science – reflection on ethos and requirements of sharing/openness Open Research Data – Data Publishing, Life-Cycle, Metadata and annotation § Data Carpentry – Introduction to SQL databases § Software Carpentry – Introduction to programming in R, the Unix shell and Git (sharing software and data) § Visualisation – Tools, Critical Analysis of Visualisation § Analysis – Statistics and Machine Learning (Clustering, supervised and unsupervised learning) § Computational Infrastructures – Introduction to cloud computing, launching a Virtual Machine on an Iaa. S cloud
Impact of the Deliverable § § § First Introductory Course, 1 -12 August, ICTP, Trieste. Subsidised accommodation and meals for up to 120 students. So far: 55 K euros funding for students and tutors committed by ICTP, TWAS, CODATA, GEO, ACU and RDA Europe and GEO. Strong emphasis on training new teachers for courses in 2017 (online preparation and ‘champion’ role). Deadline for applications is 18 April 2016: http: //indico. ictp. it/event/7658/ 7
Impact of the Deliverable § § § Beijing Introductory Research Data Science School, 4 -15 July, CNIC, Beijing, China. CODATA China in collaboration with the WG, CNIC and RADI. Scholarships available for c. 20 students from LMICs. Introductory course will follow the basic curriculum designed by the CODATA-RDA Working Group. Application process to be announced very shortly on the CODATA website: http: //www. codata. org 8
Impact of the Deliverable: Next Steps § § § Online course for ‘champions’ from May in preparation for the first introductory ICTP Trieste School in August 2016. Materials from the first school will be openly available for reuse and online study. Repeat of introductory and a new advanced school planned for summer 2017. Plans for regional introductory schools in Brazil, India and South Africa + Indonesia, Kenya? Plans for regional specialised schools in South Africa (Social Sciences and Bio. Informatics)… Create a sustainable network of schools, training teachers, reusing materials, hub and node funding… 9 Research Data Science Champions Teach-New-Teachers
Highlights of the Deliverable: Working Group § § § § § 10 CODATA-RDA Co-Chairs: Andrew Harrison, Simon Hodson, Hugh Shanahan. ICTP, Trieste: Ivan Girotto, Clement Onime. Course Delivery: Joy Davidson, DCC; Tracy Teal, Data Carpentry; Jonah Duckles, Software Carpentry; Ekpe Okorofor, Roger Barlow, Tim Head, Mitch Stanton-Cook, Jan Aerts, Colin Gillespie. Curriculum Standardisation and Accreditation: Laura Molloy and Christopher Jung (RDA IG), Steve Brewer and Yuri Demchenko (EDISON). Advanced Data Course: Jamie Shiers (CERN). Bio-Informatics: Celia van Gelder, Terri Atwood (Elixir) Social Sciences: Louise Corti, Nathan Cunningham (UKDA) Train the Trainers: Anelda van der Valt (Talarify, SA) Impact: Sabina Leonelli, Louise Bezuidenhout, Brian Rappert (Exeter) Business Models: Julia Maintz and students from Cologne Business School.
Highlights of the Deliverable: Partner Organisations 11
Next Steps and Contact Information § Information on the Research Data Science Short Courses/Summer Schools: http: //www. codata. org/working-groups/research-data-science-summer-schools § Comment on the curriculum and approach! Use the online materials from August! § Apply for the Bejing Pilot! § Apply for the first Introductory Course at ICTP, Trieste: http: //indico. ictp. it/event/7658/ § Contact Harry, Hugh and Simon about collaboration to build the network: [email protected] ac. uk, [email protected] rhul. ac. uk, [email protected] org 12