To architect or engineer Lessons from Data Pool

  • Slides: 26
Download presentation
To architect or engineer? Lessons from Data. Pool on building RDM repositories Steve Hitchcock,

To architect or engineer? Lessons from Data. Pool on building RDM repositories Steve Hitchcock, JISC Data. Pool Project 9 th DCC Research Data Management Forum (RDMF 9) Cambridge, 14 -15 November 2012

Why architecting? http: //datapool. soton. ac. uk

Why architecting? http: //datapool. soton. ac. uk

Data. Pool architecture (Sharepoint) Peter Hancock, i. Solutions, University of Southampton

Data. Pool architecture (Sharepoint) Peter Hancock, i. Solutions, University of Southampton

Data. Pool Building Capacity, Developing Skills, Supporting Researchers October 2011 Policy and guidance Progress

Data. Pool Building Capacity, Developing Skills, Supporting Researchers October 2011 Policy and guidance Progress Informed by Training Data repository Doctoral Training Centres Graduate & staff training services Case studies + • Imaging, 3 D • Geodata • ++ IDMB Surveys of data practices among academics Share. Point EPrints 3. 3 University Strategic Research Groups EPrints data apps 3 -layer metadata March 2013 Developing/ working with Support for Data Management Plans e. g. Capture/share with external sources, e. g. SWORD-ARM JISCMRD Progress Workshop 24 -25 October 2012 Nottingham Large-scale data storage Byatt, D. (D. R. [email protected] ac. uk) Hitchcock, S. (sh 94 [email protected] soton. ac. uk ) White, W. ([email protected] ac. uk ) http: /datapool. soton. ac. uk / Assign Data. Cite DOIs

Data repository platforms Architected • Data. Flow • MS Sharepoint • EPrints Engineered Other

Data repository platforms Architected • Data. Flow • MS Sharepoint • EPrints Engineered Other platforms available • DSpace, CKAN, data. bris, etc. From a data repository perspective

Implementations of Data. Flow Model Data. Stage SWORD Curated repository/ar chive Two-stage architecture Data.

Implementations of Data. Flow Model Data. Stage SWORD Curated repository/ar chive Two-stage architecture Data. Bank Addresses Dropbox effect for data producers EPrints DSpace Data. Flow: two data deposit motivations for creators: want to (practice), need to (policy) QMUL

Data. Stage: Upload file Data. Stage was developed at the University of Oxford Data.

Data. Stage: Upload file Data. Stage was developed at the University of Oxford Data. Stage screenshots courtesy JISC Kaptur project http: //www. vads. ac. uk/kaptur/ Thanks to Carlos Silva

Data. Stage: Submit as data package

Data. Stage: Submit as data package

3 -layer metadata model Takeda et al. , 6 th IDCC, Dec. 2010 available

3 -layer metadata model Takeda et al. , 6 th IDCC, Dec. 2010 available from http: //eprints. soton. ac. uk/169533/ JISC Institutional Data Management Blueprint (IDMB) Project, University of Southampton

Share. Point user interface 1: project

Share. Point user interface 1: project

Share. Point user interface 2: data + fields format, keywords

Share. Point user interface 2: data + fields format, keywords

Prof. Simon Cox (engng) on Sharepoint “The concept that formed part of SP thinking

Prof. Simon Cox (engng) on Sharepoint “The concept that formed part of SP thinking (at Southampton) from the very inception … that ability to use SP as a way to manage or at least collaborate as part of a 5 -10 year programme of work. “The other side is what we’re doing with intellectual property and what we’re offering for students. I chair a group design project, and every single student has said ‘I just do it all on Dropbox’. The same is happening with our research. So I think we have at least to provide a level of service and a level of integration between our research experience and our teaching experience. Would these people go to Southampton rather than University of Nowhereshire on the Web or the University of Google or the University of Dropbox? These are deep questions for us. ”

e. Prints Soton: Item type: Dataset Currently EPrints v 3. 2, customised to e.

e. Prints Soton: Item type: Dataset Currently EPrints v 3. 2, customised to e. Prints Soton Dataset Item Type from 2007

e. Prints Soton: start to deposit Dataset

e. Prints Soton: start to deposit Dataset

EPrints data apps Apps available from EPrints Bazaar http: //bazaar. eprints. org/ Apps work

EPrints data apps Apps available from EPrints Bazaar http: //bazaar. eprints. org/ Apps work with EPrints v 3. 3 or later

EPrints (test repo) Data. Share enabled App by Tim Brody, EPrints + Data. Pool

EPrints (test repo) Data. Share enabled App by Tim Brody, EPrints + Data. Pool

EPrints (test repo) Data Core enabled Data Core “adds a few fields and doesn’t

EPrints (test repo) Data Core enabled Data Core “adds a few fields and doesn’t remove any fields from the eprint object. It creates an alternate workflow for datasets which is much smaller than a normal eprints workflow. ” App by Patrick Mc. Sweeney

EPrints (test repo) Data Core enabled 2 App by Patrick Mc. Sweeney

EPrints (test repo) Data Core enabled 2 App by Patrick Mc. Sweeney

Essex Research Data metadata profile aims “Using metadata schema relevant to UK HE and

Essex Research Data metadata profile aims “Using metadata schema relevant to UK HE and research data (Data. Cite, INSPIRE and DDI 2. 1), we have developed a basic metadata profile suited to describing research data generated at institutions with disciplinary diversity. The inclusion of fields like Funder and Grant number will ensure future harvesting and linking opportunities (like RCUK Research Outcome Systems). The metadata also suits the EPSRC data registry requirements. ” http: //researchdataessex. posterous. com/reposito ry-beta-metadata-profile-released

EPrints: Essex Research Data repository Screenshots courtesy JISC Research Data @Essex project Thanks to

EPrints: Essex Research Data repository Screenshots courtesy JISC Research Data @Essex project Thanks to Louise Corti, Tom Ensom, Alexis Wolton EPrints v 3. 3. 10, customised to Essex Research Data http: //researchdata. essex. ac. uk/

Essex Research Data record

Essex Research Data record

Essex Research Data: observations • Assumes data deposit, so no selection of EPrints Item

Essex Research Data: observations • Assumes data deposit, so no selection of EPrints Item Type • No selection of e. g. Creative Commons licence, just copyright • Requirement for Time Period suggests particular type of data expected • Fields for Geographic info (not required) suggests particular type of data expected

Architects and surroundings “On Nine Elms, London usembassylondon one plot aggressively crystalline blocks by

Architects and surroundings “On Nine Elms, London usembassylondon one plot aggressively crystalline blocks by Rogers Stirk Harbour are going up, their diamond shapes having nothing in particular to do with anything around them. On another Foster and Partners have designed a series of curving, stepped, blobby things, of the kind usually designed to take advantage of views on the Med or the Gulf, but are here facing each other like rows of daleks. Again, it shows little interest in anything around it. ” R. Moore, Utopia on Thames, Observer, 11 Nov 2012

Open access repository interoperability Confederation of Open Access Repositories (COAR) Dublin Core, CRIS-CERIF Open.

Open access repository interoperability Confederation of Open Access Repositories (COAR) Dublin Core, CRIS-CERIF Open. AIRE, Repository. Net+, Rioxx RCUK: Research Outcomes System, Gateway to Research, REF Is there the same current debate about interoperability of data repositories?

COAR on OA interoperability Specific initiatives designed to support interoperability: Author. Claim, CRIS-OAR, Data.

COAR on OA interoperability Specific initiatives designed to support interoperability: Author. Claim, CRIS-OAR, Data. Cite, DINI Certificate for Document and Publication Services, DOI, DRIVER, Handle System, KE Usage Statistics Guidelines, OAIORE, OAI-PMH, OA-Statistik, OA Repository Junction, Open. AIRE, ORCID, Pers. ID, PIRUS, SURE, SWORD, and UK Repository. Net+. COAR, The Current State of Open Access Repository Interoperability (2012), 26 Oct. 2012 v. 02 MT @gknight 2000 (Gareth Knight) Lincoln's CKan instance impressive bit. ly/QQd 1 au Doesn't appear to support OAIPMH or preservation function #jiscmrd

What next for Data. Pool repositories? Sharepoint • User test and feedback sessions scheduled,

What next for Data. Pool repositories? Sharepoint • User test and feedback sessions scheduled, will direct further development Eprints apps (1 or 2 0 f following, initially) • Develop app based on Essex data repository, providing other repositories with a 1 -click install of this profile • Build interoperability (I/O) apps: e. g. Data Management Plans, Dropbox • Automate record capture for producers of largescale, regular data outputs