The ANDS Data Connections Strategy Adrian Burton and
The ANDS “Data Connections” Strategy Adrian Burton and Andrew Treloar International Digital Curation Conference Chicago, USA 8 th December 2011
WARNING: Australian Content – some listeners may be offended
The Australian equivalent of the English “dah-tah” and American “day-tah” is: “Dah-da” We’re not very good at Latin in the antipodes, so we just use it as an Australian singular group noun, “the data is good”, “the rice is good”
The Data Connections Strategy 1. The Data Commons (Datorum Commons? ) 2. Discovery and Publishing 3. Enabling Connections 4. Data Connections Projects 5. Data Connections Partners 6. Data Aggregation/Integration 7. Limitations
Australian Commonwealth Budget April 2009 “Data Commons - $48 million § An Australian research data commons is required to support the discovery of, and access to, research data held in Australian universities, publicly funded research agencies and government organisations for the use of research. § This investment will enable the construction of a range of ICT utilities to capitalise on and ensure greater use and reuse of existing data resources, as well as better management of new data generated in Australian research. ” https: //www. pfc. org. au/bin/view/Main/Super. Science
A vision, but not a strate or a plan https: //www. pfc. org. au/pub/Main/Data/Towardsthe. Australian. Data. Commons. pdf
The Data Commons Freefoto. com http: //www. freefoto. com/images/1016/05/1016_05_3_prev. jpg
The Data Commons Freefoto. com http: //www. freefoto. com/images/1016/05/1016_05_3_prev. jpg
Cultural Collections Domain Facilities Govt Agency Data Research Data
ANDS provides… § Data Commons Infrastructure § Core infrastructure at ANDS § Community operated at § Research facilities § Research organisations § Government agencies § Policy advocacy § Training, skills, resources, consultancy
Stage One: Publication and Discovery • Surfacing Content & Descriptions • Establish Harvesting Infrastructure • Publish, Register and Find my data
Stage Two: Enabling Connections
Data Connections
Data Connections “enable data to be more easily connected with other data and with the broader research enterprise” § Better “discovery in context” § More “linkable data”
Data Connections - Projects The things that potentially connect research and data: § § § Places Funded research projects People and organisations Fields of Research Datasets and journals Standard terminology
Data Connections: “Informatics” infrastructure Information systems for publishing and referring to: § § Definitive-source information reference values identifiers URIs
Data Connections - Partners Whenever possible, recurrently funded organisations with mandates: § § § Office of Spatial Data Management (location) Research Councils – ARC, NHMRC (research activity) National Library of Australia (parties) Australian Bureau of Statistics (research fields) Scholarly Societies (terminologies)
Location Infrastructure § Gazetteer of Australia 2. 0 § Office of Spatial Data Management § Combined data sets of all States and Territories now available freely and without charge
Location Infrastructure § WFS-G (Open Geospatial Consortium Gazetteer Profile of Web Feature Service) § WFS 1. 1 protocol and GML 3. 1 binding § Stage 2: boundaries, marine, historical, indigenous, crowd sourcing
Party Infrastructure National Library of Australia People Australia and Bibliographic Authority file Public identifier for the public persona https: //wiki. nla. gov. au/display/ARDCPIP/ARDC+Pa rty+Infrastructure+Project+Home § Brokerage to other researcher id systems § EAC-CPF; VIAF; ORCID etc § §
Activity Infrastructure § § § Research Councils ARC, NHMRC Web service information systems for grants Identifiers, definitive information, URIs Concept design phase…(CRIS, VIVO) In principle agreement
Field of Research Infrastructure § Australian Bureau of Statistics § Web service information publication of the official Australia New Zealand Standard Research Classifications § Identifiers, definitive information, URIs § Potential for other classifiers § In principle agreement
Data Set Identifier Infrastructure § Data. Cite Consortium § Community service allowing Australian research organisations to allocate DOIs to reference data objects § Enable connections between referring literature and the dataset § Enable acknowledgement, reward and incentive § Also Handle service
ANDS Terminology Support Infrastructure A set of online services to support the creation, management, and publication of human and machine-readable terminologies for use by the Australian research and higher education sector
ANDS Terminology Support Infrastructure Promoting the use of standardised terminology in data to enable data integration within and across disciplines
ANDS Terminology Support Infrastructure MANAGERS ANDS Vocabulary Service PRODUCERS CONSUMERS
Data Connections: § § Broader Agenda Infrastructure (web service information) Common approaches (information models…) Hub and spoke capacity Outreach, promotion, training
Approaches to Data Aggregation § File and Metadata § Database § Linked Data
Limitations - homogeneity Homogeneity Of Urban Spaces Drawing by Jacqueline Dubbins
Limitations – good enough
The road ahead… • Vocabularies • Annotation • Preservation • Equivalence • Metadata Schema • Ontology
http: //ands. org. au/dataconnections. pdf adrian. burton@ands. org. au services@ands. org. au ANDS is supported by the Australian Government through the National Collaborative Research Infrastructure Strategy Program and the Education Investment Fund (EIF) Super Science Initiative
- Slides: 35