RDSI: changing the face of research data storage in Australia Viviani Paz, Project Manager 16 June 2015
Project Update Landscape pre-RDSI RDSI Project Objectives Programmes Outcomes Transitioning Future
Research infrastructure Landscape (1) Federal government Initiatives A decade of Strategic Investment From 1997 : High Performance Computing Committee Report………. To 2006: eResearch Coordinating Committee Report 2007-2011 - $540M Platforms for Collaboration Research infrastructure
Landscape (2) Digital Certificates
Landscape (3) Research data Backup? Share? Accessible? Secure? Grants? Lost? Reusable? 7 April 2019
2011 100PB = 102,400TB = 104,857,600GB = 107,374,182,400MB
RDSI Project Objectives (from ABP) Develop a national network of data stores Create and develop a data storage infrastructure accessed through an infrastructure provided by agencies within the sector Connect the data storage infrastructure to the Australian Research and Education Network (AREN) by a high bandwidth connection, funded and constructed under the National Research Networks (NRN) Project. Dedicated high speed connections between major Nodes. Support meritorious data collections through the Research Data Service Programme (ReDS). Encourage economies of scale through the Vendor Panel Programme (VePa)
RDSI Programmes
ReDS Research Data Service NoDe Node Development Identify, strengthen and develop research data centres able to hold and process high data volumes ReDS Research Data Service Identify research data holdings of lasting value and importance and contribute funding to their development at the most appropriate Node(s) DaSh Data Sharing Build capability to support the sharing and re-use of research data VePa Vendor Panel Establish a panel of vendors for RDSI Nodes and sector RDSI Programmes
Node Development (NoDe) Funded research data centres able to hold and process high data volumes 6 Primary Nodes 2 Additional Nodes Townsville Brisbane Sydney Canberra Perth Adelaide Melbourne Hobart
Research Data Services Programme (ReDS) Identified research data holdings of lasting value and importance Funded storage at Nodes (MAC) Funded staff at nodes to facilitate data ingest of collections, and the development of Node infrastructure
Data Sharing Programme (DaSh) DaSh Collaboration Network: High performance network between RDSI Nodes DaSh Technical Architecture SPIN ARMS Science DMZ
DaShNet Redundant AARNet4 connections at each node to provide connectivity to the node from AARNet customers and Internet. Slide courtesy of AARNet
DaSh Programme Outcomes Mediaflux Monitoring
VePa Programme RDSI/CAUDIT Vendor Panel Current vendors on the panel
RDSI Project Outcome Facilitated the creation of a robust, innovative and collaborative network of Nodes to support better managed and more accessible world-class research data in Australia.
Node Statuses (1) Updated 9 June 2015
Node Statuses (2) Updated 9 June 2015
22 Fields of Research at RDSI funded storage Updated 19 May 2015 Updated 9 June 2015
RDSI Project Transitioning RDSI Project outcomes will be efficiently and successfully preserved moving into the future RDSI Project Outcomes Nodes DaShNet AARNet Vendor Panel CAUDIT Research Federated Identity Nodes/AAF 7 April 2019
Pathway to the future Previously my thinking was limited by the small amount of storage and computing that was available to me. I always had to summarise down and minimise the data. I don’t have to do that now. I don’t have to worry about the live disk limitation or the compute resources. Now I can keep doing the research as I’d like to see it done. – Dr Jeremy VanDerWal Centre for Tropical Biodiversity and Climate Change, James Cook University
www.rdsi.edu.au 7 April 2019
RDSI
National Collaborative Research Infrastructure Strategy (NCRIS) 2015-2017 - $300M Australian Animal Health Laboratory (AAHL) Astronomy Australia (AAL) Atlas of Living Australia (ALA) Australian Microscopy and Microanalysis Facility (AMMRF) Australian National Data Service (ANDS) Australian National Fabrication Facility (ANFF) ANSTO Nuclear Science Facilities Australian Phenomics Network (APN) Australian Plant Phenomics Network (APPF) Australian Plasma Fusion Research Facility (APFRF) Australian Urban Research Infrastructure Network (AURIN) AuScope Biofuels Bioplatforms Australia (BPA) EMBL Australia Groundwater (GIROS) Heavy Ion Accelerators (HIA) Integrated Marine Observing System (IMOS) National Computational Infrastructure (NCI) National Deuteration Facility (NDF) National eResearch Collaboration Tools and Resources (NeCTAR) National Imaging Facility (NIF) Pawsey High Performance Computing Centre Population Health Research Network (PHRN) Research Data Service (RDS) Terrestrial Ecosystem Research Network (TERN) Translating Health Discovery into Clinical Applications (THD)
Infrastructure & Foundation www.rds.edu.au Services www.rdsi.edu.au
Q & A Viviani Paz Project Manager, RDSI v.paz@uq.edu.au 7 April 2019