Some thoughts on data, software and tools and who knows how to do what Rhys Francis Executive Director The Australian eResearch Infrastructure Council
eResearch capability in 2014… Dozens of virtualised research tools and applications - In use and shared by many researchers A peta scale national research cloud Labs using digital work flows Increased connectivity and bandwidth Single Sign On, High reliance services, High reliability servers New data centres Peta scale systems and software New data products Five fold growth in computing power above Moores law Data systems automated Data repositories 100s of petabytes of storage capacity Data publication made possible A corpus of Australian data
Infrastructure Elements – Push & Pull Infrastructure Co-ordination Better data management, description and access Extended Bandwidth Better HPC modelling Larger data collections Shared Access Methods Improved research tools, environments and workflows Research Integration + $144.5M National Capabilities + $246M NeCTAR ANDS
Infrastructure – Major Challenges Infrastructure Co-ordination Better data management, description and access Extended Bandwidth Better HPC modelling Larger data collections Shared Access Methods Improved research tools, environments and workflows Research Outputs - skills & expertise - methods & techniques - findings & results Software -Encode new research methods -Create 'shoulders' for others to stand on Data -Allows us to research the real world -Connects reality to experiment & theory Expertise -Operate world class infrastructure -Provide skills and tools to researchers
To west coast United States To Singapore The Australian national e-infrastructure platform (2013) Primary Node 15 participants Additional Node 4 participants Primary Node 11 participants Primary Node 4 participants Primary Node 8 participants Primary Node 6 participants Additional Node 7 participants Primary Node 6 participants Data10-20 PB fast growing Data1-2 PB more focused NetworkNx100 Gb/s layer-1 Network10 Gb/s layer-3
To west coast United States To Singapore The Australian national e-infrastructure platform (2013) Primary Node 15 participants Additional Node 4 participants Primary Node 11 participants Primary Node 4 participants Primary Node 8 participants Primary Node 6 participants Additional Node 7 participants Primary Node 6 participants Cloud50,000 concurrent tasks Data10-20 PB fast growing Data1-2 PB more focused HPC> 1 Pf/s capability HPC~100 Tf/s specialised NetworkNx100 Gb/s layer-1 Network10 Gb/s layer-3
To west coast United States To Singapore Primary Node 15 participants Additional Node 4 participants Primary Node 11 participants Primary Node 4 participants Primary Node 8 participants Primary Node 6 participants Additional Node 7 participants Primary Node 6 participants 40 Software tools and laboratory integration projects 250 Data improvement projects engaging all institutions Single sign-on for all researchers across the entire infrastructure 40 Software tools and laboratory integration projects 250 Data improvement projects engaging all institutions Single sign-on for all researchers across the entire infrastructure Astronomy Geoscience Astronomy Geoscience Life Science Life Science Climate Science Climate Science Cloud50,000 concurrent tasks Data10-20 PB fast growing Data1-2 PB more focused HPC> 1 Pf/s capability HPC~100 Tf/s specialised NetworkNx100 Gb/s layer-1 Network10 Gb/s layer-3 A national Research Data Commons of published and accessible research data A national Research Data Commons of published and accessible research data
Some messages… MIND the GAPs visions of data & tools | | reality Data and software as infrastructure data => interpretation => people Four things to Send to the Future (results + expertise) + (data + software)
eResearch Infrastructure We are here to help Thank you