Ryan Fraser (CSIRO), Lesley Wyborn (GA), Richard Chopping (GA), Terry Rankine (CSIRO), Robert Woodcock (CSIRO) MINERALS DOWN UNDER Virtual Geophysics Laboratory.

Slides:



Advertisements
Similar presentations
AuScope Grid: ready to take the data plunge
Advertisements

MINERALS DOWN UNDER Using Spatial Data Infrastructure (SDI) to: enable interoperable data exchange & usage to deliver corporate decisions Ryan Fraser 19.
SAN DIEGO SUPERCOMPUTER CENTER Choonhan Youn Viswanath Nandigam, Nancy Wilkins-Diehr, Chaitan Baru San Diego Supercomputer Center, University of California,
Integrated Scientific Workflow Management for the Emulab Network Testbed Eric Eide, Leigh Stoller, Tim Stack, Juliana Freire, and Jay Lepreau and Jay Lepreau.
Virtual Geophysics Laboratory (VGL) VGL v1.1 Launch Ryan Fraser, Terry Rankine, Joshua Vote, Lesley Wyborn, Ben Evans, Robert Woodcock February 2013 CSIRO.
Virtual Geophysics Laboratory (VGL) VGL v1.2 NeCTAR Project Close R.Fraser, T.Rankine, J.Vote, L.Wyborn, B.Evans, R.Woodcock, C.Kemp July 2013 CSIRO |
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
TPAC Digital Library Talk Overview Presenter:Glenn Hyland Tasmanian Partnership for Advanced Computing & Australian Antarctic Division Outline: TPAC Overview.
AN ORGANISATION FOR A NATIONAL EARTH SCIENCE INFRASTRUCTURE PROGRAM Australian Earth Science Research Information Infrastructure Dr Robert Woodcock CSIRO.
Customized cloud platform for computing on your terms !
WPS Application Patterns at the Workshop “Models For Scientific Exploitation Of EO Data” ESRIN, October 2012 Albert Remke & Daniel Nüst 52°North Initiative.
EC Grant Agreement no GEOSS Interoperability for Weather Ocean and Water Enhancing the GEOSS Infrastructure for all the Stakeholders.
Virtual Geophysics Laboratory Exploiting the Cloud and Empowering Geophysicists Ryan Fraser, Terry Rankine, Lesley Wyborn, Joshua Vote, Ben Evans. Presented.
Ryan Fraser, Nicholas Carr Connecting RDSI, NeCTAR and ANDS via Provenance - VHIRL.
National Earth Science Infrastructure Program AuScope Limited Headquarters School of Earth Sciences University of Melbourne Victoria 3010 Tel
Linking AuScope to the broader minerals industry value chain Jonathan Law, Robert Woodcock, Ryan Fraser, Terry Rankine, Guillaume Duclaux.
AN ORGANISATION FOR A NATIONAL EARTH SCIENCE INFRASTRUCTURE PROGRAM The Spatial Information Services Stack – infrastructure for the AuScope Community Earth.
Virtual Geophysics Laboratory Scientific workflows exploiting the cloud Ryan Fraser, Terry Rankine, Lesley Wyborn, Joshua Vote, Ben Evans... Presented.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
AN ORGANISATION FOR A NATIONAL EARTH SCIENCE INFRASTRUCTURE PROGRAM Virtual Geophysics Laboratory (VGL): Scientific workflows Exploiting the Cloud Josh.
National Spatial Data Infrastructure The Spatial Information Services Stack Dr Robert Woodcock.
Virtual Laboratories VGL and Friends R.Fraser, T.Rankine, J.Vote, R.Woodcock AuScope Grid Roadshow 2014 CSIRO | MINERAL RESOURCES FLAGSHIP.
Digital Earth Communities GEOSS Interoperability for Weather Ocean and Water GEOSS Common Infrastructure Evolution Roberto Cossu ESA
Future Directions MINERALS DOWN UNDER SISS – Spatial Information Services Stack Ryan Fraser| Project Lead 20 th March 2012.
GEM Portal and SERVOGrid for Earthquake Science PTLIU Laboratory for Community Grids Geoffrey Fox, Marlon Pierce Computer Science, Informatics, Physics.
Lessons from SEEGrid/AuScope Grid Bruce Simons GeoScience Victoria.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
AuScope Spatial Data Infrastructure Supporting Earth Science Dr Robert Woodcock CSIRO.
Making Geological Map Data for the Earth Accessible OneGeology: assisting Geological Surveys worldwide to interoperate seamlessly on the Next Generation.
Technical Session Virtual Geophysics Laboratory MINERAL RESOURCES Josh Vote | Software Developer September 2014.
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
Project Database Handler The Project Database Handler dbCCP4i is a brokering application that mediates interactions between the project database and an.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
Portable Infrastructure for the Metafor Metadata System Charlotte Pascoe 1, Gerry Devine 2 1 NCAS-BADC, 2 NCAS-CMS University of Reading PIMMS provides.
The Astronomy challenge: How can workflow preservation help? Susana Sánchez, Jose Enrique Ruíz, Lourdes Verdes-Montenegro, Julian Garrido, Juan de Dios.
OGC and Grid in AU Dr Robert Woodcock Executive Manager, e-Science Stream Leader, Exploration and Mining.
Interoperability from the e-Science Perspective Yannis Ioannidis Univ. Of Athens and ATHENA Research Center
Virtual Geophysics Laboratory (VGL) VGL v1.2 NeCTAR Project Close Ryan Fraser, Terry Rankine, Joshua Vote, Lesley Wyborn, Ben Evans, Robert Woodcock July.
Sync and Exchange Research Data b2drop.eudat.eu This work is licensed under the Creative Commons CC-BY 4.0 licence B2DROP EUDAT’s Personal.
AN ORGANISATION FOR A NATIONAL EARTH SCIENCE INFRASTRUCTURE PROGRAM Virtual Geophysics Laboratory (VGL): Scientific workflows Exploiting the Cloud Josh.
AN ORGANISATION FOR A NATIONAL EARTH SCIENCE INFRASTRUCTURE PROGRAM The NCRIS AuScope Community Earth Model Bruce Simons.
| nectar.org.au NECTAR TRAINING Module 2 Virtual Laboratories and eResearch Tools.
Curtin University is a trademark of Curtin University of Technology CRICOS Provider Code 00301J The Digital Mineral Library at Curtin University Major.
0 National Geospatial Platform Jerry Johnston Department of the Interior January 6, 2016.
Mike Hildreth DASPOS Update Mike Hildreth representing the DASPOS project 1.
Web Technologies Lecture 13 Introduction to cloud computing.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
AN ORGANISATION FOR A NATIONAL EARTH SCIENCE INFRASTRUCTURE PROGRAM “Building Clients for the AuScope Spatial Information Services Stack (SiSS)” AuScope.
Project number: ENVRI and the Grid Wouter Los 20/02/20161.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
AN ORGANISATION FOR A NATIONAL EARTH SCIENCE INFRASTRUCTURE PROGRAM AuScope Grid Architecture “Where does your architecture fit in with the big picture?”
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
1 This Changes Everything: Accelerating Scientific Discovery through High Performance Digital Infrastructure CANARIE’s Research Software.
Nci.org.au © National Computational Infrastructure 2016 Virtual Laboratories in Australia Lesley Wyborn (NCI) With contributions from:
Enabling Digital Earth by focussing on ‘accessibility’ rather than ‘delivery’. Ryan Fraser CSIRO.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
IODE Ocean Data Portal - technological framework of new IODE system Dr. Sergey Belov, et al. Partnership Centre for the IODE Ocean Data Portal.
Accessing the VI-SEEM infrastructure
Cloud EO Big Data processing
Tools and Services Workshop
University of Chicago and ANL
Joslynn Lee – Data Science Educator
CyVerse Discovery Environment
Recap: introduction to e-science
A Web-enabled Approach for generating data processors
Demonstrator Stuart Girvan – Geoscience Australia
VHIRL: Virtual Hazard Impact and Risk Laboratory – “A reuse story”
Virtual Geophysics Laboratory (VGL): Exploiting the Cloud and HPC
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Configuration management suite
Presentation transcript:

Ryan Fraser (CSIRO), Lesley Wyborn (GA), Richard Chopping (GA), Terry Rankine (CSIRO), Robert Woodcock (CSIRO) MINERALS DOWN UNDER Virtual Geophysics Laboratory (VGL): Exploiting the Cloud and HPC

Scientific workflow – Virtual Geophysics Laboratory (VGL) Scientific Workflow Engine (or Virtual Laboratory) Automates and massively expands (GA) Geophysicists computational capacity via the Cloud  Amazon – EC2 / S3 (and others using this interface)  OpenStack Collaboration between CSIRO, GA, NCI, Monash, UQ, and ANU VGL is just a pretty face  User Driven GUI  Leverages data providers and cloud technologies to do all the heavy lifting Non-Restrictive Open-source 2

AuScope Grid and SISS AuScope Grid: Charged with delivering software infrastructure to enable Geoscience Community  Establishing governance and sustainability of services  Delivering solutions to research and government organisations  Interoperability Spatial Information Services Stack (SISS) An open-source, open-standards SDI Achieves Interoperable data exchange Standardises on 3 components – Format, Content, Tools

V(what)GL VEGL – Virtual Exploration Geophysics Laboratory  One primary science collaboration  One primary workflow  One collection of geophysical data sets VGL - Virtual Geophysics Laboratory  NeCTAR funded activity  Collaboration with multiple partners (CSIRO, NCI, GA, UQ, Monash, ANU)  Supporting multiple workflows  New data sets  New data types  New Use – Not just exploration. Done

Geophysics (as seen by a software dev) Geophysics is taking physical measurements…  Magnetism over an area  Acceleration due to gravity over an area …Applying lots of mathematics… …to infer the structure of what is under the surface… It is not geology  Samples are never taken, only measurements 5 Apply Mathematics Raw data But -- Where is all the ????

Our Geophysics Problem Measurements coming from the field are ‘raw’  Varying spatial reference systems  Noisy  Artefacts from collection process This data needs processing  From raw data to a data product Data products are valuable  They will be re-used and referenced repeatedly Processing is a time consuming process  Made worse by a purely manual workflow 6

The Past Compile raw data using proprietary FORTRAN  Also use software – Intrepid Transform to a regular grid using more software  MATLAB, Intrepid, ER Mapper, ESRI ArcGIS, QGIS Crop data spatially to suit final data product  eg: everything in Victoria Transform data into a file format that can be read by proprietary scientific code.  This is usually done with some handwritten python or c  There is no version control, code is often rewritten / redone Upload data to HPC  Manually enter input parameters/start job 7

Hardcopy of data SSH Client MATLAB Intrepid 8 Let’s map it out… Transform to a regular grid Crop data to area of interest Reformat data for processing Upload data to NCI Configure job and start processing Download results Get handed field data Visualise data

There seems to be a problem… Reproducibility – there is none What was the input of your model? What transformations occurred? It’s a manual process Time consuming Error prone Expensive Licensing costs 9

The Recent Past - Our solution Virtual Exploration Geophysics Laboratory - Portalhttp://siss1.anu.edu.au/VEGL- Portal Provenance (“I hate this word”)  All input data is saved and then published with the final data product VEGL automates portions of the workflow  Allowing scientists to focus on science Built entirely on open source tools  No licensing costs 10

11 CLOUD

Data Discovery 12

Data Selection 13

Script Builder 14

Job Monitoring 15

Published provenance records 16

17 From this… Hardcopy of data SSH Client MATLAB Intrepid Transform to a regular grid Crop data to area of interest Reformat data for processing Upload data to NCI Configure job and start processing Download results Get handed field data Visualise data

…to this Virtual Geophysics Laboratory Build “science” from existing libraries Run job Collect and publish results Discover raw data Select spatial bounds

VEGL – Point-of-View benefits User: Data all accessible in one place Same science but MUCH more efficient Bigger scale, quicker Repeatability Developer/Tech:  Its concepts can be re-used for other scientific workflows  It can produce actual scientific data products  Is capable of integrating with any SISS (OGC) data provider  The power lies with the underlying services, accessed using standardised protocols 19

NOW - Opportunities: Virtual Geophysics Laboratory  Collaboration with multiple partners – CSIRO, GA, NCI, Monash, UQ, ANU  Supporting multiple workflows  Model Registry (3D)  New Scientific Codes – Underworld, eScript, UBC, Airborne EM inversion codes  New data sets from GA: National Airborne Geophysical DB including – Gravity, Radiometric, AEM, Magnetics  New Use – Broad application. 20

Opportunities Exploiting the generic  Modularising the workflow for general scientific usage  Repurposing for other use cases – nature hazards, climate prediction, etc  Commercial uptake  Integration with other VLs to achieve ultimate aim Supercomputing - Pawsey Centre, NCI Cloud – NeCTAR, NCI and commercial providers NeCTAR VGL funded – opportunity to use and/or repurpose the platform for other uses

Thank you and for more information: CSIRO Earth Science & Resource Engineering Ryan Fraser Project Lead Phone: Web: siss.auscope.org