Tuning GENIE Earth System Model Components using a Grid Enabled Data Management System Andrew Price University of Southampton UK e-Science Programme, Booth 2140 SC2004, Pittsburgh, USA 11 th November 2004
Outline Introduction Scientific aims of GENIE e-Science tools –Data Management System –Geodise Toolboxes –OPTIONS Design Search and Optimisation Results Future work Conclusions
Introduction The GENIE project is developing a Grid-based system to: Flexibly couple together state-of- the-art components to form a unified Earth system model Execute the resulting model on the Grid Share the distributed data produced in simulations Provide high-level open access to the system, creating and supporting virtual organisations of Earth system modellers
Scientific Aims ice-age Orbital parameters affect incident radiation and climate Biological and geological processes interact with, and feedback upon, the climate (via, for instance, CO 2 )
The target GENIE Model 3D atmosphere 3D ocean 2D sea ice Atmospheric CO 2 2D land surface Land vegetation Ocean biogeochemistry Ocean sediments 3D ice sheets Atmosphere – Bristol’s IGCM3 Ocean – SOC’s GOLDSTEIN Land – Met. Office’s TRIFFID Land ice – Bristol’s GLIMMER Ocean biogeochemistry and sediments – UEA’s BioGEM
Initial GENIE experiments Initial studies in GENIE performed parameter sweeps to investigate the properties of the model
e-Science Tools Data Management System (augmented version of the Geodise Database System) Matlab scripting environment Geodise Toolboxes XML Toolbox OPTIONS Design Search and Optimisation package Template and Example scripts
Data Management System Java Client Code Apache Axis CoG Jython Functions Globus Server Geodise Database Toolbox Metadata Database ClientGrid SOAP Matlab Functions Database Web Services Authorisation Service Location Service Metadata Archive & Query Services Jython XML Schema GridFTP Portal
Data Management Client
Geodise Toolboxes Geodise Compute Toolbox gd_createproxy.m Creates a Globus proxy certificate for the user's credentials gd_destroyproxy.m Destroys the local copy of the user's Globus proxy certificate gd_jobsubmit.m Submits a compute job to a Globus GRAM job manager gd_jobstatus.m Gets the status of a Globus GRAM job gd_putfile.m Puts a remote file using GridFTP gd_getfile.m Retrieves a remote file using GridFTP gd_rmfile.m Deletes a remote file using GridFTP gd_makedir.m Creates a remote directory using GridFTP gd_rmdir.m Deletes a remote directory using GridFTP Geodise Database Toolbox gd_archive.m Archives a file or data structure to the database gd_query.m Query the database for data matching specified criteria. gd_retrieve.m Retrieves a file or data structure from the database
Grid Computation OptionsMatlab UK National Grid Service (GT2) OxfordLeeds RALManchester Local Resource (GT2) GENIE Database objfun.mobjfun_parse.m optjobparallel.m OptionsMatlab invokes the wrapped binary using parameters provided by the selected optimisation algorithm. Soton Condor Pool (1700 Nodes)
IGCM3 Atmosphere Model The objective function is a weighted sum of the RMS differences between the model state and NCEP data. Compare annual means for a number model fields.
IGCM3 Tuning Results 40% reduction in error statistic compared to default parameters Similar result to a parallel study performed using the Ensemble Kalman Filter Model physics insufficient to perfectly match observational data.
e-Science Summary
Conclusions Provided the environmental scientist with a toolset for tuning GENIE models: –Scripting environment –Database repository –Computational Grid interface –Suite of generic optimisation algorithms A Global minimum can reliably be found in low dimensional problem space. For higher dimensional problems, the tools are appropriate for locating local minima in the state space.
The GENIE Team Coordinator: Tim Lenton– CEH Edinburgh Principal investigator: Paul Valdes – Bristol Research Team and Collaborators: James Annan – FRSGC, Japan Chris Brockwell – UEA Norwich David Cameron – CEH Edinburgh Peter Cox – Hadley Centre (UKMO) Neil Edwards – Bern, Switzerland Murtaza Gulamali – London e-Science Centre Julia Hargreaves – FRSGC, Japan Phil Harris – CEH Wallingford Dan Lunt – Bristol Bob Marsh – SOC Andrew Price – Southampton e-Science Centre Andy Ridgwell – UBC, Canada Ian Rutt – Bristol Gang Xue – Southampton e-Science Centre Andrew Yool – SOC Management Team: Melvin Cannell – CEH Edinburgh Trevor Cooper-Chadwick – Southampton e-Sci. Centre Simon Cox – Southampton e-Sci. Centre John Darlington – London e-Science Centre Richard Harding – CEH Wallingford Tony Payne – Bristol John Shepherd – SOC Andrew Watson – UEA Norwich Thanks to Steven Newhouse