Download presentation
Presentation is loading. Please wait.
Published byNigel Baker Modified over 9 years ago
1
GGF16 Athens, February 13-16 2006. DEISA Perspectives Towards cooperative extreme computing in Europe Victor Alessandrini IDRIS - CNRS va@idris.fr
2
Fourth EGEE Conference Pise, October 23 - 28, 2005 V. Alessandrini, IDRIS-CNRS 2 DEISA objectives To enable Europe’s terascale science by the integration of Europe’s most powerful supercomputing systems. Enabling scientific discovery across a broad spectrum of science and technology is the only criterion for success DEISA is an European Supercomputing Service built on top of existing national services. This service is based on the deployment and operation of a persistent, production quality, distributed supercomputing environment with continental scope. The integration of national facilities and services, together with innovative operational models, is expected to add substantial value to existing infrastructures. Main focus is High Performance Computing (HPC) and Extreme Computing applications that cannot by supported by the isolated national services. Service providing model is the transnational extension of national HPC centers: –Operations, –User Support and Applications Enabling, –Network Deployment and Operation, –Middleware services.
3
Fourth EGEE Conference Pise, October 23 - 28, 2005 V. Alessandrini, IDRIS-CNRS 3 The DEISA Supercomputing Environment (21.900 processors and 145 Tf in 2006, more than 190 Tf in 2007) IBM AIX Super-cluster –FZJ-Julich, 1312 processors, 8,9 teraflops peak –RZG – Garching, 748 processors, 3,8 teraflops peak –IDRIS, 1024 processors, 6.7 teraflops peak –CINECA, 512 processors, 2,6 teraflops peak –CSC, 512 processors, 2,6 teraflops peak –ECMWF, 2 systems of 2276 processors each, 33 teraflops peak –HPCx, 1600 processors, 12 teraflops peak BSC, IBM PowerPC Linux system (MareNostrum) 4864 processeurs, 40 teraflops peak SARA, SGI ALTIX Linux system, 1024 processors, 7 teraflops peak LRZ, Linux cluster (2.7 teraflops) moving to SGI ALTIX system (5120 processors and 33 teraflops peak in 2006, 70 teraflops peak in 2007) HLRS, NEC SX8 vector system, 646 processors, 12,7 teraflops peak.
4
Fourth EGEE Conference Pise, October 23 - 28, 2005 V. Alessandrini, IDRIS-CNRS 4 HPC and Grid computing Grid computing is not always HPC. Message Passing latencies are boosted in WANs from a few microseconds to millisecond, because the speed of light is not big enough. Deploying tightly coupled parallel applications in large scale grids may not be compatible with high performance requirements Direct Grid computing works best for (almost) embarassingly parallel applications, or coupled software modules with limited real time communications. It is more efficient to run large, tightly coupled parallel applications in a single platform. DEISA implements this requirement by rerouting jobs and balancing the computational workload at a European scale. A co-scheduling service will enable deployment of weakly coupled parallel applications on several platforms
5
Fourth EGEE Conference Pise, October 23 - 28, 2005 V. Alessandrini, IDRIS-CNRS 5 OPERATION OF AIX SUPER-CLUSTER Full production status of dedicated (reserved bandwidth) 1 Gb/s network Upgrade to 10 Gb/s reserved bandwidth network planned for 2006 GPFS : Full production at FZJ, RZG, IDRIS, CINECA CSC, ECMWF and HPCx to follow. LL – Multicluster (job migration) production status at four core sites. CSC ECMWF Services Services: High performance datagrid via GPFS Remote I/O: access to remote files uses the full available network bandwidth. High performance access to shared data repositories. Remote submission and job migration across sites Mainly used to load balance the global workflow when a huge partition is allocated to a DEISA application in one site Common Production Environment HPCx
6
Fourth EGEE Conference Pise, October 23 - 28, 2005 V. Alessandrini, IDRIS-CNRS 6 Workflow simulations using UNICORE Complex simulations that are pipelined over several heterogeneous platforms. UNICORE handles them as a unique job and transparently moves the output – input data along the pipeline. The UNICORE client that monitors hhe application can run in a laptop. UNICORE has a user friendly Graphical interface. DEISA has developped a command line interface for UNICORE. UNICORE has a self-contained, robust security model. UNICORE can be a useful tool to enable secure access to the DEISA supercomputing resources by external corporate users. This is being evaluated by some external corporations (like CNES in France).
7
Fourth EGEE Conference Pise, October 23 - 28, 2005 V. Alessandrini, IDRIS-CNRS 7 Services being deployed Extension of GPFS to generic Linux systems. Ongoing collaboration with IBM. GPFS clients have been ported to SGI ALTIX systems. This enables the heterogeneous extension of the GPFS data grid. Co-scheduling service. Ongoing collaboration with Platform Computing. Deployment of a DEISA co-scheduling service based on LSF Univers. Needed to support Grid applications running on the heterogeneous environment, and for global data management. Global data management. Implementing fast data transfers across sites using stripped GridFTP, database access and hierarchical data management at a continental scale. GridFTP Co-scheduled, parallel data mover tasks
8
Fourth EGEE Conference Pise, October 23 - 28, 2005 V. Alessandrini, IDRIS-CNRS 8 Portals and Science Gateways Extending the outreach of the supercomputing infrastructure by reaching new users communities that have already structured their applications strategies around small discipline oriented grid infrastructures with disciline specific tools. Connecting supercomputer environments as « backend » resources to existing discipline oriented eInfrastructures. Community allocations will enable the access of external anonymous users. To move in this direction, DEISA plans to deploy in 2006-2007a portal to a European supercomputing service for bio-informatics and Life Sciences. This is a discipline in which the need of HPC is strongly emerging in some domains. –Critical domain applications are ported to the most adapted supercomputer of the DEISA environment –Shared data repositories are hosted by GPFS services –A DEISA portal wll be deployed, but interoperabilty with existing portals will be searched This ASP (Application Service Provider) model is potentally well adapted to external corporate users.
9
Fourth EGEE Conference Pise, October 23 - 28, 2005 V. Alessandrini, IDRIS-CNRS 9 Enabling science Initially, DEISA had an « early users » program: a number of Joint Research Activities integrated in the project from the start As some services in the infrastructure reached production quality, we moved towards « exceptional users » The DEISA Extreme Computing Initiative: identification, deployment and operation of a number of « flagsjip » applications in selected areas of science and technology. Applications are selected on the basis of scientific excellence, innovation potential and relevance criteria (the application must require the extended infrastructure services) European call for proposals: May-June every year (first one took place in 2005) Evaluation Juin -> September.
10
Fourth EGEE Conference Pise, October 23 - 28, 2005 V. Alessandrini, IDRIS-CNRS 10 Adapting applications to the infrastructure: the ATASKF Creation, in April 2005, of the Applications Task Force (ATASKF), to support the Extreme Computing Initiative. The ATASKF carries out a prospective action with the European Scientific community. It provides guidance to find the best fit betweed the users requirements and the DEISA supercomputing environment. For accepted projects, the ATASKF takes all the actions needed to adapt and optimize the aplications for efficient operation in the DEISA environment Most demanded actions are: hyperscaling of parallel applications, data management and improved I/O, workflows. We had in 2005 53 Extreme Computing proposals. 29 projects were retained for operation in 2005-2006. Full information on DEISA Web server (www.deisa.org) after November 8, 2005.www.deisa.org
11
Fourth EGEE Conference Pise, October 23 - 28, 2005 V. Alessandrini, IDRIS-CNRS 11 Extreme Computing proposals Bioinformatics 4 Biophysics 3 Astrophysics 11 Fluid Dynamics 6 Materials Sciences 11 Cosmology 3 Climate, Environment 5 Quantum Chemistry 5 Plasma Physics 2 QCD, Quantum computing 3 Profiles of applications in operation in 2005 – 2006 Huge parallel applications running in single remote nodes (dominant) Data Intensive applications of different kinds. Workflows (about 10%)
12
Fourth EGEE Conference Pise, October 23 - 28, 2005 V. Alessandrini, IDRIS-CNRS 12 Hyperscaling in Plasma Physics Extreme Gyrokinetic Turbulence Simulations (related to ITER project) The nonlinear particle-in-cell code TORB uses a Monte Carlo particle approach to simulate the time evolution of turbulent field structures in fusion plasmas (J. Nuehrenberg, IPP, Greifswald & L. Villard, CRPP, Lausanne) Within DEISA, TORB has been improved for extreme scalability at IBM system at ECMWF: On 2048 procs: Speedup = 1680 Parallel efficiency = 82% Sustained performance = 1.3 TF 64 nodes = 2048 processors
13
Fourth EGEE Conference Pise, October 23 - 28, 2005 V. Alessandrini, IDRIS-CNRS 13 DEISA and industrial users DEISA sites have today the possibility of selling resources to external industrial users. However, a number of other options are being examined: –The possibility of having priviledged access – and priviledged prices – to leading industrial research projects –The possibility of deploying a « compute on demand » service for occassional and exceptional requests coming from the industrial sector. Given the impressive amount of computational resources integrated in the DEISA environment, we have the capability of deploying this service withouth disrupting the national services. This is part of the current discussion about a model for DEISA as a sustained and persistent research infrastructure. Defining these policies require the participation of the national organizations that are funding the national HPC services integrated by DEISA DEISA has just created a DEISA Policy Committee, with representatives from the national funding bodies of the seven nations participating to the Consortium. The DEISA Policy Committee wil consider these issues in 2006.
14
Fourth EGEE Conference Pise, October 23 - 28, 2005 V. Alessandrini, IDRIS-CNRS 14 Conclusions DEISA adopts Grid technologies to integrate national supercomputing infrastructures, and to provide an European Supercomputing Service. Service activities are supported by the coordinated action of the national center's staffs. DEISA operates as a virtual European supercomputing centre. DEISA aims at deploying a persistent, basic European infrastructure for general purpose high performance computing. All the details of this infrastructure are designed with future shared European supercomputers in mind. Sooner or later, European nations will have to share investments in supercomputers to compete with the USA and Japan. DEISA expects to provide the infrastructure on which these shared computing platforms will operate. Interfaced with other grid-enabled complementary infrastructures, DEISA expects to contribute to a global European eInfrastructure for science and technology Integrating leading supercomputing platforms with Grid technologies may open the way to a new research dimension in Europe.
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.