TAT CrossGrid Yearly Review, Brussels, March 12, 2003 CrossGrid After the First Year: A Technical Overview Marian Bubak, Maciej Malawski, and Katarzyna Zając X# TAT Institute of Computer Science & ACC CYFRONET AGH, Kraków, Poland
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Main Objectives A new category of Grid-enabled applications Compute- and data-intensive distributed near real-time response (person in a loop) layered New programming tools Grid more user-friendly, secure and efficient Interoperability with other Grids Implementation of standards
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 CrossGrid in a Nutshell Interactive, Compute and Data Intensive Applications I nteractive simulation and visualization of a biomedical system Flooding crisis team support Distributed data analysis in HEP Weather forecasting and air pollution modeling Tool Environment MPI code debugging and verification Metrics and benchmarks Interactive and semiautomatic performance evaluation tools New Generic Grid Services Globus Middleware Fabric DataGrid Services Portals and roaming access Scheduling agents Application and Grid monitoring Optimization of data access Application Specific Services User Interactive Services Grid Visualization Kernel
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Key Features of CG Applications Data Data generators and databases geographically distributed Selected on demand Processing Interactive Requires large processing capacity; both HPC & HTC Presentation Complex data requires versatile 3D visualisation Support interaction and feedback to other components
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Biomedical Application Adding small modifications to the proposed structure results in immediate changes in the blood flow. Online presentation of simulation results via a 3D environment. The progress of the simulation and the estimated time of convergence should be available for inspection. LB flow simulation VE WD PC PDA Visualization Interaction
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Basic Characteristics of Flood Simulation Meteorological Intensive simulation (HPC), large input/output data sets, high availability of resources Hydrological Parametric simulations (HTC) may require different models (heterogeneous simulations) Hydraulic Many 1-D simulations HTC, 2-D hydraulic simulations require HPC Data sources Meteorological simulations Hydraulic simulations Hydrological simulations Users Output visualization
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Distributed Data Analysis in HEP Objectives Distributed data access Distributed data mining techniques with neural networks Issues Typical interactive requests will run on o(TB) of distributed data Transfer/replication times for the whole data on the order of one hour Data transfers once and in advance of the interactive session. Allocation, installation and setup the corresponding database servers before the interactive session starts Replica Manager Interactive Session Resource Broker DISTRIBUTED PROCESSING DBInstallation Interactive Session Database server Interactive Session Manager Interactive Session Worker Interactive Session Worker Interactive Session Worker Interactive Session Worker Interactive Session Worker Portal XML in/out On-line output
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Weather Forecasting and Air Pollution Modeling Distributed/parallel code on Grid Coupled Ocean/Atmosphere Mesoscale Prediction System STEM-II Air Pollution Code Integration of distributed databases Data mining applied to downscaling weather forecasts
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Supporting Tools 1.4 Meteo Pollution 1.4 Meteo Pollution 3.1 Portal & Migrating Desktop Applications Development Support 2.4 Performance Analysis 2.4 Performance Analysis 2.2 MPI Verification 2.3 Metrics and Benchmarks 2.3 Metrics and Benchmarks App. Spec Services 1.1 Grid Visualisation Kernel 1.3 Data Mining on Grid (NN) 1.3 Data Mining on Grid (NN) 1.3 Interactive Distributed Data Access 3.1 Roaming Access 3.1 Roaming Access 3.2 Scheduling Agents 3.2 Scheduling Agents 3.3 Grid Monitoring 3.3 Grid Monitoring MPICH-G Fabric 1.1, 1.2 HLA and others 3.4 Optimization of Grid Data Access 3.4 Optimization of Grid Data Access 1.2 Flooding 1.2 Flooding 1.1 BioMed 1.1 BioMed Applications Generic Services 1.3 Interactive Session Services 1.3 Interactive Session Services GRAM GSI Replica Catalog GIS / MDS GridFTP Globus-IO DataGrid Replica Manager DataGrid Replica Manager DataGrid Job Submission Service Resource Manager (CE) Resource Manager (CE) CPU Resource Manager Resource Manager Resource Manager (SE) Resource Manager (SE) Secondary Storage Resource Manager Resource Manager Instruments ( Satelites, Radars) Instruments ( Satelites, Radars) 3.4 Optimization of Local Data Access 3.4 Optimization of Local Data Access Tertiary Storage Replica Catalog Globus Replica Manager Globus Replica Manager 1.1 User Interaction Services 1.1 User Interaction Services Initial version of X# architecture
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Project Phases M 1 - 3: requirements definition and merging M : first development phase: design, 1st prototypes, refinement of requirements M : second development phase: integration of components, 2nd prototypes M : third development phase: complete integration, final code versions M : final phase: demonstration and documentation
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Tools MPI code debugging and verification Metrics and benchmarks for the Grid environment Grid-enabled Performance Measurement Performance Prediction Component Grid Monitoring Performance Prediction Component High Level Analysis Component User Interface and Visualization Component Performance Measurement Component Applications executing on Grid testbed Application source code G-PM RMD PMD MPI Verification MARMOT Benchmarks
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 MPI Verification verifies the correctness of parallel, distributed Grid applications (MPI) technical basis: MPI profiling interface which allows a detailed analysis of the MPI application Core Tool Application or Test Tool MPI Additional Process (Debug Server) Client Side Profiling Interface Server Side
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Benchmark Categories Micro-benchmarks For identifying basic performance properties of Grid services, sites, and constellations Micro-kernels Generic HPC/HTC kernels, including general and often- used kernels in Grid environments Application kernels Characteristic of representative CG applications Portal gbView gbARC gbControl gbRMP Grid Bench suite SE storage Embedding Retrieval Invocation Invocation/ Collection through GPM Direct Invocation Storage/ Retrieval
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Performance Measurement Tool G-PM Components: performance measurement component (PMC), component for high-level analysis (HLAC), component for performance prediction (PPC) based on analytical performance models of application kernels, user interface and visualization component UIVC. Interface Measurement OCM-G Interface UIVC HLAC OCM-G PMC
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 User Interactive Service Interaction GidService RTIExec GridService Simulation GridService Registry OGSA WSDL RTI Tuple Space functionality description +Dynamic discovery of OGSA Services Large On-line Data transferShort Messages and Events GridFTPSOAP/IIOP TCP or UDP/IP Visualisation GridService enables end users to run distributed simulations in the Grid environment and to steer those simulations in near real time uses OGSA mechanisms to call external resource brokers, job submission services (efficient and transparent execution of the simulation on the Grid).
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Grid Visualization Kernel addresses the problems of distributed visualization on heterogeneous devices allows easily and transparently interconnect Grid applications with existing visualisation tools (AVS, OpenDX, VTK,...) handles multiple concurrent input data streams multiplexes compressed data and images efficiently across long- distance networks GVK Portal Server GVK Visualization Planner Simulation Init Visualization Update Visualization GRAMGASSMDS GVK Visualization pipeline Simulation Data
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 New Grid Services Portals and roaming access Grid resource management Grid monitoring Optimization of data access
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Roaming Access – Current Design Web Browser Line Roaming Access Server LDAP DataBase Manager Application Portal Server Desktop Portal Server Web Browser Benchmarks Portal - easier access and use of the Grid by applications Migrating Desktop - a transparent, independent user environment Roaming Access Server - responsible for managing user profiles, job submission, file transfers and Grid monitoring Command Replica Agent Scheduling
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Scheduling Agents - Current Design JSS commands Job monitoring Scheduling Agent Web Portal JSS / CondorG Resource list Logging & Bookkeping Resource Broker CE scheduling user jobs over the CrossGrid testbed infrastructure, submition based on Condor-G, support for sequential and MPI parallel jobs, batch jobs and interactive jobs, priorities and preferences determined by the user for each job
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Application Monitoring OCM-G Components Service Managers Local Monitors Application processes Tool(s) External name service Component discovery ServiceManager LocalMonitor Tool SharedMemory OMIS ExternalLocalization ApplicationProcess
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Jiro Services Information DB System Non-invasive Monitoring MDS Globus MDS info Jiro info Instruments Infrastructure Static info Performance Information Post-processing Infrastructure monitoring Invasive monitoring (based on Jiro technology) Non-invasive monitoring (Santa-G) Infrastructure Monitoring
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Selection of specialized components best suited for data access operations Estimation of data access latency and bandwidth inside the storage elements Faster access to large tape-resident through fragmentation Data Access Design
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Generic Services Application Specific Services Supporting Tools Applications Current status of CG Architecture
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Application-centric view
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 The Current Testbed The current CrossGrid testbed is based on: EDG distribution release and (production) EDG distribution release (validation) The current infrastructure permits: installation of initial prototypes of CrossGrid software releases (described in M12 Deliverables) testing applications using: Globus and EDG middleware MPI achieving compatibility with DataGrid and therefore extending Grid coverage in Europe
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Grid Service Transient, stateful Web Service (created dynamically) Described by WSDL Identified by Grid Service Handle (GSH) in the form of URI Can be queried for configuration and state in standard way – Service Data mechanism
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Why use OGSA Standards „to be part of the Grid = to implement OGSA Grid protocols” Interoperability in heterogeneous environments Possible contribution to future Grid activities
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Grid Services – where? Dynamic service creation and lifetime management to control the state of some process, e.g.: user session in a portal data transfer running simulation. Service data model can be applied to monitoring systems that can be used as information providers for other services. Service discovery – to solve the bootstrap problem: to connect the modules of a distributed simulation to connect the application to a monitoring system
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Steps towards OGSA Using Web Service interfaces and XML where possible Experimenting with prototyping services using OGSA alpha releases Applying Grid Service extensions to services Solving GT2 - GT3 transition and compatibility issues
TAT CrossGrid Yearly Review, Brussels, March 12, 2003 Summary Achievements of the first project year : Software Requirements Specifications together with use cases written CrossGrid Architecture defined Detailed Design documents for tools and new Grid services (OO approach, UML) written First prototype of software running and documented Detailed description of the test and integration procedures created Testbed set up