Download presentation
Presentation is loading. Please wait.
Published byIra Houston Modified over 9 years ago
1
Sergey Belov, Tatiana Goloskokova, Vladimir Korenkov, Nikolay Kutovskiy, Danila Oleynik, Artem Petrosyan, Roman Semenov, Alexander Uzhinskiy LIT JINR The 5 th International Conference "Distributed Computing and Grid- technologies in Science and Education“ (GRID’2012) 20 July 2012, JINR, Dubna, Russia
2
RGN project Russian National Grid Network Development of grid-based networks for high performance computing To create a network infrastructure and connection to it of largest supercomputer centers, enterprises, high-tech industries and research organizations Computing power for high-priority industrial tasks and scientific projects 2010 – start of network backbone’s development 2011 – core grid services development, application’s interfaces, grid infrastructure prototype 2012 – further system development, more supercomputers, more applications 2/14
3
Grid and network infrastructure Main aim Integration of unique, medium and small supercomputers into a unified distributed computing environment Development of high-speed backbone network for the infrastructure Provide secure and clear remote access for users to these resources Heterogeneous grid environment (hardware, software) Oriented to parallel tasks rather than single batch tasks Workflow management Jobs consist of tasks Technical points is a set of grid services with unified interfaces partially based on Globus Toolkit 4, VOMS GSI based security model, mainly RESTful grid services 3/14
4
Core grid services Web User Interface Resource Broker/metascheduler + Workflow management (RESTful) Grid gateway to the resources (WS) Information Service (RESTful) Monitoring & accounting Registration service (RESTful) GSI services: CA, MyProxy, VOMS GridFTP servers Non-grid: applications licensing service, billing and audit, application solvers images management 4/14
5
Goals of monitoring Tracking state of sites and services Availability Real operational state Monitoring of user's jobs and tasks Keeping history on different system's parameters Information representation General infrastructure state in whole Running jobs and tasks Separate sites and services (real-time and history) Visualization of job events 5/14
6
Monitoring of resources and services State of computational resources by site (based on data from information indexes) Slots available for tasks Structure and properties of clusters Subclusters, nodes, slots, operation system, architecture Application software Supported VOs (with ACLs, Access Control Lists) Monitoring of jobs running on sites (by information from Job management servers) 6/14
7
Accounting and job monitoring Goal: to get information, both real-time and historical, on resources utilization and jobs running on RGN infrastructure (by users, VOs, sites) Information sources: JMS servers, Grid Gateways and local resources managers Collecting data on jobs and tasks in the system All jobs events timestamps, real consumed CPU time Accounting information reports in different views: by sites, VOs and single users Aggregation of actual job's execution time from all sites 7/14
8
Monitoring and accounting information flows 8/14 Monitoring and accounting data storage Information collector JMS Job management services Monitoring website Monitoring data provisioning (Web Services) Accounting Information publisher Infosys central Information index Registration service
9
Accounting and jobs monitoring screenshots 9/14
10
Resource centers on the map Main sites of the infrastructure prototype (2011) Other supercomputers are in process of connection and testing Will be more supercomputers from different Russian regions by the end of 2012 10 resource centers are already connected to high-speed VPN 10/14 http://grid.voskhod.ru
11
Infrastructure operation visualization with Google Earth 11/14 Visualization package from the ARDA Dashboard project was used
12
Registration service Service for registration of resources and grid services Main task is to store and provide information about resources and grid services in RGN Site contacts and other formal information Services’ endpoints, type, DN, etc. Resources operational state Registration and information modification procedures support Web-interface for users and RESTful interface for system services Authorization and authentication along with rights and roles of access to add, modify or access information
13
Registration service’s web interface
14
Conclusion and results Were developed architecture and software solution for monitoring, accounting and registration service for the Russian National Grid Network Experience of developing of the similar services within GridNNN project was very helpful and was heavily used Created tools were successfully deployed and are in production Russian National Grid Network project is currently developing, so further work on the services will be continued 14/14
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.