Production Grid Challenges in Hungary Péter Stefán Ferenc Szalai Gábor Vitéz NIIF/HUNGARNET.

Slides:



Advertisements
Similar presentations
Challenges for Interactive Grids a point of view from Int.Eu.Grid project Remote Instrumentation Services in Grid Environment RISGE BoF Manchester 8th.
Advertisements

Welcome to Middleware Joseph Amrithraj
11 DICOM Image Communication in Globus-Based Medical Grids Michal Vossberg, Thomas Tolxdorff, Associate Member, IEEE, and Dagmar Krefting Ting-Wei, Chen.
Computational Steering on the GRID Using a 3D model to Interact with a Large Scale Distributed Simulation in Real-Time Michael.
Workload Management Massimo Sgaravatto INFN Padova.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
Peter Stefan, NIIF 29 June, 2007, Amsterdam, The Netherlands NIIF Storage Services Collaboration on Storage Services.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
Sergey Belov, Tatiana Goloskokova, Vladimir Korenkov, Nikolay Kutovskiy, Danila Oleynik, Artem Petrosyan, Roman Semenov, Alexander Uzhinskiy LIT JINR The.
Terminal Services in Windows Server ® 2008 Infrastructure Planning and Design.
Chapter 1: Hierarchical Network Design
A.V. Bogdanov Private cloud vs personal supercomputer.
1 18 April 2002 Recent developments of the Hungarian Academic and Research Network István Tétényi dr., Hungarnet/NIIF, Hungary
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
1 Copyright © 2004, Oracle. All rights reserved. Introduction to Oracle Forms Developer and Oracle Forms Services.
Grid Data Management A network of computers forming prototype grids currently operate across Britain and the rest of the world, working on the data challenges.
INFSO-RI Enabling Grids for E-sciencE EGEODE VO « Expanding GEosciences On DEmand » Geocluster©: Generic Seismic Processing Platform.
Computer and Automation Research Institute Hungarian Academy of Sciences Automatic checkpoint of CONDOR-PVM applications by P-GRADE Jozsef Kovacs, Peter.
A short introduction to the Worldwide LHC Computing Grid Maarten Litmaath (CERN)
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Cracow Grid Workshop October 2009 Dipl.-Ing. (M.Sc.) Marcus Hilbrich Center for Information Services and High Performance.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
Connect. Communicate. Collaborate BANDWIDTH-ON-DEMAND SYSTEM CASE-STUDY BASED ON GN2 PROJECT EXPERIENCES Radosław Krzywania (speaker) PSNC Mauro Campanella.
Grid Middleware Tutorial / Grid Technologies IntroSlide 1 /14 Grid Technologies Intro Ivan Degtyarenko ivan.degtyarenko dog csc dot fi CSC – The Finnish.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Enabling the Future Service-Oriented Internet (EFSOI 2008) Supporting end-to-end resource virtualization for Web 2.0 applications using Service Oriented.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
Easy Access to Grid infrastructures Dr. Harald Kornmayer (NEC Laboratories Europe) Dr. Mathias Stuempert (KIT-SCC, Karlsruhe) EGEE User Forum 2008 Clermont-Ferrand,
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
How can Computer Networks Support National Grid Initiatives? Tamás Máray Péter Stefán NIIF/HUNGARNET.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Peer-to-Peer Systems: An Overview Hongyu Li. Outline  Introduction  Characteristics of P2P  Algorithms  P2P Applications  Conclusion.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
The Hungarian ClusterGRID Project Péter Stefán research associate NIIF/HUNGARNET
Distributed Physics Analysis Past, Present, and Future Kaushik De University of Texas at Arlington (ATLAS & D0 Collaborations) ICHEP’06, Moscow July 29,
EGEE is a project funded by the European Union under contract IST JRA4 Overview Javier Orellana JRA4 Coordinator EGEE Kick Off Meeting SA2.
The Globus Toolkit The Globus project was started by Ian Foster and Carl Kesselman from Argonne National Labs and USC respectively. The Globus toolkit.
The European KnowARC Project Péter Stefán, NIIF/HUNGARNET/KnowARC TNC2009, June 2009, Malaga.
Seminar On Rain Technology
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Presented by Edith Ngai MPhil Term 3 Presentation
Accessing the VI-SEEM infrastructure
New Paradigms: Clouds, Virtualization and Co.
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
(Prague, March 2009) Andrey Y Shevel
Grid Optical Burst Switched Networks
The Open Grid Service Architecture (OGSA) Standard for Grid Computing
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Globus —— Toolkits for Grid Computing
Introduction to Data Management in EGI
Similarities between Grid-enabled Medical and Engineering Applications
Grid Computing.
University of Technology
The Anatomy and The Physiology of the Grid
The Anatomy and The Physiology of the Grid
Introduction To Distributed Systems
Presentation transcript:

Production Grid Challenges in Hungary Péter Stefán Ferenc Szalai Gábor Vitéz NIIF/HUNGARNET

Agenda Brief introduction Grid initiatives - ClusterGrid Challenges in a production environment Generic ClusterGrid operation model Management issues User support Monitoring ClusterGrid future challenges Conclusions

computing infrastructure networking infrastructure middleware infrastructure collaborative infrastructure Brief NIIF Introduction Hungarian NREN. GRID supercomputing IP, IPv6, MPLS, lambda etc Videoconference, central HA cluster VPNs, VoIP, directory service 10G backbone, ~ users, ~750 institutions

Supercomputers Consists of 2 SUN E15Ks and 2 SUN 10Ks located at two universities, including 276 CPUs, 300 GB of memory. Used to be in the top 500. In production since Serves more than 200 users, and 100 scientific projects.

Hungarian grid initiatives, MGKK Hungarian grid initiatives can be classified into grid infrastructure and grid system development projects. Key role-players formulate grid collaboration: Hungarian Grid Competence Center (MGKK) involving BUTE, ELUB, MTA-SZTAKI, NIIF/HUNGARNET, KFKI, University of Veszprém. Intensive participation in many national and European grid initiatives: EGEE, NorduGrid, SEE- GRID, etc.

ClusterGrid initiative It is a pool of 1400 PC nodes throughout the country involving more than 26 clusters. Production infrastructure since July Supercomputer clusters are planned to be involved too. A rough measurement on the total compute capacity is about 600 Gflops. Even though it is much smaller than regional, continental grids, in complexity it is at the same range.

Challenges in production environment Grid definition - set clear objectives what to build Simplicity - keep the system transparent, usable Completeness - cover not only application level Security - using computer networking methods (MPLS, VLAN technologies) Compatibility - other grids (X509, LDAP) Manageability - easy-to-maintain Robustness - fault tolerant behavior Usability - cover many job classes, user support Platform independency - to be able to execute on MS

ClusterGrid architecture

Some new ideas… MPLS, VLAN connected resources Web-transaction based resource broker Dynamic, separated run-time environment

Generic production service model

Challenges in production … cont’d … Management  physical compute resources (supercomputers, clusters),  virtual resources (virtual clusters),  storage nodes,  users,  services User support Grid architecture monitoring

Compute-cluster management

Virtual compute-cluster management

Storage management Low level management of disks and volumes, file systems (cost efficient storage solutions by using ATA over Ethernet - AoE). Medium level access management (gridFTP, FTPS). High level data brokering (extended SRM model).

User management User personal data is kept in an LDAP based directory service separately from authentication data. Aided by a web registration interface. Authentication:  X509 certificates,  LDAP based authentication. No authorization yet.

Service management (experimental) Relatively new direction. It is a special service. It is based on well- established authorization. Basically helps to start, stop, (re)configure grid services.

User support Grid service provider gives user support covering:  consultation about the benefits of grid usage,  code porting and optimization,  partial aid in code implementation,  job formation and execution,  generic grid usage. Not yet covered:  model creation,  formal description,  algorithm creation.

ClusterGrid monitoring Fluctuation of grid cluster resources between the day-shift and night-shift operation. Blue line – total; Green area – occupied. 2-layer hierarchical monitoring system.

ClusterGrid monitoring

Future ClusterGrid (?) challenges Continuously growing demands for reliable compute and data storage infrastructure. Grid systems should conform to international standards and MUST interoperate with one another. Platform-independency is not an issue yet, but will be. LEGO-based principles are of increasing importance. Threats: solutions that prevent development; erosion of the belief in the power of “grid”.

Conclusions One of the first production-level grids have been shown in a nutshell. With special emphasis on operation, management and user support issues. Management generally covers grid resource, grid user management and monitoring. Some remarks regarding future development were also done.

Thanks for your attention!