EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Towards scalable, semantic-based virtualized storage.

Slides:



Advertisements
Similar presentations
SLA-Oriented Resource Provisioning for Cloud Computing
Advertisements

Scientific Workflow Support in the PL-Grid Infrastructure with HyperFlow Bartosz Baliś, Tomasz Bartyński, Kamil Figiela, Maciej Malawski, Piotr Nowakowski,
CLOUD COMPUTING AN OVERVIEW & QUALITY OF SERVICE Hamzeh Khazaei University of Manitoba Department of Computer Science Jan 28, 2010.
Polish Infrastructure for Supporting Computational Science in the European Research Space EUROPEAN UNION Services and Operations in Polish NGI M. Radecki,
Polish Infrastructure for Supporting Computational Science in the European Research Space GridSpace Based Virtual Laboratory for PL-Grid Users Maciej Malawski,
Optimizing of data access using replication technique Renata Słota 1, Darin Nikolow 1,Łukasz Skitał 2, Jacek Kitowski 1,2 1 Institute of Computer Science.
CoreGRID Workpackage 5 Virtual Institute on Grid Information and Monitoring Services Authorizing Grid Resource Access and Consumption Erik Elmroth, Michał.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
The CrossGrid project Juha Alatalo Timo Koivusalo.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space User Oriented Provisioning of Secure Virtualized.
Next Generation Domain-Services in PL-Grid Infrastructure for Polish Science. Numerical Simulations of Metal Forming Production Processes and Cycles by.
Lecture Nine Database Planning, Design, and Administration
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
SYNAT - the Polish National Research Content Infrastructure Wojtek Sylwestrzak, ICM Tomasz Rosiek, ICM Tomasz Krassowski, ICM Tartu, Estonia June 27, 2012.
VAP What is a Virtual Application ? A virtual application is an application that has been optimized to run on virtual infrastructure. The application software.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Cracow Grid Workshop’10 Kraków, October 11-13,
WP6: Grid Authorization Service Review meeting in Berlin, March 8 th 2004 Marcin Adamski Michał Chmielewski Sergiusz Fonrobert Jarek Nabrzyski Tomasz Nowocień.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Ab initio grid chemical software ports – transferring.
Seamless Medical Image Processing on the Grid on the Example of Segmentation and Partition of the Airspaces Andrzej Rutkowski 1, Michał Chlebiej 1, Marcelina.
Polish Infrastructure for Supporting Computational Science in the European Research Space Policy Driven Data Management in PL-Grid Virtual Organizations.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
CGW 2003 Institute of Computer Science AGH Proposal of Adaptation of Legacy C/C++ Software to Grid Services Bartosz Baliś, Marian Bubak, Michał Węgiel,
Cracow Grid Workshop, October 15-17, 2007 Polish Grid Polish Grid: National Grid Initiative in Poland Jacek Kitowski Institute of Computer Science AGH-UST.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Polish Infrastructure for Supporting Computational Science in the European Research Space QoS provisioning for data-oriented applications in PL-Grid D.
CONTENTS Arrival Characters Definition Merits Chararterstics Workflows Wfms Workflow engine Workflows levels & categories.
Next Generation Domain-Services in PL-Grid Infrastructure for Polish Science Daniel Bachniak 1, Jakub Liput 2, Łukasz Rauch 1, Renata Słota 2,3, Jacek.
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
Trace Generation to Simulate Large Scale Distributed Application Olivier Dalle, Emiio P. ManciniMar. 8th, 2012.
Y. Kotani · F. Ino · K. Hagihara Springer Science + Business Media B.V Reporter: 李長霖.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
In each iteration macro model creates several micro modules, sends data to them and waits for the results. Using Akka Actors for Managing Iterations in.
A Proposal of Application Failure Detection and Recovery in the Grid Marian Bubak 1,2, Tomasz Szepieniec 2, Marcin Radecki 2 1 Institute of Computer Science,
GLOBAL GRID FORUM 10 Workflows in PROGRESS and GridLab environments Michał Kosiedowski.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
DataNet – Flexible Metadata Overlay over File Resources Daniel Harężlak 1, Marek Kasztelnik 1, Maciej Pawlik 1, Bartosz Wilk 1, Marian Bubak 1,2 1 ACC.
Laboratoire LIP6 The Gedeon Project: Data, Metadata and Databases Yves DENNEULIN LIG laboratory, Grenoble ACI MD.
Polish Infrastructure for Supporting Computational Science in the European Research Space FiVO/QStorMan: toolkit for supporting data-oriented applications.
AKOGRIMO Integration of Grid services with mobile technologies; validation in e-health, e-learning and disaster management areas CoreGRID European Grid.
LHCb Software Week November 2003 Gennady Kuznetsov Production Manager Tools (New Architecture)
Scalarm: Scalable Platform for Data Farming D. Król, Ł. Dutka, M. Wrzeszcz, B. Kryza, R. Słota and J. Kitowski ACC Cyfronet AGH KU KDM, Zakopane, 2013.
KUKDM’2011, Zakopane Semantic Based Storage QoS Management Methodology Renata Słota, Darin Nikolow, Jacek Kitowski Institute of Computer Science AGH-UST,
AN SLA-BASED RESOURCE VIRTUALIZATION APPROACH FOR ON-DEMAND SERVICE PROVISION Gabor Kecskemeti MTA SZTAKI International Workshop on Virtualization Technologies.
GridLab Resource Management System (GRMS) Jarek Nabrzyski GridLab Project Coordinator Poznań Supercomputing and.
Rafał Słota, Michał Wrzeszcz, Renata G. Słota, Łukasz Dutka, Jacek Kitowski ACC Cyfronet AGH Department of Computer Science, AGH - UST CGW 2015 Kraków,
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Operational Architecture of PL-Grid project M.Radecki,
Interactive European Grid Environment for HEP Application with Real Time Requirements Lukasz Dutka 1, Krzysztof Korcyl 2, Krzysztof Zielinski 1,3, Jacek.
Chapter 8 System Management Semester 2. Objectives  Evaluating an operating system  Cooperation among components  The role of memory, processor,
Parameter Sweep and Resources Scaling Automation in Scalarm Data Farming Platform J. Liput, M. Paciorek, M. Wrona, M. Orzechowski, R. Slota, and J. Kitowski.
Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
Metadata Organization and Management for Globalization of Data Access with Michał Wrzeszcz, Krzysztof Trzepla, Rafał Słota, Konrad Zemek, Tomasz Lichoń,
Mobile Analyzer A Distributed Computing Platform Juho Karppinen Helsinki Institute of Physics Technology Program May 23th, 2002 Mobile.
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
ONEDATA Way to access to your Data at the global scale Lukasz Dutka, R. Slota, M. Wrzeszcz, D. Krol, L. Opiola, R. Slota, J. Kitowski ACK Cyfronet AGH.
Introduction to Performance Tuning Chia-heng Tu PAS Lab Summer Workshop 2009 June 30,
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
PL-Grid: Polish Infrastructure for Supporting Computational Science in the European Research Space 1 ESIF - The PLGrid Experience ACK Cyfronet AGH PL-Grid.
Piotr Bała, Marcin Radecki, Krzysztof Benedyczak
Introduction to Load Balancing:
DI4R Conference, September, 28-30, 2016, Krakow
University of Technology
Final Review 27th March Final Review 27th March 2019.
Presentation transcript:

EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Towards scalable, semantic-based virtualized storage resources provisioning Kornel Skałkowski, Renata Słota, Dariusz Król, Michał Orzechowski, Bartosz Kryza, Jacek Kitowski ACC Cyfronet AGH, Krakow, Poland KU KDM 2012 : fifth ACC Cyfronet AGH users' conference : Zakopane, March 07–09, 2012

2 Outline  Introduction  The QStorMan toolkit overview  The QStorMan toolkit architecture  QStorMan usage  Recent improvements  Current status of QStorMan  Test results  Future Work

Introduction  Data intensive applications and the 4 th science paradigm  Resources virtualization becomes ubiquitous  Storage resources virtualization is often provided by cluster file systems like Lustre  IT infrastructure users expect more and more computing and storage power as well as an appropriate QoS level 3

The QStorMan toolkit  Main goal is to provide virtualized storage resources with QoS warrianties for data intensive applications  Users can define QoS requirements concerning storage resources on three levels: application, user, virtual organization  Currently we support the following non-functional requirements:  Average Read/Write transfer rate,  Current Read/Write transfer rate,  Free capacity,  Result cachability – dedicated for application, which generates a large number of small files.  The toolkit consists of three components:  Knowledge base (GOM) which stores semantic descriptions concerning storage resources and synchronizes the descriptions with a grid middleware  Dedicated monitoring service (SMED) which performs continuous, real-time monitoring of virtualized storage resources with semantic support  Intelligent resources matching service (SES) which combines information obtained from the GOM and SMED services as well as advanced semantic support in order to perfectly match a virtualized resource from the resources mesh 4

The QStorMan toolkit architecture 5

QStorMan usage  Using system C library (libses-wrapper):  declare your non-functional requirements in the GOM knowledge base  export LD_PRELOAD= 2. Using C++ programming library (libses): #include using namespace lustre_api_library; LustreManager manager; StoragePolicy policy; policy.setAverageReadTransferRate(50); policy.setCapacity(100); int descriptor = manager.createFile(„nazwa_pliku.dat”, &policy); 6

Recent improvements  General purpose of the improvements is to provide a scalable, fully semantic-based solution for efficient provisioning of virtualized storage resources  SMED improvements:  Utilization of the enhanced C2MS storage resources semantic model for description of high-level QoS parameters  Application of semanatic reasoners on the monitoring level  SES improvements  Cache mechanism on demand – supporting large number of files generation  Automatic registration of users in knowledge base – decrease required administration effort  GOM improvements  Security enhancements  Scripts for administration 7

The QStorMan toolkit current status  Test installation is running at ACC Cyfronet AGH from over 1 year now  A lot of tests were performed and no major bugs were found  We have passed operational and security audits in PL-Grid succesfully  We now waiting for official deployment in ACC Cyfronet, PCSS Poznan, TASK Gdansk, and ICM Warsaw  Official tutorials, workshops and other material are on the way  Integrated with QoSCosGrid middleware from PCSS  We are willing to cooperate with anyone, who would like to test QStorMan in practice with an exisiting data intensive application 8

Test description Synthetic test  The toolkit evaluation was performed by simulation of 8 users which were executing their applications on the Grid infrastructure  3 users used the QStorMan toolkit during the applications execution, the others used plain Lustre file system  Every user periodically saved and read a 60 GB file with random sleep periods between the succeeding operations (10 reads and 10 writes)  Users started their applications with random delays in order to simulate real conditions in a Grid environment Test with real user’s application  Simulation of sound wave propagation inside human head  Out-of-core computations  No source code modifications  5 instances of application running in parallel in order to generate enough load for storage system 9

Synthethic test results 10 12% speedup between two fastest applications 26% speedup on average (~7:20 h vs ~10 h) No source code modification

Real user’s application test result 11 15% speedup on average Running on production infrastructure No source code modification

Future work  Support for domain-oriented virtualized computing environments  Implementation of new storage resources selection strategies  Orientation toward Cloud computing environments  Dissemination and exploitation among possible users 12

Questions? 13