FutureGrid Image Repository: A Generic Catalog and Storage System for Heterogeneous Virtual Machine Images Javier Diaz, Gregor von Laszewski, Fugang Wang,

Slides:



Advertisements
Similar presentations
Overview of the FutureGrid Software
Advertisements

Creating HIPAA-Compliant Medical Data Applications with Amazon Web Services Presented by, Tulika Srivastava Purdue University.
FutureGrid related presentations at TG and OGF Sun. 17th: Introduction to FutireGrid (OGF) Mon. 18th: Introducing to FutureGrid (TG) Tue. 19th –Educational.
Amazon Web Services and Eucalyptus
Management Framework for Amazon EC2 Speaker: Frank Bitzer
Network Management Overview IACT 918 July 2004 Gene Awyzio SITACS University of Wollongong.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
What is Cloud Computing? o Cloud computing:- is a style of computing in which dynamically scalable and often virtualized resources are provided as a service.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
Chapter 8: Network Operating Systems and Windows Server 2003-Based Networking Network+ Guide to Networks Third Edition.
Jefferson Ridgeway 2, Ifeanyi Rowland Onyenweaku 3, Gregor von Laszewski 1*, Fugang Wang 1 1* Indiana University, Bloomington, IN 47408, U.S.A.,
Presented by Sujit Tilak. Evolution of Client/Server Architecture Clients & Server on different computer systems Local Area Network for Server and Client.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 7 Configuring File Services in Windows Server 2008.
An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
Design Discussion Rain: Dynamically Provisioning Clouds within FutureGrid Geoffrey Fox, Andrew J. Younge, Gregor von Laszewski, Archit Kulshrestha, Fugang.
Introduction to Cloud Computing
Opensource for Cloud Deployments – Risk – Reward – Reality
Eucalyptus on FutureGrid: A case for Eucalyptus 3 Sharif Islam, Javier Diaz, Geoffrey Fox Gregor von Laszewski Indiana University.
Cyberaide Virtual Appliance: On-demand Deploying Middleware for Cyberinfrastructure Tobias Kurze, Lizhe Wang, Gregor von Laszewski, Jie Tao, Marcel Kunze,
By Mihir Joshi Nikhil Dixit Limaye Pallavi Bhide Payal Godse.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Database Technical Session By: Prof. Adarsh Patel.
Science Clouds and FutureGrid’s Perspective June Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox
Hands-On Microsoft Windows Server 2008 Chapter 5 Configuring, Managing, and Troubleshooting Resource Access.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
Engr. M. Fahad Khan Lecturer Software Engineering Department University Of Engineering & Technology Taxila.
FutureGrid Dynamic Provisioning Experiments including Hadoop Fugang Wang, Archit Kulshrestha, Gregory G. Pike, Gregor von Laszewski, Geoffrey C. Fox.
By: Ashish Gohel 8 th sem ISE.. Why Cloud Computing ? Cloud Computing platforms provides easy access to a company’s high-performance computing and storage.
Image Management and Rain on FutureGrid: A practical Example Presented by Javier Diaz, Fugang Wang, Gregor von Laszewski.
FutureGrid Connection to Comet Testbed and On Ramp as a Service Geoffrey Fox Indiana University Infra structure.
Image Generation and Management on FutureGrid CTS Conference 2011 Philadelphia May Geoffrey Fox
Image Management and Rain on FutureGrid Javier Diaz - Fugang Wang – Gregor von.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Magellan: Experiences from a Science Cloud Lavanya Ramakrishnan.
FutureGrid Cyberinfrastructure for Computational Research.
DataNet – Flexible Metadata Overlay over File Resources Daniel Harężlak 1, Marek Kasztelnik 1, Maciej Pawlik 1, Bartosz Wilk 1, Marian Bubak 1,2 1 ACC.
RAIN: A system to Dynamically Generate & Provision Images on Bare Metal by Application Users Presented by Gregor von Laszewski Authors: Javier Diaz, Gregor.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Amit Warke Jerry Philip Lateef Yusuf Supraja Narasimhan Back2Cloud: Remote Backup Service.
Design Discussion Rain: Dynamically Provisioning Clouds within FutureGrid PI: Geoffrey Fox*, CoPIs: Kate Keahey +, Warren Smith -, Jose Fortes #, Andrew.
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
Computing Research Testbeds as a Service: Supporting large scale Experiments and Testing SC12 Birds of a Feather November.
Recipes for Success with Big Data using FutureGrid Cloudmesh SDSC Exhibit Booth New Orleans Convention Center November Geoffrey Fox, Gregor von.
Aneka Cloud ApplicationPlatform. Introduction Aneka consists of a scalable cloud middleware that can be deployed on top of heterogeneous computing resources.
Vignesh Ravindran Sankarbala Manoharan. Infrastructure As A Service (IAAS) is a model that is used to deliver a platform virtualization environment with.
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
Grappling Cloud Infrastructure Services with a Generic Image Repository Javier Diaz Andrew J. Younge, Gregor von Laszewski, Fugang.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
PLATFORM TO EASE THE DEPLOYMENT AND IMPROVE THE AVAILABILITY OF TRENCADIS INFRASTRUCTURE IberGrid 2013 Miguel Caballer GRyCAP – I3M - UPV.
IT 5433 LM1. Learning Objectives Understand key terms in database Explain file processing systems List parts of a database environment Explain types of.
WP5 – Infrastructure Operations Test and Production Infrastructures StratusLab kick-off meeting June 2010, Orsay, France GRNET.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
PaaS services for Computing and Storage
Course: Cluster, grid and cloud computing systems Course author: Prof
Platform as a Service (PaaS)
Self Healing and Dynamic Construction Framework:
Bridges and Clouds Sergiu Sanielevici, PSC Director of User Support for Scientific Applications October 12, 2017 © 2017 Pittsburgh Supercomputing Center.
Introduction to Data Management in EGI
Javier Diaz, Gregor von Laszewski, Fugang Wang and Geoffrey Fox
University of Technology
FutureGrid Computing Testbed as a Service
Versatile HPC: Comet Virtual Clusters for the Long Tail of Science SC17 Denver Colorado Comet Virtualization Team: Trevor Cooper, Dmitry Mishin, Christopher.
1Indiana University, 2now Rutgers University
Gregor von Laszewski Indiana University
Harrison Howell CSCE 824 Dr. Farkas
Presentation transcript:

FutureGrid Image Repository: A Generic Catalog and Storage System for Heterogeneous Virtual Machine Images Javier Diaz, Gregor von Laszewski, Fugang Wang, Andrew Younge, Geoffrey Fox Apply at

Index Motivation Requirements, Design, Implementation Methodology Results Conclusions Outgoing Work Apply at

Motivation FutureGrid is an experimental cloud and grid testbed – We support HPC, Grid, and Cloud frameworks and services Much interest by the community is in the offered frameworks and services are based on virtualization technologies or make use of them Apply: Image management becomes a key issue Generic catalog and repository of images that will be able to interact with other FG subsystems and potentially with other infrastructures Create and maintain platforms within custom FG images that can be retrieved, deployed and provisioned on demand Apply at

FutureGrid Offerings (currently) IaaS – Nimbus – OpenStack – Eucalyptus PaaS – Hadoop – SAGA – … HPC – MPI – OpenMP – ScaleMP – Vampir Grid – Genesis II – Unicore – (Globus) – … Apply at 4 RAIN (ACL) – Repository – Initialization – Provisioning – (Management)

Index Motivation Requirements, Design, Implementation Methodology Results Conclusions Outgoing Work Apply at

Requirements Four group of users considered – A single user. Users create images that are part of experiments they conduct on FG – A group of users that work together in the same project and share the images – System administrators. They maintain the image repository ensuring backups and preserving space. They also may use it for the distribution of the HPC image that is accessible by default. – FG services and subsystems Requirements include: – A simple, intuitive and user friendly environment – A unified, extensible and integrated system design to manage various types of images for different systems – Built in fault tolerance with proper accounting and information tools – The ability to be integrated with the FG security. Apply at

Design Integrated service that enables storing and organizing images from multiple cloud efforts in the same repository Images are augmented with metadata to describe their properties like the software stack installed or the OS Access to the images can be restricted to single users, groups of users or system administrators Maintains data related with the usage to assist performance monitoring and accounting Apply at

Design (II) Quota management to avoid space restrictions Pedigree to recreate image on demand Repository’s interfaces: API's, a command line, an interactive shell, and a REST service Other cloud frameworks could integrate with this image repository by accessing it through an standard API Apply at

Design (III) Apply at

Implementation Client-Server architecture Support up to four different storage: – MySQL where the image files are stored directly in the POSIX file system – MongoDB where both data and files are stored in the NoSQL database – OpenStack Object Store (Swift) – Cumulus (Nimbus project) Increase interoperability and provide templates to help community to create their own storage plugins Apply at

User Management and Authentication Users have to authenticate to get access Access based on roles and project/group memberships FG account management services can provide needed metadata on project memberships and roles Authentication based on LDAP Apply at

Image Metadata Apply at 12 Fields with Asterisks (*) can be modified by users

Image Management Users upload the image and specify the metadata Modifications to the metadata can be accomplished by the owner of an image Repository can be queried by using a simplified SQL- style syntax Support accounting services – Number of times that an image has been requested – Last time that an image was accessed – Number of images registered by each user – Disk space used by each user Triggers that react upon certain conditions associated with the metadata Apply at

Index Motivation Requirements, Design, Implementation Methodology Results Conclusions Outgoing Work Apply at

Experiment Methodology Evaluate all these storage back-ends for the image repository Seven configurations: – Cumulus+MongoDB (Cumu+Mo) – Cumulus+MySQL (Cumu+My) – Filesystem+MySQL (Fs+My) – MongoDB with Replication (Mo+Mo) – MongoDB with No Replication (MoNR+MoNR) – Swift+MongoDB (Swi+Mo) – Swift+MySQL (Swi+My) Five different image sizes: 50MB, 300MB, 500MB, 1GB and 2GB Test read and write performance using a single client Test 16 clients retrieving images concurrently Apply at

Index Motivation Requirements, Design, Implementation Methodology Results Conclusions Outgoing Work Apply at

Upload Images Apply at 17 * done using the command line tool instead of the Python API

Retrieve Images Apply at

Retrieve Images using 16 client concurrently Apply at

Index Motivation Requirements, Design, Implementation Methodology Results Conclusions Outgoing Work Apply at

Conclusions We have introduced the FutureGrid Image Repository and presented a functional prototype that implements most of the designed features Key aspect of this image repository is the ability to provide a unique and common interface to manage any kind of image Flexible design to be integrated with FG and other frameworks Apply at

Conclusions (Storage Backend) MongoDB performance problems and high memory usage Swift too many errors Cumulus missing fault tolerance/scalability Candidates to be our default storage system: – Cumulus because is still quite fast and reliable – Swift because has a good architecture to provide fault tolerance and scalability Apply at

Index Motivation Requirements, Design, Implementation Methodology Results Conclusions Outgoing Work Apply at

Ongoing Work Development of Rest API Integration with the rest of the image management components Compatibility with the Open Virtualization Format (OVF) to describe the images. Apply at

Thank for your attention! Contact info: Gregor Laszewski: Javier Diaz: Fugang Wang: Apply at