Deploying BiobankCloud with Karamel/Chef and Federated Authentication in BiobankCloud Jim Dowling, KTH – Royal Institute of Technology.

Slides:



Advertisements
Similar presentations
Cloud computing is used to describe a variety of computing concepts that involve a large number of computers connected through a real-time communication.
Advertisements

Devops – The Last Mile. Jay Flowers
If you knew what I know or CloudWave - Improving services in the Cloud through collaborative adaptation Eliot Salant IBM Haifa Research.
It’s the App, Stupid! Orchestration, Automation, Scaling & What’s in Between Yaron Parasol, Uri
Take your CMS to the cloud to lighten the load Brett Pollak Campus Web Office UC San Diego.
DevOps and Private Cloud Automation 23 April 2015 Hal Clark.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
Big Data Open Source Software and Projects ABDS in Summary VI: Layer 6 Part 2 Data Science Curriculum March Geoffrey Fox
Big Data Open Source Software and Projects Unit 0 Part B: Class Introduction Data Science Curriculum March Geoffrey Fox
Next step of e-government.. Importance Foreword Cloud computing  Characteristics  Service  Users  Benefit Challenges in E-government Cloud government.
Cloud Computing Systems Lin Gu Hong Kong University of Science and Technology Sept. 21, 2011 Windows Azure—Overview.
Virtual Clusters Supporting MapReduce in the Cloud Jonathan Klinginsmith School of Informatics and Computing.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
1 1 Hybrid Cloud Solutions (Private with Public Burst) Accelerate and Orchestrate Enterprise Applications.
Building service testbeds on FIRE D5.2.5 Virtual Cluster on Federated Cloud Demonstration Kit August 2012 Version 1.0 Copyright © 2012 CESGA. All rights.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Introduction to Cloud Computing
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
Presented by: Sanketh Beerabbi University of Central Florida COP Cloud Computing.
Cloud Architecture Chapter 2. SPI Model Cloud Computing Classification Model – SPI - SaaS: (Software as a Service) - PaaS (Platform as a Service) - IaaS.
608D CloudStack 3.0 Omer Palo Readiness Specialist, WW Tech Support Readiness May 8, 2012.
1 © 2009 Cisco Systems, Inc. All rights reserved.Cisco Confidential Cloud Computing – The Value Proposition Wayne Clark Architect, Intelligent Network.
FutureGrid Connection to Comet Testbed and On Ramp as a Service Geoffrey Fox Indiana University Infra structure.
What is the cloud ? IT as a service Cloud allows access to services without user technical knowledge or control of supporting infrastructure Best described.
Gems, Snakes and Amazon forests by Serhii Borysov 7/6/2013.
Amazon Web Services MANEESH MOHANAVILASAM. OLD IS GOLD?...NOT Predicting peaks Developing partnerships Buying and maintaining hardware Upgrading hardware.
Big Data Open Source Software and Projects ABDS in Summary IV: Level 7 I590 Data Science Curriculum August Geoffrey Fox
Recipes for Success with Big Data using FutureGrid Cloudmesh SDSC Exhibit Booth New Orleans Convention Center November Geoffrey Fox, Gregor von.
Web Technologies Lecture 13 Introduction to cloud computing.
Launch Amazon Instance. Amazon EC2 Amazon Elastic Compute Cloud (Amazon EC2) provides resizable computing capacity in the Amazon Web Services (AWS) cloud.
Rome - 24 January Earth Server EU FP7-INFRA project Scalability for Big Data Roberto Barbera - University of Catania and INFN - Italy
Ansible and Ansible Tower 1 A simple IT automation platform November 2015 Leandro Fernandez and Blaž Zupanc.
Chapter 11 – Cloud Application Development. Contents Motivation. Connecting clients to instances through firewalls. Cloud Computing: Theory and Practice.
EGI-Engage is co-funded by the Horizon 2020 Framework Programme of the European Union under grant number WG on Python and WG on Workflows.
Cloud Installation & Configuration Management. Outline  Definitions  Tools, “Comparison”  References.
Information Initiative Center, Hokkaido University North 11, West 5, Sapporo , Japan Tel, Fax: General.
Introductory Tutorial: OpenStack, Chef, Hadoop, Hbase, Pig I590 Data Science Curriculum Big Data Open Source Software and Projects September Geoffrey.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
JIA’2016 March 25th 2016 A platform at the crossroads of data science, big data and IoT.
Deploying Docker Datacenter on AWS © 2016, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Preamble Way off topic (or is it?) Kooky prediction $8 bil arm procs. 100% annual Intel about $80 billion ~2016 collision - economy of scale. Microsoft?,
Project Cumulus Overview March 15, End Goal Unified Public & Private PaaS for GlassFish/Java EE Simplify deployment of Java EE Apps on top of.
Lecture 1 Book: Hadoop in Action by Chuck Lam Online course – “Cloud Computing Concepts” lecture notes by Indranil Gupta.
PaaS services for Computing and Storage
Define: Portability A1: Your infrastructure is repeatable (and hopefully automated) If you need someone to create and/or configure a server it will be.
Interoperability Between Modern Clouds using DevOps
Introduction to Distributed Platforms
Infrastructure Orchestration to Optimize Testing
StratusLab Final Periodic Review
StratusLab Final Periodic Review
BBMRI Competence Centre Status Report
Dmytro Mykhailov How HashiCorp platform tools can make the difference in development and deployment Target and goal of HashiCorp.
INDIGO – DataCloud PaaS
Interoperability in Modern Clouds using DevOps
An easier path? Customizing a “Global Solution”
Cloud Computing ISY143.
Cloud Computing Dr. Sharad Saxena.
If you knew what I know or CloudWave - Improving services in the Cloud through collaborative adaptation Eliot Salant IBM Haifa Research.
Introduction to Apache
Orchestration & Container Management in EGI FedCloud
OpenShift vs. Vanilla k8s on OpenStack IaaS
An introduction to the Linux environment v
Deploying Your First Full Stack Application to the Cloud
Cloud Computing: Concepts
Configuration management suite
OpenStack Summit Berlin – November 14, 2018
Anjuman College of Engineering & Technology Computer Science & Engineering Department Subject Code: BECSE408T Subject Name: (ELECTIVE-III)Clustering &
Deploy and Manage Kubernetes Clusters in a Multicloud World
Presentation transcript:

Deploying BiobankCloud with Karamel/Chef and Federated Authentication in BiobankCloud Jim Dowling, KTH – Royal Institute of Technology

Bringing together Biobankers & Bioinformaticians Biobankers -NGS data producers Collections, samples -Non-programmers Bioinformaticians -NGS data analysts -Programmers Python, R, Matlab, scripts IT Admins Hops-HDFS Hops-YARN Web Application CharonFS Cuneiform/HiWAY Karamel 2

*Configured stacks include apps, dependencies, and firewalls. BBC platforms have lots of different stacks* ResourceMgr NN ssh, agent, chef, collectd NodeMgr DN ssh, agent, chef, collectd MYSQLD MGMD ssh, agent, chef, collectd NDBD ssh, agent, chef, collectd

Plus the Frontend Stack REST APIs Glassfish collectd-server AngularJS Frontend Web Application

How do we deploy our PaaS? NDBD MGMDWorksNN DN Data Center

Deployment on AWS from Karamel using Ssh/Chef NDBD MGMDNNDN Public Cloud (AWS) DashBNN

Deployment when few public IPs available NDBD MGMDNNDN Private Cloud (OpenStack) Karamel NN

How do we install the software on the VMs?

Karamel Layers Google Compute Engine Virtual Machines Software is defined in Chef Chef Cookbooks are publicly available in Github

Ssh and Chef Solo to Install software ssh Karamel Recipes Chef Recipes are infrastructure as code: idempotent & composable ResourceMgr NN ssh, agent, chef, collectd MYSQLD MGMD ssh, agent, chef, collectd NodeMgr DN ssh, agent, chef, collectd Dependency Injection of Attributes (No Server or agents)

Karamel Stack AWSOpenStack Chef Bare Metal JCloudsssh Karamel API (YAML) VMIsBitTorrent Reduce Install TimesCreate VMs GCEEGI

Cluster Definition in Karamel name: ApacheHadoopV2 ec2: type: m3.medium region: eu-west-1 cookbooks: hadoop: github: "hopshadoop/apache-hadoop-chef" version: "v0.1" attrs: hadoop: version: groups: metadata: size: 1 recipes: - hadoop::nn - hadoop::rm - hadoop::jhs datanodes: size: 2 recipes: - hadoop::dn - hadoop::nm egi: XX: ?? YY: ??

Orchestration of Deployments 13 DSL Service Cloud Clients Karamel Engine Physical Mapping Orchestrator

Karamel FrontEnd 14

Jclouds/Chef in the Wild Node provisioning can fail for a variety of reasons. Stragglers will appear as clusters grow in size. Rate Limiting on (with low defaults) the number of concurrent operations that can be issued to AWS, OpenStack, etc.

Karamel Scheduler Builds a Directed Acyclic Graph of Chef Recipes Scheduler executes Chef recipes as a series of phases -Chef recipes are decomposed into the following phases: install, Respawn & provision failed and slow VMs. Strategies for handling rate-limiting clouds. -Automatic retry for failed Chef Recipes

Related Work Virtualization and Hadoop -Project Serengeti (VMWare) -Project Savanna (Hortonworks & OpenStack) -Elastic MapReduce (Amazon Web Services) Administration of Hadoop Clusters -Cloudera Manager with Puppet -Hortonworks Ambari with Puppet AWS OpsWorks OpenStack Heat Docker Swarm

BiobankCloud Federation 18

Connecting Biobanks Biobank Meta-Data Sample Availability Data Omics Data (Big Data) Biobank

Authentication Authentication Provider -JDBC Realm -2-Factor Authentication -LDAP EGI as Federated Authentication Provider 20