Bringing your favorite analysis applications to iPlant using Docker containers Nirav Merchant

Slides:



Advertisements
Similar presentations
1 Applied CyberInfrastructure Concepts ISTA 420/520 Fall Nirav Merchant Bio Computing & iPlant Collaborative Eric Lyons.
Advertisements

Matt Bertrand Building GIS Apps in the Cloud. Infrastructure - Provides computer infrastructure, typically a platform virtualization environment, as a.
Data to Discovery The iPlant Collaborative Community Cyberinfrastructure for Life Science Nirav Merchant iPlant / University.
1 Applied CyberInfrastructure Concepts ISTA 420/520 Fall Nirav Merchant Bio Computing & iPlant Collaborative Eric Lyons.
M.A.Doman Model for enabling the delivery of computing as a SERVICE.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Computers: Software Patrice Koehl Computer Science UC Davis.
Customized cloud platform for computing on your terms !
Unit 1 Living in the Digital WorldChapter 1 Let’s Communicate This presentation will cover the following topic: Which kind of computer?
Windows Azure Conference 2014 Running Docker on Windows Azure.
BISQUE: Enabling Cloud and Grid Powered Image Analysis Ramona Walls iPlant Collaborative
Introduction to HPC resources for BCB 660 Nirav Merchant
ISG We build general capability Introduction to Olympus Shawn T. Brown, PhD ISG MISSION 2.0 Lead Director of Public Health Applications Pittsburgh Supercomputing.
Model a Container Runtime environment on Your Mac with VMware AppCatalyst VMworld Fabio Rapposelli
Enabling Cloud and Grid Powered Image Phenotyping Nirav Merchant iPlant Collaborative
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
MaterialsHub - A hub for computational materials science and tools.  MaterialsHub aims to provide an online platform for computational materials science.
Customized cloud platform for computing on your terms ! Nirav Merchant
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
INFSO-RI Module 01 ETICS Overview Etics Online Tutorial Marian ŻUREK Baltic Grid II Summer School Vilnius, 2-3 July 2009.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
IPlant Collaborative Hands-on Cyberinfrastructure Workshop – Part 2 R. Walls University of Arizona Biodiversity Information Standards (TDWG) Sep. 29, 2015,
1 Applied CyberInfrastructure Concepts Fall Nirav Merchant Bio Computing & iPlant Collaborative Eric Lyons
1 Applied CyberInfrastructure Concepts ISTA 420/520 Fall Nirav Merchant Bio Computing & iPlant Collaborative Eric Lyons.
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
MC365 Application Servers, Servlets, and Java Server Pages (JSP’s): Tomcat.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Atmosphere.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Usage of virtualization in gLite certification Andreas Unterkircher.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop GWAS/QTL Apps Overview.
1 Applied CyberInfrastructure Concepts ISTA 420/520 Fall Nirav Merchant Bio Computing & iPlant Collaborative Eric Lyons.
1 Applied CyberInfrastructure Concepts ISTA 420/520 Fall Nirav Merchant Bio Computing & iPlant Collaborative Eric Lyons.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Tools and techniques for managing virtual machine images Andreas.
ISG We build general capability Introduction to Olympus Shawn T. Brown, PhD ISG MISSION 2.0 Lead Director of Public Health Applications Pittsburgh Supercomputing.
Unleash your inner (data) scientist : The ability and audacity to scale your science with extensible cyberinfrastructure Nirav Merchant The University.
Selenium server By, Kartikeya Rastogi Mayur Sapre Mosheca. R
CIP HPC CIP - HPC HPC = High Performance Computer It’s not a regular computer, it’s bigger, faster, more powerful, and more.
IPlant Collaborative Tools and Services Workshop Overview of the iPlant Discovery Environment Sriram Srinivasan.
What is a Computer An electronic, digital device that stores and processes information. A machine that accepts input, processes it according to specified.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Using Docker in a CyVerse World The main portion of this tutorial should take about 45 minutes to go through, and assumes you have already gone through.
Transforming Science Through Data-driven Discovery Tools and Services Workshop Atmosphere Joslynn Lee – Data Science Educator Cold Spring Harbor Laboratory,
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
A Web Based Job Submission System for a Physics Computing Cluster David Jones IOP Particle Physics 2004 Birmingham 1.
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Atmosphere.
Transforming Science Through Data-driven Discovery Bringing your Bioinformatics tools to CyVerse’s Discovery Environment using Docker Upendra Kumar Devisetty.
Canadian Bioinformatics Workshops
High Performance Computing (HPC)
Containers as a Service with Docker to Extend an Open Platform
Infrastructure Orchestration to Optimize Testing
Docker Birthday #3.
Running containers everywhere
Tools and Services Workshop Overview of Atmosphere
MaterialsHub - A hub for computational materials science and tools.
Andrew Pruski SQL Server & Containers
Integration of Singularity With Makeflow
Productivity Tools for Scientific Computing
GIFT / Fiscal Data Package Iteration 3
IBM Containers Docker in the Cloud
Module 01 ETICS Overview ETICS Online Tutorials
GWAS/QTL Apps Overview
FEniCS = Finite Element - ni - Computational Software
Introduction to Docker
Software - Operating Systems
Azure Container Service
Presentation transcript:

Bringing your favorite analysis applications to iPlant using Docker containers Nirav Merchant

Topic Coverage: Which app can you bring to the Cyverse Where can you run your app ? Choosing the right platform to run your app at Cyverse What is container technology Benefits of running your container at Cyverse Taking your container from laptop to Cyverse Sharing your app with the world (using DE+Docker) Hands on walk through

3 + = Simple Formula for Success

The Reality 4 ++ Excel, R PERL Python ARCGIS Java Ruby Fortran C C# C++ Matlab etc. Excel, R PERL Python ARCGIS Java Ruby Fortran C C# C++ Matlab etc. Amazon Azure Rackspace Campus HPC XSEDE Etc. Amazon Azure Rackspace Campus HPC XSEDE Etc. and lots of glue…..

+ = Simple Formula

Where can you run your apps Look at the capability of your application to use CPU, RAM and run time needed What happens when you run a job in: DE (Regular and HPC) Agave Atmosphere Bisque Pains of bringing your app to Cyverse

Container technology: What is it about ? Allows you to create a self contained package that contains: The specific operating system version (say Ubuntu ) Your application All of the parts your application needs (such as libraries and other dependencies) Ability to share this with other users This single package can now be run on any computing system that supports Container technology (regardless of its own version of operating system)

Container technology: Docker nding-docker/ nding-docker/ Has many interesting features and capabilities Parts of Docker you need to know about: Docker client/command line (CLI) Docker file Docker image Docker registry Docker container Most important concept of working with large amount of data in Docker: The union file system

How does it work together

What happens when you run a job in DE: Condor looks for a machine that matches your criteria (RAM, CPU, Disk Space) Once it find a suitable match: Data placement container runs and brings the data you want to operate on to that node from data store Your app (Docker container) runs (with the data visible to it as union file system) Date placement container for returning data data back to data store

How do you get started Check the step by step instructions in Wiki at: Get Docker setup on your local machine (win,mac,linux) or use Atmosphere Plan your steps i.e what you want to do Carry out those steps and verify that things work Create a Docker file file from those steps Submit the request for a “new tool” Once you hear back design your interface (and profit)

Future Directions Ability to bring any Docker images from private repository, Docker hub, files etc. Share your app/container within your group Ability to bring your own compute for containers and attach it to Cyverse pool and manage who can send jobs using it

Word of Caution Containers are very powerful and has many bells and whistles (only choose parts that you really need !) Avoid storing data inside of containers Keep containers light and nimble, build on provided base images from trusted source (iPlant prefers Ubuntu 14.X and CentOS 7.X from Docker hub) Do not trust a app without Docker file (its not easy to recreate and a blackbox, bad for reproducibility )

Thanks Linux community for containers Docker for making things more useable Cyverse/iPlant Core SW team for integrating Docker Eric Lyons for writing the first tutorial Many users for building Docker files and submitting those !! Whole iPlant and Cyverse team and community