Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.

Slides:



Advertisements
Similar presentations
Cloud Computing Mick Watson Director of ARK-Genomics The Roslin Institute.
Advertisements

Cloud Computing at Amazon’s EC2 Joe Steele
Creating an AMI at Amazon’s EC2 Joe Steele
Creating a Biolinux AMI at Amazon’s EC2
Matt Bertrand Building GIS Apps in the Cloud. Infrastructure - Provides computer infrastructure, typically a platform virtualization environment, as a.
 Contents 1.Introduction about operating system. 2. What is 32 bit and 64 bit operating system. 3. File systems. 4. Minimum requirement for Windows 7.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Amazon EC2 Quick Start adapted from EC2_GetStarted.html.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
“Grandpa’s up there somewhere.”. Making your IT skills virtual What it takes to move your services to the cloud Erik Mitchell | Kevin Gilbertson | Jean-Paul.
OM. Brad Gall Senior Consultant
IPlant Collaborative Powering a New Plant Biology iPlant Collaborative Powering a New Plant Biology.
Chromium OS is an open-source project that aims to build an operating system that provides a fast, simple, and more secure computing experience for people.
Customized cloud platform for computing on your terms !
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Accessing the Amazon Elastic Compute Cloud (EC2) Angadh Singh Jerome Braun.
Atmosphere: iPlant’s Cloud Computing Service. Overview How to connect to Atmosphere & start a virtual machine Ways to log in to a VM A simple R command.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
| nectar.org.au NECTAR TRAINING Module 5 The Research Cloud Lifecycle.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop iCommands and Other Data Store Resources.
Customized cloud platform for computing on your terms ! Nirav Merchant
| nectar.org.au NECTAR TRAINING Module 10 Beyond the Dashboard.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Objectives.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
Presented by: Mostafa Magdi. Contents Introduction. Cloud Computing Definition. Cloud Computing Characteristics. Cloud Computing Key features. Cost Virtualization.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Network for Integrating Bioinformatics into Life Sciences Education April, 2014.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
1 Applied CyberInfrastructure Concepts ISTA 420/520 Fall Nirav Merchant Bio Computing & iPlant Collaborative Eric Lyons.
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop iPlant Data Store.
The iPlant Collaborative Using iPlant for sharing, managing, and analyzing ecological data Ramona Walls Presented at ESA 2014 – Ignite session August 12,
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Atmosphere.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of the iPlant Data Store.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Biomedical Big Data Training Collaborative biobigdata.ucsd.edu BBDTC UPDATES Biomedical Big Data Training Collaborative biobigdata.ucsd.edu.
Overview of Atmosphere
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop GWAS/QTL Apps Overview.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop iPlant Data Store – Managing Your ‘Big’ Data.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop RNA-Seq visualization with cummeRbund.
Data Science Background and Course Software setup Week 1.
| nectar.org.au NECTAR TRAINING Module 5 The Research Cloud Lifecycle.
AWS Usage Tips SCS APAC MAR Agenda About Amazon Web Service Sign up the AWS account AWS Management Oracle Apps AMI – Siebel CRM – EBS R
Architecture & Cybersecurity – Module 3 ELO-100Identify the features of virtualization. (Figure 3) ELO-060Identify the different components of a cloud.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop BISQUE.
IT Applications Theory Slideshows By Mark Kelly Vceit.com Version 2 – updated for 2016 CLOUD COMPUTING CLOUD COMPUTING.
Bringing your favorite analysis applications to iPlant using Docker containers Nirav Merchant
Launch Amazon Instance. Amazon EC2 Amazon Elastic Compute Cloud (Amazon EC2) provides resizable computing capacity in the Amazon Web Services (AWS) cloud.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Transforming Science Through Data-driven Discovery Tools and Services Workshop Atmosphere Joslynn Lee – Data Science Educator Cold Spring Harbor Laboratory,
Canadian Bioinformatics Workshops
Transforming Science Through Data-driven Discovery Tools and Services Workshop Data Store Overview.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
CyVerse Data Store Managing Your ‘Big’ Data. Welcome to the Data Store Manage and share your data across all CyVerse platforms.
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Joslynn.
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Atmosphere.
Transforming Science Through Data-driven Discovery Bringing your Bioinformatics tools to CyVerse’s Discovery Environment using Docker Upendra Kumar Devisetty.
Scaling Compute with R in CyVerse
CyVerse Tools and Services
Tools and Services Workshop
Customized cloud platform for computing on your terms !
Joslynn Lee – Data Science Educator
CyVerse Discovery Environment
Tools and Services Workshop Overview of Atmosphere
Bioinformatic analysis using Jetstream, a cloud computing environment
Cyberinfrastructure for the Life Sciences
Presentation transcript:

Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling Compute – Working in the Cloud

Rapid discussion questions How do you already use the cloud? When should you be using (buying) a server and when do you think you should use a cloud resource – how do you choose? What does ‘biology in the cloud’ mean for developers and educators?

Yet another round of jargon What is Cloud Computing?

Important concepts: Image What is Cloud Computing? Image (file) Document(s) (file) Original system Complete clone (files/data) Copied Document(s) (file)

Important concepts: Instance What is Cloud Computing? iPlant Cloud + (Disk + CPU + Memory) + (Image) Atmosphere Instance (virtual machine)

Yet another round of jargon What is Cloud Computing?

Largest, easiest to use open cloud for Life Science Atmosphere Overview Choose an existing image or customize Instances up to 16-Core/128 GB RAM Access via shell or VNC Share your image with selected users, or make them public

Connecting to your instance Atmosphere Overview Windows MacLinux VNC Viewer Shell/terminal PuTTY VNC Viewer: PuTTy:

Benefits Get Science Done Reproducibility Productivity Work in an on-demand Linux environment (most bioinformatics) Collaborate with students and colleagues on the same instance Make data, workflows, and analyses available in a public image Access previous software version and images Multicore high memory images to run multithreading applications Move your analyses from your laptop to the cloud Atmosphere Overview

Select and launch an instance Connect your instance to the iPlant Data Store Use an application in Atmosphere (RNA-Seq visualization) Understand how to pause, stop, and terminate instances By the end of this demo you should be able to: Atmosphere Overview

Images do not have automatic access to your iPlant Data Store o Can iDrop to access the Data Store o Can also use iCommands Users have monthly allocation limits o Terminate, or stop instances not in use o If a larger allocation is needed contact support All data on terminated instances will be destroyed. o Use iDrop or iCommands to transfer data off the instance. o You may also create an EBS volume (see documentation) Atmosphere Overview Key things to remember when you try this yourself

User perspectives and possible applications Bench Scientist Bioinformatician Learned how to use the shell and how to work with Linux Mastered using R to develop plots for his manuscript Launches an image and has full SUDO access to customize it Developed a software suit with numerous R and Python library dependencies. She updates it regularly by making a new image. Linked several atmosphere instances with Apache Hadoop Worked with iPlant support to import existing Amazon images Core Facilities Images from personas based on: Bioinformatics Curriculum Guidelines: Toward a Definition of Core Competencies PLOS Biology DOI: /journal.pcbi Atmosphere Overview

We don’t have to be afraid of the cloud… we already using it! We just need to use it wisely to manage the data and experiments we care about. Choosing a solution always has case-by-case features, but passing up on-demand computing solutions is a great way to waste funds! Developing datasets and software in the cloud (VMs/Docker, etc.) is gold standard reproducibility. These same traits are very helpful for classroom instruction. Key take home knowledge

The iPlant Collaborative is funded by a grant from the National Science Foundation Plant Cyberinfrastructure Program (#DBI ).