Globus.org/genomics Globus Galaxies Science Gateways as a Service Ravi K Madduri, University of Chicago and Argonne National Laboratory

Slides:



Advertisements
Similar presentations
SAICG Science Applications In Clouds and Grids March 2012.
Advertisements

DTI Image Processing Pipeline and Cloud Computing Environment Kyle Chard Computation Institute University of Chicago.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Natasha Pavlovikj, Kevin Begcy, Sairam Behera, Malachy Campbell, Harkamal Walia, Jitender S.Deogun University of Nebraska-Lincoln Evaluating Distributed.
Industry and economics Ian Foster Computation Institute Department of Computer Science Mathematics and Computer Science Division Institute for Genomic.
High Performance Computing Course Notes Grid Computing.
Cloud Computing Brandon Hixon Jonathan Moore. Cloud Computing Brandon Hixon What is Cloud Computing? How does it work? Jonathan Moore What are the key.
Collaboration on Large Datasets using Globus Rachana Ananthakrishnan University of Chicago.
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS Ravi K Madduri University of Chicago and ANL.
 Amazon Web Services announced the launch of Cluster Compute Instances for Amazon EC2.  Which aims to provide high-bandwidth, low- latency instances.
Globus.org/genomics Optimizations in running large-scale Genomics workloads in Globus Genomics using HTCondor Ravi Madduri Joint work with.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
What is Cloud Computing? o Cloud computing:- is a style of computing in which dynamically scalable and often virtualized resources are provided as a service.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
© 2009 IBM Corporation ® IBM Software Group Introduction to Cloud Computing Vivek C Agarwal IBM India Software Labs.
Swift: A Scientist’s Gateway to Campus Clusters, Grids and Supercomputers Swift project: Presenter contact:
Is Your IT Out of Alignment? Chargeback and Billing with Parallels Automation Brian Shellabarger, Chief Architect - SaaS.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
An Introduction to Cloud Computing. The challenge Add new services for your users quickly and cost effectively.
Plan Introduction What is Cloud Computing?
Building Data-intensive Pipelines Ravi K Madduri Argonne National Lab University of Chicago.
“Grandpa’s up there somewhere.”. Making your IT skills virtual What it takes to move your services to the cloud Erik Mitchell | Kevin Gilbertson | Jean-Paul.
Cloud Computing Cloud Computing Class-1. Introduction to Cloud Computing In cloud computing, the word cloud (also phrased as "the cloud") is used as a.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI CloudBroker Platform integration into WS-PGRADE/gUSE Zoltán Farkas MTA.
Globus Genomics – Science as a Service for large scale NGS analysis
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Objectives.
Niagara Framework in the Clouds Scott Boehm. … what the heck does that mean??
The DPubS Development Project: Building an Open Source Electronic Publishing System David Ruddy Cornell University Library.
Plan  Introduction  What is Cloud Computing?  Why is it called ‘’Cloud Computing’’?  Characteristics of Cloud Computing  Advantages of Cloud Computing.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Using Biological Cyberinfrastructure Scaling Science and People: Applications in Data Storage, HPC, Cloud Analysis, and Bioinformatics Training Scaling.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Globus online Software-as-a-Service for Research Data Management Steve Tuecke Deputy Director, Computation Institute University of Chicago & Argonne National.
Bio-IT World Conference and Expo ‘12, April 25, 2012 A Nation-Wide Area Networked File System for Very Large Scientific Data William K. Barnett, Ph.D.
Research data management using Globus ESIP Summer Meeting 2015 Rachana Ananthakrishnan University of Chicago
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
Web Technologies Lecture 13 Introduction to cloud computing.
Galaxy Community Conference July 27, 2012 The National Center for Genome Analysis Support and Galaxy William K. Barnett, Ph.D. (Director) Richard LeDuc,
Globus and ESGF Rachana Ananthakrishnan University of Chicago
Globus Publish Lighting Talk Ben Blaiszik, Kyle Chard
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Application Hosting Services — Enabling Science 2.0 —
Globus online Delivering a scalable service Steve Tuecke Computation Institute University of Chicago and Argonne National Laboratory.
European Life Sciences Infrastructure for Biological Information ELIXIR Cloud Roadmap Chairs: Steven Newhouse, EMBL-EBI & Mirek Ruda,
VYTAUTAS SIMANAITIS Cloud computing © Kaunas 2013, KTU.
TeraGrid Capability Discovery John-Paul “JP” Navarro TeraGrid Area Co-Director for Software Integration University of Chicago/Argonne National Laboratory.
An Introduction to SaaS and Cloud Computing Ross Cooney.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI Accessing cloud resources through the WS-PGRADE/gUSE and CloudBroker integrated.
Simplifying Large-Scale Data Movement with Globus Steve Tuecke Deputy Director, Computation Institute University of Chicago & Argonne National Laboratory.
CyVerse Workshop Discovery Environment Overview. Welcome to the Discovery Environment A Simple Interface to Hundreds of Bioinformatics Apps, Powerful.
Globus online Cloud Based Services for Science Steve Tuecke Computation Institute University of Chicago and Argonne National Laboratory.
TOWARDS AN ARCHITECTURE FOR NATIONAL DATA SERVICES Ian Foster Director, Computation Institute Argonne National Laboratory & The University of
Enhancements to Galaxy for delivering on NIH Commons
Computing Clusters, Grids and Clouds Globus data service
Tools and Services Workshop
University of Chicago and ANL
Joslynn Lee – Data Science Educator
Infrastructure Orchestration to Optimize Testing
An Introduction to Cloud Computing
Software infrastructure for a National Research Platform
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
Cloud Computing Dr. Sharad Saxena.
Brandon Hixon Jonathan Moore
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
Presentation transcript:

globus.org/genomics Globus Galaxies Science Gateways as a Service Ravi K Madduri, University of Chicago and Argonne National

globus.org/genomics Provide more capability for more people at lower cost by delivering “Science as a service” Our vision for a 21st century discovery infrastructure

globus.org/genomics An Old Idea: Service Oriented Science

globus.org/genomics Two Broader Themes Productivity of Researchers –Time spent performing administrative tasks Vs time spent doing science –Reproducibility Sustainability of scientific software –Reduction in funding for science

globus.org/genomics Our Science Stack Galaxy –Interactive execution –Creation, Execution, Sharing, Discovering Workflows Globus –Data management –Identity Management AWS, NERSc, Magellan –Chef, EC2, EBS, S3, SNS, NEWT –Spot, Route 53, Cloud Formation SaaS PaaS IaaS

globus.org/genomics Data Source Data Source Data Destinatio n Data Destinatio n User initiates transfer request 1 1 Globus moves and syncs files 2 2 Globus notifies user 3 3 Globus: Fast, reliable data transfer

globus.org/genomics Globus: Federated identity

globus.org/genomics Metadata Access Control License Storage Curation Workflow Curation Workflow Policies Collection Globus: Data publication service Metadata Data Metadata Data Metadata Data Dataset Community

globus.org/genomics Flexible, scalable, affordable genomics analysis for all biologists

globus.org/genomics Globus Genomics Sequencing Centers Public Data Public Data Storage Local Cluster/ Cloud Seq Center Research Lab Globus Provides a High-performance Fault-tolerant Secure file transfer Service between all data-endpoints Data ManagementData Analysis Picard GATK Fastq Ref Genome Alignment Variant Calling Galaxy Data Libraries Globus Genomics on Amazon EC2 Analytical tools are automatically run on the scalable compute resources when possible Globus Integrated within Galaxy Web-based UI Drag-Drop workflow creations Easily modify Workflows with new tools Galaxy Based Workflow Management System FTP, SCP, others FTP, SCP SCP Globus Genomics FTP, SCP, HTTP

globus.org/genomics Core Capabilities Computational profiles for various analysis tools to provide optimal performance Resources can be provisioned on-demand with Amazon Web Services cloud based infrastructure High performance, Reliable Data movement is streamlined with integrated Globus file-transfer functionality Integrated Globus endpoints and Campus login

globus.org/genomics Globus Galaxies - FACE-IT

globus.org/genomics Globus Galaxies - eMatter

globus.org/genomics Cardio Vascular Research Grid – Dr. Hopkins

globus.org/genomics Cosmology - PDACS

globus.org/genomics Sustainability Our goal is to build services that live beyond a funded proposals Two pricing options and multiple usage tiers. –Targeted users include individual research groups and bioinformatics cores –Platform pricing (includes only subscription to the Globus Genomics platform) –Bundled pricing (includes Globus Genomics platform subscription and AWS usage costs)

globus.org/genomics Our work is supported by: U.S. DEPARTMENT OF ENERGY 17

globus.org/genomics Thank