New Ventures in Research, Engineering, and Educational Computing

Slides:



Advertisements
Similar presentations
Creating HIPAA-Compliant Medical Data Applications with Amazon Web Services Presented by, Tulika Srivastava Purdue University.
Advertisements

April 19, 2015 CASC Meeting 7 Sep 2011 Campus Bridging Presentation.
Win8 on Intel Programming Course Win8 and Intel Paul Guermonprez Intel Software
Bill Barnett, Bob Flynn & Anurag Shankar Pervasive Technology Institute and University Information Technology Services, Indiana University CASC. September.
Data Gateways for Scientific Communities Birds of a Feather (BoF) Tuesday, June 10, 2008 Craig Stewart (Indiana University) Chris Jordan.
Pti.iu.edu /jetstream Award # A national science & engineering cloud funded by the National Science Foundation Award #ACI
1 Supplemental line if need be (example: Supported by the National Science Foundation) Delete if not needed. Supporting Polar Research with National Cyberinfrastructure.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Pti.iu.edu /jetstream Award # A national science & engineering cloud funded by the National Science Foundation Award #ACI Jetstream Overview.
Pti.iu.edu /jetstream Award # A national science & engineering cloud funded by the National Science Foundation Award #ACI Prepared for the.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 7 2/23/2015.
Pti.iu.edu /jetstream Award # A national science & engineering cloud funded by the National Science Foundation Award #ACI
Customized cloud platform for computing on your terms !
Statewide IT Conference, Bloomington IN (October 7 th, 2014) The National Center for Genome Analysis Support, IU and You! Carrie Ganote (Bioinformatics.
Next Generation Cyberinfrastructures for Next Generation Sequencing and Genome Science AAMC 2013 Information Technology in Academic Medicine Conference.
Craig Stewart 23 July 2009 Cyberinfrastructure in research, education, and workforce development.
Big Red II & Supporting Infrastructure Craig A. Stewart, Matthew R. Link, David Y Hancock Presented at IUPUI Faculty Council Information Technology Subcommittee.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Genomics, Transcriptomics, and Proteomics: Engaging Biologists Richard LeDuc Manager, NCGAS eScience, Chicago 10/8/2012.
The National Center for Genome Analysis Support as a Model Virtual Resource for Biologists Internet2 Network Infrastructure for the Life Sciences Focused.
Leveraging the National Cyberinfrastructure for Top Down Mass Spectrometry Richard LeDuc.
- Raghavi Reddy.  With traditional desktop computing, we run copies of software programs on our own computer. The documents we create are stored on our.
September 6, 2013 A HUBzero Extension for Automated Tagging Jim Mullen Advanced Biomedical IT Core Indiana University.
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. The IQ-Table & Collection Viewer A.
RNA-Seq 2013, Boston MA, 6/20/2013 Optimizing the National Cyberinfrastructure for Lower Bioinformatic Costs: Making the Most of Resources for Publicly.
608D CloudStack 3.0 Omer Palo Readiness Specialist, WW Tech Support Readiness May 8, 2012.
Pti.iu.edu /jetstream Award # funded by the National Science Foundation Award #ACI Jetstream - A self-provisioned, scalable science and.
July 18, 2012 Campus Bridging Security Challenges from “Panel: Security for Science Gateways and Campus Bridging”
Pti.iu.edu /jetstream Award # funded by the National Science Foundation Award #ACI Jetstream Overview – XSEDE ’15 Panel - New and emerging.
INDIANAUNIVERSITYINDIANAUNIVERSITY Spring 2000 Indiana University Information Technology University Information Technology Services Please cite as: Stewart,
November 18, 2015 Quarterly Meeting 30Aug2011 – 1Sep2011 Campus Bridging Presentation.
February 27, 2007 University Information Technology Services Research Computing Craig A. Stewart Associate Vice President, Research Computing Chief Operating.
A national science & engineering cloud funded by the National Science Foundation Award #ACI Craig Stewart ORCID ID Jetstream.
Pti.iu.edu /jetstream Award # A national science & engineering cloud funded by the National Science Foundation Award #ACI
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Update on EAGER: Best Practices and.
Award # funded by the National Science Foundation Award #ACI Jetstream: A Distributed Cloud Infrastructure for.
Jetstream: A new national research and education cloud Jeremy Fischer ORCID Senior Technical Advisor, Collaboration.
A national science & engineering cloud funded by the National Science Foundation Award #ACI Craig Stewart ORCID ID Jetstream.
Craig Stewart ORCID ID Jetstream Principal Investigator Executive Director, Indiana University Pervasive Technology Institute Presented.
1 A national science & engineering cloud funded by the National Science Foundation Award #ACI Craig Stewart ORCID ID Jetstream.
Remote & Collaborative Visualization. TACC Remote Visualization Systems Longhorn – Dell XD Visualization Cluster –256 nodes, each with 48 GB (or 144 GB)
© Trustees of Indiana University Released under Creative Commons 3.0 unported license; license terms on last slide. Informatics Tools at the Indiana CTSI.
Award # funded by the National Science Foundation Award #ACI Jetstream: INFO-590, Science Gateways Architecture.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Funded by the National Science Foundation Award #ACI Jetstream: Adding Cloud-based Computing to the National Cyberinfrastructure Matthew
Jetstream Overview Jetstream: A national research and education cloud Jeremy Fischer ORCID Senior Technical Advisor,
Introduction to Data Analysis with R on HPC Texas Advanced Computing Center Feb
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
1 Campus Bridging: What is it and why is it important? Barbara Hallock – Senior Systems Analyst, Campus Bridging and Research Infrastructure.
Jetstream: A national research and education cloud Jeremy Fischer ORCID Senior Technical Advisor, Collaboration and.
Research & Academic Computing Indiana University Statewide IT Conference 11 September 2003 Indianapolis IN.
The CLoud Infrastructure for Microbial Bioinformatics
Jetstream: A science & engineering cloud Mike Lowe
Tools and Services Workshop
Joslynn Lee – Data Science Educator
Matt Link Associate Vice President (Acting) Director, Systems
funded by the National Science Foundation Award #ACI
Bridges and Clouds Sergiu Sanielevici, PSC Director of User Support for Scientific Applications October 12, 2017 © 2017 Pittsburgh Supercomputing Center.
Tools and Services Workshop Overview of Atmosphere
Accessing Jetstream via the OpenStack Command Line Interface George Turner, Chief Systems Architect Pervasive Technologies Institute, UITS/RT, Indiana.
Usage of Openstack Cloud Computing Architecture in COE Seowon Jung Systems Administrator, COE
Chapter 18 MobileApp Design
University of Technology
Bioinformatic analysis using Jetstream, a cloud computing environment
XSEDE’s Campus Bridging Project
Managing Clouds with VMM
* Introduction to Cloud computing * Introduction to OpenStack * OpenStack Design & Architecture * Demonstration of OpenStack Cloud.
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
Azure Container Service
Presentation transcript:

New Ventures in Research, Engineering, and Educational Computing New Ventures in Research, Engineering, and Educational Computing. George Turner, Chief Systems Architect Research Technologies/UITS, Pervasive Technologies Institute Indiana University Open Research Cloud Declaration Workshop Boston, MA 11 May2017 funded by the National Science Foundation Award #ACI-1445604

What is Jetstream? User-friendly, widely accessible cloud environment User-selectable library of preconfigured virtual machines Interactive computing Software maintained by domain specialist No need for system administration skills Programmable cyberinfrastructure Go beyond batch computing Implement modern cloud computing techniques https://kb.iu.edu/d/ayep The primary goal of the XD program is to enable major advances in science and engineering research, in the integration of research and education, and in broadening participation in science and engineering by under-represented groups, by providing researchers and educators with usable access to extreme-scale digital resources beyond those typically available on a typical campus, together with the interfaces, consulting support, and training necessary to facilitate their use.

Platform Overview Atmosphere API Globus Auth Atmo Services XSEDE Accounting OpenStack Ceph Web App OpenStack API access Platform Overview Agave API access (work in progress) S3 access to Ceph (work in progress) Indiana University TACC Note the multiple touch points for users, won’t dive deep into the architecture

Platform Overview Atmosphere API Globus Auth Atmo Services XSEDE Accounting OpenStack Ceph Web App OpenStack API access Platform Overview Agave API access (work in progress) S3 access to Ceph (work in progress) Indiana University TACC Note the multiple touch points for users, won’t dive deep into the architecture

Platform Overview Atmosphere API Globus Auth Atmo Services XSEDE Accounting OpenStack Ceph Web App OpenStack API access Platform Overview Agave API access (work in progress) S3 access to Ceph (work in progress) Indiana University TACC Note the multiple touch points for users, won’t dive deep into the architecture

What is Jetstream? Reproducibility: store, publish via IU Scholarworks (DOI) Cloudy: clouds are more the just virtual machines (VM) Old way: robust (expensive) infrastructure, weak (cheap) software Cloudy way: commodity infrastructure, robust software Cows, not pets : pets have state, you name them, put forth great amount of care cows do not have state, you intend to have high turnover, you give them numbers instead of names Primary goal is to expand the user base of NSF’s eXtreme Digital (XD) program resources beyond the current community of users. https://kb.iu.edu/d/ayep The primary goal of the XD program is to enable major advances in science and engineering research, in the integration of research and education, and in broadening participation in science and engineering by under-represented groups, by providing researchers and educators with usable access to extreme-scale digital resources beyond those typically available on a typical campus, together with the interfaces, consulting support, and training necessary to facilitate their use.

What is Jetstream? (cont) “Long tail” of the Science Large HPC systems requiring sophisticated distributed memory programming skills ~3% researchers supported by the NSF Problem size Capable users few many Everyone else Mostly node level parallelism

What is Jetstream? (cont) Software layers Atmosphere web interface library of images, genertic, domain specific simplify VM administration Openstack: software tools for building and managing cloud computing platforms for public and private clouds. KVM hypervisor: what the VMs run on Ceph: storage platform that stores data on a single distributed computer cluster, and provides interfaces for object-, block- and file-level storage. Operating systems: CentOS, Ubuntu, Windows? Applications; e.g. software developed by the domain specialist, gateways, etc.

Jetstream System Overview (cont.) Jetstream-IU Internet2 XSEDE Jetstream-AZ 100 Gb/s Jetstream-TACC 10 Gb/s

Production Cloud Hardware (per site) Number Specifications Function (IU) Dell PowerEdge M630 blades 320 2X Intel E5-2680v3 “Haswell” 24 cores @ 2.5 GHz 128 GB RAM 2 TB local disk Compute hosts OpenStack services R630 1U server 7 Cluster management High Availability Databases RabbitMQ R730xd 2U servers 20 64 GB RAM 48 TB storage for Ceph pool ~1 PB Ceph storage Dell S6000-ON network switches 9 32+2 40 Gb/s ports Top of Rack Spine

Jetstream’s Atmosphere Interface (no login required at this point) https://use.jetstream-cloud.org/

Jetstream’s Atmosphere Interface Pick identity provider Globus Auth under the hood

Jetstream’s Atmosphere Interface Authenticate with the chosen identity provider

Jetstream’s Atmosphere Interface user’s home dashboard

Jetstream’s Atmosphere Interface user’s project dashboard

Jetstream’s Atmosphere Interface user’s project details

Jetstream’s Atmosphere Interface starting an instance

Jetstream’s Atmosphere Interface pick an image from library

Jetstream’s Atmosphere Interface Instance’s details; e.g. name, flavor, allocation, etc.

Jetstream’s Atmosphere Interface Instance is building

Jetstream’s Atmosphere Interface …still building

Jetstream’s Atmosphere Interface …still building

Jetstream’s Atmosphere Interface Instance ready for use

Jetstream’s Atmosphere Interface User’s instance dashboard

Jetstream’s Atmosphere Interface User’s instance dashboard Instance’s details

Jetstream’s Atmosphere Interface User’s instance dashboard Action requests

Jetstream’s Atmosphere Interface User’s instance dashboard Action requests Open Web Shell

Jetstream’s Atmosphere Interface Web Shell access to instance External ssh access also available

Jetstream’s Atmosphere Interface User’s instance dashboard Action requests Open Web Desktop

VNC access also available Jetstream’s Atmosphere Interface Web desktop access to instance VNC access also available

Challenges of the past; thoughts for the future Proposed as an easy to use platform for domain researchers; but, huge demand for domain developers ready to implement cloudy technologies Need to make the case that cloudy resources have benefit

Challenges of the past; thoughts for the future (cont.) Users not aware of new (cloudy) ways of doing things They’re responsible for start/stopping instances Burning up your allocation unintentionally What’s ephemeral & what’s persistent You blew away the instance and the data was not on persistent storage The difference between file systems and objects There is not a giant workbench (scratch) filesystem Shared volumes are critical

Challenges of the past; thoughts for the future (cont.) Users expectations Jetstream/OpenStack is not AWS We will not have the fancy bells-&-whistles Support from within the domain; this is a plus

Challenges of the past; thoughts for the future (cont.) Protocol for image templates IU, BRIDGES, Nectar collaborating Can’t be a library of images; bit rot Cloud-init good starting point distro differences Beyond the image

Questions? Project website: http://jetstream-cloud.org/ Project email: jethelp@iu.edu Direct email: turnerg@iu.edu License Terms Turner, G.. 2017. Jetstream,New Ventures in Research, Engineering and Educational Computing: Open Research Cloud Declaration, Boston MA. Also available at: http://jetstream-cloud.org/publications.php Jetstream is supported by NSF award 1445604 (Craig Stewart, IU, PI) XSEDE is supported by NSF award 1053575 (John Towns, UIUC, PI) This research was supported in part by the Indiana University Pervasive Technology Institute, which was established with the assistance of a major award from the Lilly Endowment, Inc. Opinions presented here are those of the author(s) and do not necessarily represent the views of the NSF, IUPTI, IU, or the Lilly Endowment, Inc. Items indicated with a © are under copyright and used here with permission. Such items may not be reused without permission from the holder of copyright except where license terms noted on a slide permit reuse. Except where otherwise noted, contents of this presentation are copyright 2015 by the Trustees of Indiana University. This document is released under the Creative Commons Attribution 3.0 Unported license (http://creativecommons.org/licenses/by/3.0/). This license includes the following terms: You are free to share – to copy, distribute and transmit the work and to remix – to adapt the work under the following conditions: attribution – you must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). For any reuse or distribution, you must make clear to others the license terms of this work.

Challenges of the past; thoughts for the future (cont.)