Teaching How to Scale Science (and People) Using Cloud Resources Nirav Merchant The University of Arizona

Slides:



Advertisements
Similar presentations
E-Science Data Information and Knowledge Transformation Thoughts on Education and Training for E-Science Based on edikt project experience Dr. Denise Ecklund.
Advertisements

PROBLEM-BASED LEARNING & CAPACITY BUILDING
Supporting Research on Campus - Using Cyberinfrastructure (CI) Public research use of ICT has rapidly increased in the past decade, requiring high performance.
Global Health and health Informatics: Serving the underserved Paul Biondich, MD MS Regenstrief Institute & OpenMRS.
Supporting and Hosting Web- Based Learning Systems Educause 2001 Charlene Douglas – Director Kathryn Gomm - Training Manager Sharon McCarrager – Accessibility.
What is Cloud Computing? o Cloud computing:- is a style of computing in which dynamically scalable and often virtualized resources are provided as a service.
“Grandpa’s up there somewhere.”. Making your IT skills virtual What it takes to move your services to the cloud Erik Mitchell | Kevin Gilbertson | Jean-Paul.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
VAP What is a Virtual Application ? A virtual application is an application that has been optimized to run on virtual infrastructure. The application software.
IPlant Collaborative Powering a New Plant Biology iPlant Collaborative Powering a New Plant Biology.
VSS supports knowledge management Knowledge based organizations and improving the absorption capacity.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
Dot.edu: An e-learning Infrastructure for the University of Wisconsin System and Beyond CUMREC 2002 Charlene Douglas – Director Kathryn Gomm – Training.
 Prototype for Course on Web Security ETEC 550.  Huge topic covering both system/network architecture and programming techniques.  Identified lack.
Data to Discovery The iPlant Collaborative Community Cyberinfrastructure for Life Science Nirav Merchant iPlant / University.
PCGRID ‘08 Workshop, Miami, FL April 18, 2008 Preston Smith Implementing an Industrial-Strength Academic Cyberinfrastructure at Purdue University.
Overview of the HUBzero Platform
Enabling Cloud and Grid Powered Image Phenotyping Nirav Merchant iPlant Collaborative
Cloud Computing. Cloud Computing defined Dynamically scalable, device-independent and task-centric computing resources are provided online, with all charges.
IT ISSUES & TRENDS, 2015 Faculty Technology Day Wednesday, August 19, 2025.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
Customized cloud platform for computing on your terms ! Nirav Merchant
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Objectives.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
VCL Virtual Computing Labora tory An Opportunity to Lead Technical Economic Pedagogical Teaching & Learning Research & Development Outreach & Engagement.
Cloud Strategy made Simple David G. Fletcher. 2 Hybrid Cloud Approach Utah is building a private cloud to provision services from its virtualized infrastructure.
Enterprise Cloud Computing
IT&MS MAY 2015 ALL STAFF. WELCOME TO THE MIDDLE OF SPRING! Mid Quarter Check-in  Overview  Service Management  Spiceworks  Project Planning and Timelines.
UF-SNRE Program in Land Use Dynamics A new synthesis of disciplines for the delivery of science for a sustainable future.
Overview of the NEH’s Digital Humanities Initiative
MicroAge Cloud Servers The cloud platform for the modern business Small and midsize businesses (SMBs) want the power and flexibility to run their business.
Unleash your inner (data) scientist : The ability and audacity to scale your science with extensible cyberinfrastructure Nirav Merchant The University.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Office 365 What Is The Right Fit For You? Andrew Riley, MCTS, MCTIP, ITIL v3.
Computer Science Infrastructure Security for Virtual Cloud Computing Peng Ning 04/08/111BITS/ Financial Services Roundtable Supported by the US National.
Transforming Science Through Data-driven Discovery Workshop Overview Ohio State University MCIC Jason Williams – Lead, CyVerse – Education, Outreach, Training.
An Innovative Internship Model ( version 3 1/3/2012 )
Transforming Science Through Data-driven Discovery Using CyVerse Cyberinfrastructure to Enable Data Intensive Research, Collaboration, and Education Joslynn.
Chapter 6: Securing the Cloud
Scaling Compute with R in CyVerse
VCL Virtual Computing Laboratory An Opportunity to Lead
ICT22 – 2016: Technologies for Learning and Skills ICT24 – 2016: Gaming and gamification Francesca Borrelli DG CONNECT, European Commission BRUXELLES.
How IoT Initiatives are Changing Product Development.
Vision for a Research Presence
Give Your Data the Edge A Scalable Data Delivery Platform
CyVerse Tools and Services
Tools and Services Workshop
University of Chicago and ANL
Customized cloud platform for computing on your terms !
Joslynn Lee – Data Science Educator
What is Cloud Computing - How cloud computing help your Business?
architecting the DIGITAL enterprise
Tools and Services Workshop Overview of Atmosphere
A Web-enabled Approach for generating data processors
Harvard CRM Service Strategy
Week 01 Comp 7780 – Class Overview.
Enterprise Application Stores
Cloud Computing Student Access and Success Through a DOL Grant
NAV In The Cloud: Exploring Options for a Cloud-based Deployment
Cyberinfrastructure for the Life Sciences
Virtualization, Cloud Computing, and TeraGrid
SILICE SinnoLAB Model- Social Start-up Generator 1.2
School of Education Opportunity for Discovery, Learning & Engagement
Karen Bartleson, President, IEEE Standards Association
Storing and Accessing G-OnRamp’s Assembly Hubs outside of Galaxy
PD Goals Program Overview December, 2012
PD Goals Program Overview December, 2012
An enterprise solution to managing and transferring technology and knowledge A Springboard Project.
OU BATTLECARD: Oracle Identity Management Training
Presentation transcript:

Teaching How to Scale Science (and People) Using Cloud Resources Nirav Merchant The University of Arizona

Topic Coverage: My Motivation (for teaching cloud) Pragmatic approaches (as a cloud provider and consumer) Exemplar courses and workshops (that leverage cloud) Teaching complex stack/technology through project based learning (my experience) Future direction

Motivation for teaching/using cloud: Early 2000, many students from our Bioinformatics course (MCB 416/516 with David Mount) with computational expertise were getting (postdoc and faculty) placement at academic institutes with limited (no) institutional computational infrastructure U. Arizona awarded NSF IGERT & NIH PERT, both with active engagement with local community colleges and PUI These new postdoc and faculty wanted to include “bioinformatics at scale” as part of courses they were designing. Owning infrastructure was not possible (staff expertise, budget, data center space etc.) Computational thinking had become central to our course work Amazon AWS ~2006, opportunity to use on demand resources NSF funds iPlant 2008, opportunity to fill usability gap for cloud

Scaling users (and building communities) Most lab have local/in-house computational expert(s): Produces custom code, pipelines, investigates new methods Teaches other how to use them (Occasionally) Victim of their own success (denial of service attacks on their availability) External collaborators start sending data Cannot install your custom code on their systems Faculty wanting to teach hands on topics (in courses and workshops) requiring complex application stacks Cannot spend significant time s/w instalation (taking time away from primary focus/science)

CyVerse: What the users are looking for

Expectation Management in the cloud

Paradigm Shift from Producer to User Innovation Open source book on his website & MIT Courseware online

Empowering users: CyVerse Atmosphere (a managed cloud)

Cloud Computing Not a singular technology component Not a black box or alien technology Not a “elixir of scalability”, “panacea for Big Data” etc. It cannot keep growing and scaling without planning (and architecting your application) Unfortunate victim of marketing hype Further complicated by use of jargon, TLA, private cloud, community cloud, hybrid cloud …

What is cloud computing ?

Cloud Computing…is Amazingly flexible technology It’s a platform that comprise of many uniquely flexible components Allows us to create “purpose built appliances” Allows us to finally “script our infrastructure” Allows mixing and matching of components that you need to do your science Opens up many new avenues and approaches for teaching topics that require complex (pre configured) software tools and data

Covering cloud concepts: Beginners: Typically looking for a “bigger” laptop or desktop in the “cloud” Need admin/sudo privileges to customize software installation (and local IT will not give it) Intermediate: Typically Looking for custom components (sql, web application server) Need to manage workflows (analysis pipelines) Advanced: Technology specific (spark, yarn etc.) Need to manage lifecycle of application and data Edge computing needs

The landscape: Do not get distracted

14 Cloud Computing Zen Don’t get frustrated… –This is cutting (bleeding) edge technology –There will be plenty of moments Be patient… –Instructions/infrastructure keep changing (s/w version) Be flexible… –There will be unanticipated issues along the way Be constructive… –Use wiki, forums and share knowledge – Make everyone’s experience better Be creative… –There is more than one way to do it (TIM-TOWTDI)

? ? Communication Teamwork Responsibility Collaboration I hate people Teamwork Responsibility Collaboration Communication Interdisciplinary Teams Distributed Computing High-performance Computing Data Management Code Management Virtualization Documentation Team Science Applied Cyberinfrastructure Concepts

Annotating 12 Rice Genomes (2012) Ground water movement simulation (2013) Ecosystem energy mass transfer (2014) Exoplanet hunting (2015) Dynamic Mapping of Mosquito (Zika) Abundance (2016) Project based Learning

Topic Coverage: From ideas to solutions (in the cloud)

Scaling users and Ideas