Software infrastructure for a National Research Platform

Slides:



Advertisements
Similar presentations
Implementing Tableau Server in an Enterprise Environment
Advertisements

Peter Berrisford RAL – Data Management Group SRB Services.
Creating HIPAA-Compliant Medical Data Applications with Amazon Web Services Presented by, Tulika Srivastava Purdue University.
Tom Lewis Director, Academic & Collaborative Applications University of Washington.
Power BI Sites and Mobile BI. What You Will Learn Sharing and Collaboration Introducing Power BI Exploring Power BI Features and Services Partner Opportunities.
High Performance Computing Course Notes Grid Computing.
Collaboration on Large Datasets using Globus Rachana Ananthakrishnan University of Chicago.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
SaaS, PaaS & TaaS By: Raza Usmani
Monitor Linux OS health & performance Monitor log files Monitor JEE app servers Monitor line-of-business applications Monitor databases and web.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
Tim Vander Kooi Systems
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
ArcGIS Server for Administrators
Communicating Security Assertions over the GridFTP Control Channel Rajkumar Kettimuthu 1,2, Liu Wantao 3,4, Frank Siebenlist 1,2 and Ian Foster 1,2,3 1.
Module 2: Overview of IIS 7.0 Application Server.
Configuring and Troubleshooting Identity and Access Solutions with Windows Server® 2008 Active Directory®
System Center Lesson 4: Overview of System Center 2012 Components System Center 2012 Private Cloud Components VMM Overview App Controller Overview.
Text Microsoft to Or Tweet #uktechdays Questions?
ALCF Argonne Leadership Computing Facility GridFTP Roadmap Bill Allcock (on behalf of the GridFTP team) Argonne National Laboratory.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
Research data management using Globus ESIP Summer Meeting 2015 Rachana Ananthakrishnan University of Chicago
Globus and ESGF Rachana Ananthakrishnan University of Chicago
Globus.org/genomics Globus Galaxies Science Gateways as a Service Ravi K Madduri, University of Chicago and Argonne National Laboratory
SQL Server 2012 Session: 1 Session: 4 SQL Azure Data Management Using Microsoft SQL Server.
Globus online Delivering a scalable service Steve Tuecke Computation Institute University of Chicago and Argonne National Laboratory.
© 2014 VMware Inc. All rights reserved. Cloud Archive for vCloud ® Air™ High-level Overview August, 2015 Date.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
WINDOWS AZURE AND THE HYBRID CLOUD. Hybrid Concepts and Cloud Services.
Azure Active Directory Uday Hegde 2016 Redmond Summit | Identity Without Boundaries May 26, 2016 Group Program Manager, Azure AD
TOWARDS AN ARCHITECTURE FOR NATIONAL DATA SERVICES Ian Foster Director, Computation Institute Argonne National Laboratory & The University of
Enhancements to Galaxy for delivering on NIH Commons
Accessing the VI-SEEM infrastructure
Grid and Cloud Computing
Run Azure Services in your datacenter
Deploying Web Application
Computing Clusters, Grids and Clouds Globus data service
Section 6 Object Storage Gateway (RADOS-GW)
Amazon Storage- S3 and Glacier
SaaS Application Deep Dive
Introduction to Data Management in EGI
Simple Storage Service
Study course: “Computing clusters, grids and clouds” Andrey Y. Shevel
Viet Tran Institute of Informatics Slovakia
Research Data Archive - technology
AWS DevOps Engineer - Professional dumps.html Exam Code Exam Name.
Get Amazon AWS-DevOps-Engineer-Professional Exam Real Questions - Amazon AWS-DevOps-Engineer-Professional Dumps Realexamdumps.com
Getting Started.
Getting Started.
AWS Cloud Computing Masaki.
Cloud Web Filtering Platform
Microsoft Virtual Academy
SharePoint Online Authentication Patterns
Microsoft Virtual Academy
Technical Capabilities
Office 365 Development.
The Anatomy and The Physiology of the Grid
Azure Active Directory
The Anatomy and The Physiology of the Grid
Designing IIS Security (IIS – Internet Information Service)
Features Overview.
SCCM in hybrid world Predrag Jelesijević Microsoft 7/6/ :17 AM
Microsoft Virtual Academy
Microsoft Virtual Academy
STATEL an easy way to transfer data
Microsoft Virtual Academy
IoT Security and Privacy
Presentation transcript:

Software infrastructure for a National Research Platform Ian Foster The University of Chicago Argonne National Laboratory Talk at 1st National Research Platform Workshop Aug 7-8, 2017 Bozeman, Montana

Congratulations, you have a Science DMZ! Credit: Eli Dart

What you really want is a science accelerator Software Infrastructure High-speed data ingest Secure data sharing Data publication Smart instruments Ultra-scale collaboration Software transmutes silicon into discoveries

A strong software infrastructure is… Accessible — trivially usable by all Ubiquitous — it goes where you need it Performant — fast end to end Secure — all resources are protected Reliable — you can count on it Programmable — you can build on it Manageable — it supports sys admins, too Sustainable — it will be there tomorrow

Accessible means trivially usable by all Compute Facility Instrument Globus transfers files reliably, securely 2 Globus controls access to shared files on existing storage; no need to move files to cloud storage! 4 Curator reviews and approves; dataset published on campus or other system 7 Accessible Ubiquitous Performant Secure Reliable Programmable Manageable Sustainable Transfer Researcher selects files to share, selects user or group, and sets access permissions 3 Collaborator logs in to Globus and accesses shared files; no local account required; download via Globus 5 Share Researcher assembles dataset; describes it with Dublin core & domain-specific metadaa 6 Researcher initiates transfer request; or requested automatically by script, science gateway 1 Publication Repository A U P P S R M S PURPOSE SOFTWARE Peers, collaborators search and discover datasets; transfer and share using Globus 8 Access via web browser, command line, or REST API Use any storage Use existing identity Publish Personal Computer Discover

Ubiquitous means it goes where you need it 10,000+ active endpoints Native packages Installs in seconds Linux, Windows, MacOS GPFS, Lustre, OrangeFS, … AWS S3, Ceph RadosGW Spectra Logic BlackPearl Google Drive, HPSS Accessible Ubiquitous Performant Secure Reliable Programmable Manageable Sustainable A U P P S R M S PURPOSE SOFTWARE Amazon Glacier

Performant means fast end to end Specialized protocols Auto-configuration Parallel DTNs File system optimizations Tape system optimizations Accessible Ubiquitous Performant Secure Reliable Programmable Manageable Sustainable 1PB in 1.002 days, ArgonneNCSA R. Kettimuthu et al.

Secure means all resources are protected Globus service is itself highly secure Best-practice cloud security Third-party security reviews Globus platform ensures your services are secure Accept credentials from 300+ identity providers Control proxy credential lifetimes Industry-standard OAuth-2 and OIDC protocols Data encryption Build secure services with controlled delegation Accessible Ubiquitous Performant Secure Reliable Programmable Manageable Sustainable A U P P S R M S PURPOSE SOFTWARE

Reliable means you can count on it Each transfer is monitored, retried upon failure Protocols support restart Fail over on multiple DTNs Service is cloud hosted, with replication, dynamic failover, monitoring 99.5% uptime over past three years Accessible Ubiquitous Performant Secure Reliable Programmable Manageable Sustainable A U P P S R M S PURPOSE SOFTWARE

Programmable means you can build on it Globus Auth API … Globus Transfer API Globus Connect Data Publication & Discovery File Sharing File Transfer & Replication Accessible Ubiquitous Performant Secure Reliable Programmable Manageable Sustainable Web Command line A U P P S R M S PURPOSE SOFTWARE GET /endpoint/go%23ep1 PUT /endpoint/vas#my_endpt 200 OK X-Transfer-API-Version: 0.10 Content-Type: application/json … Integrate file transfer and sharing capabilities into scientific web apps, portals, gateways, etc. Use institutional ID systems in external web applications REST API

Programmable means you can build on it Jupyter Notebooks Globus Auth API … Globus Transfer API Globus Connect Data Publication & Discovery File Sharing File Transfer & Replication Accessible Ubiquitous Performant Secure Reliable Programmable Manageable Sustainable Python SDK Integrate file transfer and sharing capabilities into scientific web apps, portals, gateways, etc. Use institutional ID systems in external web applications

Programmable means automation globus.org Recurring transfers with sync option Copy /ingest Daily @ 3:30am Accessible Ubiquitous Performant Secure Reliable Programmable Manageable Sustainable Data distribution .../my_share --/cohort045 --/cohort096 --/cohort127 Shared Endpoint Staging area cleanup Shared Endpoint 1. Check if successful transfer 2. Delete data from staging area .../distribute

Programmable means automation Accessible Ubiquitous Performant Secure Reliable Programmable Manageable Sustainable ARM Climate Research Facility

Manageable means it helps sys admins, too Low admin costs Priority support Usage reporting Management console Alternative identity provider Training materials Constant innovation Accessible Ubiquitous Performant Secure Reliable Programmable Manageable Sustainable A U P P S R M S PURPOSE SOFTWARE

Sustainable means it will be there tomorrow Operated by professionals at the University of Chicago Supported by subscriptions from >65 institutions Accessible Ubiquitous Performant Secure Reliable Programmable Manageable Sustainable Picture of team

Raising the bar on research software quality 5 major services 13 national labs use Globus 290PB transferred 10,000 active endpoints 50 Bn files processed 70,000 registered users 99.5% uptime 65+ institutional subscribers 1 PB largest single transfer to date 3 months longest continuously managed transfer 300+ federated campus identities 12,000 active users/year Accessible Ubiquitous Performant Secure Reliable Programmable Manageable Sustainable

Easier More Better Get more data to more people faster Authentication Software infrastructure for a national research platform Get more data to more people faster Easier Authentication Transfer Sharing Publication Administration More Users Time Data Storage Better Collaboration Ideas Innovation Software transmutes hardware into discoveries

Thank you to our sponsors! globus.org Our subscribers U.S. DEPARTMENT OF ENERGY