LARGE SCALE DEPLOYMENT OF DAP AND DTS Rob Kooper Jay Alemeda Volodymyr Kindratenko.

Slides:



Advertisements
Similar presentations
Cloud Service Models and Performance Ang Li 09/13/2010.
Advertisements

SLA-Oriented Resource Provisioning for Cloud Computing
System Center 2012 R2 Overview
Take your CMS to the cloud to lighten the load Brett Pollak Campus Web Office UC San Diego.
Virtual Machine Usage in Cloud Computing for Amazon EE126: Computer Engineering Connor Cunningham Tufts University 12/1/14 “Virtual Machine Usage in Cloud.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 6 2/13/2015.
Clouds from FutureGrid’s Perspective April Geoffrey Fox Director, Digital Science Center, Pervasive.
IDC HPC User Forum Conference Appro Product Update Anthony Kenisky, VP of Sales.
Linux Clustering A way to supercomputing. What is Cluster? A group of individual computers bundled together using hardware and software in order to make.
FI-WARE – Future Internet Core Platform FI-WARE Cloud Hosting July 2011 High-level description.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
1 AppliedMicro X-Gene ® ARM Processors Optimized Scale-Out Solutions for Supercomputing.
Research Computing with Newton Gerald Ragghianti Nov. 12, 2010.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
MATE-EC2: A Middleware for Processing Data with Amazon Web Services Tekin Bicer David Chiu* and Gagan Agrawal Department of Compute Science and Engineering.
High Performance Computing with cloud Xu Tong. About the topic Why HPC(high performance computing) used on cloud What’s the difference between cloud and.
Capacity Planning in SharePoint Capacity Planning Process of evaluating a technology … Deciding … Hardware … Variety of Ways Different Services.
NSF Vision and Strategy for Advanced Computational Infrastructure Vision: NSF Leadership in creating and deploying a comprehensive portfolio…to facilitate.
Design Discussion Rain: Dynamically Provisioning Clouds within FutureGrid Geoffrey Fox, Andrew J. Younge, Gregor von Laszewski, Archit Kulshrestha, Fugang.
Cloud Computing Why is it called the cloud?.
SUMMER VACATION SCHOLARSHIP | IM&T Scientific Computing in the Cloud.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Opensource for Cloud Deployments – Risk – Reward – Reality
1 1 Hybrid Cloud Solutions (Private with Public Burst) Accelerate and Orchestrate Enterprise Applications.
1 1 Hybrid Cloud Solutions (Private with Public Burst) Accelerate and Orchestrate Enterprise Applications.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 7 2/23/2015.
Introduction To Windows Azure Cloud
CLOUD COMPUTING 2.0 Finally, the promise of the cloud has arrived v 1.8.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
OOI CI R2 Life Cycle Objectives Review Aug 30 - Sep Ocean Observatories Initiative OOI CI Release 2 Life Cycle Objectives Review CyberPoPs & Network.
+ CS 325: CS Hardware and Software Organization and Architecture Cloud Architectures.
Experimenting with FutureGrid CloudCom 2010 Conference Indianapolis December Geoffrey Fox
Science Clouds and FutureGrid’s Perspective June Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 2.
Ceph Storage in OpenStack Part 2 openstack-ch,
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
Sandor Acs 05/07/
FutureGrid Connection to Comet Testbed and On Ramp as a Service Geoffrey Fox Indiana University Infra structure.
Looking Ahead: A New PSU Research Cloud Architecture Chuck Gilbert - Systems Architect and Systems Team Lead Research CI Coordinating Committee Meeting.
Agenda Motion Imagery Challenges Overview of our Cloud Activities -Big Data -Large Data Implementation Lessons Learned Summary.
SALSASALSASALSASALSA FutureGrid Venus-C June Geoffrey Fox
Computing Research Testbeds as a Service: Supporting large scale Experiments and Testing SC12 Birds of a Feather November.
Load Rebalancing for Distributed File Systems in Clouds.
KAASHIV INFOTECH – A SOFTWARE CUM RESEARCH COMPANY IN ELECTRONICS, ELECTRICAL, CIVIL AND MECHANICAL AREAS
 Cloud Computing technology basics Platform Evolution Advantages  Microsoft Windows Azure technology basics Windows Azure – A Lap around the platform.
The Evolution of the Italian HPC Infrastructure Carlo Cavazzoni CINECA – Supercomputing Application & Innovation 31 Marzo 2015.
Architecture of a platform for innovation and research Erik Deumens – University of Florida SC15 – Austin – Nov 17, 2015.
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
Introduction to Data Analysis with R on HPC Texas Advanced Computing Center Feb
New Computing Effort at CSU Fresno ATLAS Group Cui Lin, Yongsheng Gao California State University, Fresno 10/11/2011 at SMU Workshop LHC.
Extreme Scale Infrastructure
Azure.
Organizations Are Embracing New Opportunities
What is HPC? High Performance Computing (HPC)
ISDA + OpenStack Rob Kooper.
Status and Challenges: January 2017
Heterogeneous Computation Team HybriLIT
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Cloud Data platform (Cloud Application Development & Deployment)
Bridges and Clouds Sergiu Sanielevici, PSC Director of User Support for Scientific Applications October 12, 2017 © 2017 Pittsburgh Supercomputing Center.
Traditional Enterprise Business Challenges
AWS. Introduction AWS launched in 2006 from the internal infrastructure that Amazon.com built to handle its online retail operations. AWS was one of the.
Azure.
NSF : CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science PI: Geoffrey C. Fox Software: MIDAS HPC-ABDS.
Versatile HPC: Comet Virtual Clusters for the Long Tail of Science SC17 Denver Colorado Comet Virtualization Team: Trevor Cooper, Dmitry Mishin, Christopher.
Clouds from FutureGrid’s Perspective
Cloud Computing Architecture
IBM Power Systems.
Presentation transcript:

LARGE SCALE DEPLOYMENT OF DAP AND DTS Rob Kooper Jay Alemeda Volodymyr Kindratenko

The need for scaling How can we scale? How can DAP architecture scale? How can DTS architecture scale? What options do we have to scale? Amazon solution for scaling XSEDE solution for scaling Cloud solution for scaling

Finite Resources CPUMemoryDiskNetwork

Scalability A system whose performance improves after adding hardware, proportionally to the capacity added, is said to be a scalable system.

Scaling Up And Out Scale UP (vertically) Adding resources to a single system “Speed” Performance Moor’s Law Scale OUT (horizontally) Cloud Adding nodes to the system Nodes can be commodity hardware (vs HPC) Increase software complexity Increase management complexity

Elasticity Need ability to grow/shrink on demand Based on workload add or remove resources Keep requirements small If many people use one service bring up more of those Don’t bring up services that people don’t use

Software Server Architecture Software Server Image Magic Image Magic Open Office ffmepg 3D Studio … … Polyglot Unknown Format Data Unknown Format Data Useable Data Useable Data

Medici 2.0 Architecture Frontend Webapp Load Balancer MongoD B HTTP HTML JSON HTTP HTML JSON External Services Frontend Webapp Frontend Webapp Event Bus (rabbitMQ) Extractor (Java) Extractor (Python) Elastic search Filesystem …… MongoD B Elastic search Elastic search

How to grow? More servers at ISDA Funding is in Brown Dog Not sustainable Commercial Clouds Amazon, … XSEDE NSF funded HPC computation NCSA Cloud infrastructure

AWS Web Application Reference Architecture

AWS Batch Processing Reference Architecture

Pricing Small machine (1CPU, 2GB) Linux $0.026 per Hour Windows $0.036 per Hour Server is approx. $10,000 and can hold 20 VMs Average lifespan 5 years (~ $500 per VM) Equals around 2 years of Amazon time But cheaper if we only need it 8 hours per day! And 7 hours/day in case of windows.

XSEDE Resources Jay Alameda National Center for Supercomputing Applications 23 July 2014

What is XSEDE Integrating service for wide variety of High Performance Computing (HPC) and Visualization and Data Analysis (RDAV) resources – Front line support – Uniform documentation – Extended collaborative support – Training, education and outreach services – Allocations

Variety of HPC and RDAV resources Dynamic list at erview erview – Overview, and expiration dates for each resource – Traditional clusters – Visualization and data analysis resources – Storage resources – High throughput resources – Testbeds – Services

Potentially Interesting Resources for Browndog Testbed resource “FutureGrid” – Production through 9/30/2014 – Partitioned into HPC Infrastructure as a Service (IaaS) – Nimbus – Openstack – Eucalyptus Dedicated – Layer Platform as a Service (PaaS) (eg, MapReduce, Hadoop) on top of these partitions

Potentially Interesting Resources for Browndog - 2 Service resource “Quarry” – Web service hosting environment – Resource end date not specified – Available for XRAC allocations with web-service component Storage: either NSF home directories, or lustre based storage. – OpenVZ provides virtual hosting of RPM based linux distributions – Persistent virtual machine

New XSEDE Resource: Comet Long-tail science system hosted at San Diego Supercomputer Center Builds on experience with SDSC Gordon (flash memory, persistent storage nodes), and SDSC Trestles (long-tail science) – 99% of jobs in 2012 used < 2048 cores – These jobs consumed half of the total core hours across NSF resources.

Comet Partially designed to pick up FutureGrid use (virtual clusters) Gateway hosting nodes and virtual machine repository Optimized for jobs within a rack Continues access to flash memory (Gordon) Capacity computing: computing for the 99% of XSEDE jobs

Comet virtualization Leverage experience and expertise from FutureGrid Virtual machine jobs scheduled like batch jobs Flexible software environments for new communities and applications Virtual machine repository Virtual HPC cluster (multi-(whole)-node), miminum latency and overhead penalty

XSEDE and BrownDog Premise: BrownDog will become an integral part of a researcher’s workflow Question: Should BrownDog evolve into an XSEDE resource provider, to provide data services for XSEDE?

National Center for Supercomputing Applications University of Illinois at Urbana-Champaign ISL Resources Volodymyr Kindratenko Innovative Systems Laboratory

Hadoop

OpenStack Cloud

Virtual Lab for Advanced Design

High memory node Dell PowerEdge R920 CPU Intel Xeon E7-4860v2 2.6 GHz (4) RAM3 TB Storage 2x 300 GB 10,000 RPM SAS 6 Gbps HDD 4x 800 GB SAS Read-Intensive MLC 12 Gbps SSD 6x 1 TB 7,200 RPM Near-Line SAS 6 Gbps HDD Interconnect 6x 1 Gbps Ethernet 2x 10 Gbps Ethernet CPU0 CPU1 CPU2 CPU3 RAM PHC QPI PCIe, DMA

Other systems GPU Server 8 NVIDIA C2050 GPUs Intel Xeon Phi Server 2 Xeon Phi 7120 (Knights Corner) application accelerators HPC cluster 8 nodes