Science of Cloud Computing Panel Cloud2011 Washington DC July 5 2011 Geoffrey Fox

Slides:



Advertisements
Similar presentations
SALSA HPC Group School of Informatics and Computing Indiana University.
Advertisements

FutureGrid Overview NSF PI Science of Cloud Workshop Washington DC March Geoffrey Fox
Cosmic Issues and Analysis of External Comments on FutureGrid TG11 Salt Lake City July Geoffrey Fox
Clouds from FutureGrid’s Perspective April Geoffrey Fox Director, Digital Science Center, Pervasive.
Future Grid Introduction March MAGIC Meeting Gregor von Laszewski Community Grids Laboratory, Digital Science.
Authors: Thilina Gunarathne, Tak-Lon Wu, Judy Qiu, Geoffrey Fox Publish: HPDC'10, June 20–25, 2010, Chicago, Illinois, USA ACM Speaker: Jia Bao Lin.
Clouds will win! Geoffrey Fox Director,
Student Visits August Geoffrey Fox
Overview Presented at OGF31 Salt Lake City, July 2011 Geoffrey Fox, Gregor von Laszewski, Renato Figueiredo Contact:
SALSASALSASALSASALSA Digital Science Center June 25, 2010, IIT Geoffrey Fox Judy Qiu School.
Data Intensive Applications on Clouds The Second International Workshop on Data Intensive Computing in the Clouds (DataCloud-SC11)
FutureGrid Summary TG’10 Pittsburgh BOF on New Compute Systems in the TeraGrid Pipeline August Geoffrey Fox
Panel Session The Challenges at the Interface of Life Sciences and Cyberinfrastructure and how should we tackle them? Chris Johnson, Geoffrey Fox, Shantenu.
3DAPAS/ECMLS panel Dynamic Distributed Data Intensive Analysis Environments for Life Sciences: June San Jose Geoffrey Fox, Shantenu Jha, Dan Katz,
FutureGrid Summary FutureGrid User Advisory Board TG’10 Pittsburgh August Geoffrey Fox
Big Data and Clouds: Challenges and Opportunities NIST January Geoffrey Fox
SCSI: Platforms & Foundations: Cyberinfrastructure Socially Coupled Systems & Informatics: Science, Computing & Decision Making in a Complex Interdependent.
X-Informatics Cloud Technology (Continued) March Geoffrey Fox Associate.
FutureGrid: an experimental, high-performance grid testbed Craig Stewart Executive Director, Pervasive Technology Institute Indiana University
FutureGrid: an experimental, high-performance grid testbed Craig Stewart Executive Director, Pervasive Technology Institute Indiana University
FutureGrid Overview CTS Conference 2011 Philadelphia May Geoffrey Fox
Cloud Data mining and FutureGrid SC10 New Orleans LA AIST Booth November Geoffrey Fox
Science Clouds and CFD NIA CFD Conference: Future Directions in CFD Research, A Modeling and Simulation Conference August.
FutureGrid SOIC Lightning Talk February Geoffrey Fox
FutureGrid and US Cyberinfrastructure Collaboration with EU Symposium on transatlantic EU-U.S. cooperation in the field of large scale research infrastructures.
Experimenting with FutureGrid CloudCom 2010 Conference Indianapolis December Geoffrey Fox
Cloud Architecture for Earthquake Science 7 th ACES International Workshop 6th October 2010 Grand Park Otaru Otaru Japan Geoffrey Fox
Science Clouds and FutureGrid’s Perspective June Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox
FutureGrid Overview Geoffrey Fox
Bioinformatics on Cloud Cyberinfrastructure Bio-IT April Geoffrey Fox
Biomedical Cloud Computing iDASH Symposium San Diego CA May Geoffrey Fox
Future Grid FutureGrid Overview Dr. Speaker. Future Grid Future GridFutureGridFutureGrid The goal of FutureGrid is to support the research on the future.
FutureGrid Overview Geoffrey Fox
FutureGrid: an experimental, high-performance grid testbed Craig Stewart Executive Director, Pervasive Technology Institute Indiana University
What’s Hot in Clouds? Analyze (superficially) the ~140 Papers/Short papers/Workshops/Posters/Demos in CloudCom Each paper may fall in more than one category.
Future Grid FutureGrid Overview Geoffrey Fox SC09 November
FutureGrid Overview Geoffrey Fox
FutureGrid SC10 New Orleans LA IU Booth November Geoffrey Fox
FutureGrid Overview Geoffrey Fox
Some remarks on Use of Clouds to Support Long Tail of Science July XSEDE 2012 Chicago ILL July 2012 Geoffrey Fox.
FutureGrid SOIC Lightning Talk February Geoffrey Fox
FutureGrid Cyberinfrastructure for Computational Research.
Building Effective CyberGIS: FutureGrid Marlon Pierce, Geoffrey Fox Indiana University.
RAIN: A system to Dynamically Generate & Provision Images on Bare Metal by Application Users Presented by Gregor von Laszewski Authors: Javier Diaz, Gregor.
FutureGrid Computing Testbed as a Service Overview July Geoffrey Fox for FutureGrid Team
SALSASALSASALSASALSA FutureGrid Venus-C June Geoffrey Fox
Research in Grids and Clouds and FutureGrid Melbourne University September Geoffrey Fox
FutureGrid TeraGrid Science Advisory Board San Diego CA July Geoffrey Fox
FutureGrid Overview Geoffrey Fox
Tutorial Presented at TG2011 Geoffrey Fox, Gregor von Laszewski, Renato Figueiredo, Kate Keahey, Andrew Younge Contact:
FutureGrid BOF Overview TG 11 Salt Lake City July Geoffrey Fox
SALSASALSASALSASALSA Clouds Ball Aerospace March Geoffrey Fox
FutureGrid NSF September Geoffrey Fox
Computing Research Testbeds as a Service: Supporting large scale Experiments and Testing SC12 Birds of a Feather November.
Recipes for Success with Big Data using FutureGrid Cloudmesh SDSC Exhibit Booth New Orleans Convention Center November Geoffrey Fox, Gregor von.
Science Applications on Clouds and FutureGrid June Cloud and Autonomic Computing Center Spring 2012 Workshop Cloud.
Security: systems, clouds, models, and privacy challenges iDASH Symposium San Diego CA October Geoffrey.
Future Grid Future Grid Overview. Future Grid Future GridFutureGridFutureGrid The goal of FutureGrid is to support the research that will invent the future.
HPC in the Cloud – Clearing the Mist or Lost in the Fog Panel at SC11 Seattle November Geoffrey Fox
Cloud Cyberinfrastructure and its Challenges & Applications 9 th International Conference on Parallel Processing and Applied.
Bioinformatics on Cloud Cyberinfrastructure Bio-IT April Geoffrey Fox
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
Private Public FG Network NID: Network Impairment Device
Geoffrey Fox, Shantenu Jha, Dan Katz, Judy Qiu, Jon Weissman
Digital Science Center Overview
FutureGrid: a Grid Testbed
Clouds from FutureGrid’s Perspective
Big Data Architectures
Services, Security, and Privacy in Cloud Computing
Cloud versus Cloud: How Will Cloud Computing Shape Our World?
Presentation transcript:

Science of Cloud Computing Panel Cloud2011 Washington DC July Geoffrey Fox Director, Digital Science Center, Pervasive Technology Institute Associate Dean for Research and Graduate Studies, School of Informatics and Computing Indiana University Bloomington

Science of or with Cloud Computing? In that cloud computing is likely to dominate “mainstream” data center computing, any science of useful computing implies there is a science of clouds Research issues for Clouds: – Computer Science – Computational Science – are clouds useful for science – if clouds provide most cost effective computing, lets hope they are Clouds provide Infrastructure – elastic computing on demand – useful and important and cost effective Clouds were developed for large scale data services and associated platforms are transformational for data enabled science – Clouds match the data deluge

Clouds and Grids/HPC Synchronization/communication Performance Grids > Clouds > HPC Systems Clouds appear to execute effectively Grid workloads but are not easily used for closely coupled HPC applications Service Oriented Architectures and workflow appear to work similarly in both grids and clouds Assume for immediate future, science supported by a mixture of – Clouds – Grids/High Throughput Systems (moving to clouds as convenient) – Supercomputers (“MPI Engines”) going to exascale

Authentication and Authorization: Provide single sign in to All system architectures Workflow: Support workflows that link job components between Grids and Clouds. Provenance: Continues to be critical to record all processing and data sources Data Transport: Transport data between job components on Grids and Commercial Clouds respecting custom storage patterns like Lustre v HDFS Program Library: Store Images and other Program material Blob: Basic storage concept similar to Azure Blob or Amazon S3 DPFS Data Parallel File System: Support of file systems like Google (MapReduce), HDFS (Hadoop) or Cosmos (dryad) with compute-data affinity optimized for data processing Table: Support of Table Data structures modeled on Apache Hbase/CouchDB or Amazon SimpleDB/Azure Table. There is “Big” and “Little” tables – generally NOSQL SQL: Relational Database Queues: Publish Subscribe based queuing system Worker Role: This concept is implicitly used in both Amazon and TeraGrid but was (first) introduced as a high level construct by Azure. Naturally support Elastic Utility Computing MapReduce: Support MapReduce Programming model including Hadoop on Linux, Dryad on Windows HPCS and Twister on Windows and Linux. Need Iteration for Datamining Software as a Service: This concept is shared between Clouds and Grids Web Role: This is used in Azure to describe user interface and can be supported by portals in Grid or HPC systems

MapReduce “File/Data Repository” Parallelism Instruments Disks Map 1 Map 2 Map 3 Reduce Communication Map = (data parallel) computation reading and writing data Reduce = Collective/Consolidation phase e.g. forming multiple global sums as in histogram Portals /Users Iterative MapReduce Map Map Reduce Reduce Reduce

Typical iterative data analysis Typical MapReduce runtimes incur extremely high overheads – New maps/reducers/vertices in every iteration – File system based communication Long running tasks and faster communication in Twister (Iterative MapReduce) enables it to perform close to MPI Time for 20 iterations Why Iterative MapReduce? K-means map reduce Compute the distance to each data point from each cluster center and assign points to cluster centers Compute new cluster centers Compute new cluster centers User program

The Power of Cloud Platforms: Twister4Azure Architecture

Research Issues for (Iterative) MapReduce Quantify and Extend that Data analysis for Science seems to work well on Iterative MapReduce and clouds so far. – Iterative MapReduce spans all architectures as unifying idea Performance and Fault Tolerance Trade-offs; – Writing to disk each iteration (as in Hadoop) naturally lowers performance but increases fault-tolerance – Integration of GPU’s Security and Privacy technology and policy essential for use in many biomedical applications Storage: multi-user data parallel file systems have scheduling and management – NOSQL and SciDB on virtualized and HPC systems Data parallel Data analysis languages: Sawzall and Pig Latin more successful than HPF? Scheduling: How does research here fit into scheduling built into clouds and Iterative MapReduce (Hadoop) – important load balancing issues for MapReduce for heterogeneous workloads

Traditional File System? Typically a shared file system (Lustre, NFS …) used to support high performance computing Big advantages in flexible computing on shared data but doesn’t “bring computing to data” S Data S S S Compute Cluster C C C C C C C C C C C C C C C C Archive Storage Nodes

Data Parallel File System? No archival storage and computing brought to data C Data C C C C C C C C C C C C C C C File1 Block1 Block2 BlockN …… Breakup Replicate each block File1 Block1 Block2 BlockN …… Breakup Replicate each block

FutureGrid key Concepts I FutureGrid is an international testbed modeled on Grid5000 Supporting international Computer Science and Computational Science research in cloud, grid and parallel computing (HPC) – Industry and Academia – Note much of current use Education, Computer Science Systems and Biology/Bioinformatics The FutureGrid testbed provides to its users: – A flexible development and testing platform for middleware and application users looking at interoperability, functionality, performance or evaluation – Each use of FutureGrid is an experiment that is reproducible – A rich education and teaching platform for advanced cyberinfrastructure (computer science) classes

FutureGrid key Concepts II Rather than loading images onto VM’s, FutureGrid supports Cloud, Grid and Parallel computing environments by dynamically provisioning software as needed onto “bare-metal” using Moab/xCAT –Image library for MPI, OpenMP, Hadoop, Dryad, gLite, Unicore, Globus, Xen, ScaleMP (distributed Shared Memory), Nimbus, Eucalyptus, OpenNebula, KVM, Windows ….. Growth comes from users depositing novel images in library FutureGrid has ~4000 (will grow to ~5000) distributed cores with a dedicated network and a Spirent XGEM network fault and delay generator Image1 Image2 ImageN … LoadChooseRun

FutureGrid: a Grid/Cloud/HPC Testbed Private Public FG Network NID : Network Impairment Device

FutureGrid Partners Indiana University (Architecture, core software, Support) Purdue University (HTC Hardware) San Diego Supercomputer Center at University of California San Diego (INCA, Monitoring) University of Chicago/Argonne National Labs (Nimbus) University of Florida (ViNE, Education and Outreach) University of Southern California Information Sciences (Pegasus to manage experiments) University of Tennessee Knoxville (Benchmarking) University of Texas at Austin/Texas Advanced Computing Center (Portal) University of Virginia (OGF, Advisory Board and allocation) Center for Information Services and GWT-TUD from Technische Universtität Dresden. (VAMPIR) Red institutions have FutureGrid hardware

5 Use Types for FutureGrid ~122 approved projects over last 10 months Training Education and Outreach (11%) – Semester and short events; promising for non research intensive universities Interoperability test-beds (3%) – Grids and Clouds; Standards; Open Grid Forum OGF really needs Domain Science applications (34%) – Life sciences highlighted (17%) Computer science (41%) – Largest current category Computer Systems Evaluation (29%) – TeraGrid (TIS, TAS, XSEDE), OSG, EGI, Campuses Clouds are meant to need less support than other models; FutureGrid needs more user support ……. 15

Software Components Portals including “Support” “use FutureGrid” “Outreach” Monitoring – INCA, Power (GreenIT) Experiment Manager: specify/workflow Image Generation and Repository Intercloud Networking ViNE Virtual Clusters built with virtual networks Performance library Rain or Runtime Adaptable InsertioN Service for images Security Authentication, Authorization, “Research” Above and below Nimbus OpenStack Eucalyptus