Https://portal.futuregrid.org Directions in eScience Interoperability and Science Clouds June 19 2012 Interoperability in Action – Standards Implementation.

Slides:



Advertisements
Similar presentations
International Conference on Cloud and Green Computing (CGC2011, SCA2011, DASC2011, PICom2011, EmbeddedCom2011) University.
Advertisements

Clouds from FutureGrid’s Perspective April Geoffrey Fox Director, Digital Science Center, Pervasive.
CLOUD COMPUTING AN OVERVIEW & QUALITY OF SERVICE Hamzeh Khazaei University of Manitoba Department of Computer Science Jan 28, 2010.
FutureGrid Image Repository: A Generic Catalog and Storage System for Heterogeneous Virtual Machine Images Javier Diaz, Gregor von Laszewski, Fugang Wang,
Applied Architectures Eunyoung Hwang. Objectives How principles have been used to solve challenging problems How architecture can be used to explain and.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
Cyberinfrastructure Supporting Social Science Cyberinfrastructure Workshop October Chicago Geoffrey Fox
Virtual Clusters Supporting MapReduce in the Cloud Jonathan Klinginsmith School of Informatics and Computing.
3DAPAS/ECMLS panel Dynamic Distributed Data Intensive Analysis Environments for Life Sciences: June San Jose Geoffrey Fox, Shantenu Jha, Dan Katz,
1 Challenges Facing Modeling and Simulation in HPC Environments Panel remarks ECMS Multiconference HPCS 2008 Nicosia Cyprus June Geoffrey Fox Community.
Big Data and Clouds: Challenges and Opportunities NIST January Geoffrey Fox
X-Informatics Cloud Technology (Continued) March Geoffrey Fox Associate.
1 1 Hybrid Cloud Solutions (Private with Public Burst) Accelerate and Orchestrate Enterprise Applications.
Science Clouds and CFD NIA CFD Conference: Future Directions in CFD Research, A Modeling and Simulation Conference August.
Science of Cloud Computing Panel Cloud2011 Washington DC July Geoffrey Fox
Getting Access to FutureGrid CTS Conference 2011 Philadelphia May Geoffrey Fox
Clouds for Sensors and Data Intensive Applications May st International Workshop on Data-intensive Process Management.
Experimenting with FutureGrid CloudCom 2010 Conference Indianapolis December Geoffrey Fox
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Science Clouds and FutureGrid’s Perspective June Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox
OpenQuake Infomall ACES Meeting Maui May Geoffrey Fox
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
Science Applications on Clouds June Cloud and Autonomic Computing Center Spring 2012 Workshop Cloud Computing: from.
1 © 2009 Cisco Systems, Inc. All rights reserved.Cisco Confidential Cloud Computing – The Value Proposition Wayne Clark Architect, Intelligent Network.
Scientific Computing Environments ( Distributed Computing in an Exascale era) August Geoffrey Fox
ICETE 2012 Joint Conference on e-Business and Telecommunications Hotel Meliá Roma Aurelia Antica, Rome, Italy July
FutureGrid Connection to Comet Testbed and On Ramp as a Service Geoffrey Fox Indiana University Infra structure.
Image Generation and Management on FutureGrid CTS Conference 2011 Philadelphia May Geoffrey Fox
Magellan: Experiences from a Science Cloud Lavanya Ramakrishnan.
Some remarks on Use of Clouds to Support Long Tail of Science July XSEDE 2012 Chicago ILL July 2012 Geoffrey Fox.
SALSASALSASALSASALSA FutureGrid Venus-C June Geoffrey Fox
ISERVOGrid Architecture Working Group Brisbane Australia June Geoffrey Fox Community Grids Lab Indiana University
SALSASALSASALSASALSA Clouds Ball Aerospace March Geoffrey Fox
X-Informatics MapReduce February Geoffrey Fox Associate Dean for Research.
Programming Models for Technical Computing on Clouds and Supercomputers (aka HPC) May Cloud Futures 2012 May 7–8,
Scientific Computing Supported by Clouds, Grids and HPC(Exascale) Systems June HPC 2012 Cetraro, Italy Geoffrey Fox.
SALSASALSASALSASALSA Cloud Panel Session CloudCom 2009 Beijing Jiaotong University Beijing December Geoffrey Fox
Big Data Open Source Software and Projects ABDS in Summary IV: Level 7 I590 Data Science Curriculum August Geoffrey Fox
Internet of Things (Smart Grid) Storm Archival Storage – NOSQL like Hbase Streaming Processing (Iterative MapReduce) Batch Processing (Iterative MapReduce)
Virtual Appliances CTS Conference 2011 Philadelphia May Geoffrey Fox
Computing Research Testbeds as a Service: Supporting large scale Experiments and Testing SC12 Birds of a Feather November.
Recipes for Success with Big Data using FutureGrid Cloudmesh SDSC Exhibit Booth New Orleans Convention Center November Geoffrey Fox, Gregor von.
Security: systems, clouds, models, and privacy challenges iDASH Symposium San Diego CA October Geoffrey.
Web Technologies Lecture 13 Introduction to cloud computing.
Big Data to Knowledge Panel SKG 2014 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China August Geoffrey Fox
HPC in the Cloud – Clearing the Mist or Lost in the Fog Panel at SC11 Seattle November Geoffrey Fox
1 Cloud Systems Panel at HPDC Boston June Geoffrey Fox Community Grids Laboratory, School of informatics Indiana University
Panel Discussion Software Defined Ecosystems June BigSystem Software-Defined Ecosystems at HPDC Vancouver Canada Geoffrey Fox.
Big Data Open Source Software and Projects ABDS in Summary II: Layer 5 I590 Data Science Curriculum August Geoffrey Fox
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Big Data Workshop Summary Virtual School for Computational Science and Engineering July Geoffrey Fox
Panel: Beyond Exascale Computing
Private Public FG Network NID: Network Impairment Device
Geoffrey Fox, Shantenu Jha, Dan Katz, Judy Qiu, Jon Weissman
Status and Challenges: January 2017
Recap: introduction to e-science
University of Technology
NSF : CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science PI: Geoffrey C. Fox Software: MIDAS HPC-ABDS.
FutureGrid Computing Testbed as a Service
Digital Science Center Overview
Cloud DIKW based on HPC-ABDS to integrate streaming and batch Big Data
Clouds from FutureGrid’s Perspective
Services, Security, and Privacy in Cloud Computing
Department of Intelligent Systems Engineering
$1M a year for 5 years; 7 institutions Active:
3 Questions for Cluster and Grid Use
Panel on Research Challenges in Big Data
Cloud versus Cloud: How Will Cloud Computing Shape Our World?
Big Data, Simulations and HPC Convergence
Convergence of Big Data and Extreme Computing
Presentation transcript:

Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation in VENUS-C & the context of the SIENA Roadmap OGF35 at HPDC 2012 Delft Geoffrey Fox Director, Digital Science Center, Pervasive Technology Institute Associate Dean for Research and Graduate Studies, School of Informatics and Computing Indiana University Bloomington

Successes in eScience I Basic Supercomputer Architecture now being extended to Exascale – Grand Challenge activity produced consensus Basic OGF standards such as JSDL, BES, SAGA, GridFTP Software as a Service Use of Services Use of Workflow Use of Portals Say “use of” as details not agreed 2

More on Successes Appliances/Roles in Clouds (see Venus-C later) – Images defined explicitly (by construction) or implicitly by content Value added Platforms such as MPI, parallel domain specific Libraries, (Iterative) MapReduce, Queues, Tables and other NOSQL data models, Object Stores, HDFS/GFS style file systems PaaS delivered by tools/libraries/roles? Other good important general standards in security, OVF, accounting, networking 3

What Platforms to use in Clouds HDFS style file system to collocate data and computing Or Object Stores as basic scalable storage Queues to manage multiple tasks Tables to track job information MapReduce and Iterative MapReduce for parallelism Services for everything Portals as User Interface Appliances and Roles as customized images Software environments/tools like Google App Engine, memcached, Workflow to link multiple services (functions) 4

What to use in Grids and Supercomputers? Portals, Services and Workflow as in clouds MPI and GPU/multicore threaded parallelism Wonderful libraries supporting parallel linear algebra, particle evolution, partial differential equation solution Parallel I/O for high performance in an application Wide area File System (e.g. Lustre) supporting file sharing This is a rather different style of PaaS from clouds – we should unify? 5

Comments No agreement on problem to solve e.g. what is architecture for data intensive problems, role of clouds(!) Certainly no agreement on even style of workflow Services can be WSDL or REST Confusion as to architecture level being standardized – User or developer? – e.g. clouds may be built on federated infrastructure; that must be hidden from user 6

Some Standards Futures In general look for a few key SIMPLE concepts From past, SQL and MPI standardization very successful – suggesting that Cloud PaaS standards should be looked at – MapReduce – NOSQL data models Needs to be done at right time De facto standard “Hadoop” versus “real” standard What “roles” are important: Worker, Web, Grid, Worker + I/O, MPI, MapReduce, GPU – need a study? Roles v. Libraries v. Standard Interfaces GPU related standards: OpenACC extends OpenMP 7

Using Science Clouds in a Nutshell High Throughput Computing; pleasingly parallel; grid applications Multiple users (long tail of science) and usages (parameter searches) Internet of Things (Sensor nets) as in cloud support of smart phones (Iterative) MapReduce including “most” data analysis Exploiting elasticity and platforms (HDFS, Object Stores, Queues..) Use worker roles, services, portals (gateways) and workflow Good Strategies: – Build the application as a service; – Build on existing cloud deployments/roles such as Hadoop; – Use PaaS if possible; (This is not clearly eScience strategy – uses IaaS?) – Design for failure; (Not much work on what this means. Are there tools?) – Use as a Service (e.g. SQLaaS) where possible; (WHAT should be Provided) – Address Challenge of Moving Data (Need Production large scale Science Cloud) 8

Cosmic Comments Does Cloud + MPI Engine cover the future? – Will current High throughput computing and cloud concepts merge? Need Data analytics libraries for HPC and Clouds Does a “modest-size private science cloud” make sense – Too small to be elastic Should governments fund use of commercial clouds (or build their own) – Most science doesn’t have privacy issues motivating some private clouds Most interest in clouds from “new” applications such as life sciences Recent cloud infrastructure (Eucalyptus 3, OpenStack Essex) much improved More employment opportunities in clouds than HPC and Grids; so cloud related activities popular with students 9