Elastic-R A cloud platform for web computing, real-time collaboration, rapid applications development and reproducible modelling Karim Chine Cloud Era.

Slides:



Advertisements
Similar presentations
automated single login access to Novell storage resources
Advertisements

Cloud Computing Computer Science Innovations, LLC.
Software Installation Deck Big Data Workshop Saturday March 10 th, 2012.
The Access Grid Ivan R. Judson 5/25/2004.
1 Mixing Public and private clouds a Practical Perspective Maarten Koopmans Nordunet Conference 2009 Maarten Koopmans Nordunet Conference 2009.
CollectionSpace for Technology Service Providers and Developers October 22,
University of St Andrews School of Computer Science Experiences with a Private Cloud St Andrews Cloud Computing co-laboratory James W. Smith Ali Khajeh-Hosseini.
Cloud Resource Broker for Scientific Community By: Shahzad Nizamani Supervisor: Peter Dew Co Supervisor: Karim Djemame Mo Haji.
Building and using REST information services Rion Dooley.
Ivan Pleština Amazon Simple Storage Service (S3) Amazon Elastic Block Storage (EBS) Amazon Elastic Compute Cloud (EC2)
Cloud Computing for Education & Cloud Learning Minjuan Wang to BT Research Center (Abu Dhabi) Educational Technology San Diego State University
Immersive Teaching and Research in Data Sciences via Cloud Computing Cloud Era Ltd 13 June 2013 Karim Chine.
Leveraging scriptable infrastructures, Towards a paradigm shift in software for data science Cloud Era Ltd 14 June 2013 Karim.
NG-CHC Northern Gulf Coastal Hazards Collaboratory Simulation Experiment Integration Sandra Harper 1, Manil Maskey 1, Sara Graves 1, Sabin Basyal 1, Jian.
Lecture 12 IaaS Chapter 3 of the second reference book.
CHANGING THE WAY IT WORKS Cloud Computing 4/6/2015 Presented by S.Ganesh ( )
1 Cloud Computing Prof. Ravi Sandhu Executive Director and Endowed Chair April 12, © Ravi Sandhu World-Leading.
Amazon. Cloud computing also known as on-demand computing or utility computing. Similar to other utility providers like electric, water, and natural gas,
, Towards a universal platform for research and education in the cloud Karim Chine, Cloud Era Ltd 13 December 2013 Elastic-R e-Age 2013.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
Matt Bertrand Building GIS Apps in the Cloud. Infrastructure - Provides computer infrastructure, typically a platform virtualization environment, as a.
Nikolay Tomitov Technical Trainer SoftAcad.bg.  What are Amazon Web services (AWS) ?  What’s cool when developing with AWS ?  Architecture of AWS 
Modeling Public Pensions with Mathematica and Python II
M.A.Doman Model for enabling the delivery of computing as a SERVICE.
Cloud computing Tahani aljehani.
Amazon EC2 Quick Start adapted from EC2_GetStarted.html.
R and the Cloud Cloud Era Ltd 27 June 2013 Karim Chine Deuxièmes rencontres R.
What the Cloud can do for Computational Life Sciences: Biocep-R's Unified Perspective Karim Chine
 Cloud computing is one of the more recent technologies that many businesses, individuals and other industry organizations believe to by one of the keys.
Cloud Computing.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
Accessing the Amazon Elastic Compute Cloud (EC2) Angadh Singh Jerome Braun.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
The Blue Book pages 19 onwards
DISTRIBUTED COMPUTING
Introduction to Cloud Computing
| nectar.org.au NECTAR TRAINING Module 1 Overview of cloud computing and NeCTAR services.
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
Customized cloud platform for computing on your terms ! Nirav Merchant
Accelerating Scientific Exploration Using Workflow Automation Systems Terence Critchlow (LLNL) Ilkay Altintas (SDSC) Scott Klasky(ORNL) Mladen Vouk (NCSU)
Portal for ArcGIS An Introduction
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks C. Loomis (CNRS/LAL) M.-E. Bégin (SixSq.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Virtual Appliances CTS Conference 2011 Philadelphia May Geoffrey Fox
Datalayer Notebook Allows Data Scientists to Play with Big Data, Build Innovative Models, and Share Results Easily on Microsoft Azure MICROSOFT AZURE ISV.
| nectar.org.au NECTAR TRAINING Module 1 Overview of cloud computing and NeCTAR services.
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
Web Technologies Lecture 13 Introduction to cloud computing.
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
Launch Amazon Instance. Amazon EC2 Amazon Elastic Compute Cloud (Amazon EC2) provides resizable computing capacity in the Amazon Web Services (AWS) cloud.
INFSO-RI JRA2 Test Management Tools Eva Takacs (4D SOFT) ETICS 2 Final Review Brussels - 11 May 2010.
Large-scale accelerator simulations: Synergia on the Grid turn 1 turn 27 turn 19 turn 16 C++ Synergia Field solver (FFT, multigrid) Field solver (FFT,
Information Networks. Internet It is a global system of interconnected computer networks that link several billion devices worldwide. It is an international.
© 2015 MetricStream, Inc. All Rights Reserved. AWS server provisioning © 2015 MetricStream, Inc. All Rights Reserved. By, Srikanth K & Rohit.
INTRODUCTION TO AMAZON WEB SERVICES (EC2). AMAZON WEB SERVICES  Services  Storage (Glacier, S3)  Compute (Elastic Compute Cloud, EC2)  Databases (Redshift,
JIA’2016 March 25th 2016 A platform at the crossroads of data science, big data and IoT.
Customized cloud platform for computing on your terms !
Evolution of Internet.
Platform as a Service.
Cloudy with a Chance of Data
Polly Baker Division Director: Data, Mining, and Visualization
Module 01 ETICS Overview ETICS Online Tutorials
In this session… Introduce what we’re talking about
The Blue Book pages 19 onwards
Presentation transcript:

Elastic-R A cloud platform for web computing, real-time collaboration, rapid applications development and reproducible modelling Karim Chine Cloud Era Ltd BD 04 February 2011

o Open-source (GPL) software environment for statistical computing and graphics o Lingua franca of data analysis. o Repositories of contributed R packages related to a variety of problem domains in life sciences, social sciences, finance, econometrics, chemo metrics, etc. are growing at an exponential rate. o R Website: o CRAN Task View: o CRAN packages : o Bioconductor: o R Metrics: Scientific Computing Environments office.microsoft.com

From: John Fox, Aspects of the Social Organization and Trajectory of the R Project, R Journal-Feb 2009 The ‘s Success Story

"Give me a place to stand, and I shall move the earth with a lever" Scientific/Statistical Computing Software, HPC and Usability

Extract from the NetSolve/GridSolve Description Document The emergence of Grid computing as the prototype of a next generation cyberinfrastructure for science has excited high expectations for its potential as an accelerator of discovery, but it has also raised questions about whether and how the broad population of research professionals, who must be the foundation of such productivity, can be motivated to adopt this new and more complex way of working. The rise of the new era of scientific modeling and simulation has, after all, been precipitous, and many science and engineering professionals have only recently become comfortable with the relatively simple world of the uniprocessor workstations and desktop scientific computing tools. In that world, software packages such as Matlab and Mathematica represent general-purpose scientific computing environments (SCEs) that enable users — totaling more than a million worldwide — to solve a wide variety of problems through flexible user interfaces that can model in a natural way the mathematical aspects of many different problem domains. Moreover, the ongoing, exponential increase in the computing resources supplied by the typical workstation makes these SCEs more and more powerful, and thereby tends to reduce the need for the kind of resource sharing that represents a major strength of Grid computing [1]. Certainly there are various forces now urging collaboration across disciplines and distances, and the burgeoning Grid community, which aims to facilitate such collaboration, has made significant progress in mitigating the well-known complexities of building, operating, and using distributed computing environments. But it is unrealistic to expect the transition of research professionals to the Grid to be anything but halting and slow if it means abandoning the SCEs that they rightfully view as a major source of their productivity. We therefore believe that Grid computing’s prospects for success will tend to rise and fall according to its ability to interface smoothly with the general purpose SCEs that are likely to continue to dominate the toolbox of its targeted user base. Arnold, D. and Agrawal, S. and Blackford, S. and Dongarra, J. and Miller, M. and Seymour, K. and Sagi, K. and Shi, Z. and Vadhiyar, S.

Computational Components R packages : CRAN, Bioconductor, Wrapped C,C++,Fortran code Scilab modules, Matlab Toolkits, etc. Open source or commercial Computational Resources Hardware & OS agnostic computing engine : R, Scilab,.. Clusters, grids, private or public clouds free: academic grids or pay-per-use: EC2, Azure Computational User Interfaces Workbench within the browser Built-in views / Plugins / Spreadsheets Collaborative views Open source or commercial Computational Scripts R / Python / Groovy On client side: interactivity.. On server side: data transfer.. Stateful or stateless, automatic mapping of R data objects and functions Computational Application Programming Interfaces Java / SOAP / REST, Stateless and stateful Computational Data Storage Local, NFS, FTP, Amazon S3, Amazon EBS free or commercial Generated Computational Web Services Elastic-R Elastic-R is a ubiquitous plug-and-play platform for scientific and statistical computing

Public Clouds Private Cloud Elastic-R portal: single facade to public and private clouds

Elastic-R is a collaborative Virtual Research Environment. Users can share their machine instances, stateful remote engines, data,..

Reproducible research: A scientist can snapshot her computational environment and her data. She can archive the snapshot or share it with others. Elastic-R AMI 1 R BioC 2.5 Elastic-R AMI 2 R BioC 2..3 Elastic-R AMI 3 R 2.8+BioC 2.0 Elastic-R Amazon Machine Images Elastic-R EBS 1 Data Set XXX Elastic-R EBS 2 Data Set YYY Elastic-R EBS 3 Data Set ZZZ Elastic-R EBS 4 Data Set VVV Elastic-R AMI 2 R BioC 2.3 Elastic-R EBS 4 Data Set VVV Amazon Elastic Block Stores Elastic-R AMI 2 R BioC 2.3 Elastic-R EBS 4 Data Set VVV Elastic-R.org

Anatomy of an Elastic-R machine instance on Amazon EC2 HTTPS Restful WS over SSL SSH Restful WS over SSL SOAP over SSL Heartbeat Restful WS over SSL

The scientist can control any number of stateful R engines from within an R session on the cloud or on his machine. He can use them for parallel computing

Software+Services=Applications convergence + ubiquitous collaboration. The server-side toolkit: R + spreadsheet models + virtual gui widgets.

Elastic-R on Infrastructure-as-a-Service style Cloud

Amazon Virtual Private Cloud Subnet 2 Subnet 3 Subnet 1 The Elastic-R portal itself is an EC2 machine instance. Any number of portals can be run on EC2 for decentralized and private collaboration

T1T3T2 getData LogOn Login Pwd Options SessionID associated with a reserved Elastic-R Engine Retrieve Data logOff ES ES on2 ES on3 f ( ES ) ES on1 T1,T2,T3 : Generated Stateful Web Services for R functions T1,T2 & T3 LogOn, getData : R-SOAP methods ES : ExpressionSet ESon1, ESon2, ESon3 : ExpressionSet Object Names f = T3 o T2 o T1 remove ESonx « Clean » Elastic-R Engine Put Elastic-R Engine back in the Pool kill Elastic-R Engine Stateful generated Web Services : Elastic-R for workflow workbenches

Generate token Deliver token Use token Activate token Launch machine instance Register machine instance Use R console Call R Engine XXYYZZ AWS Credentials + Private Key One Amazon account and many users : Elastic-R signed tokens

Elastic-R Portal : Articles about the project: Chine K. (2010). Open Science in the Cloud: Towards a Universal Platform for Scientific and Statistical Computing. In Handbook of Cloud Computing. (Chapter 19). Springer US. Karim Chine, "Learning Math and Statistics on the Cloud, Towards an EC2-Based Google Docs-like Portal for Teaching / Learning Collaboratively with R and Scilab," icalt, pp , th IEEE International Conference on Advanced Learning Technologies, 2010 Karim Chine, "Scientific Computing Environments in the age of virtualization, toward a universal platform for the Cloud" pp , 2009 IEEE International Workshop on Open Source Software for Scientific Computation (OSSC), 2009 Karim Chine, "Biocep, Towards a Federative, Collaborative, User-Centric, Grid-Enabled and Cloud- Ready Computational Open Platform" escience,pp , 2008 Fourth IEEE International Conference on eScience, 2008 Linkedin Group: Links

Thank you !

Node 5 : EC2 virtual machine 2 Remote Objects Registry Node 1: Windows XP Front-end host Node 4 : EC2 virtual machine 1 Node 2: Mac OS Node 3: 64 bits Server / Linux Supervisor Cloudbursting via Amazon Web Services Perl Scripts  logOn  Use R  logOff.NET Appli  logOn  Use R  logOff R-HTTPR-SOAP Parallel Computing Applications  Borrow Rs  Use Rs  Release Rs Web Application  Borrow R  Generate Graphics/Data  Release R Pool B Pool A Pool C Elastic-R SOA platform

Elastic-R for clusters/grids