Immersive Teaching and Research in Data Sciences via Cloud Computing Cloud Era Ltd 13 June 2013 Karim Chine.

Slides:



Advertisements
Similar presentations
Programming with Android: SDK install and initial setup Luca Bedogni Marco Di Felice Dipartimento di Scienze dellInformazione Università di Bologna.
Advertisements

Pricing for Utility-driven Resource Management and Allocation in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS)
1 Towards an Open Service Framework for Cloud-based Knowledge Discovery Domenico Talia ICAR-CNR & UNIVERSITY OF CALABRIA, Italy Cloud.
Remote Educational Programming Of Robots (REPOR) Tord Fauskanger Aurelie Aurilla Bechina Arntzen Dag Samuelsen Buskerud University College.
1 Mixing Public and private clouds a Practical Perspective Maarten Koopmans Nordunet Conference 2009 Maarten Koopmans Nordunet Conference 2009.
Sensor Web Enablement and GEOSS Presented by: Terence van Zyl.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
17 Copyright © 2005, Oracle. All rights reserved. Deploying Applications by Using Java Web Start.
Copyright CompSci Resources LLC Web-Based XBRL Products from CompSci Resources LLC Virginia, USA. Presentation by: Colm Ó hÁonghusa.
Designing Services for Grid-based Knowledge Discovery A. Congiusta, A. Pugliese, Domenico Talia, P. Trunfio DEIS University of Calabria ITALY
Grid Initiatives for e-Science virtual communities in Europe and Latin America The VRC-driven GISELA Science Gateway Diego Scardaci.
E-Marketplaces.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
DISTRICT AND SCHOOL ASSESSMENT & TECHNOLOGY COORDINATOR ONLINE TESTING WEBINAR FEBRUARY 7 AND 9, 2012 Washington Online Testi ng OSPI Office of Superintendent.
Introduction to HTML, XHTML, and CSS
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
Foundations of Programming and Problem Solving Introduction.
University of St Andrews School of Computer Science Experiences with a Private Cloud St Andrews Cloud Computing co-laboratory James W. Smith Ali Khajeh-Hosseini.
|epcc| NeSC Workshop Open Issues in Grid Scheduling Ali Anjomshoaa EPCC, University of Edinburgh Tuesday, 21 October 2003 Overview of a Grid Scheduling.
NGS computation services: API's,
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
Cloud Resource Broker for Scientific Community By: Shahzad Nizamani Supervisor: Peter Dew Co Supervisor: Karim Djemame Mo Haji.
ZMQS ZMQS
Introduction Lesson 1 Microsoft Office 2010 and the Internet
© 2009 IBM Corporation iEA16 Defining and Aligning Requirements using System Architect and DOORs Paul W. Johnson CEO / President Pragmatica Innovations.
Niagara Portal Introduction January 2007 Scott Muench - Technical Sales Manager.
Software change management
Mind Mapping Techniques to Create Proposals APMP Colorado Chapter March 6, 2012 James J. Franklin San Diego PMI Chapter PMI is a registered trade and service.
The Platform as a Service Model for Networking Eric Keller, Jennifer Rexford Princeton University INM/WREN 2010.
INTRODUCTION TO SIMULATION WITH OMNET++ José Daniel García Sánchez ARCOS Group – University Carlos III of Madrid.
Project Supervisor: Dr. Sanath Jayasena Project Coordinator: Mr. Shantha Fernando Athukorala A.U.B Dissanayake C.P. Kumara M.G.C.P. Priyadarshana G.V.J.
© 2005 AT&T, All Rights Reserved. 11 July 2005 AT&T Enhanced VPN Services Performance Reporting and Web Tools Presenter : Sam Levine x111.
My ELAS December 2012.
Cloud Computing for Education & Cloud Learning Minjuan Wang to BT Research Center (Abu Dhabi) Educational Technology San Diego State University
Enterprise Directorate General European Commission Supporting NCPs’ activities Irja Vounakis
1 UML ++ Mohamed T IBRAHIM University of Greenwich -UK.
Suggested Course Outline Cloud Computing Bahga & Madisetti, © 2014Book website:
Xsede eXtreme Science and Engineering Discovery Environment Ron Perrott University of Oxford 1.
Chapter 1 Introduction to Visual Basic Programming and Applications 1 Exploring Microsoft Visual Basic 6.0 Copyright © 1999 Prentice-Hall, Inc. By Carlotta.
1 US activities and strategy :NSF Ron Perrott. 2 TeraGrid An instrument that delivers high-end IT resources/services –a computational facility – over.
1. 2 Captaris Workflow Microsoft SharePoint User Group 16 May 2006.
Getting Familiar with Web Pages 1 2 The Internet Worldwide collection of interconnected computer networks that enables businesses, organizations, governments,
Contents 2 Engagement Overview Migrating to Hyper-V from VMware Consider if time allows.
Leveraging scriptable infrastructures, Towards a paradigm shift in software for data science Cloud Era Ltd 14 June 2013 Karim.
Elastic-R A cloud platform for web computing, real-time collaboration, rapid applications development and reproducible modelling Karim Chine Cloud Era.
1 Chapter 11: Data Centre Administration Objectives Data Centre Structure Data Centre Structure Data Centre Administration Data Centre Administration Data.
Addition 1’s to 20.
25 seconds left…...
The e-Research framework for South Africa developed by Fernihough (2011), after in depth interviews with various.
Week 1.
We will resume in: 25 Minutes.
14-1 © Prentice Hall, 2004 Chapter 14: OOSAD Implementation and Operation (Adapted) Object-Oriented Systems Analysis and Design Joey F. George, Dinesh.
1 Implementing DDIEditor in the Danish Data Archive - Demonstration and gained experience Part of session: Recent Developments in the DDI Implementation.
1 Cloud Computing Prof. Ravi Sandhu Executive Director and Endowed Chair April 12, © Ravi Sandhu World-Leading.
, Towards a universal platform for research and education in the cloud Karim Chine, Cloud Era Ltd 13 December 2013 Elastic-R e-Age 2013.
R and the Cloud Cloud Era Ltd 27 June 2013 Karim Chine Deuxièmes rencontres R.
What the Cloud can do for Computational Life Sciences: Biocep-R's Unified Perspective Karim Chine
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Enterprise Cloud Computing
Information Systems in Organizations 5.2 Cloud Computing.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
1 This Changes Everything: Accelerating Scientific Discovery through High Performance Digital Infrastructure CANARIE’s Research Software.
CyberGIS Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
JIA’2016 March 25th 2016 A platform at the crossroads of data science, big data and IoT.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
IBM Predictive Analytics Virtual Users’ Group Meeting March 30, 2016
Recap: introduction to e-science
Presentation transcript:

Immersive Teaching and Research in Data Sciences via Cloud Computing Cloud Era Ltd 13 June 2013 Karim Chine

22 Outline Introduction The Rise of Data Science What is missing on the Cloud for Research and Education? Elastic-R: The Next Generation Data Science Platform Elastic-R: Design and Technologies Overview Rethinking Virtual Research and Teaching Demo Conclusion

33 Introduction Science and the 4th Paradigm The promises of Grid Computing for e-Research The e-Science program (UK) The NSF-funded cyber infrastructure (USA) e-Infrastructure and ICT Calls (EC) Experimental Science Theoretical Science Computational Science e-Science / Data-intensive Science

4 Introduction The Dancing Bear The townspeople gather to see the wondrous sight as the massive, lumbering beast shambles and shuffles from paw to paw. The bear is really a terrible dancer, and the wonder isn't that the bear dances well but that the bear dances at all The tragic failure of the Grid paradigm No democratic access for all scientists, a tool for Particle Physicists and Geeky scientists No sustainable infrastructures No SLA, best effort support No flexibility, no isolation, only admins can install software, X.509 certificates, restrictive policies No interaction design, no user-centric approach

55 Introduction The Data Deluge and the challenges of Big Data *

6 The Rise of Data Science * Data Scientists Led by NASA Star Most Sought for Century: Jobs By Aki Ito - Jun 11, :01 AM ET (Bloomberg.com) Harvard Business Review last year called this profession “the sexiest job of the 21st century” One measure of demand: Hours billed for work in statistical analysis grew by 522 percent in the first quarter compared with the same period in 2011

77 What is missing on the Cloud for Research and Education? Science-as-a-Service: (SCEs) Scientific Computing Environments-as-a-service, Models-as-a-service… Generalized Real-time Collaboration, including interaction with SCEs and with Data One consistent platform for homogenous access to Science Clouds and new abstractions for stateful access to remote infrastructure/Data Reproducibility and traceability of data analysis and computational research Science Gateways building and publishing for everyone

8 Elastic-R: The Next Generation Data Science Platform Arduino / Raspberry pi Democratizing Electronics Elastic-R Democratizing Data Science

99 Elastic-R: The Next Generation Data Science Platform Elastic-R is:  A link between the data scientist/the educator/the student and the Infrastructure-as-a-Service  A consistent set of concepts, architectures, frameworks and interaction design models to wrap and complete the existing public and private clouds with the missing capabilities to radically improve the data scientist productivity  « Super Glue » between the virtual IT capabilities, the software capabilities, the mathematical/statistical capabilities and the man- machine interaction components  A Data Operating System enabling infinite combinations for analysing data and building rapidly Data Science-centric applications and services

10 Elastic-R: The Next Generation Data Science Platform Elastic-R is:  A hyper-dropbox allowing to run a large spectrum of capabilities next to the centrally stored and ubiquitously accessible data.  A platform using the cloud to introduce traceability and reproducibility to the data science arena.  A federation paradigm for scientific clouds  A framework for building on-the-fly user-defined software-as-a-service

11 Elastic-R: Design and Technologies Overview

12 Elastic-R: Design and Technologies Overview Robot submarine dives to the deepest part of the ocean controlled by a 7-mile cable as thin as single human hair

13 Elastic-R: Design and Technologies Overview Remote Java/R Processes  Events-driven Remote Objects/Engines  R, Python, Mathematica, Matlab, Scilab,..  Collaborative Spreadsheets  Collaborative Scientific Graphics Canvas  Collaborative Dashboard with collaborative widgets

14 Elastic-R: Design and Technologies Overview

15 Elastic-R: Design and Technologies Overview Elastic-R AMI 1 R 2.10 BioC 2.5 Elastic-R AMI 2 R 2.9 BioC 2.3 Elastic-R AMI 3 R 2.8 BioC 2.0 Elastic-R Amazon Machine Images Elastic-R EBS 1 Data Set XXX Elastic-R EBS 2 Data Set YYY Elastic-R EBS 3 Data Set ZZZ Elastic-R EBS 4 Data Set VVV Elastic-R AMI 2 R 2.9 BioC 2.3 Elastic-R EBS 4 Data Set VVV Amazon Elastic Block Stores Elastic-R AMI 2 R 2.9 BioC 2.3 Elastic-R.org Elastic-R EBS 4 Data Set VVV

16 Modes of access Individuals with AWS accounts  Standard AMIs (Amazon Machine Images): paid-per-use to Amazon. For Academic use and trial purposes  Paid AMI: Paid-per-use software model. For business users Individuals without AWS accounts  Trial tokens, purchased tokens, tokens granted by other users.  Resources (data science engines) shared by other users  Individual subscribtions Companies/Educational & Research Institutions  Dedicated Platform and AMIs on an Amazon VPC (Virtual Private Cloud). Paid via subscribtion

17 Rethinking Virtual Research and Teaching Free researchers from the IT service dictatorship. Give researchers self-service access to the IT resources they need Allow Researchers to share without restrictions Make real-time collaboration a free and ubuiquitous service Allow researchers to produce and publish to the web advanced applications/services without recourse to developers/admins

18 Rethinking Virtual Research and Teaching Make the cloud an ecosystem for Open Science where all research artifacts can be produced, discovered and reused Allow Researchers to « sell » the software/models/algorithms/techniques they invent seamlessly Bridge the gap between the different computational research tools: interconnect SCEs, workflow workbenches, Documents editors…

19 Rethinking Virtual Research and Teaching Provide affordable and reliable tools for remote education  High-quality voice and video chat for a large number of users  Self-service collaboration tools: Editors, White boards, IDEs, etc.  Modules for Traceability/reproducibilty Extend existing on-line courses platforms to include capabilities such as:  Companion software environments in SaaS mode  Collaborative problem solving tools  Interactive courses  Tokens for Ready-to-run e-Learning applications  E-Learning environments’ visual designers

20 Demo Register to Elastic-R academic and trial portal ( ) Create Data Science Engines using trial tokens Work with R, Python and Scientific Spreadsheets in the browser Share Data Science Engine and Collaborate Use The Visual and Collaborative Scientific Applications designer to create and publish to the web an interactive dashboard Connect to the remote Data Science Engine from withing a local R session, push and pull data, execute commands and show impact on the dashboard

21 Conclusion Elastic -R unlocks the potential of the cloud for Data Scientists and Educators With Elastic-R, the cloud becomes a cyberspace for collaborative research and sharing and an eco-system suited for open Science, open innovation and open education Elastic-R improves dramatically the productivity of the Data Scientists: The entire Data Science Factory chain, from resources acquisition to services and applications publishing, becomes under their direct control Elastic-R provides Analytics-as-a-Service platform that can extend any existing portal or application

22 What to do Next Register to Elastic-R and try the HTML 5 Workbench and the collaboration Download the R package elasticR and use it to access the cloud from local R sessions Download the Java SDK and try to create your first Analytical application using AWS and the most advanced tools for programming with data. Get in touch with me to explore potential partnerships and collaborations

23 Contact details Karim Chine