Cloud Computing for Research Fabrizio Gagliardi Microsoft Research.

Slides:



Advertisements
Similar presentations
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Advertisements

ELIDA Panel on Data Marketplaces and digital preservation Fabrizio Gagliardi, Vassilis Christophides, David Giaretta, Robert Sharpe, Elena Simperl.
EInfrastructures (Internet and Grids) US Resource Centers Perspective: implementation and execution challenges Alan Blatecky Executive Director SDSC.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
Client+Cloud The Future of Research Dr. Daniel A. Reed Corporate Vice President Extreme Computing Group & Technology Strategy and Policy.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 6 2/13/2015.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Emerging Platform#6: Cloud Computing B. Ramamurthy 6/20/20141 cse651, B. Ramamurthy.
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Integrated Scientific Workflow Management for the Emulab Network Testbed Eric Eide, Leigh Stoller, Tim Stack, Juliana Freire, and Jay Lepreau and Jay Lepreau.
What is Cloud Computing? o Cloud computing:- is a style of computing in which dynamically scalable and often virtualized resources are provided as a service.
King Cloud by akakamu "Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable computing resources.
The Cloud: Demystified Neil Cattermull Frontier Technology.
SaaS, PaaS & TaaS By: Raza Usmani
Be Smart, Use PwrSmart What Is The Cloud?. Where Did The Cloud Come From? We get the term “Cloud” from the early days of the internet where we drew a.
Cloud computing Tahani aljehani.
An Introduction to Cloud Computing. The challenge Add new services for your users quickly and cost effectively.
Plan Introduction What is Cloud Computing?
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
Customized cloud platform for computing on your terms !
“Clouds: a construction zone” (and Why PaaS is the future…) Matt Thompson General Manager, Developer & Platform Evangelism Microsoft.
Cloud Computing. What is Cloud Computing? Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable.
1 European policies for e- Infrastructures Belarus-Poland NREN cross-border link inauguration event Minsk, 9 November 2010 Jean-Luc Dorel European Commission.
Software Architecture
OpenQuake Infomall ACES Meeting Maui May Geoffrey Fox
Communicate with All Workers Involved in the Process of Delivering High-Quality Health Care by Choosing Dossier365 on the Azure Platform MICROSOFT AZURE.
Cloud and VENUS-C Fabrizio Gagliardi Microsoft Research Europe External Research, Director.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
What is the cloud ? IT as a service Cloud allows access to services without user technical knowledge or control of supporting infrastructure Best described.
EMI INFSO-RI JRA1.5-Infra-Cloud Computing Task Force Shahbaz Memon (JUELICH)
Securely Synchronize and Share Enterprise Files across Desktops, Web, and Mobile with EasiShare on the Powerful Microsoft Azure Cloud Platform MICROSOFT.
OpenField Consolidates Stadium Data, Provides CRM and Analysis Functions for an Intelligent, End-to-End Solution COMPANY PROFILE : OPENFIELD Founded by.
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
Chad Collins CEO Henry Chan CTO In Latin, nubifer means “bringing the clouds”
Chapter 8 – Cloud Computing
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware A Cloud Computing Methodology Study of.
User Scenarios in VENUS-C Focus on Structural Analysis Ignacio Blanquer I3M - UPV.
HPC in the Cloud – Clearing the Mist or Lost in the Fog Panel at SC11 Seattle November Geoffrey Fox
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
4a. Aula 2o. Período de Livro texto Copyright © 2012, Elsevier Inc. All rights reserved March 5, 2012 Prof. Kai Hwang, USC Cloud Roles in.
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
Bioinformatics Computation in the Cloud A Joint Collaboration Between Microsoft’s External Research and eXtreme Computing Groups
Cloud Computing: Concepts, Technologies and Business Implications B. Ramamurthy & K. Madurai &
EGI-InSPIRE EGI-InSPIRE RI The European Grid Infrastructure Steven Newhouse Director, EGI.eu Project Director, EGI-InSPIRE 29/06/2016CoreGrid.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
Agenda  What is Cloud Computing?  Milestone of Cloud Computing  Common Attributes of Cloud Computing  Cloud Service Layers  Cloud Implementation.
Towards a scientific cloud for Europe Åke Edlund, PhD KTH/CSC/PDC Cloud Group Lead Leader of VENUS-C WP2 – Scientific and International.
Connected Infrastructure
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
Organizations Are Embracing New Opportunities
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
An Introduction to Cloud Computing
Recap: introduction to e-science
Connected Infrastructure
Cloud Computing Dr. Sharad Saxena.
Introduction to D4Science
DeFacto Planning on the Powerful Microsoft Azure Platform Puts the Power of Intelligent and Timely Planning at Any Business Manager’s Fingertips Partner.
Clouds from FutureGrid’s Perspective
Cloud Helps Company Scale to Demand for Growing Healthcare Provider Field MINI-CASE STUDY “Microsoft Azure gives us the opportunity to focus on the task.
Cloud Computing for Research
Presentation transcript:

Cloud Computing for Research Fabrizio Gagliardi Microsoft Research

Introduction Happy to be given the chance of addressing so many smart and young scientists My talk is not about computer science, but about computing technology which could enable a new and better way of supporting computational science I will also cover recent experience in attracting EU funding to support new experimental computing initiative in this field

The Cloud A model of computation and data storage based on “pay as you go” access to “unlimited” remote data center capabilities A cloud infrastructure provides a framework to manage scalable, reliable, on-demand access to applications A cloud is the “invisible” backend to many of our mobile applications Historical roots in today’s Internet apps –Search, , social networks –File storage

The Cloud is built on massive data centers Range in size from “edge” facilities to megascale. Economies of scale Approximate costs for a small size center (1K servers) and a larger, 100K server center. Each data center is 11.5 times the size of a football field TechnologyCost in small-sized Data Center Cost in Large Data Center Ratio Network$95 per Mbps/ Month $13 per Mbps/ month 7.1 Storage$2.20 per GB/ Month $0.40 per GB/ month 5.7 Administration~140 servers/ Administrator >1000 Servers/ Administrator 7.1

Conquering complexity. Building racks of servers & complex cooling systems all separately is not efficient Package and deploy into bigger units Microsoft Advances in DC Deployment

DCs vs Grids, Clusters and Supercomputer Apps Supercomputers –High parallel, tightly synchronized MPI simulations Clusters –Gross grain parallelism, single administrative domains Grids –Job parallelism, throughput computing, heterogeneous administrative domains Cloud –Scalable, parallel, resilient web services InternetInternet MPI communication Map Reduce Data Parallel HPC SupercomputerData Center based Cloud

Changing the way we do research The Rest of Us –Use laptops –Got data, now what? –And it is really is about data, not the FLOPS… » Our data collections are not as big as we wished » When data collection does grow large, not able to analyze –Tools are limited, must dedicate resources to build analysis tools Paradigm shifts for research –The ability to marshal needed resources on demand » Without caring or knowing how it gets done… –Funding agencies can request grantees to archive research data in common public repositories –The cloud can support very large numbers of users or communities in a flexible way

F ocus Client + Cloud for Research Seamless interaction Cloud is the lens that magnifies the power of desktop Persist and share data from client in the cloud Analyze data initially captured in client tools, such as Excel –Analysis as a service (think SQL, Map-Reduce, R/MatLab) –Data visualization generated in the cloud, display on client –Provenance, collaboration, other ‘core’ services…

The Clients+Cloud Platform At one time the “client” was a PC + browser Now: –The Phone –The laptop/tablet –The TV/Surface/Media wall And the future: –The instrumented room –Aware and active surfaces –Voice and gesture recognition –Knowledge of where we are –Knowledge of our health

Petabytes Doubling every 2 years The Future: an Explosion of Data ExperimentsArchivesLiteratureSimulationsInstruments The Challenge: Enable Discovery. Deliver the capability to mine, search and analyze this data in near real time. Enhance our Lives Participate in our own health care. Augment experience with deeper understanding.

Complex models Multidisciplinary interactions Wide temporal and spatial scales Large multidisciplinary data Real-time steams Structured and unstructured Distributed communities Virtual organizations Socialization and management Diverse expectations Client-centric and infrastructure-centric Changing Nature of Discovery

Infrastructure as a Service (IaaS) –Provide a data center and a way to host client VMs and data. Platform as a Service (PaaS) –Provide a programming environment to build a cloud application –The cloud deploys and manages the app for the client Software as a Service (SaaS) –Delivery of software from the cloud to the desktop The Cloud Landscape

Some examples based on MSR ongoing Cloud Research Engagements

AzureBLAST* selects DBs and input sequence Web Role Input Splitter Worker Role Input Splitter Worker Role BLAST Execution Worker Role #n BLAST Execution Worker Role #n ….…. Combiner Worker Rol e Combiner Worker Rol e Genome DB 1 Genom e DB K BLAST DB Configuration Azure Blob Storage BLAST Execution Worker Role #1 BLAST Execution Worker Role #1 Seamless Experience Evaluate data and invoke computational models from Excel. Computationally heavy analysis done close to large database of curated data. Scalable for large, surge computationally heavy analysis. Test local, run on the cloud. *The Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases

Making Excel the user interface to the cloud

AzureMODIS – Azure Service for Remote Sensing Geoscience 5 TB (~600K files) upload of 9 different imagery products from 15 different locations (~6 days of download) 4 TB reprojected harmonized imagery ~35000 cpu hours 50 GB reduced science variable results ~18000 cpu hours (~14 hour download) 50 GB additional reduced science analysis results ~18000 cpu hours (~14 hour download)

Newcastle University, UK -Paul Watson Investigating applicability of commercial clouds for scientific research Build a working prototype for use-cases in chemo-informatics Uses Microsoft technologies to build science-related services (Windows Azure, Silverlight…) Exploits Azure and Amazon Clouds Built initial proof-of-concept Silverlight UI for basic Quantitative Structure- Analysis Relationship (QSAR) modeling Demonstrated ability to scale QSAR computations in Windows Azure Project JUNIOR

Dynamic shared state multiplayer games, virtual worlds, social networks –New modalities of collaboration

Reaching Out: Azure Research Engagement project In the U.S. Memorandum of Understanding with the National Science Foundation –Provide a substantial Azure resource as a donation to NSF –NSF will provide funding to researchers to use this resource In Europe –We interested in direct engagement with the thought leaders in the U.K., France and Germany –EC engagements where possible (VENUS-C to start off) In both we provide our engagement team –We provide workshops, tutorials, best practices and shared services, learn from this community, shape policy… In Asia –We wish to explore possibilities

VENUS-C Virtual multidisciplinary EnviroNments USing Cloud infrastructures Funding Scheme: Combination of Collaborative Project and Coordination and Support Action: Integrated Infrastructure Initiative (I3) Program Topic: INFRA Distributed Computing Infrastructures

Consortium 1 (co)Engineering Ingegneria Informatica S.p.a.ENGIT 2European Microsoft Innovation CentreEMICDE 3European Charter of Open Grid ForumOGF.eeigUK 4Barcelona Supercomputing Center – Centro Nacional de SupercomputaciónBSC-CNSES 5Universidad Politecnica de ValenciaUPVES 6Kungliga Tekniska HoegskolanKTHSE 7University of the AegeanAEGGR 8TechnionTECHIL 9Centre for Computational and Systems BiologyCoSBiIT 10University of NewcastleNCLUK 11Consiglio Nazionale delle RicercheCNRIT 12CollaboratorioCOLBIT

Goals 1. Create a platform that enables user applications to leverage cloud computing principles and benefits 2. Leverage the state of the art to early adopters quickly, incrementally enable interop with existing DCI and push the state of the art where needed 3. Create a sustainable infrastructure that enables cloud computing paradigms for the user communities inside the project, the ones from the open-call for applications, as well as others

Supporting multiple basic research disciplines Biomedicine: Integrating widely used tools for Bioinformatics (UPV), System Biology (CosBI) and Drug Discovery (NCL) into the VENUS-C infrastructure Civil Protection and Emergency: Early fire risk detection (AEG), through an application that will run models on the VENUS-C infrastructure, based on multiple data sources. Civil Engineering: Support complex computing tasks on Building Information Management for green constructions (provided by COLB) and dynamic building structure analysis (provided by UPV). D4Science: Integrating computing through VENUS-C on data repositories (CNR). In particular focus will be on Marine Biodiversity through Aquamaps.

Open Call for 20 e-Science Applications 20K€ funding each (in addition to Azure Compute, Storage and Network Resources) -> porting applications to the cloud -> education and traning -> scalability tests

Thanks for your attention