Scientific Computing at Amazon Disruptive Innovations in Distributed Computing Dave Ward, Principal Product Manager Adam Gray, Senior Product Manager.

Slides:



Advertisements
Similar presentations
Meet Hadoop Doug Cutting & Eric Baldeschwieler Yahoo!
Advertisements

Ivan Pleština Amazon Simple Storage Service (S3) Amazon Elastic Block Storage (EBS) Amazon Elastic Compute Cloud (EC2)
Large Scale Computing Systems
Making Fly Parviz Deyhim
1 Cloud Computing with Amazon and Oracle Lewis Cunningham TUSC, Sr Datawarehouse Consultant
Amazon Web Services Justin DeBrabant CIS Advanced Systems - Fall 2013.
University of Notre Dame
EHarmony in Cloud Subtitle Brian Ko. eHarmony Online subscription-based matchmaking service Available in United States, Canada, Australia and United Kingdom.
Cloud Computing Imranul Hoque. Today’s Cloud Computing.
StratusLab is co-funded by the European Community’s Seventh Framework Programme (Capacities) Grant Agreement INSFO-RI Ioannis Konstantinou Greek.
OPNET Technologies, Inc. Performance versus Cost in a Cloud Computing Environment Yiping Ding OPNET Technologies, Inc. © 2009 OPNET Technologies, Inc.
Webscale Computing Mike Culver Amazon Web Services.
Slide 1 International Internet Preservation Consortium General Assembly 2014, Paris Mining a Large Web Corpus Robert Meusel Christian Bizer.
Low Cost, Scalable Proteomics Data Analysis Using Amazon's Cloud Computing Services and Open Source Search Algorithms Brian D. Halligan, Ph.D. Medical.
Authors: Thilina Gunarathne, Tak-Lon Wu, Judy Qiu, Geoffrey Fox Publish: HPDC'10, June 20–25, 2010, Chicago, Illinois, USA ACM Speaker: Jia Bao Lin.
Matt Bertrand Building GIS Apps in the Cloud. Infrastructure - Provides computer infrastructure, typically a platform virtualization environment, as a.
Experiences Teaching MapReduce in the Clouds Ari Rabkin, Charles Reiss, Randy Katz, David Patterson University of California, Berkeley 1.
Big Data Use Cases in the cloud Peter Sirota, GM Elastic
MATE-EC2: A Middleware for Processing Data with Amazon Web Services Tekin Bicer David Chiu* and Gagan Agrawal Department of Compute Science and Engineering.
Next Generation of Apache Hadoop MapReduce Arun C. Murthy - Hortonworks Founder and Architect Formerly Architect, MapReduce.
Python In The Cloud PyHou MeetUp, Dec 17 th 2013 Chris McCafferty, SunGard Consulting Services.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
Big data accessible to all Ryan Shuttleworth, AWS Evangelist.
Hadoop Team: Role of Hadoop in the IDEAL Project ●Jose Cadena ●Chengyuan Wen ●Mengsu Chen CS5604 Spring 2015 Instructor: Dr. Edward Fox.
Introduction to Amazon Web Services (AWS)
HeteroPar 2013 Optimization of a Cloud Resource Management Problem from a Consumer Perspective Rafaelli de C. Coutinho, Lucia M. A. Drummond and Yuri Frota.
FOSS4G: 52°North WPS Behind the buzz of Cloud Computing - 52°North Open Source Geoprocessing Software in the Clouds FOSS4G 2009.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Cloud MapReduce : a MapReduce Implementation on top of a Cloud Operating System Speaker : 童耀民 MA1G Authors: Huan Liu, Dan Orban Accenture.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
A MAZON W EB S ERVICES Reza Yousefzadeh 12/9/2014.
Advanced Topics in Distributed Systems Fall 2011 Instructor: Costin Raiciu.
Jamie Kinney, AWS Scientific Computing
Hadoop: Data Processing by Minions ABCD-GIS August 2015 Presentation Dave Strohschein, Harvard Center for Geographic Analysis
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
On the Varieties of Clouds for Data Intensive Computing 董耀文 Antslab Robert L. Grossman University of Illinois at Chicago And Open Data.
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Scaling to the Modern Internet CSCI 572: Information Retrieval and Search Engines Summer 2010.
Amazon Web Services BY, RAJESH KANDEPU. Introduction  Amazon Web Services is a collection of remote computing services that together make up a cloud.
Webscale Computing Mike Culver Amazon Web Services.
1 Time & Cost Sensitive Data-Intensive Computing on Hybrid Clouds Tekin Bicer David ChiuGagan Agrawal Department of Compute Science and Engineering The.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.
Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes – Bretagne Atlantique Rennes, France
Presented by: Mostafa Magdi. Contents Introduction. Cloud Computing Definition. Cloud Computing Characteristics. Cloud Computing Key features. Cost Virtualization.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Graduate Student Department Of CSE 1.
How AWS Pricing Works Jinesh Varia Technology Evangelist.
1 HPC as Service and Scientific Applications Bruno Schulze.
Team: 3 Md Liakat Ali Abdulaziz Altowayan Andreea Cotoranu Stephanie Haughton Gene Locklear Leslie Meadows.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
Cloud Computing Paradigms for Pleasingly Parallel Biomedical Applications Thilina Gunarathne, Tak-Lon Wu Judy Qiu, Geoffrey Fox School of Informatics,
PDAC-10 Middleware Solutions for Data- Intensive (Scientific) Computing on Clouds Gagan Agrawal Ohio State University (Joint Work with Tekin Bicer, David.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
Next Generation of Apache Hadoop MapReduce Owen
Distributed Process Discovery From Large Event Logs Sergio Hernández de Mesa {
Beyond Hadoop The leading open source system for processing big data continues to evolve, but new approaches with added features are on the rise. Ibrahim.
By: Joel Dominic and Carroll Wongchote 4/18/2012.
Databricks What is Databricks ? Cloud services used Functionality Languages Spark Usage 3 rd Party Apps Architecture Books
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
Lecture 1 Book: Hadoop in Action by Chuck Lam Online course – “Cloud Computing Concepts” lecture notes by Indranil Gupta.
Scoring Change for Milestone 4 The bonus will be based on improvement from L3.0 to L3.1 to L4. As announced, you get a bonus for – Any improvement from.
More than IaaS Academic Cloud Services for Researchers
Amazon Instance Purchasing Options
Chapter 19 Cloud Computing for Multimedia Services
Trends: Technology Doubling Periods – storage: 12 mos, bandwidth: 9 mos, and (what law is this?) cpu compute capacity: 18 mos Then and Now Bandwidth 1985:
Hadoop Clusters Tess Fulkerson.
Database Applications (15-415) Hadoop Lecture 26, April 19, 2016
Public vs Private Cloud Usage Costs:
INFO 344 Web Tools And Development
CS246: Search-Engine Scale
Presentation transcript:

Scientific Computing at Amazon Disruptive Innovations in Distributed Computing Dave Ward, Principal Product Manager Adam Gray, Senior Product Manager

Innovation #1:

42

Building your own virtual programmable datacenter

ec2-run-instances

On Demand Global Infrastructure

Programmable

Elastic

Instance Types

Standard (m1) High Memory (m2) High CPU (c1)

High Performance

“Our 40-instance (m2.2xlarge) cluster can scan, filter, and aggregate 1 billion rows in 950 milliseconds.” Mike Driscoll – Meta Markets

Cluster Computing

MPI

Bandwidth Intensive

Cluster Compute Instance

2*Intel Xeon Cores w/ HT 23 GB RAM 1.7 TB disk HVM Cc1.4xlarge

linpack

231 November 2010

451 June 2011

New Cluster Compute Instances

2*Intel Xeon 16 cores w/HT 60.5GB RAM 3.4TB disk HVM cc2.8xlarge

linpack

42 November 2011

Innovation #2:

Lowering the cost of developing a distributed system

Case Study: Amazon’s Associates Program

Text Links Enhanced Links

how much to pay each associate?

orders c++ app bi-hourly flat files bi-hourly flat files

orders c++ app bi-hourly flat files bi-hourly flat files c++ app daily aggregations daily aggregations

orders c++ app bi-hourly flat files bi-hourly flat files c++ app daily aggregations daily aggregations c++ app to payments service…

orders c++ app bi-hourly flat files bi-hourly flat files c++ app daily aggregations daily aggregations c++ app to payments service…

“just one more Q4”

distributed computing

Difficulty Number of Machines 1 1

Difficulty Number of Machines

Difficulty Number of Machines

distributed computing is hard

distributed computing requires god-like engineers

Hadoop is… The MapReduce computational paradigm

Hadoop is… The MapReduce computational paradigm … implemented as an Open-source, Scalable, Fault-tolerant, Distributed System

PersonStartEnd Bob00:44:4800:45:11 Charlie02:16:0202:16:18 Charlie11:16:5911:17:17 Charlie11:17:2411:17:38 Bob11:23:1011:23:25 Alice16:26:4616:26:54 David17:20:2817:20:45 Alice18:16:5318:17:00 Charlie19:33:4419:33:59 Bob21:13:3221:13:43 David22:36:2222:36:34 Alice23:42:0123:42:11

PersonStartEndDuration Bob00:44:4800:45:11 Charlie02:16:0202:16:18 Charlie11:16:5911:17:17 Charlie11:17:2411:17:38 Bob11:23:1011:23:25 Alice16:26:4616:26:54 David17:20:2817:20:45 Alice18:16:5318:17:00 Charlie19:33:4419:33:59 Bob21:13:3221:13:43 David22:36:2222:36:34 Alice23:42:0123:42:11

PersonStartEndDuration Bob00:44:4800:45:1123 Charlie02:16:0202:16:18 Charlie11:16:5911:17:17 Charlie11:17:2411:17:38 Bob11:23:1011:23:25 Alice16:26:4616:26:54 David17:20:2817:20:45 Alice18:16:5318:17:00 Charlie19:33:4419:33:59 Bob21:13:3221:13:43 David22:36:2222:36:34 Alice23:42:0123:42:11

PersonStartEndDuration Bob00:44:4800:45:1123 Charlie02:16:0202:16:1816 Charlie11:16:5911:17:17 Charlie11:17:2411:17:38 Bob11:23:1011:23:25 Alice16:26:4616:26:54 David17:20:2817:20:45 Alice18:16:5318:17:00 Charlie19:33:4419:33:59 Bob21:13:3221:13:43 David22:36:2222:36:34 Alice23:42:0123:42:11

PersonStartEndDuration Bob00:44:4800:45:1123 Charlie02:16:0202:16:1816 Charlie11:16:5911:17:1718 Charlie11:17:2411:17:3814 Bob11:23:1011:23:2515 Alice16:26:4616:26:548 David17:20:2817:20:4517 Alice18:16:5318:17:007 Charlie19:33:4419:33:5915 Bob21:13:3221:13:4311 David22:36:2222:36:3412 Alice23:42:0123:42:1110

PersonDuration Bob23 Charlie16 Charlie18 Charlie14 Bob15 Alice8 David17 Alice7 Charlie15 Bob11 David12 Alice10

PersonDuration Bob23 Charlie16 Charlie18 Charlie14 Bob15 Alice8 David17 Alice7 Charlie15 Bob11 David12 Alice10 PersonStartEnd Bob00:44:4800:45:11 Charlie02:16:0202:16:18 Charlie11:16:5911:17:17 Charlie11:17:2411:17:38 Bob11:23:1011:23:25 Alice16:26:4616:26:54 David17:20:2817:20:45 Alice18:16:5318:17:00 Charlie19:33:4419:33:59 Bob21:13:3221:13:43 David22:36:2222:36:34 Alice23:42:0123:42:11 map

PersonDuration Bob23 Charlie16 Charlie18 Charlie14 Bob15 Alice8 David17 Alice7 Charlie15 Bob11 David12 Alice10

PersonDuration Alice Bob23 Bob15 Bob11 Charlie16 Charlie18 Charlie14 Charlie15 David12 David17

PersonTotal Alice25 PersonDuration Alice Bob23 Bob15 Bob11 Charlie16 Charlie18 Charlie14 Charlie15 David12 David17

PersonDuration Alice Bob23 Bob15 Bob11 Charlie16 Charlie18 Charlie14 Charlie15 David12 David17 PersonTotal Bob49 Alice25

PersonTotal Charlie63 Bob49 Alice25 PersonDuration Alice Bob23 Bob15 Bob11 Charlie16 Charlie18 Charlie14 Charlie15 David12 David17

PersonTotal David29 Charlie63 Bob49 Alice25 PersonDuration Alice Bob23 Bob15 Bob11 Charlie16 Charlie18 Charlie14 Charlie15 David12 David17

PersonTotal David29 Charlie63 Bob49 Alice25

PersonTotal Alice25 Bob49 Charlie63 David29 PersonDuration Alice Bob23 Bob15 Bob11 Charlie16 Charlie18 Charlie14 Charlie15 David12 David17 reduce

PersonStartEnd Bob00:44:4800:45:11 Charlie02:16:0202:16:18 Charlie11:16:5911:17:17 Charlie11:17:2411:17:38 Bob11:23:1011:23:25 Alice16:26:4616:26:54 David17:20:2817:20:45 Alice18:16:5318:17:00 Charlie19:33:4419:33:59 Bob21:13:3221:13:43 David22:36:2222:36:34 Alice23:42:0123:42:11

PersonDuration Alice Bob23 Bob15 Bob11 Charlie16 Charlie18 Charlie14 Charlie15 David12 David17

Hadoop is… The MapReduce computational paradigm

Hadoop is… The MapReduce computational paradigm … implemented as an Open-source, Scalable, Fault-tolerant, Distributed System

distributed computing requires god-like engineers

distributed computing (with Hadoop) requires god-like talented engineers

how much to pay each associate?

orders c++ app bi-hourly flat files bi-hourly flat files c++ app daily aggregations daily aggregations c++ app to payments service…

orders c++ app bi-hourly flat files bi-hourly flat files c++ app daily aggregations daily aggregations c++ app to payments service… PersonTotal Alice25 Bob49 Charlie63 David29

Orders Filter S3 Other Services

Orders Filter S3 Hadoop Cluster

Difficulty Number of Machines

Difficulty Number of Machines More data? Smarter engineers.

Difficulty Number of Machines

Difficulty Number of Machines More data? Smarter Engineers. More data? More boxes.

Hadoop lowers the cost of developing a distributed system.

What about the cost of operating a distributed system?

November traffic at amazon.com

76% 24%

Orders Filter S3 Hadoop Cluster

Amazon Elastic Compute Cloud “provides resizable compute capacity in the cloud.”

Amazon Elastic MapReduce = Amazon EC2 + Hadoop

Orders Filter S3 Hadoop Cluster

Filter S3 EMR Cluster Orders

Filter S3 EMR Cluster Orders

Filter S3 Orders

Filter S3 Orders

Amazon EC2 lowers the cost of operating a distributed system.

Hadoop lowers the cost of developing a distributed system.

Amazon Elastic MapReduce changes the economics of data processing.

Managed Apache Hadoop Service Removes MUCK from Big Data processing Provides tight integration with AWS services AMAZON ELASTIC MAPREDUCE

> elastic-mapreduce --create --instance-type m1.large / --instance-count name “My Hadoop Cluster” / --jar s3://elasticmapreduce/samples/cloudburst/cloudburst.jar

What is big data?

Dataset size Number of datasets

Dataset size Number of datasets fits on a single machine

Dataset size Number of datasets Big Data

Dataset size Number of datasets Extremely Big Data

Dataset size Difficulty

Dataset size Difficulty

Dataset size Difficulty Extremely valuable Marginally valuable

Dataset size Difficulty Extremely valuable Marginally valuable

Dataset size Number of datasets Extremely Big Data

Dataset size Difficulty

Dataset size Difficulty

Dataset size Difficulty

Dataset size Difficulty

cheap experimentation

Innovation #3:

Lowering the cost of accessing data

Over 50 free data sets

Nearly 1 PB of free data

Stored at no cost to providers; also free access to consumers

1000 Genomes Project (110 TB) Common Crawl Corpus (60 TB) Sloan Digital Sky Survey (180 GB) United States Census (200 GB) Million Song Dataset (500 GB) Google Books Corpus (2.2 TB) Marvel Universe Social Graph (50 GB)

aws.amazon.com/datasets

Innovation #4: Creating a Market for Capacity

Finding Research Dollars (even further) for AWS

Educators

Up to $100 per Student in AWS Credits for intro courses

Researchers

Infrastructure Credits (EC2, S3, …)

4 Grant Review Cycles Per Year

February 10, 2012

Students

Student Organizations, Self Learning, Entrepreneurial Projects

aws.amazon.com/education

Stretching your Research Dollars (even further) on AWS

On-Demand

Reserved

Spot

Unused EC2 Capacity

Bid

July 2011

Interruption

July 2011

Manage Interruption

Grid Computing

MIT StarCluster

Harvard Medical School Lab of Personalized Medicine

Temple University Spot MPI

Elastic MapReduce

#1: Cost without Spot 4 instances *14 hrs * $0.50 = $28 Allocate 4 instances Job Flow 14 Hours Duration: #2: Cost with Spot 4 instances *7 hrs * $0.50 = $ instances * 7 hrs * $0.25 = $8.75 Total = $21.75 Scenario #1 Add 5 Spot Instances Duration: Job Flow 7 Hours Scenario #2 Time Savings: 50% Cost Savings: ~22% Save Time and Money

Queue Based Architecture Amazon EC2 Spot Amazon EC2 On-Demand / Reserved Queue Applications

Checkpointing

30,000+ Cores 95,078 Instance Hours

$1,279/hour

We are Hiring! FT/Interns: amazon.com/careers Experienced: aws.amazon.com/jobs