SilverLining. Stuff we're covering Hardware infrastructure and scaling Cloud platform as a service The SilverLining Project.

Slides:



Advertisements
Similar presentations
A presentation by Kelvin Gan & Liam McMurray Web Services, University of Bath.
Advertisements

Large Scale Computing Systems
Cloud Computing at GES DISC Presented by: Long Pham Contributors: Aijun Chen, Bruce Vollmer, Ed Esfandiari and Mike Theobald GES DISC UWG May 11, 2011.
Emerging Platform#6: Cloud Computing B. Ramamurthy 6/20/20141 cse651, B. Ramamurthy.
Google App Engine Cloud B. Ramamurthy 7/11/2014CSE651, B. Ramamurthy1.
© 2011 Citrusleaf. All rights reserved.1 A Real-Time NoSQL DB That Preserves ACID Citrusleaf Srini V. Srinivasan Brian Bulkowski VLDB, 09/01/11.
What is Cloud Computing? o Cloud computing:- is a style of computing in which dynamically scalable and often virtualized resources are provided as a service.
Zuora Inc.: Venturing Into CLOUD COMPUTING By Kaitlin Farrell BUS 550.
Accelerate adoption, provide customer insights to engineering, and deliver knowledge to the IT Pro community.
Live for today as if it is your last day but plan for tomorrow as if it will last forever!
5th Edition, Irv Englander
A T AXONOMY AND S URVEY OF C LOUD C OMPUTING S YSTEMS Reporter: Steven Chen Date: 2010/10/27 1.
Modeling Public Pensions with Mathematica and Python II
M.A.Doman Model for enabling the delivery of computing as a SERVICE.
Engineering the Cloud Andrew McCombs March 10th, 2011.
Cloud Don McGregor Research Associate MOVES Institute
Addition to Networking.  There is no unique and standard definition out there  Cloud Computing is a general term used to describe a new class of network.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
An Introduction to Cloud Computing. The challenge Add new services for your users quickly and cost effectively.
Google App Engine and Java Application: Clustering Internet search results for a person Aleksandar Kartelj Faculty of Mathematics,
Platform as a Service (PaaS)
Google AppEngine. Google App Engine enables you to build and host web apps on the same systems that power Google applications. App Engine offers fast.
SENG 422 Lab 4 Cloud Computing Time: ELW B220 from (4:00 - 6:50) every Tuesday TA: Philip Baback Alipour Ph.D. Candidate in Electrical, Computer Engineering.
Google App Engine Danail Alexiev Technical Trainer SoftAcad.bg.
Cross Platform Mobile Backend with Mobile Services James
Effectively Explaining the Cloud to Your Colleagues.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
What is Cloud Computing? Cloud computing is the delivery of computing capabilities as a service, making access to IT resources like compute power, networking.
Cloud Computing Saneel Bidaye uni-slb2181. What is Cloud Computing? Cloud Computing refers to both the applications delivered as services over the Internet.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
Lecture 8 – Platform as a Service. Introduction We have discussed the SPI model of Cloud Computing – IaaS – PaaS – SaaS.
Cloud Computing Definitions Cloud The set of hardware, networks, storage, services and interfaces that combine to deliver computing as a service Cloud.
GIS and Cloud Computing. Flickr  Upload and manage your photos online  Share your photos with your family and friends  Post your photos everywhere.
Geographic Information Systems Cloud GIS. ► The use of computing resources (hardware and software) that are delivered as a service over the Internet ►
1 NETE4631 Using Google Web Services and Using Microsoft Cloud Services Lecture Notes #7.
Software Architecture
M.A.Doman Short video intro Model for enabling the delivery of computing as a SERVICE.
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
Prepared By : Bhavin Tank(S.Y.B.Sc.(IT)) College of Computer Science & IT, Junagadh Cloud Computing.
Cloud Computing Presented by Boyoung Kim.
Company small business cloud solution Client UNIVERSITY OF BEDFORDSHIRE.
Enterprise Cloud Computing
Take enterprise virtualization to the next level
HUSKY CONSULTANTS FRANKLIN VALENCIA WIOLETA MILCZAREK ANTHONY GAGLIARDI JR. BRIAN CONNERY.
Paperless Timesheet Management Project Anant Pednekar.
My project  Small-Medium Enterprises (SMEs)  faces goods distribution problems  needs necessary resources, money and technical expertise, to purchase.
GOOGLE APP ENGINE By Muktadiur Rahman. Contents  Cloud Computing  What is App Engine  Why App Engine  Development with App Engine  Quote & Pricing.
Lecture III: Challenges for software engineering with the cloud CS 4593 Cloud-Oriented Big Data and Software Engineering.
Information Systems in Organizations 5.2 Cloud Computing.
Mark Gilbert Microsoft Corporation Services Taxonomy Building Block Services Attached Services Finished Services.
1 NETE4631 Using Google Web Services Lecture Notes #6.
Information Systems in Organizations 5.2 Cloud Computing.
Cloud Computing from a Developer’s Perspective Shlomo Swidler CTO & Founder mydrifts.com 25 January 2009.
From infrastructure to applications Where cloud computing is at and where it’s headed.
KAASHIV INFOTECH – A SOFTWARE CUM RESEARCH COMPANY IN ELECTRONICS, ELECTRICAL, CIVIL AND MECHANICAL AREAS
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
Google App Engine. Contents Overview Getting Started Databases Inter-app Communications Modes.
Platform as a Service (PaaS)
Overview: Cloud Datacenters
Platform as a Service (PaaS)
Dr.S.Sridhar, Director, RVCT, RVCE, Bangalore
Platform as a Service (PaaS)
What is Cloud Computing - How cloud computing help your Business?
Dr.S.Sridhar, Director, RVCT, RVCE, Bangalore
Andrew McCombs March 10th, 2011
The Improvement of PaaS Platform ZENG Shu-Qing, Xu Jie-Bin 2010 First International Conference on Networking and Distributed Computing SQUARE.
Google App Engine Danail Alexiev
Cloud Computing: Concepts
Presentation transcript:

SilverLining

Stuff we're covering Hardware infrastructure and scaling Cloud platform as a service The SilverLining Project

Some context We work at a university Funding based on projects Biodiversity web apps and APIs Focus on software (not hardware)

Infrastructure Applications depend on infrastructure Infrastructure that "just works" is expensive More money for infrastructure means less money for application development Degenerates without long-term funding Unreliability is bad for applications Increasingly bad user experience over time

$1.6M USD total budget to 17 institutions $245k USD (30.6% of direct costs) for infrastructure

$1.6M USD total budget to 17 institutions $245k USD (30.6% of direct costs) for infrastructure $100k USD (12.6% of direct costs) for core application development o DiGIR provider, DiGIR portal

MaNIS, ORNIS, HerpNet, FishNet $7.6M USD combined budgets, 71 institutions $196k USD annual operating cost

MaNIS, ORNIS, HerpNet, FishNet $7.6M USD combined budgets, 71 institutions $196k USD annual operating cost $179k USD (92%) for infrastructure

Infrastructure as a Problem (IaaP)

Unsustainable Creates a barrier to innovation And this is all before scaling comes into play!

Scalability "The ability for infrastructure to reliably handle heavy request loads in a high performance way."

IaaP at scale

Scaling up Scale up vertically with a server upgrade Scale out horizontally with more servers

Scaling up

Scaling DiGIR networks MaNIS, ORNIS, HerpNet, FishNet ~85 million records ~100 servers

Scaling DiGIR networks MaNIS, ORNIS, HerpNet, FishNet ~85 million records ~100 servers s

Query: All records with a point

Response: Error: IO problem

"Scaling is hard." - Alex Payne

"Scaling is hard." - Alex Payne al3x.net/2010/07/27/node.html

Scaling in the small Handling dozens or requests per second Scaling up vertically is sufficient Performance improvements are software related al3x.net/2010/07/27/node.html

Scaling in the large Billions of requests per week (Google) Millions of active users (Facebook) Data centers worldwide with millions of servers al3x.net/2010/07/27/node.html

Are we scaling large or small? GBIF ~220 million records eBird ~2 million new records per month Undigitized collections ~2.5 billion records

Scaling in the "small-ish" We're at the brink! IaaP is in the way, scaling is making it worse Where's the silver lining in all of this?

Platform as a Service (PaaS) en.wikipedia.org/wiki/Platform_as_a_service Conceptually quite simple: Computing power over the Internet No servers to maintain Pay for use Scales large (even if your application is small) Provided by companies such as Amazon, Microsoft, Google

SilverLining silver-lining.googlecode.com Experiments, metrics, prototypes (not products) Picked Google App Engine PaaS with biodiversity data Simple Darwin Core Bulk loading, storage MapReduce - indexes, validation, statistics Optimize for resource efficiency, search performance

Cost comparison Total annual operating costs of vertebrate networks: Current architecture: USD $195,600 Projected App Engine: USD $19,540

Cost comparison Total annual operating costs of vertebrate networks: Current architecture: USD $195,600 Projected App Engine: USD $19,540 Total cost for SilverLining work to date: 50 cents

App Engine code.google.com/appengine Develop scalable web apps on Google's infrastructure No servers or hardware to maintain and free quotas Standards based Java and Python SDKs IDE support for Eclipse, NetBeans, IntelliJ Local development server Integrated support for unit testing

App Engine constraints Practical constraints for performance and scalability The datastore is not a relational database Query can only use inequality filters on 1 property Fails: year >= 1980 and year 10 Solution: Set membership queries

Set membership queries Before: year >= 1980 and year 10 After: year "within 1 year" of 1981 and elevation > 10 List for "within 1 year" of 1980: [1979, 1980, 1981]

Aggregation and synchronization code.google.com/p/pubsubhubbub code.google.com/apis/feed/push Fast aggregation via API Subscribe to changes at the source Changes pushed automatically

What's the end game? PaaS instead of IaaP SaaS (software as a solution) BaaS (biodiversity applications at scale) Aaron Steele John Wieczorek

What's the end game? PaaS instead of IaaP SaaS (software as a solution) BaaS (biodiversity applications at scale) Any QaaC? (Questions as a challenge) Aaron Steele John Wieczorek