A walk in cloud (and look for databases) Jian Xu DMM DB-talk, Feb 2010.

Slides:



Advertisements
Similar presentations
A Ridiculously Easy & Seriously Powerful SQL Cloud Database Itamar Haber AVP Ops & Solutions.
Advertisements

A presentation by Kelvin Gan & Liam McMurray Web Services, University of Bath.
Cloud Computing Development. Shallow Introduction.
Trust Management of Services in Cloud Environments:
Suggested Course Outline Cloud Computing Bahga & Madisetti, © 2014Book website:
Introduction to Data Center Computing Derek Murray October 2010.
STUDY ON OPENSTACK BY JAI KRISHNA. LIST OF COMPONENTS Introduction Components Architecture Where it is used.
Database Architectures and the Web
WEB & MOBILE CLOUD APP With Bootstrap, Backbone, Pusher, AWS, Slim Gabriele Mittica –
C-Store: Data Management in the Cloud Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY Jun 5, 2009.
ICS (072)Database Systems: A Review1 Database Systems: A Review Dr. Muhammad Shafique.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 17 Client-Server Processing, Parallel Database Processing,
© 2009 IBM Corporation ® IBM Software Group Introduction to Cloud Computing Vivek C Agarwal IBM India Software Labs.
Presented by Sujit Tilak. Evolution of Client/Server Architecture Clients & Server on different computer systems Local Area Network for Server and Client.
Cloud computing Tahani aljehani.
Platform as a Service (PaaS)
Google AppEngine. Google App Engine enables you to build and host web apps on the same systems that power Google applications. App Engine offers fast.
Manage & Configure SQL Database on the Cloud Haishi Bai Technical Evangelist Microsoft.
Cloud Computing Source:
Ch 4. The Evolution of Analytic Scalability
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Opensource for Cloud Deployments – Risk – Reward – Reality
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
Database Design, Application Development, and Administration, 5 th Edition Copyright © 2011 by Michael V. Mannino. All rights reserved. Chapter 18 Client-Server.
PhD course - Milan, March /09/ Some additional words about cloud computing Lionel Brunie National Institute of Applied Science (INSA) LIRIS.
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
MapReduce April 2012 Extract from various presentations: Sudarshan, Chungnam, Teradata Aster, …
+ CS 325: CS Hardware and Software Organization and Architecture Cloud Architectures.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Cloud Databases Matt Gregg Bob Guidinger. Cloud 101 What do we mean by Cloud Databases? Why do we have them? o Alternative to IT infrastructure investment.
Computing on the Cloud Jason Detchevery March 4 th 2009.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
Amazon Web Services BY, RAJESH KANDEPU. Introduction  Amazon Web Services is a collection of remote computing services that together make up a cloud.
Cloud Computing & Amazon Web Services – EC2 Arpita Patel Software Engineer.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
System Center Lesson 4: Overview of System Center 2012 Components System Center 2012 Private Cloud Components VMM Overview App Controller Overview.
GOOGLE APP ENGINE By Muktadiur Rahman. Contents  Cloud Computing  What is App Engine  Why App Engine  Development with App Engine  Quote & Pricing.
Configuring, Managing and Maintaining Windows Server® 2008 Servers Course 6419A.
Web Technologies Lecture 13 Introduction to cloud computing.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
Cloud Computing from a Developer’s Perspective Shlomo Swidler CTO & Founder mydrifts.com 25 January 2009.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
SAS® Viya™ Overview ANDRĖ DE WAAL, GLOBAL ACADEMIC PROGRAM
Platform as a Service (PaaS)
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
Organizations Are Embracing New Opportunities
Platform as a Service (PaaS)
Platform as a Service (PaaS)
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
The Improvement of PaaS Platform ZENG Shu-Qing, Xu Jie-Bin 2010 First International Conference on Networking and Distributed Computing SQUARE.
CEG7380 Cloud Computing Lecture 1
Java in the cloud PaaS Platform in Comparison
Outline Virtualization Cloud Computing Microsoft Azure Platform
Ch 4. The Evolution of Analytic Scalability
Introduction to Apache
Overview of big data tools
Syllabus and Introduction Keke Chen
Cloud Computing: Concepts
Anjuman College of Engineering & Technology Computer Science & Engineering Department Subject Code: BECSE408T Subject Name: (ELECTIVE-III)Clustering &
Microsoft Virtual Academy
Presentation transcript:

A walk in cloud (and look for databases) Jian Xu DMM DB-talk, Feb 2010

outline Cloud products Architecture Database issues Data integration issues Other topics

Cloud products Google App Engine – Amazon Elastic Compute Cloud (EC2) – Yahoo! Cloud Testbed / M45 – WestGrid – Microsoft Azure –

What’s in common ? A cluster of centrally managed computing resources Provide application hosting & storage Provide API / SDK Encourage deployment of applications that runs in parallel. The goal : fast, scalable & cheap computing

Comparison to cluster/grid Share similar hardware deployment [IBM] Cloud hides resource specification: – Only API is exposed [google, Amazon] – User transparent to underlying resources Cluster exposes resource configuration – “Utility” computing – Less or no service support

Source : [Foster 09]

Cloud Architecture Cluster / Grid system as hardware platform – IBM cluster line Virtualization [Barham03] – Amazon Machine Image Unified storage and data access model – Google’s bigtable – Amazon’s Simple DB

8 Cloud Infrastructure & Application Provider Standards Based Interfaces Virtualized Infrastructure Web 2.0 Platform (image deployment, integrated security, workload mgmt., high-availability) Applications & Services Content & Data User Request Management/Self Service Security: Identity, Access, Integrity, Isolation, Audit & Compliance Usage Accounting License Management Image Lifecycle Management Provisioning Performance Management Availability/Backup/ Restore Service Lifecycle Management Service Management Virtual Resources & Aggregations SMP Servers Network Hardware Storage Servers System Resources Blades Storage Virtualized Applications Server Virt. Storage Virt. Network Virt. Virtualized Infrastructure End User Requests … Service Catalog Mashup Interface Design & Build Image Library (Store) Deployment Operational Lifecycle of Images Web 2.0 Solution Tools Subscribers IBM Architectural Model for Cloud Computing [Web2.0 expo]

Where is database ? Database as a service Above / below / crossing virtualization layer File system v.s. Database G’s bigtable -- (???) A’s S3 -- SimpleDB, RDS(mysql based) M’s Azure Table – SQL Azure

Handling big^big data Is database too weak? Map-reduce v.s. Distri. Databases [stonebraker 2010] Database accepting map-reduce [chen 2009] Why databases are missing in clouds General purpose v.s. specific use Programming model Work-flow administration

Cloud is like database Cloud User does not know how resources are allocated balances resource among user applications [LINQ] Database User does not care how DBMS executes an SQL query Handles large number of concurrent queries SQL standard

New Database Architectures? DBMS for High-density clusters, Virtualization platform. [Ibrahim 09] SSD & SSD-Matrix as storage / high speed cache / virtualized storage GPU aided computing or DBMS on GPU [He 08]

How about Data Integration? Domain heterogeneous data sources Need to scale More data sources co-operate in query processing Larger scope of schema Data integration as a cloud service

Our preliminary model

Opportunities with Cloud Cloud gathers data sources into highly available clusters. Cloud reduces the overhead of replicating data sources. Cloud simplifies the underlay communication. Peer-to-Peer model maintains autonomy of individual data sources and scales

Other topics Security & privacy New business model Inter-cloud data exchange

References [stonebraker 2010] Michael Stonebraker. MapReduce and Parallel DBMSs, Friends or Foes?, CACM 2010 [Ibrahim 09] Shadi Ibrahim et, al. CloudLet: Towards MapReduce Implementation on Virtual Machines. HPDC09 [chen 09] Qiming Chen et, al. Efficiently Support MapReduce-like Computation Models Inside Parallel DBMS. IDEAS 2009 [Loebman] Analyzing Massive Astrophysical Datasets: Can Pig/Hadoop or a Relational DBMS Help? [He08] Mars: A MapReduce Framework on Graphics Processors

Reference (2) [LINQ] Michael Isard, Yuan Yu. Distributed Data-Parallel Computing Using a High-Level Programming Language. Sigmod 09 [Pavlo 09] Andrew Pavlo et, al. A Comparison of Approaches to Large-Scale Data Analysis. Sigmod 09 [Foster 09 ] Ian Foster et,al. Cloud Computing and Grid Computing 360-Degree Compared. [Web2.0 expo] Scott Gerard. Maximize your web2.0 experience with cloud computing. (presentation on web2.0 expo 2009) [Barham 03] Xen and Art of Virtualization, SOSP 2003