Cloud Storage Yizheng Chen. Outline Cassandra Hadoop/HDFS in Cloud Megastore.

Slides:



Advertisements
Similar presentations
From Startup to Enterprise A Story of MySQL Evolution Vidur Apparao, CTO Stephen OSullivan, Manager of Data and Grid Technologies April 2009.
Advertisements

Chen Zhang Hans De Sterck University of Waterloo
Megastore: Providing Scalable, Highly Available Storage for Interactive Services. Presented by: Hanan Hamdan Supervised by: Dr. Amer Badarneh 1.
Dynamo: Amazon’s Highly Available Key-value Store
CASSANDRA-A Decentralized Structured Storage System Presented By Sadhana Kuthuru.
Data Management in the Cloud Paul Szerlip. The rise of data Think about this o For the past two decades, the largest generator of data was humans -- now.
Cassandra A Decentralized, Structured Storage System Avinash Lakshman and Prashant Malik Facebook Published: April 2010, Volume 44, Issue 2 Communications.
AMAZON’S KEY-VALUE STORE: DYNAMO DeCandia,Hastorun,Jampani, Kakulapati, Lakshman, Pilchin, Sivasubramanian, Vosshall, Vogels: Dynamo: Amazon's highly available.
Map/Reduce in Practice Hadoop, Hbase, MongoDB, Accumulo, and related Map/Reduce- enabled data stores.
A Hadoop Overview. Outline Progress Report MapReduce Programming Hadoop Cluster Overview HBase Overview Q & A.
Cassandra Structured Storage System over a P2P Network Avinash Lakshman, Prashant Malik.
Cloud Storage Theo Benson. Outline Distributed storage – Commodity server, limited resources, – Geodistribution, scalable, reliable Cassandra [FB] – High.
Hadoop YARN in the Cloud Junping Du Staff Engineer, VMware China Hadoop Summit, 2013.
NoSQL Databases: MongoDB vs Cassandra
© 2009 VMware Inc. All rights reserved Big Data’s Virtualization Journey Andrew Yu Sr. Director, Big Data R&D VMware.
HBase Presented by Chintamani Siddeshwar Swathi Selvavinayakam
Cassandra Database Project Alireza Haghdoost, Jake Moroshek Computer Science and Engineering University of Minnesota-Twin Cities Nov. 17, 2011 News Presentation:
 Need for a new processing platform (BigData)  Origin of Hadoop  What is Hadoop & what it is not ?  Hadoop architecture  Hadoop components (Common/HDFS/MapReduce)
Google Bigtable A Distributed Storage System for Structured Data Hadi Salimi, Distributed Systems Laboratory, School of Computer Engineering, Iran University.
A Decentralized Structure Storage Model - Avinash Lakshman & Prashanth Malik - Presented by Srinidhi Katla CASSANDRA.
Inexpensive Scalable Information Access Many Internet applications need to access data for millions of concurrent users Relational DBMS technology cannot.
Thanks to our Sponsors! To connect to wireless 1. Choose Uguest in the wireless list 2. Open a browser. This will open a Uof U website 3. Choose Login.
Cloud Storage: All your data belongs to us! Theo Benson This slide includes images from the Megastore and the Cassandra papers/conference slides.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
Windows Azure SQL Database and Storage Name Title Organization.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Cloud MapReduce : a MapReduce Implementation on top of a Cloud Operating System Speaker : 童耀民 MA1G Authors: Huan Liu, Dan Orban Accenture.
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
Distributed Indexing of Web Scale Datasets for the Cloud {ikons, eangelou, Computing Systems Laboratory School of Electrical.
Hadoop Basics -Venkat Cherukupalli. What is Hadoop? Open Source Distributed processing Large data sets across clusters Commodity, shared-nothing servers.
Introduction to Hadoop and HDFS
VLDB2012 Hoang Tam Vo #1, Sheng Wang #2, Divyakant Agrawal †3, Gang Chen §4, Beng Chin Ooi #5 #National University of Singapore, †University of California,
Cloud Computing Cloud Data Serving Systems Keke Chen.
High Throughput Computing on P2P Networks Carlos Pérez Miguel
IMDGs An essential part of your architecture. About me
Cassandra - A Decentralized Structured Storage System
WINDOWS AZURE STORAGE SERVICES A brief comparison and overview of storage services offered by Microsoft.
Cassandra – A Decentralized Structured Storage System Lecturer : Prof. Kyungbaek Kim Presenter : I Gde Dharma Nugraha.
CEPH: A SCALABLE, HIGH-PERFORMANCE DISTRIBUTED FILE SYSTEM S. A. Weil, S. A. Brandt, E. L. Miller D. D. E. Long, C. Maltzahn U. C. Santa Cruz OSDI 2006.
CS 347Lecture 9B1 CS 347: Parallel and Distributed Data Management Notes 13: BigTable, HBASE, Cassandra Hector Garcia-Molina.
Introduction to Hbase. Agenda  What is Hbase  About RDBMS  Overview of Hbase  Why Hbase instead of RDBMS  Architecture of Hbase  Hbase interface.
VMware vSphere Configuration and Management v6
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
HADOOP DISTRIBUTED FILE SYSTEM HDFS Reliability Based on “The Hadoop Distributed File System” K. Shvachko et al., MSST 2010 Michael Tsitrin 26/05/13.
NoSQL Or Peles. What is NoSQL A collection of various technologies meant to work around RDBMS limitations (mostly performance) Not much of a definition...
Nov 2006 Google released the paper on BigTable.
Technology Drill Down: Windows Azure Platform Eric Nelson | ISV Application Architect | Microsoft UK |
An Introduction to Super-Scalability But first…
Department of Computer Science, Johns Hopkins University EN Instructor: Randal Burns 24 September 2013 NoSQL Data Models and Systems.
VICTORIA UNIVERSITY OF WELLINGTON Te Whare Wananga o te Upoko o te Ika a Maui SWEN 432 Advanced Database Design and Implementation Cassandra Architecture.
BIG DATA/ Hadoop Interview Questions.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
St. Petersburg, 2016 Openstack Disk Storage vs Amazon Disk Storage Computing Clusters, Grids and Cloud Erasmus Mundus Master Program in PERCCOM Author:
BI 202 Data in the Cloud Creating SharePoint 2013 BI Solutions using Azure 6/20/2014 SharePoint Fest NYC.
Amirhossein Saberi May CASSANDRA NAME A daughter of the Trojan king Priam, who was given the gift of prophecy by Apollo. When she cheated him, however,
Why NO-SQL ?  Three interrelated megatrends  Big Data  Big Users  Cloud Computing are driving the adoption of NoSQL technology.
1 Gaurav Kohli Xebia Breaking with DBMS and Dating with Relational Hbase.
Plan for Final Lecture What you may expect to be asked in the Exam?
and Big Data Storage Systems
Cassandra - A Decentralized Structured Storage System
Introduction to Distributed Platforms
INTRODUCTION TO PIG, HIVE, HBASE and ZOOKEEPER
CS122B: Projects in Databases and Web Applications Winter 2017
Dynamo: Amazon’s Highly Available Key-value Store
NOSQL.
Introduction to PIG, HIVE, HBASE & ZOOKEEPER
آزمايشگاه سيستمهای هوشمند علی کمالی زمستان 95
Outline Virtualization Cloud Computing Microsoft Azure Platform
Introduction to Apache
Cloud Computing for Data Analysis Pig|Hive|Hbase|Zookeeper
Presentation transcript:

Cloud Storage Yizheng Chen

Outline Cassandra Hadoop/HDFS in Cloud Megastore

Overview Extension of Bigtable with aspects of Dynamo Motivations: – High Availability – High Write Throughput – Fail Tolerance

Data Model Table is a multi dimensional map indexed by key (row key). Columns are grouped into Column Families. 2 Types of Column Families – Simple – Super (nested Column Families) Each Column has – Name – Value – Timestamp

Data Model keyspace settings column family settings column namevaluetimestamp * Figure taken from Eben Hewitt’s (author of Oreilly’s Cassandra book) slides.

System Architecture Partitioning How data is partitioned across nodes Replication How data is duplicated across nodes

Partitioning Nodes are logically structured in Ring Topology. Hashed value of key associated with data partition is used to assign it to a node in the ring. Hashing rounds off after certain value to support ring structure. Lightly loaded nodes moves position to alleviate highly loaded nodes.

Replication Each data item is replicated at N (replication factor) nodes. Different Replication Policies – Rack Unaware – replicate data at N-1 successive nodes after its coordinator – Rack Aware – uses ‘Zookeeper’ to choose a leader which tells nodes the range they are replicas for – Datacenter Aware – similar to Rack Aware but leader is chosen at Datacenter level instead of Rack level.

Partitioning and Replication 01 1/2 F E D C B A N=3 h(key2) h(key1) 9 * Figure taken from Avinash Lakshman and Prashant Malik (authors of the paper) slides.

Local Persistence Relies on local file system for data persistency. Write operations happens in 2 steps – Write to commit log in local disk of the node – Update in-memory data structure. Read operation – Looks up in-memory ds first before looking up files on disk. – Uses Bloom Filter (summarization of keys in file store in memory) to avoid looking up files that do not contain the key.

Read Operation Query Closest replica Cassandra Cluster Replica A Result Replica BReplica C Digest Query Digest Response Result Client Read repair if digests differ * Figure taken from Avinash Lakshman and Prashant Malik (authors of the paper) slides.

Outline Cassandra Hadoop/HDFS in Cloud Megastore

Motivation online services should be highly scalable (millions of users) responsive to users highly available provide a consistent view of data (update should be visible immediately and durable) it is challenging for the underlying storage system to meet all these potential conflicting requirements

Motivation Existing solutions not sufficient NoSQL vs. RDBMS

NoSQL (Bigtable, HBase, Cassandra) – Merits: + Highly scalable – Limitations: ‐ Less features to build applications (transaction at the granularity of single key, poor schema support and query capability, limited API (put/get)) ‐ Loose consistency models with asynchronous replication (eventual consistency)

NoSQL vs. RDBMS RDBMS (mysql) – Merits: + Mature, rich data management features, easy to build applications + Synchronous replication comes with strong transactional semantics – Limitations: ‐ Hard to scale ‐ Synchronous replication may have performance and scalability issues - may not have fault‐tolerant replication mechanisms

Contribution Megastore combines the merits from both NoSQL and RDBMS – it is both scalable and highly available, – allows fast development and provides enhanced replication consistency A Paxos‐based wide‐area synchronous replication method for primary user data on every write across datacenters to meet scalability and performance requirements

Scale Axis vs. Wide Replication Axis For scalability and throughput, based on the observation that most internet services can be suitably partitioned, Megastore partitions the database into entity groups (small sub‐databases), each with its replicated log stored in per‐replica/per‐datacenter Bigtable.

More Features adds rich primitives to Bigtable such as ACID transactions, several kinds of indexes and queues under the constraint of tolerable latency and scalability requirements – ease the application development – provides transactional features within an entity group, but only gives limited consistency guarantees across entity groups – Operations across entity groups usually use asynchronous – messaging and rely on two‐phase commits

Consistency In an entity group which provides serializable ACID semantics, read and write could happen at the same time – leverages the feature of Bigtable – multiple values can be stored in a cell with multiple timestamps – Megastore also provides different kinds of read with different consistency Current read Snapshot read Inconsistent read

Outline Cassandra Hadoop/HDFS in Cloud Megastore

Hadoop in Cloud Hadoop on a virtualized infrastructure – Benefits: Datacenter efficiency (server consolidation), Multi-tenancy – multiple application servers consolidated onto one physical server, with little or no decrease in overall performance Simple to operate – Rapid Provisioning/deployment – Easy clone/move of Cluster (encapsulate an application into an image) High Availability (HA) – No more single point of failure – Any affected virtual machines are restarted on another host in a VMware cluster – Virtual machines may be moved from server to server with no downtime using VMware vMotion™ Elastic Scaling – On demand shrink/expand – Resource guarantee – Independent scaling of compute and data

Hadoop in Cloud Hadoop on a virtualized infrastructure – Concerns: Data locality for performance: move computation to data Replica placement for reliability – What’s different in virtualized environment: Extra redundancy: Windows Azure Blob Storage VMs on the same physical host are affected by the same hardware failure The “network” between VMs on the same physical host is better

Hadoop in Cloud How Hadoop could be running on a virtual platform – Solutions from VMWare: Hadoop Virtualization Extensions (HVE) – Make Hadoop virtualization-aware – Enhance the support for failure and locality topologies Serengeti project in CloudFoundry (open source PaaS) – – Enable the rapid deployment of an Apache Hadoop cluster (HDFS, MapReduce, Pig, Hive,..) on a virtual platform – Several options Add/remove VMs (virtual nodes) Grow/shrink a VM Separate compute from data – 2012 Hadoop Summit by Richard McDougall, Application Infrastructure CTO/ Big Data Chief Architect at VMWare

Add/remove VMs (virtual nodes) Virtualization Host Virtual Hadoop Node Other Workload VMDK Datanode Task Tracker Slot Virtual Hadoop Node VMDK Datanode Task Tracker Slot * Powering off the Hadoop VM would in effect fail the datanode * Adding a node would require TBs of data replication

Grow/shrink a VM Virtualization Host Virtual Hadoop Node Other Workload VMDK Datanode Task Tracker Slot Add/Remove Slots

Separate compute from data Virtual Hadoop Node Datanode Virtualization Host Virtual Hadoop Node Other Workload VMDK Task Tracker Slot Virtual Hadoop Node VMDK Task Tracker Slot Virtual Hadoop Node Virtual Hadoop Node Task Tracker Slot Truly Elastic Hadoop: Scalable through virtual nodes; Enable shared workloads

Dataflow with separated Compute/Data Virtualization Host Virtual Hadoop Node VMDK Datanode Virtual Hadoop Node NodeManager Slot Virtual Switch Virtual NIC NIC Drivers

Separate Compute Clusters Virtual Hadoop Queue Host Distributed File System (HDFS, KFS, GPFS, MAPR, Isilon,…) Namespace Virtual Hadoop Virtual Hadoop Virtual Hadoop Public Secret Data Layer Runtime Layer Coke Pepsi

Separate Compute Clusters Separate virtual clusters per tenant – Performance isolation: SLA – Configuration isolation: enable deployment of multiple Hadoop runtime versions – Security isolation: stronger VM-grade security and resource isolation