Amirhossein Saberi May 2016. CASSANDRA NAME A daughter of the Trojan king Priam, who was given the gift of prophecy by Apollo. When she cheated him, however,

Slides:



Advertisements
Similar presentations
Chen Zhang Hans De Sterck University of Waterloo
Advertisements

CASSANDRA-A Decentralized Structured Storage System Presented By Sadhana Kuthuru.
Large Scale Computing Systems
Brewer’s Conjecture and the Feasibility of Consistent, Available, Partition-Tolerant Web Services Authored by: Seth Gilbert and Nancy Lynch Presented by:
Data Management in the Cloud Paul Szerlip. The rise of data Think about this o For the past two decades, the largest generator of data was humans -- now.
A Survey of Distributed Database Management Systems Brady Kyle CSC
Cassandra Structured Storage System over a P2P Network Avinash Lakshman, Prashant Malik.
Cloud Storage Yizheng Chen. Outline Cassandra Hadoop/HDFS in Cloud Megastore.
NoSQL Databases: MongoDB vs Cassandra
Distributed Systems Fall 2010 Replication Fall 20105DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Cassandra Database Project Alireza Haghdoost, Jake Moroshek Computer Science and Engineering University of Minnesota-Twin Cities Nov. 17, 2011 News Presentation:
Overview Distributed vs. decentralized Why distributed databases
Distributed Systems Fall 2009 Replication Fall 20095DV0203 Outline Group communication Fault-tolerant services –Passive and active replication Highly.
Presentation by Krishna
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
CS346: Advanced Databases
NoSQL Database.
Distributed Databases
Cloud Storage: All your data belongs to us! Theo Benson This slide includes images from the Megastore and the Cassandra papers/conference slides.
1 Yasin N. Silva Arizona State University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Databases with Scalable capabilities Presented by Mike Trischetta.
Getting Biologists off ACID Ryan Verdon 3/13/12. Outline Thesis Idea Specific database Effects of losing ACID What is a NoSQL database Types of NoSQL.
Distributed Indexing of Web Scale Datasets for the Cloud {ikons, eangelou, Computing Systems Laboratory School of Electrical.
Modern Databases NoSQL and NewSQL Willem Visser RW334.
Molecular Transactions G. Ramalingam Kapil Vaswani Rigorous Software Engineering, MSRI.
1 Yasin N. Silva Arizona State University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Apache Cassandra - Distributed Database Management System Presented by Jayesh Kawli.
NoSQL Databases Oracle - Berkeley DB. Content A brief intro to NoSQL About Berkeley Db About our application.
Cassandra - A Decentralized Structured Storage System
Cassandra – A Decentralized Structured Storage System Lecturer : Prof. Kyungbaek Kim Presenter : I Gde Dharma Nugraha.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 12 Distributed Database Management Systems.
CAP Theorem Justin DeBrabant CIS Advanced Systems - Fall 2013.
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
NoSQL Or Peles. What is NoSQL A collection of various technologies meant to work around RDBMS limitations (mostly performance) Not much of a definition...
{ Tanya Chaturvedi MBA(ISM) Hadoop is a software framework for distributed processing of large datasets across large clusters of computers.
Distributed databases A brief introduction with emphasis on NoSQL databases Distributed databases1.
VICTORIA UNIVERSITY OF WELLINGTON Te Whare Wananga o te Upoko o te Ika a Maui SWEN 432 Advanced Database Design and Implementation Cassandra Architecture.
Look Mom! – NoSQL Charles Nurse | DotNetNuke Corp.
Plan for Final Lecture What you may expect to be asked in the Exam?
and Big Data Storage Systems
Cloud Computing and Architecuture
Cassandra - A Decentralized Structured Storage System
PGT(CS) ,KV JHAGRAKHAND
An Open Source Project Commonly Used for Processing Big Data Sets
Cassandra Tools and Config Files
CS122B: Projects in Databases and Web Applications Winter 2017
A free and open-source distributed NoSQL database
Introduction In the computing system (web and business applications), there are enormous data that comes out every day from the web. A large section of.
Dynamo: Amazon’s Highly Available Key-value Store
Cassandra Transaction Processing
Modern Databases NoSQL and NewSQL
NOSQL.
The NoSQL Column Store used by Facebook
Database Concepts.
Christian Stark and Odbayar Badamjav
Introduction to NewSQL
NOSQL databases and Big Data Storage Systems
Central Florida Business Intelligence User Group
A Comparison of SQL and NoSQL Databases
1 Demand of your DB is changing Presented By: Ashwani Kumar
NOSQL and CAP Theorem.
NoSQL Databases An Overview
Distributed P2P File System
Database Systems Summary and Overview
CSE 482 Lecture 5: NoSQL.
H-store: A high-performance, distributed main memory transaction processing system Robert Kallman, Hideaki Kimura, Jonathan Natkins, Andrew Pavlo, Alex.
Transaction Properties: ACID vs. BASE
NoSQL & Document Stores
NoSQL databases An introduction and comparison between Mongodb and Mysql document store.
Presentation transcript:

Amirhossein Saberi May 2016

CASSANDRA NAME A daughter of the Trojan king Priam, who was given the gift of prophecy by Apollo. When she cheated him, however, he turned this into a curse by causing her prophecies, though true, to be disbelieved.

RDBMS VS NOSQL Vertical scalingHorizontal scaling

WHAT'S WRONG WITH RELATIONAL DATABASE?

ACID GUARANTEE TRANSACTIONS ARE PROCESSED A tomic: Atomic means “all or nothing” C onsistent: Consistent means that data moves from one correct state to another correct state I solated: Isolated means that transactions executing concurrently will not become entangled with each other D urable: Once a transaction has succeeded, the changes will not be lost

INTRODUCTION Original authors: Avinash Lakshman, Prashant Malik Developer: Apache Software Foundation (Initially developed by Facebook to power their Inbox Search feature) Initial release: 2008 Stable release: 3.4 / March 8,2016 Written in: Java Type: Database / NoSQL License: Apache License 2.0 (permissive free software license) Apache Cassandra is an open source distributed database management system designed to handle large amounts of data across many commodity servers, providing high availability with no single point of failure.

MAIN FEATURES Decentralized: Data located, stored, and maintained in a single location Supports replication and multi data center replication Scalability (Elastic Scaling) Fault-tolerant Tunable consistency: Strong consistency or eventual? MapReduce support: programming model and an associated implementation for processing and generating large data sets with a parallel, distributed algorithm on a cluster. Cassandra Query language (CQL) Fast write

WHAT PROBLEM DO YOU HAVE? RDBMS, NoSQL. The horse, the car, the plane. They each build on prior art, they each attempt to solve certain problems, and so they’re each good at certain things—and less good at others. They each coexist, even now. Do u need to fully support transactions with ACID guarantee? Do u need to have high availability and horizontal scalability?

CAP THEOREM Consistency (all nodes see the same data at the same time - every read would get you the most recent write) Availability (a guarantee that every request receives a response about whether it succeeded or failed) Partition tolerance (the system continues to operate despite arbitrary partitioning due to network failures)

CASSANDRA IN CAP

DATA MODEL MySQL Cassandra

REPLICATION Simple Strategy:Specifies a simple replication factor for the cluster. Network Topology Strategy: Using this option, you can set the replication factor for each data-center independently. Old Network Topology Strategy: This is a legacy replication strategy.

INSTALLATION 1.Download & Install JAVA 2.Download & Install Cassandra

COMMANDS bin]$ cqlsh Connected to Test Cluster at :9042. [cqlsh | Cassandra | CQL spec | Native protocol v3] Use HELP for help. cqlsh> -CREATE KEYSPACE “KeySpace Name WITH replication = {'class': ‘Strategy name’, 'replication_factor' : ‘No.Of replicas’}; -CREATE TABLE tablename( column1 name datatype PRIMARYKEY, column2 name data type, column3 name data type. ) -INSERT INTO (,....) VALUES (,....) USING -SELECT FROM

REFERENCES