Modern Databases NoSQL and NewSQL

Slides:



Advertisements
Similar presentations
Andy Pavlo April 13, 2015April 13, 2015April 13, 2015 NewS QL.
Advertisements

Data Management in the Cloud Paul Szerlip. The rise of data Think about this o For the past two decades, the largest generator of data was humans -- now.
Adding scalability to legacy PHP web applications Overview Mario A. Valdez-Ramirez.
Jennifer Widom NoSQL Systems Overview (as of November 2011 )
Reporter: Haiping Wang WAMDM Cloud Group
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
NoSQL Database.
CS 405G: Introduction to Database Systems 24 NoSQL Reuse some slides of Jennifer Widom Chen Qian University of Kentucky.
Group 11 Sameera Shah & Fatemah Husain [10/31/13].
Massively Parallel Cloud Data Storage Systems S. Sudarshan IIT Bombay.
A Study in NoSQL & Distributed Database Systems John Hawkins.
Distributed Data Stores and No SQL Databases S. Sudarshan IIT Bombay.
Databases with Scalable capabilities Presented by Mike Trischetta.
Distributed Data Stores and No SQL Databases S. Sudarshan Perry Hoekstra (Perficient) with slides pinched from various sources such as Perry Hoekstra (Perficient)
Getting Biologists off ACID Ryan Verdon 3/13/12. Outline Thesis Idea Specific database Effects of losing ACID What is a NoSQL database Types of NoSQL.
Modern Databases NoSQL and NewSQL Willem Visser RW334.
1 Yasin N. Silva Arizona State University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
NoSQL Databases NoSQL Concepts SoftUni Team Technical Trainers Software University
Changwon Nati Univ. ISIE 2001 CSCI5708 NoSQL looks to become the database of the Internet By Lawrence Latif Wed Dec Nhu Nguyen and Phai Hoang CSCI.
NoSQL Databases Oracle - Berkeley DB. Content A brief intro to NoSQL About Berkeley Db About our application.
Windows Azure Conference 2014 Polyglot Persistence: NoSQL + Azure!
Iran Hutchinson.  I work for InterSystems who drives the new NoSQL project. 
Lecture 8: Databases and Data Infrastructure CS 6071 Big Data Engineering, Architecture, and Security Fall 2015, Dr. Rozier.
MongoDB Jer-Shuan Lin.
NoSQL Or Peles. What is NoSQL A collection of various technologies meant to work around RDBMS limitations (mostly performance) Not much of a definition...
NoSQL Systems Motivation. NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis.
NOSQL DATABASE Not Only SQL DATABASE
NoSQL: Graph Databases. Databases Why NoSQL Databases?
Data and Information Systems Laboratory University of Illinois Urbana-Champaign Data Mining Meeting Mar, From SQL to NoSQL Xiao Yu Mar 2012.
NoSQL databases A brief introduction NoSQL databases1.
Context Aware RBAC Model For Wearable Devices And NoSQL Databases Amit Bansal Siddharth Pathak Vijendra Rana Vishal Shah Guided By: Dr. Csilla Farkas Associate.
CS422 Principles of Database Systems Introduction to NoSQL Chengyu Sun California State University, Los Angeles.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
Intro to NoSQL Databases Tony Hannan November 2011.
NoSQL: Graph Databases
Neo4j: GRAPH DATABASE 27 March, 2017
2 Phase Commit Protocol In transaction processing, databases, and computer networking, the two-phase commit protocol (2PC) is a type of atomic commitment.
CSCI5570 Large Scale Data Processing Systems
CS 405G: Introduction to Database Systems
NO SQL for SQL DBA Dilip Nayak & Dan Hess.
NoSQL: Graph Databases
and Big Data Storage Systems
Apache Ignite Data Grid Research Corey Pentasuglia.
CSE 775 – Distributed Objects Bekir Turkkan & Habib Kaya
CS122B: Projects in Databases and Web Applications Winter 2017
Introduction In the computing system (web and business applications), there are enormous data that comes out every day from the web. A large section of.
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
NoSQL Database and Application
Azure Cosmos DB Venitta J Microsoft Connect /6/2018 4:36 PM
NOSQL.
Christian Stark and Odbayar Badamjav
Introduction to NewSQL
NOSQL databases and Big Data Storage Systems
A Comparison of SQL and NoSQL Databases
NoSQL Systems Overview (as of November 2011).
Massively Parallel Cloud Data Storage Systems
1 Demand of your DB is changing Presented By: Ashwani Kumar
NOSQL and CAP Theorem.
NoSQL Databases An Overview
NoSQL Databases Antonino Virgillito.
NoSQL W2013 CSCI 2141.
April 13th – Semi-structured data
Transaction Properties: ACID vs. BASE
Introduction to NoSQL Database Systems
CMPE 280 Web UI Design and Development March 14 Class Meeting
NoSQL databases An introduction and comparison between Mongodb and Mysql document store.
Working with GEOLocation Data
build a real time operational data lake in minutes.
The Database World of Azure
Presentation transcript:

Modern Databases NoSQL and NewSQL Willem Visser RW334

Relational DBs Cannot Handle Web-Scale or can they? To be honest the jury is out on this one NoSQL An attempt at using non-relational solutions NewSQL Scaling relational DBs

The NoSQL Movement Not Only SQL Use the right tools (DBs) for the job It is not No SQL Not only relational would have been better Use the right tools (DBs) for the job It is more like a feature set, or even the not of a feature set

Definition from nosql-databases.org Next Generation Databases mostly addressing some of the points: being non-relational, distributed, open-source and horizontal scalable. The original intention has been modern web-scale databases. The movement began early 2009 and is growing rapidly. Often more characteristics apply as: schema-free, easy replication support, simple API, eventually consistent /BASE (not ACID), a huge data amount, and more. So the misleading term "nosql" (the community now translates it mostly with "not only sql") should be seen as an alias to something like the definition above.

NoSQL http://nosql-database.org/ Non relational Scalability Vertically Add more data Horizontally Add more storage Collection of structures Hashtables, maps, dictionaries No pre-defined schema No join operations CAP not ACID Consistency, Availability and Partitioning (but not all three at once!) Atomicity, Consistency, Isolation and Durability

Advantages of NoSQL Cheap, easy to implement Data are replicated and can be partitioned Easy to distribute Don't require a schema Can scale up and down Quickly process large amounts of data Relax the data consistency requirement (CAP) Can handle web-scale data, whereas Relational DBs cannot

Disadvantages of NoSQL New and sometimes buggy Data is generally duplicated, potential for inconsistency No standardized schema No standard format for queries No standard language Difficult to impose complicated structures Depend on the application layer to enforce data integrity No guarantee of support Too many options, which one, or ones to pick

NoSQL Presentation Introduction to NoSQL by John Nunemaker http://glennas.wordpress.com/2011/03/11/introduction-to-nosql-john-nunemaker-presentation-from-june-2010/ Added it to our pages at Movie http://www.cs.sun.ac.za/rw334/nosql.mp4 Slides: http://www.cs.sun.ac.za/rw334/whynosql.pdf

NoSQL Options Key-Value Stores This technology you know and love and use all the time Hashmap for example Put(key,value) value = Get(key) Examples Redis (my favorite!!) – in memory store Memcached and 100s more

Column Stores Not to be confused with the relational-db version of this Sybase-IQ etc. Multi-dimensional map Not all entries are relevant each time Column families Examples Cassandra Hbase Amazon SimpleDB

Document Stores Key-document stores However the document can be seen as a value so you can consider this is a super-set of key-value Big difference is that in document stores one can query also on the document, i.e. the document portion is structured (not just a blob of data) Examples MongoDB CouchDB

Graph Stores Use a graph structure Example Neo4j Labeled, directed, attributed multi-graph Label for each edge Directed edges Multiple attributes per node Multiple edges between nodes Relational DBs can model graphs, but an edge requires a join which is expensive Example Neo4j http://www.infoq.com/articles/graph-nosql-neo4j

/

451 Group Report (Not Free) http://blogs. the451group SPRAIN Characteristics Scalability – hardware economics Performance – MySQL limitations Relaxed consistency – CAP theorem Agility – polyglot persistence Intricacy – big data, total data Necessity – open source All NoSQL and NewSQL evaluated according to SPRAIN

Polyglot Persistence Using different DB technologies for different storage requirements http://martinfowler.com/bliki/PolyglotPersistence.html

NewSQL Just like NoSQL it is more of a movement than specific product or even product family The “New” refers to the Vendors and not the SQL Goal(s): Bring the benefits of relational model to distributed architectures, or, VoltDB, ScaleDB, etc. Improve Relational DB performance to no longer require horizontal scaling Tokutek, ScaleBase, etc. “SQL-as-a-service”: Amazon RDS, Microsoft SQL Azure, Google Cloud SQL

1 Year From Now NoSQL and NewSQL terms will no longer be there Focus will be on how to map problems onto solutions Whether it is SQL, NoSQL, NewSQL hopefully will be irrelevant