NoSQL, No SQL!!, No, SQL? Raj Nair, Penton. Variety is the spice of life Key-Value stores Document stores ColumnFam ily Graph Hybrid Spice can lead to.

Slides:



Advertisements
Similar presentations
Data Management in the Cloud Paul Szerlip. The rise of data Think about this o For the past two decades, the largest generator of data was humans -- now.
Advertisements

A Fast Growing Market. Interesting New Players Lyzasoft.
Dos and don’ts of Columnstore indexes The basis of xVelocity in-memory technology What’s it all about The compression methods (RLE / Dictionary encoding)
Relational Database Alternatives NoSQL. Choosing A Data Model Relational database underpin legacy applications and meet business needs However, companies.
NoSQL Databases: MongoDB vs Cassandra
Introduction to Backend James Kahng. Install Node.js.
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
NoSQL Database.
Platform as a Service (PaaS)
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1 Preview of Oracle Database 12 c In-Memory Option Thomas Kyte
A Study in NoSQL & Distributed Database Systems John Hawkins.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
Introduction To Windows Azure Cloud
Training Workshop Windows Azure Platform. Presentation Outline (hidden slide): Technical Level: 200 Intended Audience: Developers Objectives (what do.
HBase A column-centered database 1. Overview An Apache project Influenced by Google’s BigTable Built on Hadoop ▫A distributed file system ▫Supports Map-Reduce.
Getting Biologists off ACID Ryan Verdon 3/13/12. Outline Thesis Idea Specific database Effects of losing ACID What is a NoSQL database Types of NoSQL.
Goodbye rows and tables, hello documents and collections.
September 2011Copyright 2011 Teradata Corporation1 Teradata Columnar.
1 © 2012 OpenLink Software, All rights reserved. Virtuoso - Column Store, Adaptive Techniques for RDF Orri Erling Program Manager, Virtuoso Openlink Software.
KIT – University of the State of Baden-Württemberg and National Large-scale Research Center of the Helmholtz Association Institute of Applied Informatics.
NoSQL Databases NoSQL Concepts SoftUni Team Technical Trainers Software University
Cloud Computing Clase 8 - NoSQL Miguel Johnny Matias
Methodological Foundations of Biomedical Informatics (BMSC-GA 4449) Himanshu Grover.
 70s - Database access is hard and depends on the app  80s – Relational databases come on the scene  90s – Object oriented programming and DBs  00s.
Cassandra - A Decentralized Structured Storage System
Discussion MySQL&Cassandra ZhangGang 2012/11/22. Optimize MySQL.
When bet365 met Riak and discovered a true, “always on” database.
MySQL to NoSQL Data Modeling Challenges in Supporting Scalability ΧΑΡΟΚΟΠΕΙΟ ΠΑΝΕΠΙΣΤΗΜΙΟ - ΤΜΗΜΑ ΠΛΗΡΟΦΟΡΙΚΗΣ ΚΑΙ ΤΗΛΕΜΑΤΙΚΗΣ ΠΜΣ "Πληροφορική και Τηλεματική“
Development of Hybrid SQL/NoSQL PanDA Metadata Storage PanDA/ CERN IT-SDC meeting Dec 02, 2014 Marina Golosova and Maria Grigorieva BigData Technologies.
Large-scale Linked Data Management Marko Grobelnik, Andreas Harth (Günter Ladwig), Dumitru Roman Big Linked Data Tutorial Semantic Days 2012.
MongoDB is a database management system designed for web applications and internet infrastructure. The data model and persistence strategies are built.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Lecture 8: Databases and Data Infrastructure CS 6071 Big Data Engineering, Architecture, and Security Fall 2015, Dr. Rozier.
NOSQL Implementation and examples Maciej Matuszewski.
Clusterpoint Margarita Sudņika ms RDBMS & NoSQL Databases & tables → Document stores Columns, rows → Schemaless documents Scales UP → Scales UP.
NoSQL Or Peles. What is NoSQL A collection of various technologies meant to work around RDBMS limitations (mostly performance) Not much of a definition...
Nov 2006 Google released the paper on BigTable.
NOSQL DATABASE Not Only SQL DATABASE
NoSQL Cheng Lei Department of Electrical and Computer Engineering University of Victoria Mar 05, 2015.
Cloudera Kudu Introduction
NoSQL: Graph Databases. Databases Why NoSQL Databases?
Technology Drill Down: Windows Azure Platform Eric Nelson | ISV Application Architect | Microsoft UK |
An Introduction to Super-Scalability But first…
JSON C# Libraries Parsing JSON Files “Deserialize” OR Generating JSON Files “Serialize” JavaScriptSerializer.NET Class JSON.NET.
Information Eastman. Business Process Skills Order to Cash, Forecasting & Budgeting, etc. Process Modeling Project Management Technical Skills.
CS422 Principles of Database Systems Introduction to NoSQL Chengyu Sun California State University, Los Angeles.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
BIG DATA/ Hadoop Interview Questions.
Oracle Announced New In- Memory Database G1 Emre Eftelioglu, Fen Liu [09/27/13] 1 [1]
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
1 Analysis on the performance of graph query languages: Comparative study of Cypher, Gremlin and native access in Neo4j Athiq Ahamed, ITIS, TU-Braunschweig.
DreamFactory for Microsoft Azure Is an Open Source REST API Platform That Enables Mobilization of Data in Minutes across Frameworks and Storage Methods.
Dive into NoSQL with Azure Niels Naglé Hylke Peek.
1 Gaurav Kohli Xebia Breaking with DBMS and Dating with Relational Hbase.
Neo4j: GRAPH DATABASE 27 March, 2017
and Big Data Storage Systems
Data Platform and Analytics Foundational Training
Cloud Computing and Architecuture
5/9/2018 7:28 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS.
CSE 775 – Distributed Objects Bekir Turkkan & Habib Kaya
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
NOSQL databases and Big Data Storage Systems
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
11/18/2018 2:14 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Explore the Azure Cosmos DB with .NET Core 2.0
Overview of big data tools
relational thoughts on NoSql
NoSQL Overview + Elasticsearch Quick Dive
Copyright © JanBask Training. All rights reserved Get Started with Hadoop Hive HiveQL Languages.
Presentation transcript:

NoSQL, No SQL!!, No, SQL? Raj Nair, Penton

Variety is the spice of life Key-Value stores Document stores ColumnFam ily Graph Hybrid Spice can lead to heartburn

General Advantages Programmer friendly Web friendly No pre-defined schemasBetter

General Challenges More onus on the applicationLose the power of SQLDifferent needs can require different stores How much do you care about “immediate consistency” ?Good knowledge of access patterns for schema design

Key-Value stores Simple Fast Uses hash table/dict of keys and values Super fast for key based access Session Management High frequency atomic operations Caching

Document stores Rich Powerful Uses JSON format Structure Indexing capabilities REST interfaces Non-key based queries Schema All of that at Scale

Columnar Stores (Special Mention) Physical storage is “column” basedRows only materialized in memory Great for analysis/warehouse type workload Billions of rows, you want only a handful of columns Faster aggregation

Column Family Flexible High Scale Uses No format imposed Read/write Low latency Logs Messaging Temporal/TimeSeries

Graph databases Niche Uses Model Pairwise relations Networked systems Triple stores or RDF Recommendation engine base (matches on dating sites?)

“Other” Postgres – has built KV storage, HStoreRDF / triple stores – specialized graph storesXML storesNew SQL – yeah really!!

My “Hypothesis” – Long Tail, Data Applications using data Data available for active use PB100 TB50 TB500 TB500 GB Single server RDBMS Clustered RDBMS Upto low 100s of GB 100’s of GB to few TBs ** Not to Scale ** Illustrative only NoSQL

Workload Economics Mins to hours $ per GB <200 ms $$ per GB $$$$ per GB In-memory - Few GBs Operational NoSQL - 100’s GBs to few TBs Analytic NoSQL - TBs to 100’s TB Hadoop - TBs to PBs > 200 ms to 2 secs $$$ per GB Few seconds to minutes * Nieman Marcus approach presented at TDWI Solution Summit 2014

Popular Document Stores MongoDBCouchDB/CouchBase BenefitsDeveloper friendly Indexing Operations friendly, Easy scale out Caching support ChallengesHarder to scale, sharding, keys, write locks Unique query design, complex index creation CAPLeans towards “C”Leans towards “A” So when do I use it? - Your RDBMS is growing out of a single server environment or you are in clustered RDBMS mode - You’d rather respond correctly or not give an answer at all - You are anticipating a 2-4 node cluster - When low latency is a high priority for you - Additionally, you want richness of document store with query flexibility -Scale out management needs to be friendly -You prefer that app always gets a response

ColumnFamily HBaseCassandra BenefitsReally scales!! Columns and Rows!SQL “like” query language ChallengesLeaves a lot to the application code No query language Unique query design, complex index creation CAPLeans towards “C”Leans towards “A” So when do I use it? When eventual consistency is not an option, but you can live with “less availability” When you have a finite number of key- based access patterns When your programmers are comfortable buidling queries You are working on the Hadoop stack and prefer strong Hadoop integration When you need When query patterns are more complicated and you need to use secondary indexes When your developers prefer an “SQL like” interface for queries

Everyone has a say Dictionaries, variables, objects, arrays How many systems? Who has the skills? Stability Easy to meet changing needs, flexibility SLAs App Developer Ops Business Customer