NoSQL Systems Overview (as of November 2011).

Slides:



Advertisements
Similar presentations
No SQL is not about SQL No SQL is a Zoo.. Key-Value Stores Wide Column Stores Document Stores Graph Databases.
Advertisements

Large Scale Computing Systems
Data Management in the Cloud Paul Szerlip. The rise of data Think about this o For the past two decades, the largest generator of data was humans -- now.
Map/Reduce in Practice Hadoop, Hbase, MongoDB, Accumulo, and related Map/Reduce- enabled data stores.
NoSQL and Review 1. Big Data (some old numbers) Facebook:  130TB/day: user logs  TB/day: 83 million pictures Google: > 25 PB/day processed data.
Jennifer Widom NoSQL Systems Overview (as of November 2011 )
Reporter: Haiping Wang WAMDM Cloud Group
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
NoSQL Database.
CS 405G: Introduction to Database Systems 24 NoSQL Reuse some slides of Jennifer Widom Chen Qian University of Kentucky.
Massively Parallel Cloud Data Storage Systems S. Sudarshan IIT Bombay.
1 Yasin N. Silva Arizona State University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Project By: Anuj Shetye Vinay Boddula. Introduction Motivation HBase Our work Evaluation Related work. Future work and conclusion.
AN INTRODUCTION TO NOSQL DATABASES Karol Rástočný, Eduard Kuric.
SQL vs NOSQL Discussion
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
WTT Workshop de Tendências Tecnológicas 2014
Distributed Indexing of Web Scale Datasets for the Cloud {ikons, eangelou, Computing Systems Laboratory School of Electrical.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
Modern Databases NoSQL and NewSQL Willem Visser RW334.
NoSQL Not Only SQL Edel Sherratt. What is NoSQL? Not Only SQL Large volumes of data No schema Partition tolerance – scale by adding more commodity servers.
Changwon Nati Univ. ISIE 2001 CSCI5708 NoSQL looks to become the database of the Internet By Lawrence Latif Wed Dec Nhu Nguyen and Phai Hoang CSCI.
Cloud Computing Clase 8 - NoSQL Miguel Johnny Matias
Jennifer Widom NoSQL Systems Motivation. Jennifer Widom NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
IBM Research ® © 2007 IBM Corporation A Brief Overview of Hadoop Eco-System.
Nov 2006 Google released the paper on BigTable.
NoSQL Systems Motivation. NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis.
NOSQL DATABASE Not Only SQL DATABASE
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT IT Monitoring WG Technology for Storage/Analysis 28 November 2011.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
NoSQL: Graph Databases. Databases Why NoSQL Databases?
Data and Information Systems Laboratory University of Illinois Urbana-Champaign Data Mining Meeting Mar, From SQL to NoSQL Xiao Yu Mar 2012.
NoSQL databases A brief introduction NoSQL databases1.
Introduction to NoSQL Databases Chyngyz Omurov Osman Tursun Ceng,Middle East Technical University.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
An Introduction to Big Data (With a strong focus on Apache) Nick Burch Senior Developer, Alfresco Software VP ConCom, ASF Member.
NoSQL: Graph Databases
Introduction to Mongo DB(NO SQL data Base)
COMP 5323 Web Database Technologies and Applications 2014
CS 405G: Introduction to Database Systems
Gergely Lukács Pázmány Péter Catholic University
Mail call Us: / / Hadoop Training Sathya technologies is one of the best Software Training Institute.
NoSQL: Graph Databases
and Big Data Storage Systems
Hadoop.
CSE 775 – Distributed Objects Bekir Turkkan & Habib Kaya
An Open Source Project Commonly Used for Processing Big Data Sets
CS122B: Projects in Databases and Web Applications Winter 2017
NoSQL Database and Application
Modern Databases NoSQL and NewSQL
NOSQL.
RDBMSs The NoSQL movement
NOSQL databases and Big Data Storage Systems
Hadoop and NoSQL at Thomson Reuters
Massively Parallel Cloud Data Storage Systems
1 Demand of your DB is changing Presented By: Ashwani Kumar
NOSQL and CAP Theorem.
NoSQL Databases An Overview
NoSQL Systems Motivation.
NoSQL Databases Antonino Virgillito.
NoSQL Not Only SQL University of Kurdistan Faculty of Engineering
Database Systems Summary and Overview
CSE 482 Lecture 5: NoSQL.
April 13th – Semi-structured data
Charles Tappert Seidenberg School of CSIS, Pace University
Introduction to NoSQL Database Systems
NoSQL & Document Stores
NoSQL databases An introduction and comparison between Mongodb and Mysql document store.
Presentation transcript:

NoSQL Systems Overview (as of November 2011)

NoSQL Systems: Overview Not every data management/analysis problem is best solved exclusively using a traditional DBMS “NoSQL” = “Not Only SQL”

NoSQL Systems: Overview Alternative to traditional relational DBMS Flexible schema Quicker/cheaper to set up Massive scalability Relaxed consistency  higher performance & availability No declarative query language  more programming Relaxed consistency  fewer guarantees

NoSQL Systems: Overview Several incarnations MapReduce framework Key-value stores Document stores Graph database systems

NoSQL Systems: Overview MapReduce Framework Originally from Google, open source Hadoop No data model, data stored in files User provides specific functions System provides data processing “glue”, fault-tolerance, scalability

NoSQL Systems: Overview Map and Reduce Functions Map: Divide problem into subproblems Reduce: Do work on subproblems, combine results

NoSQL Systems: Overview MapReduce Architecture

NoSQL Systems: Overview MapReduce Example: Web log analysis Each record: UserID, URL, timestamp, additional-info Task: Count number of accesses for each domain (inside URL)

NoSQL Systems: Overview MapReduce Example (modified #1) Each record: UserID, URL, timestamp, additional-info Task: Total “value” of accesses for each domain based on additional-info

NoSQL Systems: Overview MapReduce Example (modified #2) Each record: UserID, URL, timestamp, additional-info Separate records: UserID, name, age, gender, … Task: Total “value” of accesses for each domain based on user attributes

NoSQL Systems: Overview MapReduce Framework No data model, data stored in files User provides specific functions System provides data processing “glue”, fault-tolerance, scalability

NoSQL Systems: Overview MapReduce Framework Schemas and declarative queries are missed Hive – schemas, SQL-like query language Pig – more imperative but with relational operators Both compile to “workflow” of Hadoop (MapReduce) jobs Dryad allows user to specify workflow Also DryadLINQ language

NoSQL Systems: Overview Key-Value Stores Extremely simple interface Data model: (key, value) pairs Operations: Insert(key,value), Fetch(key), Update(key), Delete(key) Implementation: efficiency, scalability, fault-tolerance Records distributed to nodes based on key Replication Single-record transactions, “eventual consistency”

NoSQL Systems: Overview Key-Value Stores Extremely simple interface Data model: (key, value) pairs Operations: Insert(key,value), Fetch(key), Update(key), Delete(key) Some allow (non-uniform) columns within value Some allow Fetch on range of keys Example systems Google BigTable, Amazon Dynamo, Cassandra, Voldemort, HBase, …

NoSQL Systems: Overview Document Stores Like Key-Value Stores except value is document Data model: (key, document) pairs Document: JSON, XML, other semistructured formats Basic operations: Insert(key,document), Fetch(key), Update(key), Delete(key) Also Fetch based on document contents Example systems CouchDB, MongoDB, SimpleDB, …

NoSQL Systems: Overview Graph Database Systems Data model: nodes and edges Nodes may have properties (including ID) Edges may have labels or roles

NoSQL Systems: Overview Graph Database Systems Interfaces and query languages vary Single-step versus “path expressions” versus full recursion Example systems Neo4j, FlockDB, Pregel, … RDF “triple stores” can map to graph databases

NoSQL Systems: Overview “NoSQL” = “Not Only SQL” Not every data management/analysis problem is best solved exclusively using a traditional DBMS Current incarnations MapReduce framework Key-value stores Document stores Graph database systems

NoSQL Systems Overview (as of November 2011)