A Study in NoSQL & Distributed Database Systems John Hawkins.

Slides:



Advertisements
Similar presentations
Chapter 10: Designing Databases
Advertisements

Data Management in the Cloud Paul Szerlip. The rise of data Think about this o For the past two decades, the largest generator of data was humans -- now.
Distributed Databases John Ortiz. Lecture 24Distributed Databases2  Distributed Database (DDB) is a collection of interrelated databases interconnected.
Adding scalability to legacy PHP web applications Overview Mario A. Valdez-Ramirez.
Relational Database Alternatives NoSQL. Choosing A Data Model Relational database underpin legacy applications and meet business needs However, companies.
Nov 12, 2009IAT 8001 Hash Table Bucket Sort. Nov 12, 2009IAT 8002  An array in which items are not stored consecutively - their place of storage is calculated.
Chapter 13 (Web): Distributed Databases
Chapter 9 Designing Systems for Diverse Environments.
Chapter 3 : Distributed Data Processing
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
Platform as a Service (PaaS)
1 Yasin N. Silva Arizona State University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Databases with Scalable capabilities Presented by Mike Trischetta.
Systems analysis and design, 6th edition Dennis, wixom, and roth
Database Design – Lecture 16
CS 345: Topics in Data Warehousing Tuesday, October 19, 2004.
Goodbye rows and tables, hello documents and collections.
Modern Databases NoSQL and NewSQL Willem Visser RW334.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
NoSQL Databases NoSQL Concepts SoftUni Team Technical Trainers Software University
Massively Distributed Database Systems - Distributed DBS Spring 2014 Ki-Joune Li Pusan National University.
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 12 Distributed Database Management Systems.
Database Essentials. Key Terms Big Data Describes a dataset that cannot be stored or processed using traditional database software. Examples: Google search.
G063 - Distributed Databases. Learning Objectives: By the end of this topic you should be able to: explain how databases may be stored in more than one.
Multiprossesors Systems.. What are Distributed Databases ? “ A Logically interrelated collection of shared data ( and a description of this data) physically.
7. Replication & HA Objectives –Understand Replication and HA Contents –Standby server –Failover clustering –Virtual server –Cluster –Replication Practicals.
CSE 3330 Database Concepts MongoDB. Big Data Surge in “big data” Larger datasets frequently need to be stored in dbs Traditional relational db were not.
1 Distributed Databases BUAD/American University Distributed Databases.
Distributed database system
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Lecture 8: Databases and Data Infrastructure CS 6071 Big Data Engineering, Architecture, and Security Fall 2015, Dr. Rozier.
MapReduce and NoSQL CMSC 461 Michael Wilson. Big data  The term big data has become fairly popular as of late  There is a need to store vast quantities.
Topic Distributed DBMS Database Management Systems Fall 2012 Presented by: Osama Ben Omran.
Clusterpoint Margarita Sudņika ms RDBMS & NoSQL Databases & tables → Document stores Columns, rows → Schemaless documents Scales UP → Scales UP.
| nectar.org.au NECTAR TRAINING Module 4 From PC To Cloud or HPC.
NoSQL Systems Motivation. NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis.
NOSQL DATABASE Not Only SQL DATABASE
NoSQL Cheng Lei Department of Electrical and Computer Engineering University of Victoria Mar 05, 2015.
 Distributed Database Concepts  Parallel Vs Distributed Technology  Advantages  Additional Functions  Distribution Database Design  Data Fragmentation.
Introduction to Databases Angela Clark University of South Alabama.
NoSQL databases A brief introduction NoSQL databases1.
Introduction to Core Database Concepts Getting started with Databases and Structure Query Language (SQL)
Orion Contextbroker PROF. DR. SERGIO TAKEO KOFUJI PROF. MS. FÁBIO H. CABRINI PSI – 5120 – TÓPICOS EM COMPUTAÇÃO EM NUVEM
SQL Basics Review Reviewing what we’ve learned so far…….
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
DreamFactory for Microsoft Azure Is an Open Source REST API Platform That Enables Mobilization of Data in Minutes across Frameworks and Storage Methods.
Intro to NoSQL Databases Tony Hannan November 2011.
Neo4j: GRAPH DATABASE 27 March, 2017
Platform as a Service (PaaS)
NO SQL for SQL DBA Dilip Nayak & Dan Hess.
and Big Data Storage Systems
Platform as a Service (PaaS)
CS122B: Projects in Databases and Web Applications Winter 2017
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
Modern Databases NoSQL and NewSQL
NOSQL.
Replication.
NOSQL databases and Big Data Storage Systems
Virtual Private Servers – Types of Virtualization platforms Virtual Private ServersVirtual Private Servers, popularly known as VPS is considered one of.
1 Demand of your DB is changing Presented By: Ashwani Kumar
NOSQL and CAP Theorem.
What is database? Types and Examples
NoSQL Databases An Overview
Database.
NoSQL Databases Antonino Virgillito.
Overview of big data tools
NoSQL databases An introduction and comparison between Mongodb and Mysql document store.
Presentation transcript:

A Study in NoSQL & Distributed Database Systems John Hawkins

Topics to Cover What is NoSQL (and why use it) Types of NoSQL OrientDB Distributed Databases

NoSQL Movement: What is it all about? NoSQL is term for a movement in database design away from traditional relational database models. With the emergence of big data and cloud computing, traditional databases and schema driven data design is too constraining.

Reasons for NoSQL Databases Schema-less data storage Quick data storage and traversal Easier to program Better performance Easily distributed

Three Popular NoSQL Designs Key / Value Store Document Database Graph Database

Key / Value Store Key / Value store databases allow for values to be associated with and looked up by a key. Keys can be associated with more than one value. Data can be stored in the native data type of a particular programming language.

Document Database Document databases store information in documents such as JSON or XML. Document format implies the relationship between data points in the document. Most documents create hierarchies of data inside themselves.

Graph Database Graph databases store all of their information in nodes (vertices) and edges. Graph traversal is how you “query” the database. Relationship information about nodes is stored in the edges.

OrientDB Combined graph database and document database design. Uses JSON documents to store information in nodes and edges of the graph. Uses an HTTP REST API to access / edit the database.

OrientDB Runs on the Java Virtual Machine, which allows it to be run on almost any machine in the modern world. Has APIs written in C / C++, Ruby, PHP, and Java Because of its use of HTTP, can be easily distributed across multiple machines.

Distributed Databases Often times, as databases grow larger, it is necessary to expand the hardware powering them Distributed databases take advantage of cheaper hardware by having multiple computers work together rather than building one large machine.

Replication Replication copies the entire database across all nodes in the distributed system.

Sharding Sharding divides the data inside the database and partitions pieces of it to different nodes. Databases can be sharded horizontally (by rows) or vertically (by columns).

Pros / Cons of Each ShardingReplication Pros Fast data writing / reading. Low memory overhead. Fast data reading. High data reliability. Cons Potential data lossHigh network overhead. High memory overhead.

NoSQL Distributed Databases Nearly all NoSQL database systems natively support distributed database designs. This is part of what makes NoSQL databases so appealing.

In Summary NoSQL is a movement away from relational databases NoSQL databases allow programmers to easily traverse and manipulate data. Databases like OrientDB are readily available and free to use. Distributed databases take full advantage of a cluster of less expensive hardware.

Any Questions?

References