First Indico Workshop Database Technology Pedro Ferreira 27-29 May 2013 CERN.

Slides:



Advertisements
Similar presentations
Introduction to MongoDB
Advertisements

2 Proprietary & Confidential What is Sharding Benefits of Sharding Alternatives of Sharding When to start Sharding Agenda.
Mastering Galera Data Masters. Special Thanks To… 1010 NE 2 nd Ave Miami, FL
Relational Database Alternatives NoSQL. Choosing A Data Model Relational database underpin legacy applications and meet business needs However, companies.
NoSQL Databases: MongoDB vs Cassandra
Benchmarking Cloud Serving Systems with YCSB Brian F. Cooper, Adam Silberstein, Erwin Tam, Raghu Ramakrishnan, Russell Sears Yahoo! Research Presenter.
Introduction to Backend James Kahng. Install Node.js.
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
CMU SCS Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications C. Faloutsos – A. Pavlo How to Scale a Database System.
Neo4j Adam Foust.
What is MongoDB? Developed by 10gen It is a NoSQL database A document-oriented database It uses BSON format.
Google AppEngine. Google App Engine enables you to build and host web apps on the same systems that power Google applications. App Engine offers fast.
How WebMD Maintains Operational Flexibility with NoSQL Rajeev Borborah, Sr. Director, Engineering Matt Wilson – Director, Production Engineering – Consumer.
AN INTRODUCTION TO NOSQL DATABASES Karol Rástočný, Eduard Kuric.
NoSQL for the SQL Server Pro
ZhangGang, Fabio, Deng Ziyan /31 NoSQL Introduction to Cassandra Data Model Design Implementation.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 10 Database Performance Tuning and Query Optimization.
:: Conférence :: NoSQL / Scalabilite Etat de l’art Samuel BERTHE10 Mars 2014Epitech Nantes.
Goodbye rows and tables, hello documents and collections.
Modern Databases NoSQL and NewSQL Willem Visser RW334.
PostgreSQL and relational databases As well as assignment 4…
Moohanad Hassan Maedeh Pishvaei. Introduction Open Source Apache foundation project Relational DB: SQL Server CouchDB : JSON document-oriented DB (NoSQL)
NoSQL Not Only SQL Edel Sherratt. What is NoSQL? Not Only SQL Large volumes of data No schema Partition tolerance – scale by adding more commodity servers.
Changwon Nati Univ. ISIE 2001 CSCI5708 NoSQL looks to become the database of the Internet By Lawrence Latif Wed Dec Nhu Nguyen and Phai Hoang CSCI.
NoSQL Databases Oracle - Berkeley DB Rasanjalee DM Smriti J CSC 8711 Instructor: Dr. Raj Sunderraman.
Cloud Computing Clase 8 - NoSQL Miguel Johnny Matias
NoSQL Databases Oracle - Berkeley DB. Content A brief intro to NoSQL About Berkeley Db About our application.
Methodological Foundations of Biomedical Informatics (BMSC-GA 4449) Himanshu Grover.
Massively Distributed Database Systems - Distributed DBS Spring 2014 Ki-Joune Li Pusan National University.
2005 JACoW Team Meeting Thomas Baron/Jose Benito Gonzalez – CERN – IT Managing Events with Indico.
Discussion MySQL&Cassandra ZhangGang 2012/11/22. Optimize MySQL.
Iran Hutchinson.  I work for InterSystems who drives the new NoSQL project. 
MS Access: Database Concepts Instructor: Vicki Weidler Assistant: Joaquin Obieta.
MongoDB Jer-Shuan Lin.
NOSQL Implementation and examples Maciej Matuszewski.
MongoDB First Light. Mongo DB Basics Mongo is a document based NoSQL. –A document is just a JSON object. –A collection is just a (large) set of documents.
NoSQL Or Peles. What is NoSQL A collection of various technologies meant to work around RDBMS limitations (mostly performance) Not much of a definition...
NoSQL Systems Motivation. NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis.
NOSQL DATABASE Not Only SQL DATABASE
NoSQL: Graph Databases. Databases Why NoSQL Databases?
Data and Information Systems Laboratory University of Illinois Urbana-Champaign Data Mining Meeting Mar, From SQL to NoSQL Xiao Yu Mar 2012.
NoSQL databases A brief introduction NoSQL databases1.
Big Data Yuan Xue CS 292 Special topics on.
CMPE 226 Database Systems May 3 Class Meeting Department of Computer Engineering San Jose State University Spring 2016 Instructor: Ron Mak
Context Aware RBAC Model For Wearable Devices And NoSQL Databases Amit Bansal Siddharth Pathak Vijendra Rana Vishal Shah Guided By: Dr. Csilla Farkas Associate.
Data Tier Options NWEN304 Advanced Network Applications.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
Software Engineering for Business Information Systems (sebis) Department of Informatics Technische Universität München, Germany wwwmatthes.in.tum.de Factors.
1 Analysis on the performance of graph query languages: Comparative study of Cypher, Gremlin and native access in Neo4j Athiq Ahamed, ITIS, TU-Braunschweig.
Big Data Yuan Xue CS 292 Special topics on.
Dive into NoSQL with Azure Niels Naglé Hylke Peek.
NoSql An alternative option in the DevEvenings ORM Smackdown Tarn Barford
Solr Power FTW Alex #solrnosql. What Will I Cover? Who I am What Bazaarvoice does SOLR and NoSQL Can SOLR handle 20K queries per second?
1 Gaurav Kohli Xebia Breaking with DBMS and Dating with Relational Hbase.
NoSQL: Graph Databases
2 Phase Commit Protocol In transaction processing, databases, and computer networking, the two-phase commit protocol (2PC) is a type of atomic commitment.
NoSQL: Graph Databases
and Big Data Storage Systems
Cloud Computing and Architecuture
Bridging SQL and NoSQL Rupal Dhillon U
Modern Databases NoSQL and NewSQL
NOSQL databases and Big Data Storage Systems
Massively Parallel Cloud Data Storage Systems
1 Demand of your DB is changing Presented By: Ashwani Kumar
11/18/2018 2:14 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
RealStudio and Cloud Databases
Database Futures Workshop cern. ch/conferenceDisplay
NoSQL databases An introduction and comparison between Mongodb and Mysql document store.
The Database World of Azure
Presentation transcript:

First Indico Workshop Database Technology Pedro Ferreira May 2013 CERN

The Future Let’s start from the end A more «social» Indico User-oriented approach Personal home page, user-tailored information More data in less time Need for an adequate DB

Avoiding disaster

ZODB Why not? It’s object-oriented It’s been around for a long time We know how to use it We are already using it for some of these things

ZODB An object-oriented database IndexTransactions O1 O2 O1O4 O2 O3 O4 O3 … O1 time

ZODB The good parts It’s simple No need for ORM or mapping layers Tightly integrated with Python It is ACID – things work as expected

ZODB The bad parts No server-side queries No built-in indexing – application level Data recovery is slow (latency, unpickling, setting object state) No way to fetch more than an object at once (pre-load)

ZODB Yes, there’s more… Has to be packed regularly No caching on the server side (besides OS cache) FileStorage Replication is not Open Source - money RelStorage could work, but requires migration It’s a niche product

Don’t Panic! We’re working on it!

NG DB Project The Next Generation DB for Indico Aims to find DB infrastructure that can support growth 6 month initial phase (tech preview/boilerplate – end 2013) Tech Survey, Evaluation, Prototyping…

NG DB Project The criteria Availability (OSS) Scalability/Replication Ease of use/development Transactions/Consistency Community/Momentum Costs

The Contestants The NoSQL crowd Key-value stores Riak, Redis, Voldemort Document-oriented MongoDB, CouchDB Column-oriented Cassandra, Hbase Neo4J

Key-Value Just a mapping structure user1Pedro Ferreira user2Alberto Resco user3{name: {first:“Jose Benito”, last: “Gonzalez”}

Column-oriented

Document-oriented Closer to the OO philosophy user1Pedro Ferreira user2 name Genevacity Alberto Resconame Runninghobbies Cycling

The Contestants The usual (relational) suspects MySQL MariaDB Drizzle Percona PostgreSQL

Relational vs. NoSQL A very coarse comparison RelationalNoSQL ACIDYesNot usually PhilosophyGeneral- purpose Table-oriented Problem-specific Normally closer to OO MaturityDecadesPretty recent ConsistencyNormally strong Normally eventual

Relational vs. NoSQL The problems RelationalNoSQL Requires ORM Different philosophy Lack of transactions Eventual consistency Too simplistic

Typical query «All events in a user’s favourite categories» Relational Simple join between two tables MongoDB Either replicate data or use DB refs (slow!)

Hybrid Approach The best of two worlds? ZODB - excellent storage for business objects SELECT style queries… ZODB as primary storage? Already kind of doing that (Redis) Need for transition period

Hybrid Approach No such thing as a free lunch… Keeping data consistent Multiple DB calls per request Yet another thing to install

Conclusion Research project Still a lot of ground to cover Hard to evaluate the hype Hybrid could be a good option

Pedro Ferreira