Summary of “ Oracle does about-face on NoSQL ” Jaikumar Vijayan, ComputerWorld, Oct 4th, 2011 Presented by: James Klassen.

Slides:



Advertisements
Similar presentations
DataGarage: Warehousing Massive Performance Data on Commodity Servers
Advertisements

Dan Bassett, Jonathan Canfield December 13, 2011.
Big Data Training Course for IT Professionals Name of course : Big Data Developer Course Duration : 3 days full time including practical sessions Dates.
HadoopDB Inneke Ponet.  Introduction  Technologies for data analysis  HadoopDB  Desired properties  Layers of HadoopDB  HadoopDB Components.
Real-Time Big Data Use Cases John Leach CTO, Splice Machine.
C-Store: Data Management in the Cloud Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY Jun 5, 2009.
HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads Azza Abouzeid1, Kamil BajdaPawlikowski1, Daniel Abadi1, Avi.
HadoopDB An Architectural Hybrid of Map Reduce and DBMS Technologies for Analytical Workloads Presented By: Wen Zhang and Shawn Holbrook.
Observation Pattern Theory Hypothesis What will happen? How can we make it happen? Predictive Analytics Prescriptive Analytics What happened? Why.
 Need for a new processing platform (BigData)  Origin of Hadoop  What is Hadoop & what it is not ?  Hadoop architecture  Hadoop components (Common/HDFS/MapReduce)
Running Hadoop-as-a-Service in the Cloud
Kapil Bakshi Distinguished Architect
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
CMU SCS Carnegie Mellon Univ. Dept. of Computer Science /615 - DB Applications C. Faloutsos – A. Pavlo How to Scale a Database System.
PARALLEL DBMS VS MAP REDUCE “MapReduce and parallel DBMSs: friends or foes?” Stonebraker, Daniel Abadi, David J Dewitt et al.
David Gibbs and Govardhan Tanniru Georgia State University Department of Computer Science P.O. Box 3965 Atlanta, GA
Better Performance for Big Data Shuya Zhang; Shyam Sundar Somasundaram [10/03/13] 1 [1] Bhasker Allene, Marco Righini, “Better Performance for Big Data”
SYSTEMS SUPPORT FOR GRAPHICAL LEARNING Ken Birman 1 CS6410 Fall /18/2014.
Tyson Condie.
MapReduce April 2012 Extract from various presentations: Sudarshan, Chungnam, Teradata Aster, …
H ADOOP DB: A N A RCHITECTURAL H YBRID OF M AP R EDUCE AND DBMS T ECHNOLOGIES FOR A NALYTICAL W ORKLOADS By: Muhammad Mudassar MS-IT-8 1.
Cloud Databases Matt Gregg Bob Guidinger. Cloud 101 What do we mean by Cloud Databases? Why do we have them? o Alternative to IT infrastructure investment.
:: Conférence :: NoSQL / Scalabilite Etat de l’art Samuel BERTHE10 Mars 2014Epitech Nantes.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Oracle Challenges Parallelism Limitations Parallelism is the ability for a single query to be run across multiple processors or servers. Large queries.
HadoopDB project An Architetural hybrid of MapReduce and DBMS Technologies for Analytical Workloads Anssi Salohalla.
Hadoop Basics -Venkat Cherukupalli. What is Hadoop? Open Source Distributed processing Large data sets across clusters Commodity, shared-nothing servers.
W HAT IS H ADOOP ? Hadoop is an open-source software framework for storing and processing big data in a distributed fashion on large clusters of commodity.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
HadoopDB Presenters: Serva rashidyan Somaie shahrokhi Aida parbale Spring 2012 azad university of sanandaj 1.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1.
When bet365 met Riak and discovered a true, “always on” database.
Oracle’s Big Plans For Big Data Analysis By Doug Henschen, InformationWeek, Oct 4 th, 2011 Presented by Group 7: Sam Tucker and Ayoung Noh.
Spatial Tajo Supporting Spatial Queries on Apache Tajo Slideshare Shorten URL : goo.gl/j0VLXpgoo.gl/j0VLXp.
THE ART OF PARALLEL HETEROGENEOUS DATA TRANSFORMATION Deema Alswaimil, Eric DeVeau 06/18/2012.
Introduction to Hbase. Agenda  What is Hbase  About RDBMS  Overview of Hbase  Why Hbase instead of RDBMS  Architecture of Hbase  Hbase interface.
Clustrix Parallelized Clustered Database Presented by Ashutosh Dhiman and David Gloe Source: Mellor, Chris. “Size doesn't matter to database thrusting.
Review of technologies for developing geospatial applications with a focus on open source (FOSS4G) and their implementation of cloud computing application.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
A Comparison of Approaches to Large-Scale Data Analysis Andrew Pavlo, Erik Paulson, Alexander Rasin, Daniel J. Abadi, David J. Dewitt, Samuel Madden, Michael.
History & Motivations –RDBMS History & Motivations (cont’d) … … Concurrent Access Handling Failures Shared Data User.
CPS 216: Advanced Database Systems Shivnath Babu.
BACS 287 Big Data & NoSQL 2016 by Jones & Bartlett Learning LLC.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
Cloud Distributed Computing Environment Hadoop. Hadoop is an open-source software system that provides a distributed computing environment on cloud (data.
1 HBASE – THE SCALABLE DATA STORE An Introduction to HBase XLDB Europe Workshop 2013: CERN, Geneva James Kinley EMEA Solutions Architect, Cloudera.
Efficient Data Management Tools for the Heterogeneous Big Data Warehouse Autors: Aleksandr Alekseev (Programmer), Victoria Osipova (Associate professor),
BIG DATA/ Hadoop Interview Questions.
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
Microsoft Ignite /28/2017 6:07 PM
1 Cloud-Native Data Warehousing Bob Muglia. 2 Scenarios with affinity for cloud Gartner 2016 Predictions: By 2018, six billion connected things will be.
1 Gaurav Kohli Xebia Breaking with DBMS and Dating with Relational Hbase.
Big Data Enterprise Patterns
Hadoop Aakash Kag What Why How 1.
Introduction to Distributed Platforms
CS122B: Projects in Databases and Web Applications Winter 2017
Operational & Analytical Database
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
David Ostrovsky | Couchbase
Tools for Processing Big Data Jinan Al Aridhee and Christian Bach
Ch 4. The Evolution of Analytic Scalability
Overview of big data tools
Big Data Young Lee BUS 550.
Zoie Barrett and Brian Lam
Charles Tappert Seidenberg School of CSIS, Pace University
Dark Data Are we at risk?.
NoSQL & Document Stores
Presentation transcript:

Summary of “ Oracle does about-face on NoSQL ” Jaikumar Vijayan, ComputerWorld, Oct 4th, 2011 Presented by: James Klassen

Summary May 2011 Oracle Released paper “Debunking NoSQL Hype”. (No longer available) [1] Oct 2011 Oracle Released Big Data Appliance based on open source stack of Apache Hadoop (MapReduce) and R (statistical engine). [1] “Oracle's entry, in a sense validates the NoSQL space” [1]

Why Not NoSQL? But, MapReduce doesn't allow some SQL features that are hard to run reliably in parallel. [3] Joins, Full ACID compliance & Transactions (across tables) [3] “There's no doubt that traditional database management products will continue to be around for many years” [1]

Why NoSQL? In this case NoSQL means MapReduce architecture. MapReduce enables massive parallel queries. [2] Supports shared-nothing commodity clusters [3] Goals: Performance, Fault Tolerance, Heterogeneous Environment

Economic Impact New companies forming to support NoSQL (10Gen, DataStax, Couchbase, …) [1] 10Gen raised $50 million, customers “Viacom, Disney, SAP, FourSquare, Shutterfly” [1]

Relationship With Course Relevant to chapters of Big Data, MapReduce, Cloud

References [1] Jaikumar Vijayan, ComputerWorld, Oct 2011, _NoSQL? [2] Google, 2011, [3] Azza Abouzeid et.al., HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads,