BigData - NoSQL Hadoop - Couchbase

Slides:



Advertisements
Similar presentations
MLAN Maguire Local Area Network Version 2.0, May 1998.
Advertisements

The Lucernex Cloud: A software-as-a-service solution delivered via the Cloud What is the Cloud? Cloud Computing is the future of all software applications,
Setting Big Data Capabilities Free How to Make Business on Big Data? Stig Torngaard, Partner Platon.
Technical Evangelist Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase.
BarcelonaJS / April 4th, 2013 Couchbase & Javascript MapReduce, Node.js, Angular Tugdual “Tug” Grall Technical Evangelist BarcelonaJS / April 4th, 2013.
A Fast Growing Market. Interesting New Players Lyzasoft.
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
Business Intelligence Technology and Career Options Paul Boal Director - Data Management Mercy ( April 7, 2014.
Lecture-8/ T. Nouf Almujally
Apache Spark and the future of big data applications Eric Baldeschwieler.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
Introduction à Couchbase Server 2.0 Tugdual Grall
Database Systems – CRM DEFINITIONS CRM - Customer Relationship Management CRM usually refers to a strategic solution that helps businesses identify the.
Contents HADOOP INTRODUCTION AND CONCEPTUAL OVERVIEW TERMINOLOGY QUICK TOUR OF CLOUDERA MANAGER.
Distributed Systems Fall 2014 Zubair Amjad. Outline Motivation What is Sqoop? How Sqoop works? Sqoop Architecture Import Export Sqoop Connectors Sqoop.
Hadoop IT Services Hadoop Users Forum CERN October 7 th,2015 CERN IT-D*
03 | Express and Databases
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy DBI210.
An Introduction To Big Data For The SQL Server DBA.
CPSC8985 FA 2015 Team C3 DATA MIGRATION FROM RDBMS TO HADOOP By Naga Sruthi Tiyyagura Monika RallabandiRadhakrishna Nalluri.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Our experience with NoSQL and MapReduce technologies Fabio Souto.
Microsoft Ignite /28/2017 6:07 PM
1 Cloud-Native Data Warehousing Bob Muglia. 2 Scenarios with affinity for cloud Gartner 2016 Predictions: By 2018, six billion connected things will be.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Hadoop Introduction. Audience Introduction of students – Name – Years of experience – Background – Do you know Java? – Do you know linux? – Any exposure.
From RDBMS to Hadoop A case study Mihaly Berekmeri School of Computer Science University of Manchester Data Science Club, 14th July 2016 Hayden Clark,
SocialBoards Self-Service, Multichannel Support Ticket Notifications in Microsoft Office 365 Groups Help Customer Care Teams to Provide Better Care OFFICE.
A presentation on ElasticSearch
OMOP CDM on Hadoop Reference Architecture
Pipe Engineering.
Business Intelligence in the age of analytics
Microsoft Dynamics 365 for Sales Guidance for selling to SMB customers
Data Platform and Analytics Foundational Training
Business Discovery, Monitoring & Reporting Data Flow iCLM UI Operator Systems OCS IN CDR PCC CRM Marketing Operations CSR Monitoring Marketing Integration.
Data Platform Modernization
From DBA to DPA – Becoming a Data Platform Administrator
Big Data A Quick Review on Analytical Tools
TDWI EXECUTIVE SUMMIT From Traditional to Modern: How Rakuten Marketing Realized the Promise of a New Generation of BI September 21, 2015 Donald Krapohl.
CS122B: Projects in Databases and Web Applications Winter 2017
BIG DATA IN ENGINEERING APPLICATIONS
IBM Tivoli Web Site Analyzer Training Document
SmartHOTEL Planner Add-In for Outlook: Office 365 Integration Enhances Room Planning, Booking, and Guest Management for Small Hotels and B&Bs OFFICE 365.
of our Partners and Customers
Sqoop Mr. Sriram
Hadoopla: Microsoft and the Hadoop Ecosystem
Microsoft Azure Enables Banner Creator Tool to Change the Online Advertisement Industry “Running the BannerFlow platform on Azure enables us to develop.
Operational & Analytical Database
Dremel.
Booklet365 Office 365 Outlook Add-In Makes Easy Work of Managing Schedules for Fitness Gyms, Sports Associations, Trainers, and Their Customers Partner.
Enabling Scalable and HA Ingestion and Real-Time Big Data Insights for the Enterprise OCJUG, 2014.
The Contemporary Firm 550 By: Beatriz Guzman
Project Project mid-term report due on 25th October at midnight Format
© 2016 Global Market Insights, Inc. USA. All Rights Reserved Fuel Cell Market size worth $25.5bn by 2024Low Power Wide Area Network.
Hadoop Market
SocialBoards Self-Service, Multichannel Support Ticket Notifications in Microsoft Office 365 Groups Help Customer Care Teams to Provide Better Care OFFICE.
+Vonus: An Intuitive, Cloud-Based Point-of-Sale Solution That’s Powered by Microsoft Office 365 with Tools to Increase Sales Using Social Media OFFICE.
Storage Systems for Managing Voluminous Data
Big Data - in Performance Engineering
Data Platform Modernization
File Manager for Microsoft Office 365, SharePoint, and OneDrive: Extensible Via Custom Connectors in Enterprise Deployments, Ideal for End Users OFFICE.
It’s Always a Hard Choice
Azure's Performance, Scalability, SQL Servers Automate Real Time Data Transfer at Low Cost MINI-CASE STUDY “Azure offers high performance, scalable, and.
Intro to NoSQL Databases
Security Information and Event Management (SIEM) Solution Runs on Microsoft Azure Power “We are so happy to be using Microsoft Azure to make our security.
Replace with Application Image
Big DATA.
Business Intelligence
Presentation transcript:

BigData - NoSQL Hadoop - Couchbase Tugdual “Tug” Grall Technical Evangelist email: tug@couchbase.com twitter: @tgrall

About me Web Tugdual “Tug” Grall @tgrall Couchbase http://blog.grallandco.com tgrall NantesJUG co-founder Pet Project : http://www.resultri.com Tugdual “Tug” Grall Couchbase Technical Evangelist eXo CTO Oracle Developer/Product Manager Mainly Java/SOA Developer in consulting firms

$30B Database Market Being Disrupted 95% <50%? Relational Technology Relational Technology Other Relational Technology Relational Technology NoSQL Technology The database industry is about $30B today and is dominated by companies like Oracle, IBM, and Microsoft Relational technology has dominated the industry for the last 40 years and is the technology underpinning for 95% of the industry today. We believe the database industry is being disrupted. In 10 – 15 years we believe relational technology will make up a much smaller percentage of the industry. It’s too early to tell whether it will be 50%, 40%, or 30% percent but it seems clear to me it will be much small than 95% We believe most of the future operational database growth will be NoSQL 2012 2027 All new database growth will be NoSQL

Operational vs. Analytic Databases Get insights from data Real-time, Interactive Databases Fast access to data NoSQL There are two types of databases. Each is focused on a very different problem. Analytic databases were referred to in the past as OLAP databases. They are focused on looking through every record in a huge database to answer a question or gain an insight about the data contained in it. These analyses are batch processes that access every piece of data in the database, are very “read” heavy, and produce results in seconds, minutes, or sometimes days. For analytic databases, “real time” means an analysis takes a few seconds to run. Real-time interactive databases are often referred to as operational databases. They store a lot of data but usually much less than an analytic database. They must provide access to individual records in a database in milliseconds so that users of an application get good response time. Since the requirements of each database is very different, the architectures and capabilities of each are very different as well. When I refer to NoSQL in my presentation, I am referring to real-time, interactive databases. This is the type of NoSQL database Couchbase provides. Couchbase MongoDB Cassandra Hbase Cloudera Hortonworks Mapr

What Is Biggest Data Management Problem Driving Use of NoSQL in Coming Year? 49% 35% 29% 16% 12% 11% Lack of flexibility/ rigid schemas Inability to scale out data Performance challenges Cost All of these Other Source: Couchbase Survey, December 2011, n = 1351.

Hadoop & NoSQL

What is Sqoop? Sqoop is a tool designed to transfer data between Hadoop and relational databases. You can use Sqoop to import data from a relational database management system (RDBMS) such as MySQL or Oracle into the Hadoop Distributed File System (HDFS), transform the data in Hadoop MapReduce, and then export the data back into an RDBMS. sqoop.apache.org

What is Sqoop? Traditional ETL T Data Application Data

What is Sqoop? A different paradigm Application Data Data

What is Sqoop? A very scalable different paradigm Data Application

What is Sqoop? Where did the Transform go? Application Data T T T T T

Sqoop Details Sqoop Default connection is via JDBCLots of custom connectorsCouchbase, VoltDB, VerticaTeradata, NetezzaOracle, MySQL, Postgres

Ad and offer targeting 40 milliseconds to respond with the decision. profiles, real time campaign statistics 3 2 1 profiles, campaigns events

Moving Parts

Content and Recommendation Targeting

Content Driven Site: Moving Parts

Couchbase

Couchbase Server Core Principles Easy Scalability Consistent High Performance Grow cluster without application changes, without downtime with a single click Consistent sub-millisecond read and write response times with consistent high throughput Always On 24x365 Flexible Data Model No downtime for software upgrades, hardware maintenance, etc. JSON document model with no fixed schema.

Couchbase Handles Real World Scale

Q&A