Technical Evangelist Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase.

Slides:



Advertisements
Similar presentations
Inner Architecture of a Social Networking System Petr Kunc, Jaroslav Škrabálek, Tomáš Pitner.
Advertisements

Big Data Training Course for IT Professionals Name of course : Big Data Developer Course Duration : 3 days full time including practical sessions Dates.
Real-Time Big Data Use Cases John Leach CTO, Splice Machine.
INTEGRATING BIG DATA TECHNOLOGY INTO LEGACY SYSTEMS Robert Cooley, Ph.D.CodeFreeze 1/16/2014.
BigData Tools Seyyed mohammad Razavi. Outline  Introduction  Hbase  Cassandra  Spark  Acumulo  Blur  MongoDB  Hive  Giraph  Pig.
BarcelonaJS / April 4th, 2013 Couchbase & Javascript MapReduce, Node.js, Angular Tugdual “Tug” Grall Technical Evangelist BarcelonaJS / April 4th, 2013.
A Fast Growing Market. Interesting New Players Lyzasoft.
FAST FORWARD WITH MICROSOFT BIG DATA Vinoo Srinivas M Solutions Specialist Windows Azure (Hadoop, HPC, Media)
One Billion Rows Per Second: Analytics for the Digital Media Markets XLDB October 19, 2011 MICHAEL DRISCOLL CO-FOUNDER &
Big Data Workflows N AME : A SHOK P ADMARAJU C OURSE : T OPICS ON S OFTWARE E NGINEERING I NSTRUCTOR : D R. S ERGIU D ASCALU.
Mihai Pintea. 2 Agenda Hadoop and MongoDB DataDirect driver What is Big Data.
NoSQL and NewSQL Justin DeBrabant CIS Advanced Systems - Fall 2013.
Business Intelligence Technology and Career Options Paul Boal Director - Data Management Mercy ( April 7, 2014.
Business Intelligence: The Next Big Thing (Really!) John Bair CTO, Ajilitee Sep 14, 2012 Presented to TDWI St. Louis Chapter.
1 Yasin N. Silva Arizona State University This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright © 2007 Quest Software The Changing Role of SQL Server DBA’s Bryan Oliver SQL Server Domain Expert Quest Software.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
Planning for a Web site Project ITS Web Services - Wendy Dascoli November 1, 2007.
Introduction à Couchbase Server 2.0 Tugdual Grall
` tuplejump The data engineering platform. A startup with a vision to simplify data engineering and empower the next generation of data powered miracles!
:: Conférence :: NoSQL / Scalabilite Etat de l’art Samuel BERTHE10 Mars 2014Epitech Nantes.
NoSQL continued CMSC 461 Michael Wilson. MongoDB  MongoDB is another NoSQL solution  Provides a bit more structure than a solution like Accumulo  Data.
Database Systems – CRM DEFINITIONS CRM - Customer Relationship Management CRM usually refers to a strategic solution that helps businesses identify the.
Distributed Indexing of Web Scale Datasets for the Cloud {ikons, eangelou, Computing Systems Laboratory School of Electrical.
Presented by John Dougherty, Viriton 4/28/2015 Infrastructure and Stack.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Contents HADOOP INTRODUCTION AND CONCEPTUAL OVERVIEW TERMINOLOGY QUICK TOUR OF CLOUDERA MANAGER.
Distributed Systems Fall 2014 Zubair Amjad. Outline Motivation What is Sqoop? How Sqoop works? Sqoop Architecture Import Export Sqoop Connectors Sqoop.
Introduction to Sqoop. Table of Contents Sqoop - Introduction Integration of RDBMS and Sqoop Sqoop use case Sample sqoop commands Key features of Sqoop.
When bet365 met Riak and discovered a true, “always on” database.
Data and SQL on Hadoop. Cloudera Image for hands-on Installation instruction – 2.
Hadoop IT Services Hadoop Users Forum CERN October 7 th,2015 CERN IT-D*
An Introduction to Predictive Analytics with Big Data and Open Source tools Joe Heary CTO & VP of Technical Operations Zimmerman Associates, Inc. (ZAI)
03 | Express and Databases
Nov 2006 Google released the paper on BigTable.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT IT Monitoring WG Technology for Storage/Analysis 28 November 2011.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT IT Monitoring WG Update on the tool hunt & MonALISA monitoring.
Scalable data access with Impala Zbigniew Baranowski Maciej Grzybek Daniel Lanza Garcia Kacper Surdy.
1 HBASE – THE SCALABLE DATA STORE An Introduction to HBase XLDB Europe Workshop 2013: CERN, Geneva James Kinley EMEA Solutions Architect, Cloudera.
One Billion Rows Per Second: Analytics for the Digital Media Markets STRATA SUMMIT NYC September 21, 2011 MICHAEL DRISCOLL CO-FOUNDER &
Big Data Analytics with Excel Peter Myers Bitwise Solutions.
Information Eastman. Business Process Skills Order to Cash, Forecasting & Budgeting, etc. Process Modeling Project Management Technical Skills.
Big Data Yuan Xue CS 292 Special topics on.
HADOOP Course Content By Mr. Kalyan, 7+ Years of Realtime Exp. M.Tech, IIT Kharagpur, Gold Medalist. Introduction to Big Data and Hadoop Big Data › What.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy DBI210.
An Introduction To Big Data For The SQL Server DBA.
What is it and why it matters? Hadoop. What Is Hadoop? Hadoop is an open-source software framework for storing data and running applications on clusters.
CPSC8985 FA 2015 Team C3 DATA MIGRATION FROM RDBMS TO HADOOP By Naga Sruthi Tiyyagura Monika RallabandiRadhakrishna Nalluri.
Microsoft Partner since 2011
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Our experience with NoSQL and MapReduce technologies Fabio Souto.
Dive into NoSQL with Azure Niels Naglé Hylke Peek.
Microsoft Ignite /28/2017 6:07 PM
OMOP CDM on Hadoop Reference Architecture
Business Intelligence in the age of analytics
BigData - NoSQL Hadoop - Couchbase
From DBA to DPA – Becoming a Data Platform Administrator
Big Data A Quick Review on Analytical Tools
TDWI EXECUTIVE SUMMIT From Traditional to Modern: How Rakuten Marketing Realized the Promise of a New Generation of BI September 21, 2015 Donald Krapohl.
An Open Source Project Commonly Used for Processing Big Data Sets
Integrating with Dynamics 365
CS122B: Projects in Databases and Web Applications Winter 2017
Sqoop Mr. Sriram
Insights driven Customer Experience
© 2016 Global Market Insights, Inc. USA. All Rights Reserved Fuel Cell Market size worth $25.5bn by 2024Low Power Wide Area Network.
Hadoop Market
Storage Systems for Managing Voluminous Data
03 | Building a Backend with Socket.IO and Mongo
Presentation transcript:

Technical Evangelist Tugdual “Tug” Grall BigData - NoSQL Hadoop - Couchbase

About me Tugdual “Tug” Grall - Couchbase - Technical Evangelist - eXo - CTO - Oracle - Developer/Product Manager - Mainly Java/SOA - Developer in consulting firms Web tgrall NantesJUG co-founder Pet Project :

<50%? % Relational Technology $30B Database Market Being Disrupted 2012 All new database growth will be NoSQL Relational Technology NoSQL Technology Other

Cloudera Hortonworks Mapr Operational vs. Analytic Databases Couchbase MongoDB Cassandra Hbase AnalyticDatabases Get insights from data Real-time, Interactive Databases Fast access to data NoSQL

What Is Biggest Data Management Problem Driving Use of NoSQL in Coming Year? Lack of flexibility/ rigid schemas Inability to scale out data Performance challengesCostAll of theseOther 49 % 35 % 29 % 16 % 12 % 11 % Source: Couchbase Survey, December 2011, n = 1351.

Hadoop & NoSQL

What is Sqoop? Sqoop is a tool designed to transfer data between Hadoop and relational databases. You can use Sqoop to import data from a relational database management system (RDBMS) such as MySQL or Oracle into the Hadoop Distributed File System (HDFS), transform the data in Hadoop MapReduce, and then export the data back into an RDBMS. sqoop.apache.org

What is Sqoop? Traditional ETL Application Data T

What is Sqoop? A different paradigm Data Application Data

What is Sqoop? A very scalable different paradigm Data Application Data Application Data Application Data

What is Sqoop? Where did the Transform go? Application Data TTTTTTTTTTTT

Sqoop Details Sqoop Default connection is via JDBCLots of custom connectorsCouchbase, VoltDB, VerticaTeradata, NetezzaOracle, MySQL, Postgres

Ad and offer targeting events profiles, campaigns profiles, real time campaign statistics 40 milliseconds to respond with the decision

Moving Parts

Content and Recommendation Targeting

Content Driven Site: Moving Parts

Couchbase

Couchbase Server Core Principles Easy Scalability Consistent High Performance Always On 24x365 Grow cluster without application changes, without downtime with a single click Consistent sub-millisecond read and write response times with consistent high throughput No downtime for software upgrades, hardware maintenance, etc. Flexible Data Model JSON document model with no fixed schema.

Couchbase Handles Real World Scale

Q&A