1 Apache Spark and Its Role in the Enterprise Data Hub Mike Olson, Chief Strategy Officer,

Slides:



Advertisements
Similar presentations
1 1 Apache Hadoop and the Emergence of the Enterprise Data Hub Eli Collins, Chief Technologist ©2014 Cloudera, Inc. All rights reserved.
Advertisements

© 2009 VMware Inc. All rights reserved Big Data’s Virtualization Journey Andrew Yu Sr. Director, Big Data R&D VMware.
Hortonworks Eric Baldeschwieler – CEO © Hortonworks Inc Architecting the Future of Big Data June 29, 2011.
Observation Pattern Theory Hypothesis What will happen? How can we make it happen? Predictive Analytics Prescriptive Analytics What happened? Why.
Microsoft Ignite /16/2017 5:47 PM
An Information Architecture for Hadoop Mark Samson – Systems Engineer, Cloudera.
Hadoop tutorials. Todays agenda Hadoop Introduction and Architecture Hadoop Distributed File System MapReduce Spark 2.
SM STRATA PRESENTATION Tim Garnto - SVP Engineering, edo Interactive Rob Rosen – Big Data Field Lead, Pentaho.
Apache Spark and the future of big data applications Eric Baldeschwieler.
This presentation was scheduled to be delivered by Brian Mitchell, Lead Architect, Microsoft Big Data COE Follow him Contact him.
Committed to Deliver….  We are Leaders in Hadoop Ecosystem.  We support, maintain, monitor and provide services over Hadoop whether you run apache Hadoop,
© Copyright 2013 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Apache and Hadoop are.
` tuplejump The data engineering platform. A startup with a vision to simplify data engineering and empower the next generation of data powered miracles!
Hadoop tutorials. Todays agenda Hadoop Introduction and Architecture Hadoop Distributed File System MapReduce Spark Cluster Monitoring 2.
Contents HADOOP INTRODUCTION AND CONCEPTUAL OVERVIEW TERMINOLOGY QUICK TOUR OF CLOUDERA MANAGER.
Distributed Systems Fall 2014 Zubair Amjad. Outline Motivation What is Sqoop? How Sqoop works? Sqoop Architecture Import Export Sqoop Connectors Sqoop.
The World Moves Fast, and Data is Driving: Big Data and - #GILSV September 2013 | Silicon - #GILSV September.
DELIVERING THE ENTERPRISE FABRIC FOR BIG DATA Aiaz Kazi SVP, Platform Strategy and Adoption
How Companies are Using Spark And where the Edge in Big Data will be Matei Zaharia.
SCAPE Rainer Schmidt SCAPE Training Event September 16 th – 17 th, 2013 The British Library Building Scalable Environments Technologies and SCAPE Platform.
1 © Cloudera, Inc. All rights reserved. Partner Solution Overview 1 Partner Logo Full Color Partner Logo Full Color.
Hadoop IT Services Hadoop Users Forum CERN October 7 th,2015 CERN IT-D*
Matthew Winter and Ned Shawa
Breaking points of traditional approach What if you could handle big data?
1 © Cloudera, Inc. All rights reserved. Engines, Algorithms, and Data Models Josh Wills | Senior Director of Data Science From Dimensional Modeling to.
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
Streaming Relational Internal & external Non-relational NoSQL MobileReports Natural language queryDashboardsApplications Orchestration Machine learningModeling.
1 © Cloudera, Inc. All rights reserved. Alexander Bibighaus| Director of Engineering The Future of Data Management with Hadoop and the Enterprise Data.
Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy DBI210.
Making Data Work for Everyone Gordon Phillips May 28, 2014.
BIG DATA/ Hadoop Interview Questions.
Apache Hadoop on Windows Azure Avkash Chauhan
Cloudera Enterprise on Microsoft Azure. Data is driving business transformation CUSTOMER & CHANNEL DATA-DRIVEN PRODUCTS SECURITY, RISK & COMPLIANCE In.
Data Analytics and Hadoop Service in IT-DB Visit of Cloudera - April 19 th, 2016 Luca Canali (CERN) for IT-DB.
Microsoft Partner since 2011
Ignite in Sberbank: In-Memory Data Fabric for Financial Services
Microsoft Ignite /28/2017 6:07 PM
Raju Subba Open Source Project: Apache Spark. Introduction Big Data Analytics Engine and it is open source Spark provides APIs in Scala, Java, Python.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Business Insights Play briefing deck.
11/19/2017 9:41 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Data Lake and HAWQ Integration
Connected Infrastructure
WPC047 Data ON THE ROAD: the Azure part
Data Platform and Analytics Foundational Training
PROTECT | OPTIMIZE | TRANSFORM
5/9/2018 7:28 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS.
Smart Building Solution
Apache Spot (Incubating)
TDWI EXECUTIVE SUMMIT From Traditional to Modern: How Rakuten Marketing Realized the Promise of a New Generation of BI September 21, 2015 Donald Krapohl.
Melbourne Azure Meetup
Smart Building Solution
Microsoft SharePoint Server 2016
Connected Infrastructure
Data Platform and Analytics Foundational Training
Data Analytics and CERN IT Hadoop Service
9/21/2018 3:41 AM BRK3180 Architect your big data solutions with SQL Data Warehouse & Azure Analysis Services Josh Caplan & Matt Usher Program Managers.
Hybrid Cloud Strategies for Big Data
Data Analytics and CERN IT Hadoop Service
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
Federico Perrero – Plant Manager
Technical Capabilities
Azure Machine Learning on Databricks
Big-Data Analytics with Azure HDInsight
Introduction to Azure Data Lake
Productive + Hybrid + Intelligent + Trusted
How Dell, SAP and SUSE Deliver Value Quickly
Copyright © JanBask Training. All rights reserved Get Started with Hadoop Hive HiveQL Languages.
SQL Server 2019 Bringing Apache Spark to SQL Server
Dimension Load Patterns with Azure Data Factory Data Flows
Presentation transcript:

1 Apache Spark and Its Role in the Enterprise Data Hub Mike Olson, Chief Strategy Officer,

2 ©2014 Cloudera, Inc. All rights reserved. Spark Unifies and Simplifies Hadoop Batch Processing Stream Processing Machine Learning

3 ©2014 Cloudera, Inc. All rights reserved. Developing and supporting Spark together to ensure customer success

4 ©2014 Cloudera, Inc. All rights reserved. Spark at Cloudera October 2013 February 2014 July 2014 Databricks and Cloudera partner Spark support added to CDH Continuing support & innovation

5 ©2014 Cloudera, Inc. All rights reserved. Spark is a Core Component of Hadoop

6 ©2014 Cloudera, Inc. All rights reserved. Fully Integrated into CDH Integrated and supported part of our platform Diverse use cases in production Well-trained support and external trainings 3 RD PARTY APPS STORAGE BATCH PROCESSING INTERACTIVE SQL SEARCH ENGINE MACHINE LEARNING STREAM PROCESSING WORKLOAD MANAGEMENT FILESYSTEMONLINE NOSQL

7 ©2014 Cloudera, Inc. All rights reserved. Customer Adoption Search personalization through machine learning investigations Fast processing of millions of stock positions and future scenarios Genomics research using Spark pipelines Predictive modeling of disease conditions

8 What’s Next?

9 The only hands-on deep dive into building unified applications with Spark Cloudera Developer Training for Apache Spark Public GA: Aug 5, Redwood City ©2014 Cloudera, Inc. All rights reserved.

10 ©2014 Cloudera, Inc. All rights reserved. Simplifies and speeds up complex cluster deployments Includes Cloudera Enterprise and ScaleMP's Versatile SMP (vSMP) architecture Built on the Intel(R) Xeon(R) processor-based Dell R920 hardware Optimized for Spark Dell In-Memory Appliances for Cloudera Enterprise

11 ©2014 Cloudera, Inc. All rights reserved. Spark as the Standard Processing Engine

12 ©2014 Cloudera, Inc. All rights reserved. The Hive and Spark communities are coming together to drive consolidation in the Hadoop ecosystem Bringing the Communities Together

13 ©2014 Cloudera, Inc. All rights reserved. Hive on Spark

14 ©2014 Cloudera, Inc. All rights reserved. Architecture SPARK BATCH PROCESSING STREAM PROCESSING HIVE Parser, Metastore, Semantic Analyser, Logical Plan, Optimizer, Task execution layer HDFS MRTez

15 ©2014 Cloudera, Inc. All rights reserved. Our SQL on Hadoop Vision SQL BI and SQL Analytics Batch Processing Mixed Spark and SQL Applications

16 Mike Thank you! ©2014 Cloudera, Inc. All rights reserved.