Download presentation
Presentation is loading. Please wait.
Published byLindsay Chambers Modified over 9 years ago
1
1 Apache Spark and Its Role in the Enterprise Data Hub Mike Olson, Chief Strategy Officer, Cloudera mike.olson@cloudera.com, @mikeolson
2
2 ©2014 Cloudera, Inc. All rights reserved. Spark Unifies and Simplifies Hadoop Batch Processing Stream Processing Machine Learning
3
3 ©2014 Cloudera, Inc. All rights reserved. Developing and supporting Spark together to ensure customer success
4
4 ©2014 Cloudera, Inc. All rights reserved. Spark at Cloudera October 2013 February 2014 July 2014 Databricks and Cloudera partner Spark support added to CDH Continuing support & innovation
5
5 ©2014 Cloudera, Inc. All rights reserved. Spark is a Core Component of Hadoop
6
6 ©2014 Cloudera, Inc. All rights reserved. Fully Integrated into CDH Integrated and supported part of our platform Diverse use cases in production Well-trained support and external trainings 3 RD PARTY APPS STORAGE BATCH PROCESSING INTERACTIVE SQL SEARCH ENGINE MACHINE LEARNING STREAM PROCESSING WORKLOAD MANAGEMENT FILESYSTEMONLINE NOSQL
7
7 ©2014 Cloudera, Inc. All rights reserved. Customer Adoption Search personalization through machine learning investigations Fast processing of millions of stock positions and future scenarios Genomics research using Spark pipelines Predictive modeling of disease conditions
8
8 What’s Next?
9
9 The only hands-on deep dive into building unified applications with Spark Cloudera Developer Training for Apache Spark Public GA: Aug 5, Redwood City ©2014 Cloudera, Inc. All rights reserved.
10
10 ©2014 Cloudera, Inc. All rights reserved. Simplifies and speeds up complex cluster deployments Includes Cloudera Enterprise and ScaleMP's Versatile SMP (vSMP) architecture Built on the Intel(R) Xeon(R) processor-based Dell R920 hardware Optimized for Spark Dell In-Memory Appliances for Cloudera Enterprise
11
11 ©2014 Cloudera, Inc. All rights reserved. Spark as the Standard Processing Engine
12
12 ©2014 Cloudera, Inc. All rights reserved. The Hive and Spark communities are coming together to drive consolidation in the Hadoop ecosystem Bringing the Communities Together
13
13 ©2014 Cloudera, Inc. All rights reserved. Hive on Spark
14
14 ©2014 Cloudera, Inc. All rights reserved. Architecture SPARK BATCH PROCESSING STREAM PROCESSING HIVE Parser, Metastore, Semantic Analyser, Logical Plan, Optimizer, Task execution layer HDFS MRTez
15
15 ©2014 Cloudera, Inc. All rights reserved. Our SQL on Hadoop Vision SQL BI and SQL Analytics Batch Processing Mixed Spark and SQL Applications
16
16 Mike Olson mike.olson@cloudera.com @mikeolson Thank you! ©2014 Cloudera, Inc. All rights reserved.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.