Confidential – Oracle Internal/Restricted/Highly Restricted

Slides:



Advertisements
Similar presentations
1/17/20141 Leveraging Cloudbursting To Drive Down IT Costs Eric Burgener Senior Vice President, Product Marketing March 9, 2010.
Advertisements

System Center 2012 R2 Overview
Datalayer Notebook Allows Data Scientists to Play with Big Data, Build Innovative Models, and Share Results Easily on Microsoft Azure MICROSOFT AZURE ISV.
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
Axis AI Solves Challenges of Complex Data Extraction and Document Classification through Advanced Natural Language Processing and Machine Learning MICROSOFT.
Dato Confidential 1 Danny Bickson Co-Founder. Dato Confidential 2 Successful apps in 2015 must be intelligent Machine learning key to next-gen apps Recommenders.
The Derivitec Risk Portal Provides Powerful, Cost-Effective Risk Management Solutions, Powered by Azure, that Deploy in Minutes MICROSOFT AZURE ISV PROFILE:
data & analytics beyond dashboards
Going Serverless with AWS Lambda
Let's talk about Linux and Virtualization in 'vLAMP'
Module 2: Microsoft Azure overview
DocFusion 365 Intelligent Template Designer and Document Generation Engine on Azure Enables Your Team to Increase Productivity MICROSOFT AZURE APP BUILDER.
Introduction to Distributed Platforms
Nicho Joins Microsoft Azure Certified Program to Transform Brand Engagement, Boost Customer Acquisition and Conversions with Scalable Ease MICROSOFT AZURE.
Ralleo Enterprise-Grade Solution for Managing Change and Business Transformation Provides Opportunities to Better Analyze Real-Time Data MICROSOFT AZURE.
Partner Logo Veropath Offers a Next-Gen Expense Management SaaS Technology Solution, Built Specifically to Harness Big Data Analytics Capabilities in Azure.
What is Cloud Computing - How cloud computing help your Business?
FunnelCake Creates Next-Generation Marketing Operations Platform for Secure, Scalable Big Data Analytics, All Driven by Power of Microsoft Azure MICROSOFT.
Free Cloud Management Portal for Microsoft Azure Empowers Enterprise Users to Govern Their Cloud Spending and Optimize Cloud Usage and Planning MICROSOFT.
Trial.iO Makes it Easy to Provision Software Trials, Demos and Training Environments in the Azure Cloud in One Click, Without Any IT Involvement MICROSOFT.
NeoFirma Taps into the Microsoft Azure Cloud Platform to Deliver Digital Oilfield SaaS to North American Independent Oil and Gas Producers MICROSOFT AZURE.
Firefish Software for Professional Recruiters Stays Available Around the Clock from Any Device and Anywhere by Using the Microsoft Azure Platform Partner.
Cherwell Service Management is an IT Service Management Solution that Makes it Easier for Users to Capitalize on Power of Microsoft Azure MICROSOFT AZURE.
SMS+ on Microsoft Azure Provides Enhanced and Secure Text Messaging, with Audit Trail, Scalability, End-to-End Encryption, and Special Certifications MICROSOFT.
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
VMware és KVM környezetek változtatás nélkül a felhőben
Hosted on Azure, LoginRadius’ Customer Identity
Veeam Backup Repository
Cloud Computing.
SmartHOTEL Solutions Powered by Microsoft Azure Provide Hoteliers with Comprehensive, One-Stop Automated Management of All Booking Channels MICROSOFT AZURE.
OpenNebula Offers an Enterprise-Ready, Fully Open Management Solution for Private and Public Clouds – Try It Easily with an Azure Marketplace Sandbox MICROSOFT.
Take Control of Insurance Product Management: Build, Test, and Launch Any Product Globally 10x Faster, 10x More Cheaply with INSTANDA on Azure Partner.
Running on the Powerful Microsoft Azure Platform,
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
The Sitecore® Experience Platform™ on Microsoft Azure
Yellowfin: An Azure-Compatible Business Intelligence Platform That Connects People with Their Data for Better Decision Making MICROSOFT AZURE APP BUILDER.
Scalable SoftNAS Cloud Protects Customers’ Mission-Critical Data in the Cloud with a Highly Available, Flexible Solution for Microsoft Azure MICROSOFT.
Logsign All-In-One Security Information and Event Management (SIEM) Solution Built on Azure Improves Security & Business Continuity MICROSOFT AZURE APP.
ResourceFirst Puts Emphasis on Communication, Uses Power of Azure to Bring Successful Resource and Portfolio Management to Companies Globally MICROSOFT.
On-Premises, or Deployed in a Hybrid Environment
Auth0 Is Identity Made Simple for Developers, Built by Developers and Supported by the High Availability and Performance of Microsoft Azure MICROSOFT AZURE.
I-POWER JAPAN Gives Small Businesses the Ability to Get Their Work Done from Anywhere, Even a Construction Site, by Using Microsoft Azure MICROSOFT AZURE.
Cloud Fleet Manager from Hanseaticsoft Enables Shipping Organizations to Meet Challenges and Improve Structures on the Azure Cloud Platform MICROSOFT AZURE.
DeFacto Planning on the Powerful Microsoft Azure Platform Puts the Power of Intelligent and Timely Planning at Any Business Manager’s Fingertips Partner.
Data Security for Microsoft Azure
Accelerate Your Self-Service Data Analytics
Excelian Grid as a Service Offers Compute Power for a Variety of Scenarios, with Infrastructure on Microsoft Azure and Costs Aligned to Actual Use MICROSOFT.
MyCloudIT Enables Partners to Drive Their Cloud Profitability Using CSP-Enabled Desktop Hosting Automation with Microsoft Azure and Office 365 MICROSOFT.
Introducing Qwory, a Business-to-Business Search Engine That’s Powered by Microsoft Azure and Detects Vital Contact Information for Businesses MICROSOFT.
Partner Logo Azure Provides a Secure, Scalable Platform for ScheduleMe, an App That Enables Easy Meeting Scheduling with People Outside of Your Company.
Crypteron is a Developer-Friendly Data Breach Solution that Allows Organizations to Secure Applications on Microsoft Azure in Just Minutes MICROSOFT AZURE.
Catalyze Redpoint Platform on Microsoft Azure
MARMIND’s New Service Delivers a Single Centralized Marketing Plan That Connects Teams, Campaigns and Outcomes by Using the Power of the Azure Platform.
Dell Data Protection | Rapid Recovery: Simple, Quick, Configurable, and Affordable Cloud-Based Backup, Retention, and Archiving Powered by Microsoft Azure.
Appcelerator Arrow: Build APIs in Minutes. Connect to Any Data Source
Cloud Analytics for Microsoft Azure
Microsoft Virtual Academy
XtremeData on the Microsoft Azure Cloud Platform:
Abiquo’s Hybrid Cloud Management Solution Helps Enterprises Maximise the Full Potential of the Microsoft Azure Platform MICROSOFT AZURE ISV PROFILE: ABIQUO.
Quasardb Is a Fast, Reliable, and Highly Scalable Application Database, Built on Microsoft Azure and Designed Not to Buckle Under Demand MICROSOFT AZURE.
Stephen W Thomas Using BizTalk Server as your Foundation to the Clouds
Single Cell’s Progenitor Powered by Microsoft Azure Improves Organisational Efficiency with Strategic Procurement, Contract Management, and Analytics MICROSOFT.
Guarantee Hyper-V, System Center Performance and Autoscale to Microsoft Azure with Application Performance Control System from VMTurbo MICROSOFT AZURE.
School Districts Can Analyze and Report on Data Across Multiple Systems with EdWire, a Powerful Integration Solution that Utilizes Microsoft Azure MICROSOFT.
Zendos Tecnologia Utilizes the Powerful, Scalable
Microsoft Virtual Academy
COMPANY PROFILE: REELWAY
SQL Server 2019 Bringing Apache Spark to SQL Server
OU BATTLECARD: Oracle Database 12c R2
Presentation transcript:

Confidential – Oracle Internal/Restricted/Highly Restricted

Big Data Made Simple with Oracle Data Flow Just Add Code Big Data Made Simple with Oracle Data Flow Carter Shanklin Product Management Oracle Confidential – Oracle Internal/Restricted/Highly Restricted

The Challenge of Big Data Flexible frameworks for ETL, SQL, ML and more. Process huge datasets. Retain data longer than ever. Extreme complexity. Lack of skilled operations teams. Difficult to align cost and value. Over the past 5 or so years, Big Data has led to amazing advancements across all industries. My personal favorite Big Data story is British Airway’s Know Me program, which by combining data across disparate inputs, is able to do things like predict when a customer is going to miss a flight and automatically offer to re-book them on another flight. There are hundreds of stories of how Big Data has changed the business landscape by bringing in data from many fronts, combining them and analyzing them. Everyone understand the potential of Big Data, but most people have not realized it. Why? Confidential – Oracle Internal/Restricted/Highly Restricted

Big Data Is Just Too Complicated! “70% of Hadoop deployments will fail to meet cost savings and revenue generation objectives due to skills and integration challenges.” - Merv Adrian, VP Research, Gartner “For three successive years the number of deployments for (Hadoop) projects for our clients has been about 15%. It's hardly moved.” - Merv Adrian, VP Research, Gartner Gartner has looked at this question in great detail, and reports that only about 15% of Big Data projects make it into production. The reason is complexity. Big Data infrastructures require deep operational expertise that is hard to find, hard to hire and hard to retain, so as much as companies want to use Big Data, they don’t have the skills and ability to do it. We need a better way to consume Big Data. Source: https://www.informationweek.com/big-data/software-platforms/big-data-tech-hadoop-and-spark- get-slow-start-in-enterprise/d/d-id/1331285?print=yes March 20, 2018 “...technologies such as Hadoop and Spark are new, they are difficult, and they are complicated.” - Merv Adrian, VP Research, Gartner Confidential – Oracle Internal/Restricted/Highly Restricted

Oracle Data Flow Makes Big Data Truly Easy 1 Fast: Launch Apache Spark® Jobs in Just Seconds. 2 Serverless: No Infrastructure to Deploy or Manage. 3 Complete: Process data in the Cloud or in your Datacenter. Realizing that complexity is the main challenge, we have begun development on Oracle Data Flow, which lets you run Apache Spark® jobs in a true Serverless model Spark as you may know has emerged as the leading Big Data processing framework, supporting SQL, ETL, Graph, ML and more, and is at the center of Modern Big Data. Data Flow focuses squarely on solving the main complexity challenge we all face with Big Data. With no infrastructure to manage, no OS to patch or tune, nothing to upgrade, the most complex aspects of Big Data are handled for you transparently and automatically, allowing you to focus on the analytics, focus on those analytics that make it so your customers just can’t live without you. On top of this, Data Flow launches almost instantly, which helps you stay agile whether you’re in test or dev Data Flow Supports any type of Spark job including SQL, Python, Java, Scala and R. And Data Flow can be Run via UI or API, for easy integration with your production pipelines or your favorite scheduler. Oracle Data Flow The easiest way to run Apache Spark® in the cloud. Serverless, so you never worry about any infrastructure. Any type of Spark job, no re-coding required. Access data from any source, wherever it lives. Availability targeted for 2019. Oracle Confidential – Restricted

Oracle Data Flow: Fast Launch 1 Fast: Launch Apache Spark® Jobs in Just Seconds. 2 Serverless: No Infrastructure to Deploy or Manage. 3 Complete: Process data in the Cloud or in your Datacenter. Let’s drill down into some of these points before we jump into a demo Oracle Data Flow Launch in Seconds, Simple UI and API Hadoop on Prem: Weeks to Months Hadoop in Cloud: 20 Minutes or More Oracle Confidential – Restricted

Oracle Data Flow: Serverless 1 Fast: Launch Apache Spark® Jobs in Just Seconds. 2 Serverless: No Infrastructure to Deploy or Manage. 3 Complete: Process data in the Cloud or in your Datacenter. Upgrading and re-certifying Hadoop can take weeks to months. HDFS upgrades touch all data and can lose/corrupt data Hadoop APIs are not backward compatible and frequently break apps YARN and HDFS performance characteristics differ substantially between versions, affecting post- upgrade SLAs Serverless lets you choose a compatible version for each job. No data upgrade. Roll development forward independently of production Application performance is isolated Net result is you can upgrade in hours/days rather than weeks/months Hadoop: 30% or more of your time spent keeping the lights on. Oracle Data Flow: No patches, easy to upgrade = spend 100% on innovation. MAY s m t w f 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 MAY s m t w f 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 Dev: Spark 2.4 Prod: Spark 2.3 Downtime: Upgrade OS Downtime: Upgrade Hadoop Switch to Spark 2.4 Certify New Hadoop Oracle Confidential – Restricted

Oracle Data Flow: Complete 1 Fast: Launch Apache Spark® Jobs in Just Seconds. 2 Serverless: No Infrastructure to Deploy or Manage. 3 Complete: Process data in the Cloud or in your Datacenter. Oracle Cloud Data Lake Oracle Object Storage SaaS Oracle Autonomous DB Co-Located Access Oracle Data Flow Compute Multi Cloud Oracle Data Flow S3 / WASB NoSQL RDBMS Compute Oracle Data Flow Hybrid Cloud Compute Object Storage EDW Private VPN Oracle Catalog

How Does it Work? Run Any Spark Job in 3 Steps 1 2 3 Sign Up Upload your Spark App Run! .py .jar SQL Data in Oracle Object Storage cloud.oracle.com Includes $300 of free trial credits Configure your app Select number of VMs Run! Oracle Confidential – Restricted

Oracle Data Flow: Access Data Wherever it Lives 1 2 3 In Oracle Cloud From Other Clouds From Your Datacenter Secure via VPN 1: Includes all the connectors you need to talk to Database or MySQL 2. Connect to any cloud to offload analytics 3. Peer a virtual private network to access your on -premises data for data that doesn’t live in the cloud Azure AWS Oracle Oracle Data Flow Oracle Autonomous Database MySQL Oracle Object Storage Oracle Data Flow Private VCN On-Prem EDW Oracle Confidential – Restricted

Oracle Data Flow: Secure Execution Control Your Spark jobs are isolated in private VMs. Data is always encrypted, at- rest and in-motion. Service Controller dfcs.oraclecloud.com Customer 1 Job Customer 2 Job Spark Driver Spark Executor Spark Executor Spark Driver Spark Executor Spark Executor Spark Executor Spark Executor Storage Container Storage Container Customer 2 Objects Customer 2 Objects Customer 2 Objects Oracle Confidential – Restricted

Oracle Data Flow: No Complex Capacity Planning Control Traditional Model: Noisy neighbors mean you provision to maximum expected load. Data Flow: Jobs run on VMs that come and go on-demand. No noisy, no expensive max-load provisioning. Hit SLAs while paying for only what you need. Service Controller dfcs.oraclecloud.com Customer 1 Job Customer 2 Job One of the most complex problems is sizing shared clusters to ensure predictable, consistent SLAs, when workloads are bursty and when workloads come and go. In traditional big data architectures you have to size your cluster to handle the busiest possible load, meaning you massively overspend on your big data architectures Spark Driver Spark Executor Spark Executor Spark Driver Spark Executor Spark Executor Spark Executor Spark Executor Storage Container Storage Container Customer 2 Objects Customer 2 Objects Customer 2 Objects Oracle Confidential – Restricted

Oracle Data Flow is API-Driven. Service driven entirely by REST APIs. Integrate with your applications or workflow engine of choice. Oracle Object Storage Oracle Data Flow Spark Job Spark Jobs REST APIs Secure Read / Write Oracle Confidential – Restricted

Big Data Made Really Simple DEMO Oracle Data Flow Big Data Made Really Simple Confidential – Oracle Internal/Restricted/Highly Restricted

Oracle Data Flow Demo: Convert XML Mess into a SQL Table for Reporting Documents Parquet Table Oracle Data Flow Apache Spark Processing Fully Serverless Oracle Object Storage Oracle Compute Oracle Object Storage Oracle Confidential – Restricted

Summary Oracle Data Flow Simplest Way to run Spark in the Cloud. Any type of Spark job, any Spark version. Infinitely scalable compute and storage. Pay for only what you use. Oracle Confidential – Restricted