THE BUSINESS CASE FOR AI, SPARK & MORE

Slides:



Advertisements
Similar presentations
Building an Operational Enterprise Architecture and Service Oriented Architecture Best Practices Presented by: Ajay Budhraja Copyright 2006 Ajay Budhraja,
Advertisements

Spark in the Hadoop Ecosystem Eric Baldeschwieler (a.k.a. Eric14)
Hosted on the Powerful Microsoft Azure Platform, Advent Countdown Lets Companies Run Reliable and Scalable Holiday Marketing Campaigns MICROSOFT AZURE.
IT AIN’T WHAT YOU DO TO DATA… IT’S WHAT YOU DO WITH IT Edd svds.com/StrataUK2015.
Datalayer Notebook Allows Data Scientists to Play with Big Data, Build Innovative Models, and Share Results Easily on Microsoft Azure MICROSOFT AZURE ISV.
Matthew Winter and Ned Shawa
Microsoft Azure and DataStax: Start Anywhere and Scale to Any Size in the Cloud, On- Premises, or Both with a Leading Distributed Database MICROSOFT AZURE.
DreamFactory for Microsoft Azure Is an Open Source REST API Platform That Enables Mobilization of Data in Minutes across Frameworks and Storage Methods.
Snip2Code: Search, Share and Collect Code Snippets Faster, Easier, Efficiently with Power of Microsoft Azure Platform MICROSOFT AZURE ISV PROFILE: SNIP2CODE.
Raju Subba Open Source Project: Apache Spark. Introduction Big Data Analytics Engine and it is open source Spark provides APIs in Scala, Java, Python.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Hadoop Javad Azimi May What is Hadoop? Software platform that lets one easily write and run applications that process vast amounts of data. It includes:
READ ME FIRST Use this template to create your Partner datasheet for Azure Stack Foundation. The intent is that this document can be saved to PDF and provided.
AuraPortal Cloud Helps Empower Organizations to Organize and Control Their Business Processes via Applications on the Microsoft Azure Cloud Platform MICROSOFT.
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
IBM Predictive Analytics Virtual Users’ Group Meeting March 30, 2016
Organizations Are Embracing New Opportunities
Big Data is a Big Deal!.
Big Data Enterprise Patterns
PROTECT | OPTIMIZE | TRANSFORM
DocFusion 365 Intelligent Template Designer and Document Generation Engine on Azure Enables Your Team to Increase Productivity MICROSOFT AZURE APP BUILDER.
Big Data A Quick Review on Analytical Tools
Metis Data Science Meetup:
Working With Azure Batch AI
Status and Challenges: January 2017
Spark Presentation.
Primal and Microsoft Azure Deliver Personalized Content, Intelligence, and Analytics That Match Your Content to the Interests of Your Audience MICROSOFT.
Cherwell Service Management is an IT Service Management Solution that Makes it Easier for Users to Capitalize on Power of Microsoft Azure MICROSOFT AZURE.
Couchbase Server is a NoSQL Database with a SQL-Based Query Language
Enabling Scalable and HA Ingestion and Real-Time Big Data Insights for the Enterprise OCJUG, 2014.
Data Platform and Analytics Foundational Training
Nimble Streamer Helps Media Content Providers Create Streaming Networks Cost-Effectively and Easily by Utilizing Azure’s Worldwide Scalability MICROSOFT.
Hadoop Clusters Tess Fulkerson.
Get Real Value and Insights from Your Data: Biin Solutions Provides Predictive Analytics, IoT, and Business Intelligence with Microsoft Azure Power MICROSOFT.
OpenNebula Offers an Enterprise-Ready, Fully Open Management Solution for Private and Public Clouds – Try It Easily with an Azure Marketplace Sandbox MICROSOFT.
Sas is open (for business)
Take Control of Insurance Product Management: Build, Test, and Launch Any Product Globally 10x Faster, 10x More Cheaply with INSTANDA on Azure Partner.
Microsoft Azure Platform Powers New Elements Constellation Software Suite to Deliver Invaluable Insights From Your Data for Marketing and Sales MICROSOFT.
NSF : CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science PI: Geoffrey C. Fox Software: MIDAS HPC-ABDS.
OpenWorld 2018 Audio Recognition Using Oracle Data Science Platform
Introduction to Spark.
Oscar AP by Massive Analytic: A Precognitive Analytics Platform for Effortless Data-Driven Decisions. Now Available in Azure Marketplace MICROSOFT AZURE.
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
Logsign All-In-One Security Information and Event Management (SIEM) Solution Built on Azure Improves Security & Business Continuity MICROSOFT AZURE APP.
Hosted on Microsoft Azure, Seismic is Drastically Changing How Enterprise Sales Teams Utilize Content to Accelerate Sales and Close Deals MICROSOFT AZURE.
DeFacto Planning on the Powerful Microsoft Azure Platform Puts the Power of Intelligent and Timely Planning at Any Business Manager’s Fingertips Partner.
Accelerate Your Self-Service Data Analytics
Principal Product Manager Oracle Data Science Platform
Your gateway to cloud innovation
Data science and machine learning at scale, powered by Jupyter
Appcelerator Arrow: Build APIs in Minutes. Connect to Any Data Source
XtremeData on the Microsoft Azure Cloud Platform:
Overview of big data tools
Abiquo’s Hybrid Cloud Management Solution Helps Enterprises Maximise the Full Potential of the Microsoft Azure Platform MICROSOFT AZURE ISV PROFILE: ABIQUO.
Big Data Young Lee BUS 550.
Enabling ML Based Research
Improve Patient Experience with Saama and Microsoft Azure
Using the Microsoft AI Platform for next generation applications
Technical Capabilities
2/19/2019 9:06 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Last.Backend is a Continuous Delivery Platform for Developers and Dev Teams, Allowing Them to Manage and Deploy Applications Easier and Faster MICROSOFT.
H2O is used by more than 14,000 companies
Pitch Deck.
Big-Data Analytics with Azure HDInsight
Databricks and End-to-End Processes Demo Links & Help
Thank you to our Sponsors
Customer 360.
The Intelligent Enterprise and SAP Business One
Presentation transcript:

THE BUSINESS CASE FOR AI, SPARK & MORE Sanjay Mathur, CEO sanjay@svds.com • @sanjaymathur • www.svds.com

Silicon Valley Data Science is a boutique consulting firm focused on transforming business through data science and engineering. We are a company of experienced software engineers, architects, data scientists, strategists, and designers who specialize in data-driven product development and experimentation. We work in blended teams trained in holistic, strategic, and agile approaches.

Come see us & say hello! Tuesday, 23 May Data 101: The Business Case for Deep Learning, Spark, and Friends with Sanjay Mathur Architecting a Data Platform with John Akred & Stephen O’Sullivan Developing a Modern Enterprise Data Strategy with John Akred & Scott Kurth Thursday, 25 May What’s Your Data Worth? with John Akred Ask Me Anything with John Akred, Scott Kurth & Stephen O’Sullivan

To view SVDS speakers and scheduling, or to receive a copy of our slides, go to: www.svds.com/StrataUK2017

… it’s really about agility BIG DATA … it’s really about agility

BUYING AGILITY Linear scale-out cost Opex vs capex Ease of purchase

Scale-out systems move us from managing scarcity to promoting utility

DEVELOPMENT AGILITY Architectural factors Schema on read Rapid deployment Mirror production setup Executes faster Programmer factors Fun to program Concision Easier to test Faster to write DEVELOPMENT AGILITY

The experimental enterprise Supports investigative work and builds a solid layer for production. Conducts experiments and responds to the changing environment. Makes foundational infrastructure readily accessible. http://www.forbes.com/sites/edddumbill/2014/02/05/the-experimental-enterprise/

Data Management Security, Operations, Data Quality, Meta Data Management and Data Lineage Analytics Low Latency Access Data Ingest Repository Persistence Offline Processing Real-Time Processing Batch Processing Data Services External Systems Data Acquisition Internal Data Platform

What is Docker? Container technology: bundles every part of an application Provides isolation for each application without the overhead of running a virtual machine Ships only the parts that are needed—leaves out the operating system Answers the question of “how do I get my data” VM – full isolation, guaranteed resources Docker – process isolation, and better resource sharing

WHY SHOULD BUSINESS CARE? Better use of server resource than virtual machines A fast and reliable way of deploying applications It’s the ideal packaging mechanism for scale-out distributed systems Easy for developers to work in an environment identical to production Sharing containers leads to innovation DevOps enabler with programmatic infrastructure components. Fast scale up and tear down.

What is Apache Spark? In-memory distributed computing platform Comes from Berkeley AMPlab In production with early adopters, now integral to every commercial Hadoop distribution Doesn’t need Hadoop, but runs easily on top

Use cases Managing a major retailer’s inventory across a diverse network of entities in near real time Managing and processing event streams for online gaming Supporting data science initiatives across massive data sets at a media analytics company

Why should business care? Enables use cases Hadoop didn’t provide, all in one platform streaming, interactive analytics, machine learning, graphs Fast Iteration time down, more productive Use existing cluster investment Sits on HDFS, can run under YARN (or use Amazon S3, or Cassandra)

Why should business care? (2) SparkSQL Use SQL skills and tools, e.g. Tableau Dataframes integrate external data sources into one context: RDBMS, Hive, JSON… Developer-friendly Concise and fluid to program Language integration: Scala, R, Python, Java

THE AI BOOM Image from https://devblogs.nvidia.com/parallelforall/deep-learning-for-computer-vision-with-matlab-and-cudnn/ Deep learning uses deep neural networks which have been around for a few decades; what’s changed in recent years is the availability of large labeled datasets and powerful GPUs. Neural networks are inherently parallel algorithms and GPUs with thousands of cores can take advantage of this parallelism to dramatically reduce computation time needed for training deep learning networks.

THE AI BOOM A long history of under-delivered promise—why? Traditionally, an encoding of knowledge Expert systems Current excitement is based on statistical methods Machine learning—many methods Deep learning—neural networks

AI USE CASES Cognition at scale Pattern recognition Predictive analytics Deep learning takes this further than before Extraordinary levels of composition, eg. Image captioning, generation

AI, WHY CARE? Leverage data previously “invisible” e.g. search images by text Automate low-value cognition tasks Call center triage Natural interfaces X.ai — scheduling bot Amazon Alexa, Google Now, Cortana, etc. Amazing open source toolkits and cloud APIs Google Tensorflow, MSFT, IBM Watson

What are Notebooks? Interactive documents that contain a program and its output Long history: Mathematica Particularly successful with data science Projects to watch Jupyter https://jupyter.org/ Apache Zeppelin https://zeppelin.incubator.apache.org/ Answers the question of “how do I get my data”

Demo goes here

Why should business care? Easy collaboration and sharing of data science Think “Docker for analysis” Easy access to data and compute resource A building block for more self-service analytical capabilities Commercial version of Notebooks + Spark is the Databricks Cloud

Data is your business

SILICON VALLEY’S DATA MACHINE

The experimental enterprise Supports investigative work and builds a solid layer for production. Conducts experiments and responds to the changing environment. Makes foundational infrastructure readily accessible. http://www.forbes.com/sites/edddumbill/2014/02/05/the-experimental-enterprise/

BECOME DATA NATIVE Can only win with situational awareness New architectures offer new opportunities Creation of data-driven value requires new approach Create an Experimental Enterprise Business must lead, and understand the potential of the technology

Sanjay Mathur• sanjay@svds. com • @sanjaymathur www. svds Sanjay Mathur• sanjay@svds.com • @sanjaymathur www.svds.com/StrataCA2017

Come see us & say hello! Tuesday, 23 May Data 101: The Business Case for Deep Learning, Spark, and Friends with Sanjay Mathur Architecting a Data Platform with John Akred & Stephen O’Sullivan Developing a Modern Enterprise Data Strategy with John Akred & Scott Kurth Thursday, 25 May What’s Your Data Worth? with John Akred Ask Me Anything with John Akred, Scott Kurth & Stephen O’Sullivan

To view speakers and scheduling, or to receive a copy of our slides, go to: www.svds.com/StrataCA2017