High Performance Data Scientist

Slides:



Advertisements
Similar presentations
What is Cloud Computing? o Cloud computing:- is a style of computing in which dynamically scalable and often virtualized resources are provided as a service.
Advertisements

Presented by Sujit Tilak. Evolution of Client/Server Architecture Clients & Server on different computer systems Local Area Network for Server and Client.
Cloud computing Tahani aljehani.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
Platform as a Service (PaaS)
Google AppEngine. Google App Engine enables you to build and host web apps on the same systems that power Google applications. App Engine offers fast.
SENG 422 Lab 4 Cloud Computing Time: ELW B220 from (4:00 - 6:50) every Tuesday TA: Philip Baback Alipour Ph.D. Candidate in Electrical, Computer Engineering.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Lecture 8 – Platform as a Service. Introduction We have discussed the SPI model of Cloud Computing – IaaS – PaaS – SaaS.
Robert MacLean BBD Software Get Ready For The Cloud TRACK: Cloud & ALM.
Application and Data Portability in Cloud Computing The Cirrocumulus Way Ajith Ranabahu and Amit Sheth Ohio Center of Excellence in Knowledge enabled Computing.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Introduction to Cloud Computing
Cloud Computing. Cloud Computing defined Dynamically scalable, device-independent and task-centric computing resources are provided online, with all charges.
CLOUD COMPUTING Kevin Levy, Gunster David Zimmerman, IBM Marcos Galigarcía, Santander Private Banking.
Paperless Timesheet Management Project Anant Pednekar.
GOOGLE APP ENGINE By Muktadiur Rahman. Contents  Cloud Computing  What is App Engine  Why App Engine  Development with App Engine  Quote & Pricing.
Microsoft Cloud Computing. Topics to be covered 1.Environmental Features of windows azure 2.What is Cloud Computing 3.Roles in Cloud Computing 4.Benefits.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
1 NETE4631 Using Google Web Services Lecture Notes #6.
Amazon Web Services. Amazon Web Services (AWS) - robust, scalable and affordable infrastructure for cloud computing. This session is about:
Sausalito: An Application Server for RESTful Services in the Cloud Matthias Brantner & Donald Kossmann 28msec Inc.
Agenda  What is Cloud Computing?  Milestone of Cloud Computing  Common Attributes of Cloud Computing  Cloud Service Layers  Cloud Implementation.
Leverage Big Data With Hadoop Analytics Presentation by Ravi Namboori Visit
Clouding with Microsoft Azure
Prof. Jong-Moon Chung’s Lecture Notes at Yonsei University
Platform as a Service (PaaS)
Unit 3 Virtualization.
Let's talk about Linux and Virtualization in 'vLAMP'
Containers as a Service with Docker to Extend an Open Platform
Platform as a Service (PaaS)
Dr.S.Sridhar, Director, RVCT, RVCE, Bangalore
Platform as a Service (PaaS)
Dockerize OpenEdge Srinivasa Rao Nalla.
Foundations of Data Science
Tulika Chaudharie / Harikharan Krishnaraju
Working With Azure Batch AI
Google App Engine Mandeep Singh (37926)
Docker Birthday #3.
Cloud Computing & ANalytics
Spark Presentation.
Platform as a Service.
Logo here Module 3 Microsoft Azure Web App. Logo here Module Overview Introduction to App Service Overview of Web Apps Hosting Web Applications in Azure.
Cloud Computing Platform as a Service
Dr.S.Sridhar, Director, RVCT, RVCE, Bangalore
In-Memory Performance
Using docker containers
Oscar AP by Massive Analytic: A Precognitive Analytics Platform for Effortless Data-Driven Decisions. Now Available in Azure Marketplace MICROSOFT AZURE.
Cloud Computing.
Bringing Containers to Production
Open Source Toolkit for Turn-Key AI Cluster (Introduction)
Microsoft Connect /17/ :34 AM
Deep Learning Packages
Principal Product Manager Oracle Data Science Platform
Data science and machine learning at scale, powered by Jupyter
Appcelerator Arrow: Build APIs in Minutes. Connect to Any Data Source
: Infrastructure for Complete Machine Learning Lifecycle
Modern cloud PaaS for mobile apps, web sites, API's and business logic apps
Using the Microsoft AI Platform for next generation applications
Bringing Containers to Production
Technical Capabilities
Building and running HPC apps in Windows Azure
Agenda Need of Cloud Computing What is Cloud Computing
A platform for the Complete Machine Learning Lifecycle
Artificial intelligence for everyone
Anjuman College of Engineering & Technology Computer Science & Engineering Department Subject Code: BECSE408T Subject Name: (ELECTIVE-III)Clustering &
A DevOps process for deploying R to production
Docker for DBAs SQL Saturday 8/17/2019.
Containers on Azure Peter Lasne Sr. Software Development Engineer
Presentation transcript:

High Performance Data Scientist By : Swapnil Dubey

About me ... Swapnil Dubey : 9+ years - Domains - Ecommerce ,AdServing, Banking

Agenda ... Benefits of running DS on Cloud. Introduction to MLaaS. Demo: Running DS models using Tensorflow and keras on Google Cloud ML(Using GPUs). Discussion on how to develop a generic infinitely scalable infrastructure - Why?What?How? Demo: Running multiple R jobs to show auto scaling feature of the infrastructure. Kubernetes Federation + Ubernetes. Demo : Deploying application in Kubernetes Federation set up. Infrastructure for running production and experiment jobs for DS. Talk on Docker, Kubernetes Cross region kubernetes Cross cloud vendor kubernete

Benefits of Running DS in Cloud Unpredictable need of Infrastructure - Infinitely scalable infra for shorter duration Innovate and lead - New Languages, New Libraries - lots of experiments New hardware components - Running models on various versions of GPUs Improved security and compliance - running jobs in different regions of world Effective task execution - Fault tolerance, run-time dynamic scalability.

Introduction to MLaas ... Machine Learning as a Service Platform which can provide almost all the advantages of last slide. Famous vendors: Amazon Machine Learning, Google Cloud Machine Learning, Azure Machine Learning Different vendors - different pre-built libraries Amazon ML pre built libraries - Apache MXNet, TensorFlow, Caffe2 (and Caffe), Theano, Torch, Keras

Introduction to MLaas ... Machine Learning as a Service Platform which can provide almost all the advantages of last slide. Famous vendors: Amazon Machine Learning, Google Cloud Machine Learning, Azure Machine Learning Different vendors - different pre-built libraries Amazon ML pre built libraries - Apache MXNet, TensorFlow, Caffe2 (and Caffe), Theano, Torch, Keras Areas of Concerns : - How to use custom infrastructure ? - How to run job involving a non pre built library?

Demo Running DS models using Tensorflow and keras on Google Cloud ML(Using GPUs). Disadvantage : Completely new flavor of technology set is not achievable.

Cognitive APIs Advanced artificial intelligence features NLP,Speech recognition,Vision API etc. Demo : Google Vision API Very basic ….

Kevin Novak, Head Of Data Science@Uber Article By Kevin Novak The Purpose of Platforms in Data Science ,April-2016 “We will need special features, we’ll have more control, we’ve got great engineers who could build it for cheap, and of course, this is strategic to the business.”

Developing a Generic Infrastructure How to support an altogether different flavour of DS as well as non DS job on a Cloud Vendor? Constructing a numpy file. Running spark jobs for transformations Any new hypothetical task on any new technology How to work with different versions of languages supported out of the box? How to have an auto scale up and scale down infrastructure which is cost effective? How to have a cloud vendor independent deployment for you DS jobs? Running jobs in different regions.

Developing a Generic Infrastructure How to support an altogether different flavour of DS as well as non DS job on a Cloud Vendor? Constructing a numpy file. Running spark jobs for transformations Any new hypothetical task on any new technology How to work with different versions of languages supported out of the box? How to have an auto scale up and scale down infrastructure which is cost effective? How to have a cloud vendor independent deployment for you DS jobs? Running jobs in different regions.

Developing a Generic Infrastructure How to support an altogether different flavour of DS as well as non DS job on a Cloud Vendor? Constructing a numpy file. Running spark jobs for transformations Any new hypothetical task on any new technology How to work with different versions of languages supported out of the box? How to have an auto scale up and scale down infrastructure which is cost effective? How to have a cloud vendor independent deployment for you DS jobs? Running jobs in different regions.

Developing a Generic Infrastructure How to support an altogether different flavour of DS as well as non DS job on a Cloud Vendor? Constructing a numpy file. Running spark jobs for transformations Any new hypothetical task on any new technology How to work with different versions of languages supported out of the box? How to have an auto scale up and scale down infrastructure which is cost effective? How to have a cloud vendor independent deployment for you DS jobs? Running jobs in different regions.

In other words ... Horizontal dynamic runtime scaling Rolling updates of Tech stack Cloud native job like infrastructure Support for both persistent and non persistent applications

Using Native Cloud Compute And Storage ... Spawn a VM and run with inflated resources.

Docker & Kubernetes

(EC2 Container Registry/Google Container Registry) What we did ? Producer Consumer KOPS/ GKE Submit Python Job Python Job Topic spark Java Producer R Submit R jobs R Job Topic python Job Submission UI Producer Submit MPI Jobs MPI Job Topic Container Registry (EC2 Container Registry/Google Container Registry) Producer Submit Java Jobs Java Job Topic updateStatus (failed) Virtual Machines Pub/Sub submitJob updateStatus (submitted) updateStatus (running) Expected time to complete- 10 mins 1 Already hosted web app. 2. Python job 3. Upscaling and downscaling 4. Preemptible nodes- reducing the cost by 1/3 Admin Module(Virtual Machines)

Kubernetes Federation ...

Ubernetes ...

Summary : Developing a Generic Infrastructure How to support an altogether different flavour of DS as well as non DS job on a Cloud Vendor? Constructing a numpy file. Running spark jobs for transformations Any new hypothetical task on any new technology How to work with different versions of languages supported out of the box? How to have an auto scale up and scale down infrastructure which is cost effective? How to have a cloud vendor independent deployment for you DS jobs? Ubernetes Running jobs in different regions. - Federation

Catch me ... https://www.linkedin.com/in/swapnil-dubey-8133154a/ @SwapnilDubey14 swapnil.dubey@exadatum.com

Thanks!