DL (Deep Learning) Workspace

Slides:



Advertisements
Similar presentations
Using the WDK for Windows Logo and Signature Testing Craig Rowland Program Manager Windows Driver Kits Microsoft Corporation.
Advertisements

9/24/2017 7:27 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
What is it all about? .NET MeetUp in Prague, CZ (2017/7/19)
Microsoft Build /9/2017 5:00 AM © 2016 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY,
IT Operations Management
Job Scheduling and Runtime in DLWorkspace
1/27/2018 5:13 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
4/19/ :02 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
4/18/2018 3:49 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Data Platform and Analytics Foundational Training
4/24/ :07 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Fundamentals Sunny Sharma Microsoft
5/5/ :05 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Microsoft Virtual Academy
5/13/2018 1:53 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Microsoft Virtual Academy
Tooling Breakout Session
Working With Azure Batch AI
Windows Server 2016 Secure IaaS Microsoft Build /1/2018 4:00 AM
Docker Birthday #3.
Open Source Toolkit for Turn-Key AI Cluster (Introduction)
Developing Hybrid Apps on Microsoft Azure Stack
AI development using Data Science Virtual Machines (DSVM) in Azure
IoT at the Edge Technical guidance deck.
Easily manage SQL everywhere from anywhere with SQL Tools
TensorFlow on Kubernetes with GPU Enabled
DL (Deep Learning) Workspace
Deep Learning Platform as a Service
Microsoft Connect /23/2018 5:27 PM
IT Operations Management
DL (Deep Learning) Workspace
9/14/ :46 AM BRK3293 How the Portland Trail Blazers Use Personalization and Acxiom Data to Target Customers Chris Hoder Program Manager, AI + Research.
Cognitive Toolkit (CNTK) Cha Zhang, Principal Researcher Microsoft AI & Research With 150+ contributors.
Open Source Toolkit for Turn-Key AI Cluster (Introduction)
9/20/ :55 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
IoT at the Edge Technical guidance deck.
SQL Server BI on Windows Azure Virtual Machines
SQL Server OLTP with Microsoft Azure Virtual Machines
Azure Container Service - the most open container orchestration service yet Saurya Das Program Manager.
Azure Container Instances
Managing Specialized Devices With Windows Embedded Device Manager 2011
Xiaokai He Junyi Yi Rapid, Iterative Kubernetes Development with Java Using Visual Studio Code.
How To: Deep Learning Workspace
Open Source Toolkit for Turn-Key AI Cluster (Introduction)
11/17/2018 9:32 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Microsoft Virtual Academy
Microsoft Ignite NZ October 2016 SKYCITY, Auckland.
Bring existing desktop apps to UWP with the Desktop Bridge
New Developer Tools in Windows Embedded Standard 2011
Developing for the cloud with Visual Studio
Intro about Contanier and Docker Technology
Microsoft Virtual Academy
12/9/2018 Desktop Virtualization Corey Hynes Kyle Rosenthal President Technical Lead HynesITe Inc Spider Consulting @windowspcguy.
Microsoft Virtual Academy
From Source to Production: The Latest in Container Dev
Make Web Not War /Web Say(Hello); to the Microsoft Web Platform
Using the Microsoft AI Platform for next generation applications
Microsoft Connect /25/2019 1:20 PM
Building and running HPC apps in Windows Azure
Delivering great hardware solutions for Windows
LearnAI - Machine Learning on Azure
TechEd /23/2019 9:23 AM © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks.
OpenStack Summit Berlin – November 14, 2018
Mark Quirk Head of Technology Developer & Platform Group
A DevOps process for deploying R to production
Deploying machine learning models at scale
Productive + Hybrid + Intelligent + Trusted
Containers on Azure Peter Lasne Sr. Software Development Engineer
SQL Server 2019 Bringing Apache Spark to SQL Server
Presentation transcript:

DL (Deep Learning) Workspace Turn-Key Cluster for DL Training, Exploration, Inferencing, etc.. Hongzhi Li, Jin Li, Sanjeev Mehrotra

DL Workspace: Shared Computing Cluster Training Data Analytics Interactive Exploration WebUI/Restful API Serving DL Workspace DL Workspace DL Workspace Individual machine Group Server Shared Cluster

DL workspace Turn-key cluster environment No-installation required Support Nvidia GPU/CPU Support popular DL toolkit: Caffe, CNTK, TensorFlow, MxNet, etc.. Support single machine and/or multi machine training Scenario Dev box (interactive exploration) for the group members Follow/repeat/extend DL experiments (ToDo) DL training Data analytics Inferencing/serving

Key Building Blocks

DL Workspace: Architecture (Modularly built) Interactive Workload Training Data Analytics Inferencing/Serving Filesystem Plug-in Container Orchestration + Pluggable Device Driver Nvidia GPU/Infiniband (FPGA, other GPU, etc..) Base OS or CPU Hardware FPGA GPU

Docker: Containerized Microservice Why docker Run anywhere (laptop, desktop, Azure, etc..) Streamline development & testing Lightweight (only the necessary processor in docker) Match well with Microservices Architecture Alternative:

Why Docker in DL workspace It contains everything you need to quickly start E.g., tensorflow docker contains: All binaries package used by tensorflow (e.g., curl, libpng, libzmq, zip, etc..) Python (with pip, jypyer, numpy, scipy, sklean, etc..) [ Optional, for –devel] Source code and build tools (bazel, etc..) Proper cuda libraries Different DL toolkit (of different version) may use different and conflict libraries (e.g., cudnn? Version) Docker nicely encapsulate everything needed by a workload (avoid dll hell)

How to use docker in DL Workspace: Public docker [quickest route]: Most major DL toolkit today (e.g., TensorFlow, CNTK, Caffe, MxNet) has publicly released docker that is directly useable in DL workspace Customized docker (e.g., TensorFlow with XLA support) Most major DL toolkit today (e.g., TensorFlow, CNTK, Caffe, MxNet) has released source Dockerfile to build their docker You can start with them, and to customize the build (e.g., to try out new/customized feature)

Kubernetes: Cluster Scheduling & Orchestration Why Top projects on github Significant Slack and Stack Overflow community [From our own experience] pretty stable platform, good code base quality, extensive unit test/stress test in code Alternative: CoreOS fleet, Swarm, DC/OS

How Kubernete Works?

What is a Kubernete Pod? A group of one or more containers and shared storage E.g., a distributed training pod can contain: A parameter server Multiple workers

Demo & Q/A

Sample User Case: Interactive Exploration

Backup

Targeted opportunity associated with potential AI use cases 5/15/2018 10:54 PM IDC Cognitive / AI software and services forecast Nov 2016, excludes Hardware and unclassified spend Our approach Include Software and Services spend as those seem relevant for a solutions approach Filter by: Overall forecasted opportunity size in 2020 Growth in opportunity size from 2016-2020  looking for $1B+ growth Include adjacent industries within total opportunity Use Case Sector / Industries 2016 2020 Growth, 2016-20 CAGR, 2016-20 1. Diagnosis and Treatment Systems Healthcare Providers $0.7B $6.2B +$5.4B +71% 2. Quality Management Investigation & Recommendation Systems Manufacturing $0.8B $5.4B +$4.6B +64% 3. Automated Customer Service Agents Retail $4.3B +$3.5B +51% 4. Fraud Analysis & Investigation Financial Services $3.7B +$2.9B +49% 5. Program Advisors & Recommendation Systems $0.4B $2.9B +$2.4B +60% 6. Automated Threat Intelligence & Prevention Systems $1.8B +$1.4B +42% 7. Merchandising for Omni Channel Operations $0.3B $1.4B +$1.1B +50% 8. Sales Process Recommendation & Automation Cross-Industry $0.2B $1.2B +$1.0B +57% © 2014 Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

AI Startup (China)