Open Source Toolkit for Turn-Key AI Cluster (Introduction)

Slides:



Advertisements
Similar presentations
Grid Resource Allocation Management (GRAM) GRAM provides the user to access the grid in order to run, terminate and monitor jobs remotely. The job request.
Advertisements

Eucalyptus Virtual Machines Running Maven, Tomcat, and Mysql.
Configuring the MagicInfo Pro Display
Building service testbeds on FIRE D5.2.5 Virtual Cluster on Federated Cloud Demonstration Kit August 2012 Version 1.0 Copyright © 2012 CESGA. All rights.
ISG We build general capability Introduction to Olympus Shawn T. Brown, PhD ISG MISSION 2.0 Lead Director of Public Health Applications Pittsburgh Supercomputing.
Indispensable tools for research at its best Introducing the New Write-N-Cite.
MagicInfo Pro Scheduler Now that a template has been created from content imported into the Library, the user is ready to begin scheduling content to.
An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.
Collaborate. Welcome To Login to Computers – Read network terms and Press “OK” – Enter Password: star#101 To Access Session – Login to Blackboard – Go.
SCI-BUS is supported by the FP7 Capacities Programme under contract nr RI Creating the Autodock gateway from WS-PGRADE/gUSE and making it cloud-enabled.
6 th Annual Focus Users’ Conference Manage Integrations Presented by: Mike Morris.
A Public Release of WRF Portal Jeff Smith and Mark Govett June 24, 2008.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
Milestone SAP Portal Learning at the Lakes August 12, 2009.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Configuring IQmol for Windows machines, use version!
CMMI Training Quiz Instructions via Process-Trak.
Introduction to Taverna Online and Interaction service Aleksandra Pawlik University of Manchester.
ISG We build general capability Introduction to Olympus Shawn T. Brown, PhD ISG MISSION 2.0 Lead Director of Public Health Applications Pittsburgh Supercomputing.
HUBbub 2013: Developing hub tools that submit HPC jobs Rob Campbell Purdue University Thursday, September 5, 2013.
Active-HDL Server Farm Course 11. All materials updated on: September 30, 2004 Outline 1.Introduction 2.Advantages 3.Requirements 4.Installation 5.Architecture.
Emdeon Office Batch Management Services This document provides detailed information on Batch Import Services and other Batch features.
FIRST COURSE Exploring the Basics of Microsoft Windows Vista.
1 Terminal Management System Usage Overview Document Version 1.1.
Advanced Computing Facility Introduction
TIZEN STUDIO INSTALLATION & ENVIRONMENT SETUP FOR DEVLAB
Via the REST API and Power BI Embedded Jan Pieter Posthuma
After Construction Name: Per #:.
Everything you've ever wanted to know about using Control-M to integrate any application workload September 9, 2016 David Fernandez Senior Presales Consultant.
4/18/2018 3:49 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Open OnDemand: Open Source General Purpose HPC Portal
DL (Deep Learning) Workspace
Administrator Quick Operation Guide
Working With Azure Batch AI
Working with Feature Layers
Welcome to your first Online Class Session
Getting Started with R.
Enterprise Hybrid Cloud
Open Source Toolkit for Turn-Key AI Cluster (Introduction)
Operating System Orientation CS3135
Wireless Network Access on Mac OSX
Figure 2: Make a component
Bomgar Remote support software
Apple TV Error. Check Here.
Boeing Supply Chain Platform (BSCP) Detailed Training
File Transfer Olivia Irving and Cameron Foss
Expense Report Training
Cloud Connect Seamlessly
Microsoft Windows 2000 Professional
How To: Deep Learning Workspace
Open Source Toolkit for Turn-Key AI Cluster (Introduction)
Order-to-Cash (Project-Based Services) Scenario Overview
X Windows.
Log in here.
GLOBAL ONLINE TRAININGS INTUIT QUICKBOOKS TRAINING
Code Analysis, Repository and Modelling for e-Neuroscience
BACS Training: Requesting Access.
Lecture 16B: Instructions on how to use Hadoop on Amazon Web Services
Order-to-Cash (Project-Based Services) Scenario Overview
Dell Latitude Laptop Student setup.
Life Science online SLO Assessment
Azure Machine Learning on Databricks
Code Analysis, Repository and Modelling for e-Neuroscience
IBM C IBM Big Data Engineer. You want to train yourself to do better in exam or you want to test your preparation in either situation Dumpspedia’s.
Microsoft Windows 7 Basics
An Introduction to Designing and Executing Workflows with Taverna
SET-UP AND MODIFY SEARCH AGENTS
Downloading to the NXT requires the correct hardware setup
Creating sets in Alma from Alma Analytics Reports
portal broker PingER Replica Mgr RFT GridFTP GateKeeper Job Mgr Akenti
Presentation transcript:

Open Source Toolkit for Turn-Key AI Cluster (Introduction) DL Workspace This video introduces DL Workspace, an open source toolkit for turn-key AI Cluster setup and operation. Open Source Toolkit for Turn-Key AI Cluster (Introduction)

DL Workspace is … Open source toolkit for turn-key AI cluster setup Used for daily development/production in Microsoft internal groups (e.g., Microsoft Cognitive Services, SwiftKey, Bing Relevance) Allow AI scientist to run jobs (interactive exploration, training, inferencing, data analytics) Resource managed by cluster Turn-key operation (automatic software setup & cluster configuration) Out-of-box support All major DL toolkits (TensorFlow, CNTK, Caffe, MxNet, etc..) Big data analytics (Hadoop/Spark) DL Workspace provides out-of-box support for multiple Deep Learning toolkits, and big data analytical kits. It is used daily by Microsoft employees, and allows AI scientists to run both interactive and batch jobs on cluster.

WorkFlow The rest of the video explains the process to launch a Spark job.

authentication via open id First, log in via your favorite provider through open id.

Select “Submit New Job” once log in … Once log in, click “Submit New Job”

make optional modification on job template Select a template, make optional modification on job template Select a job template, and make optional adjustments.

Click “Submit” button to schedule the job for execution. submit a job

Select to view submitted jobs Select Job ID Click “View and Manage Jobs”, and select proper job ID to monitor the jobs you have just executed.

You may need to wait between a few seconds (execution of old job) to several minutes (new job) for the job container to be scheduled, downloaded and launched You may need to wait a few seconds to a few minutes for the job container to be scheduled, downloaded and launched.

Wait for Mapped Endpoints to appear.

Execute ssh command to connect to the container go to spark directory Run the ssh command shown in the endpoints. Go to spark directory.

Execute spark command on cluster You may then execute a spark job against the DL workspace backend.

Here is the log & output of the executed spark job. You can monitor executed spark jobs through standard YARN portal. Here is the log & output of the executed spark job.

DL Workspace provides … Turn-key setup for AI cluster (in public cloud or on-perm) Allow AI scientist to run jobs (interactive exploration, training, inferencing, data analytics) with fully setup software/hardware environment Using DL Workspace, AI scientists can easily collaborate, and share job setup, and maximize job productivity DL Workspace provides turn-key setup for AI clusters, allows AI scientists to jump directly to work, and facilitates collaboration and sharing.