Predicting Azure Consumption using Ensemble Learning

Slides:



Advertisements
Similar presentations
MIX 09 4/15/ :14 PM © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
Advertisements

Session 1.
Feature: Customer Combiner and Modifier © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are.
Windows Azure SQL Data Sync Name Title Microsoft Corporation.
Microsoft Teams Behind the Scenes – Q&A
Azure Machine Learning Deploying and Managing Models in production
The story of an IoT solution
Use any Amazon S3 application with Azure Blob Storage
6/5/2018 1:30 PM THR1029 Spend less time managing data and more time with customers: Quick tour of Outlook Customer Manager Welly Lee
Azure Cloud Shell Magic of Modern Command-line Management
AI development using Data Science Virtual Machines (DSVM) in Azure
6/19/2018 2:57 AM THR3092 Monitor and investigate actions on your user and data with alerts, insights and reports Binyan Chen Program Manager II, Office.
Performing a Seamless Migration in Azure SQL DB
What a Real, Functioning DevOps Team Looks Like
Azure Machine Learning Algorithm Accuracy Enhancement, Tips and Tricks
7/22/2018 9:21 PM BRK3270 Building a Better Data Solution: Microsoft SQL Server and Azure Data Services Joey D’Antoni Principal Consultant Denny Cherry.
SQL Server on Linux on All-Flash Arrays
8/6/ :17 AM THR2214 Hybrid Cloud Activated A customer case study optimizing on-premises & Azure performance and cost Mor Cohen-Tal Senior Product.
Understanding Windows Analytics Update Compliance
Excel and Power BI Better Together Democratization of data
Expert-level Windows 10 deployment
Workflow Orchestration with Adobe I/O
How we got a traditional bank collaborating across boundaries
Find, try and get line-of-business apps on Microsoft AppSource
Automate all things! Microsoft Azure continuous deployment
9/14/ :46 AM BRK3293 How the Portland Trail Blazers Use Personalization and Acxiom Data to Target Customers Chris Hoder Program Manager, AI + Research.
Agile Planning with Visual Studio Team Services (VSTS)
Servicing Windows 10 in the Real World
9/22/2018 3:49 AM BRK2247 Learn from MVPs: Panel discussion on all things SharePoint and OneDrive © Microsoft Corporation. All rights reserved. MICROSOFT.
Confidence at speed: Visual Studio 2017 and your CI pipeline
Azure PowerShell Aaron Roney Senior Program Manager Cormac McCarthy
Continuous Delivery for Microsoft Azure
11/22/2018 1:43 PM THR3005 How to provide business insight from your data using Azure Analysis Services Peter Myers Bitwise Solutions © Microsoft Corporation.
Continuous Delivery with Visual Studio Team Services
Azure Advisor: Optimization in the best way
PowerShell Unplugged Jeffrey Snover Technical Fellow
Mobile Center and VSTS:​ Better together for your Mobile DevOps
Title of Presentation 12/2/2018 3:48 PM
Microsoft products for non-profits
Introduction to ASP.NET Core 1.0
Five cool things you can do with Windows PowerShell on Office 365
Microsoft To-Do Preview
Securely pass passwords into your deployment
Microsoft Exchange: Through the eyes of MVPs (Panel discussion)
MDM Migration Analysis Tool (MMAT)
Overview: Dynamics 365 for Project Service Automation
Virtual Reality with Azure and Unity
Understand your Azure cloud assets dependencies with BMC Discovery
Surviving identity management in a hybrid world
Sami Laiho AMA - Ask Me Anything
Breaking Down the Value of A Yammer Post: 20 Things to Do
8/04/2019 9:13 PM © 2006 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
Cool Microsoft Edge Tips and Tricks
When Bad Things Happen to Good Applications
Getting the most out of Azure resources with Azure Advisor
Manage your App Service resources using Command line tools
“Hey Mom, I’ll Fix Your Computer”
4/21/2019 7:09 AM THR2098 Unlock New Opportunities with Nintex Hawkeye Process Intelligence and Workflow Analytics Sr. Product.
4/28/2019 3:30 AM THR1061 Learn how Dynamics 365, Office 365 and related applications work together to transform the workplace Donna Edwards Solution Architect.
WINDOWS AZURE A LAP AROUND PLATFORM THE Steve Marx
Designing Bots that Fit Your Organization
Ask the Experts: Windows 10 deployment and servicing
Passwordless Service Accounts
Azure Networking inside and out
Digital Transformation: Putting the Jigsaw Together
WCF and .NET Framework Microservices in Containers
Diagnostics and troubleshooting in Azure App Service Support Center
Optimizing your content for search and discovery
Title of Presentation 5/24/2019 1:26 PM
Presentation transcript:

Predicting Azure Consumption using Ensemble Learning 5/20/2018 1:18 PM BRK2289 Predicting Azure Consumption using Ensemble Learning Siddharth Kumar Senior Data Scientist Manager Customer Growth and Analytics Team © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Session Overview Introduction Model Workflow Data Preparation 5/20/2018 1:18 PM Session Overview Introduction Model Workflow Data Preparation Feature Engineering Model Building Model Validation Model Deployment Learnings © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

5/20/2018 1:18 PM Context The objective was to build a machine learning solution to predict azure customers spend in the next 6 months Session Goals Understanding of data science challenges faced in real world How to solve a regression problem overcoming those challenges Tricks to improve the model performance using deep learning Develop intelligent solution using ML / Deep Learning on Microsoft AI Platform Scalable way to deploy ML models using Microsoft HDInsight clusters © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Model workflow Feature Engineering Data collection Model Validation 5/20/2018 1:18 PM Model workflow Data Preparation Data Pre processing Transformations Data collection Data source Identification Initial Dataset creation Feature Engineering Feature extraction Feature transformation Feature selection Model Building Model Architecture Model Stacking Model Validation Picking the right accuracy measures Model Deployment Using Microsoft Azure © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Data Preparation Population Selection Data Cleansing 5/20/2018 1:18 PM Data Preparation Identify the right sample based on business requirements Population Selection Noise/Outlier treatment Missing values Data Cleansing Normalization / Log transformation Aggregation and Encoding Data Transformation Constant values/ Zero variance Highly correlated Data Reduction © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Data Selection and Transformation 5/20/2018 1:18 PM Data Selection and Transformation Log Appropriate transformation of response variable can improve performance of model Transformation of predictors also helps for some algorithms The above transformation helped us in identifying the right population to be model © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Model workflow Feature Engineering Data collection Model Validation 5/20/2018 1:18 PM Model workflow Data Preparation Data Pre processing Transformations Data collection Data source Identification Initial Dataset creation Feature Engineering Feature extraction Feature transformation Feature selection Model Building Model Architecture Model Stacking Model Validation Picking the right accuracy measures Model Deployment Using Microsoft Azure © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Hypotheses based feature generation 5/20/2018 1:18 PM Hypotheses based feature generation Prior spend Customers with higher historic run rate (spending/rate) are more likely to spend more on Azure Prior Spend Customers subscribed to higher tier offers are likely to spend more on Azure What offer they subscribed to.. Customers from developed countries are likely to have higher spend Customers from tech dominant states/regions are more likely to spend more What country they belong to.. Customer associated with professional services, tech industry are more likely to spend more on Azure Which industry they belong to.. Offering Customer Spend Geography Industry © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Using Deep Learning for Feature Engineering 5/20/2018 1:18 PM Using Deep Learning for Feature Engineering Deep Features Autoencoders Input layer Hidden layers Output layer Input layer Encoded layer - Autoencoders are helpful in feature representations It is also used for feature reduction (dimensionality reduction) For this project, autoencoders lead to minor lift in accuracy (1%) - The hidden layers were used to generate non-linear features - The deep features led to 10% improvement in Mean Absolute Error (MAE) © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Model workflow Feature Engineering Data collection Model Validation 5/20/2018 1:18 PM Model workflow Data Preparation Data Pre processing Transformations Data collection Data source Identification Initial Dataset creation Feature Engineering Feature extraction Feature transformation Feature selection Model Building Model Architecture Model Stacking Model Validation Picking the right accuracy measures Model Deployment Using Microsoft Azure © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Model Architecture Gradient Boosting Machine Distributed Random Forest 5/20/2018 1:18 PM Gradient Boosting Machine GBM4 GBM2 GBM1 GBM1 GBM3 GBM5 Distributed Random Forest DRF4 DRF5 DRF1 DRF2 DRF3 Generalized Linear Model Generalized Linear Models Analytical dataset GLM4 GLM1 GLM2 GLM3 GLM5 Deep Neural Nets (ANN) DL4 DL1 DL2 DL3 DL5 Base learners Super learner © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Model workflow Feature Engineering Data collection Model Validation 5/20/2018 1:18 PM Model workflow Data Preparation Data Pre processing Transformations Data collection Data source Identification Initial Dataset creation Feature Engineering Feature extraction Feature transformation Feature selection Model Building Model Architecture Model Stacking Model Validation Picking the right accuracy measures Model Deployment Using Microsoft Azure © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Model Validation Picking the right performance metric is important 5/20/2018 1:18 PM Model Validation Picking the right performance metric is important Metric Formulae Pro Cons R^2 % the variance in response explained by predictors Can be misleading MAE Outlier resistant No sense of direction RMSE General purpose Metric Outlier sensitive Decision should be driven by the specific business use case © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Model Validation: our case 5/20/2018 1:18 PM Model Validation: our case Over Prediction Under Prediction Model Stats: R^2: 0.82 RMSE: 0.5963707 MAE: 0.247357 Spike at 100% indicates cases where the model failed to predict spend but there was some spend © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Model workflow Feature Engineering Data collection Model Validation 5/20/2018 1:18 PM Model workflow Data Preparation Data Pre processing Transformations Data collection Data source Identification Initial Dataset creation Feature Engineering Feature extraction Feature transformation Feature selection Model Building Model Architecture Model Stacking Model Validation Picking the right accuracy measures Model Deployment Using Microsoft Azure © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Scalable Model Deployment using Microsoft Azure 5/20/2018 1:18 PM Scalable Model Deployment using Microsoft Azure Data collection Data Preparation Feature Engineering Model Building Output Endpoint dashboards SQL Database SQL Database Storage blob Storage blob Microsoft PowerBI SQL Database *The pipeline was setup using Azure Data Factory © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Summary of Learnings 01 04 Select evaluation metric based on use case 5/20/2018 1:18 PM Summary of Learnings 01 Select evaluation metric based on use case Preprocessing is important Lorem Ipsum Model Evaluation Data preparation Spending time on Feature engineering leads to higher rewards Don’t use grid search for large datasets Model Tuning Feature Engineering Uncorrelated models make stacking useful Stacking Deep Learning Deep learning for feature engineering 04 Lorem Ipsum Learn many models, not just one Crowdsourcing Benchmark Establish a benchmark model © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Please evaluate this session Tech Ready 15 5/20/2018 Please evaluate this session From your Please expand notes window at bottom of slide and read. Then Delete this text box. PC or tablet: visit MyIgnite https://myignite.microsoft.com/evaluations Phone: download and use the Microsoft Ignite mobile app https://aka.ms/ignite.mobileapp Your input is important! © 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.