Advancements in Analytics with Azure Machine Learning James Wang Technical Evangelist Microsoft Taiwan Slide modified from https://github.com/Azure-Readiness/hol-azure-machine-learning.

Slides:



Advertisements
Similar presentations
Microsoft Azure ML Franck Mercier Architecte Solutions | DX | Microsoft
Advertisements

LLNL-PRES This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
1 © Goharian & Grossman 2003 Introduction to Data Mining (CS 422) Fall 2010.
Delivering on one of the old dreams of Microsoft co-founder Bill Gates: Computers that can see, hear and understand. John Platt Distinguished scientist.
B.Ramamurthy. Data Analytics (Data Science) EDA Data Intuition/ understand ing Big-data analytics StatsAlgs Discoveries / intelligence Statistical Inference.
Data Mining Joyeeta Dutta-Moscato July 10, Wherever we have large amounts of data, we have the need for building systems capable of learning information.
Mehdi Ghayoumi Kent State University Computer Science Department Summer 2015 Exposition on Cyber Infrastructure and Big Data.
Anomaly detection with Bayesian networks Website: John Sandiford.
Knowledge Discovery and Data Mining Evgueni Smirnov.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
SOCIAL NETWORKS ANALYSIS SEMINAR INTRODUCTORY LECTURE #2 Danny Hendler and Yehonatan Cohen Advanced Topics in on-line Social Networks Analysis.
Machine Learning Tutorial Amit Gruber The Hebrew University of Jerusalem.
An Ensemble of Three Classifiers for KDD Cup 2009: Expanded Linear Model, Heterogeneous Boosting, and Selective Naive Bayes Members: Hung-Yi Lo, Kai-Wei.
Azure Machine Learning: From design to integration Peter Myers M355.
6.S093 Visual Recognition through Machine Learning Competition Image by kirkh.deviantart.com Joseph Lim and Aditya Khosla Acknowledgment: Many slides from.
Azure Machine Learning Introduction to Azure ML. Setting Expectations This presentation is for you if…  you hear the buzzword “Machine Learning” and.
Introduction to Azure Machine Learning and Data Mining algorithms Oleksandr Krakovetskyi CEO, DevRain Solutions PhD, Microsoft Regional
The United States Postal Service processed over 150 billion pieces of mail in 2013—far too much for efficient human sorting. But as recently as 1997,
Mustafa Gokce Baydogan, George Runger and Eugene Tuv INFORMS Annual Meeting 2011, Charlotte A Bag-of-Features Framework for Time Series Classification.
Cloud Analytics Platforms Christian Frey. About AIDA Our mission is to advance knowledge in data analytics through research, education and outreach Our.
Introducing Precictive Analytics
Oracle Advanced Analytics
Bhakthi Liyanage SQL Saturday Atlanta 15 July 2017
Energy Demand Forecasting
WPC047 Data ON THE ROAD: the Azure part
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Machine Learning with Spark MLlib
Azure Machine Learning
Machine Learning Models
Data Platform and Analytics Foundational Training
Microsoft Azure Machine Learning
Connected Living Connected Living What to look for Architecture
Azure Machine Learning
Microsoft Azure Machine Learning partner training
Machine Learning overview Chapter 18, 21
Make Predictions Using Azure Machine Learning Studio
A Personal Tour of Machine Learning and Its Applications
School of Computer Science & Engineering
Energy Demand Forecasting
Kathi Kellenberger Redgate
Connected Living Connected Living What to look for Architecture
Introduction to R Programming with AzureML
Azure ML and Cognitive Services
Neural Networks for Machine Learning Lecture 1e Three types of learning Geoffrey Hinton with Nitish Srivastava Kevin Swersky.
COMP61011 : Machine Learning Ensemble Models
Kathi Kellenberger Redgate
Kathi Kellenberger Redgate
Kathi Kellenberger Redgate Software
Vincent Granville, Ph.D. Co-Founder, DSC
Introduction to Azure Machine Learning Studio
Machine Learning & Data Science
Machine Learning Week 1.
Advanced Analytics. Advanced Analytics What is Machine Learning?
Prepared by: Mahmoud Rafeek Al-Farra
Machine Learning with Weka
Alain Goossens & Jean-Pierre Van Loo Data scientists – SII Belgium
Microsoft Virtual Academy
Overview of Machine Learning
Supervised vs. unsupervised Learning
Classification and Prediction
Lecture 6: Introduction to Machine Learning
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
What is this and how can I use it?
What is this and how can I use it?
CSE 491/891 Lecture 25 (Mahout).
Getting Started Using Azure ML
Basics of ML Rohan Suri.
Azure Machine Learning
Getting Started with Microsoft Azure Machine Learning
Presentation transcript:

Advancements in Analytics with Azure Machine Learning James Wang Technical Evangelist Microsoft Taiwan Slide modified from

Agenda What is Machine Learning What is Azure Machine Leaning (Hands-on) Azure Machine Learning Studio

What is Machine Learning ?

What is Machine Learning ? Using known data, develop a model to predict unknown data.

What is Machine Learning ? Using known data, develop a model to predict unknown data. Known Data: Big enough archive, previous observations, past data Unknown Data: Missing, Unseen, not existing, future data Model: Known data + ML Algorithms

From Learning to Machine Learning Human learning Machine learning observationlearningskilldataMLskill

From Learning to Machine Learning Machine learning What is skill for machine? Improving performance measure e.g. prediction accuracy dataMLskill

Skill

Microsoft & Machine Learning Bing maps launches What’s the best way to home? Kinect launches What does that motion “mean”? Azure Machine Learning GA What will happen next? Hotmail launches Which is junk? Bing search launches Which searches are most relevant? Skype Translator launches What is that person saying?

EXAMPLE

Classify a news article as (politics, sports, technology, health, …) Politics SportsTechHealth Model (Classification) Using known data, develop a model to predict unknown data.

Known data (Training data) Using known data, develop a model to predict unknown data. DocumentsLabels Tech Health Politics Sports Documents consist of unstructured text. Machine learning typically assumes a more structured format of examples Process the raw data

Known data (Training data) Using known data, develop a model to predict unknown data. Labels Documents Feature DocumentsLabels Tech Health Politics Sports Process each data instance to represent it as a feature vector Label Features Feature Vector

Feature vector Known data Data instance i.e. TF-IDF of key-words {0.40, 0.18, 0.08, 0.11, 0.7, 0.7, …..} : Health

Developing a Model Using known data, develop a model to predict unknown data. DocumentsLabels Tech Health Politics Sports Training data Train the Model Feature Vectors Base Model Adjust Parameters

Machine Learning Algorithms Supervised learning ( 監督式學習 ) Data with label This customer will like coffee This network traffic indicates a denial of service attack Unsupervised learning ( 非監督式學習 ) Data without label These customers are similar This network traffic is unusual

Common Classes of Algorithms Classification 分類 Regression 迴歸 Clustering 分群

What kind of algorithm you need? If you want to answer a YES|NO question, it is binary-classification If you want to answer a label question, it is multi-class-classification If you want to predict a numerical value, it is regression If you want to group data into similar observations, it is clustering

Classification 分類 Scenarios:  Which customer are more likely to buy, stay,  Which transactions|actions are fraudulent  Which quotes are more likely to become orders  Recognition of patterns: speech, speaker, image, movement, etc. Algorithms: Boosted Decision Tree, Decision Forest, Decision Jungle, Logistic Regression, SVM, ANN, etc. Classification

Clustering 分群 Scenarios: 所有分群問題 Algorithms: K-means Clustering

Regression 迴歸 Scenarios:  Stock prices prediction  Sales forecasts  Premiums on insurance based on different factors  Quality control: number of complaints over time based on product specs, utilization, etc.  Workforce prediction  Workload prediction Algorithms: Bayesian Linear, Linear Regression, Ordinal Regression, ANN, Boosted Decision Tree, Decision Forest Regression

What kind of algorithm you need? If you want to answer a YES|NO question, it is binary-classification ( 分類 ) If you want to answer a label question, it is multi-class-classification ( 分類 ) If you want to predict a numerical value, it is regression ( 迴歸分析 ) If you want to group data into similar observations, it is clustering ( 分群 )

Binary versus Multiclass Classification Does your customer want a yes|no answer? Binary examples click prediction yes|no over|under win|loss Multiclass examples kind of tree kind of network attack type of heart disease

Regression versus Classification Does your customer want to predict|estimate a number (regression) or apply a label|categorize (classification)? Regression problems Estimate household power consumption Estimate customer’s income Classification problems Power station will|will not meet demand Customer will respond to advertising

Which algorithm performs better?

Model’s Performance Known data with true labels Tech Health Politics Sports Tech Health Politics Sports Tech Health Politics Sports Training data 80% Testing data 20% Model’s Performance Difference between “True Labels” and “Predicted Labels” True labels Tech Health Politics Sports Predicted labels Train the Model Split Detach Test trained model with features Compare prediction with true labels +/-

Azure Machine Learning

Microsoft Azure Machine Learning “Make machine learning accessible to every enterprise, data scientist, developer, information worker, consumer, and device anywhere in the world.” Data Science is far too complex, need a simpler/scalable method. Problem definition Data storage Feature selection Model evaluation Deploy to application

Blobs and Tables Hadoop (HDInsight) Relational DB (Azure SQL DB) Data Clients Model is now a web service that is callable Monetize the API through our marketplace API Integrated development environment for Machine Learning ML STUDIO

Azure Machine Learning Studio

Load a Data Set Add Transformations and Filters Create the Experiment Path and apply Algorithms Save and Run the Model Publish the model Use the model Microsoft Azure Machine Learning

Thanks for your attention