Joseph E. Gonzalez Assistant UC Berkeley Dato Inc. Intelligent Services Serving Machine.

Slides:



Advertisements
Similar presentations
Recommender Systems & Collaborative Filtering
Advertisements

Ali Ghodsi UC Berkeley & KTH & SICS
Book Recommender System Guided By: Prof. Ellis Horowitz Kaijian Xu Group 3 Ameet Nanda Bhaskar Upadhyay Bhavana Parekh.
Imbalanced data David Kauchak CS 451 – Fall 2013.
1 NETE4631 Cloud deployment models and migration Lecture Notes #4.
Turning Data into Value Ion Stoica CEO, Databricks (also, UC Berkeley and Conviva) UC BERKELEY.
Berkeley Data Analytics Stack (BDAS) Overview Ion Stoica UC Berkeley UC BERKELEY.
Overview of Search Engines
Outline | Motivation| Design | Results| Status| Future
Clearstorydata.com Using Spark and Shark for Fast Cycle Analysis on Diverse Data Vaibhav Nivargi.
URLDoc: Learning to Detect Malicious URLs using Online Logistic Regression Presented by : Mohammed Nazim Feroz 11/26/2013.
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
Tyson Condie.
Practical Machine Learning Pipelines with MLlib Joseph K. Bradley March 18, 2015 Spark Summit East 2015.
Search Engines and Information Retrieval Chapter 1.
1 A User-Guided Cognitive Agent for Wireless Service Selection in Pervasive Computing George Lee May 5, 2004 G. Lee, P. Faratin, S. Bauer, and J. Wroclawski.
M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.
Streaming Predictions of User Behavior in Real- Time Ethan DereszynskiEthan Dereszynski (Webtrends) Eric ButlerEric Butler (Cedexis) OSCON 2014.
What are the main differences and commonalities between the IS and DA systems? How information is transferred between tasks: (i) IS it may be often achieved.
 Text Representation & Text Classification for Intelligent Information Retrieval Ning Yu School of Library and Information Science Indiana University.
Internet Information Retrieval Sun Wu. Course Goal To learn the basic concepts and techniques of internet search engines –How to use and evaluate search.
1 CS 294: Big Data System Research: Trends and Challenges Fall 2015 (MW 9:30-11:00, 310 Soda Hall) Ion Stoica and Ali Ghodsi (
Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
BAGGING ALGORITHM, ONLINE BOOSTING AND VISION Se – Hoon Park.
A Novel Local Patch Framework for Fixing Supervised Learning Models Yilei Wang 1, Bingzheng Wei 2, Jun Yan 2, Yang Hu 2, Zhi-Hong Deng 1, Zheng Chen 2.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
Security Analytics Thrust Anthony D. Joseph (UCB) Rachel Greenstadt (Drexel), Ling Huang (Intel), Dawn Song (UCB), Doug Tygar (UCB)
Resilient Distributed Datasets: A Fault- Tolerant Abstraction for In-Memory Cluster Computing Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave,
Matthew Winter and Ned Shawa
Matei Zaharia, in collaboration with Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Haoyuan Li, Justin Ma, Murphy McCauley, Joshua Rosen, Reynold Xin,
The Three E’s of Big Data and What DB People can do About Them UC BERKELEY Michael Franklin – UC Berkeley Beckman Database Get Together October 14, 2013.
IntroGoalCrowdPredictionWrap-up 1/26 Learning Debugging and Hacking the User Remco Chang Assistant Professor Tufts University.
Dato Confidential 1 Danny Bickson Co-Founder. Dato Confidential 2 Successful apps in 2015 must be intelligent Machine learning key to next-gen apps Recommenders.
Fraud Detection with Machine Learning: A Case Study from Sift Science
Data Summit 2016 H104: Building Hadoop Applications Abhik Roy Database Technologies - Experian LinkedIn Profile:
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
IncApprox The marriage of incremental and approximate computing Pramod Bhatotia Dhanya Krishnan, Do Le Quoc, Christof Fetzer, Rodrigo Rodrigues* (TU Dresden.
Introduction to Machine Learning, its potential usage in network area,
9/24/2017 7:27 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Intelligent Services Serving Machine Learning
Image taken from: slideshare
Big Data Analytics and HPC Platforms
Dan Crankshaw UCB RISE Lab Seminar 10/3/2015
Machine Learning with Spark MLlib
Recommender Systems & Collaborative Filtering
Katie Goldstein Christopher Bilz Brian Northern Michael Yang
Designing a Scalable Data Cleaning Infrastructure
Prediction Serving Joseph E. Gonzalez Asst. Professor, UC Berkeley
Machine Learning overview Chapter 18, 21
Machine Learning overview Chapter 18, 21
R SE to the challenges of ntelligent systems
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
Introduction to Spark.
Spark Software Stack Inf-2202 Concurrent and Data-Intensive Programming Fall 2016 Lars Ailo Bongo
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Intro to Machine Learning
Layer-wise Performance Bottleneck Analysis of Deep Neural Networks
CMPT 733, SPRING 2016 Jiannan Wang
Team 2: Graham Leech, Austin Woods, Cory Smith, Brent Niemerski
Parallel Analytic Systems
iSRD Spam Review Detection with Imbalanced Data Distributions
Approaching an ML Problem
Where Intelligence Lives & Intelligence Management
Integrating Deep Learning with Cyber Forensics
CSE 491/891 Lecture 25 (Mahout).
CMPT 733, SPRING 2017 Jiannan Wang
The Student’s Guide to Apache Spark
Azure Machine Learning
Modeling IDS using hybrid intelligent systems
Presentation transcript:

Joseph E. Gonzalez Assistant UC Berkeley Dato Inc. Intelligent Services Serving Machine Learning

Contemporary Learning Systems Training DataModels Big

Contemporary Learning Systems MLlib Create MLC LIBSVM VW Oryx 2 BIDMach

Training DataModel What happens after we train a model? Dashboards and Reports Conference Papers Drive Actions

Training DataModel What happens after we train a model? Dashboards and Reports Conference Papers Drive Actions

Suggesting Items at Checkout Fraud Detection Cognitive Assistance Internet of Things Low-LatencyPersonalizedRapidly Changing

Train DataModel

Train DataModel Actions Serve Adapt

9 Machine Learning Intelligent Services

The Life of a Query in an Intelligent Service Web Serving Tier User Product Info Intelligent Service User Data Model Info Lookup Model Feature Lookup Feature Lookup Top-K Query Request: Items like x New Page Images … Top Items Content Request Feedback: Preferred Item Feedback μ σ ρ ∑ ∫ α β math

Essential Attributes of Intelligent Services Responsi ve Intelligent applications are interactive Adaptiv e ML models out-of-date the moment learning is done Manageabl e Many models created by multiple people

Responsive : Now and Always Compute predictions in < 20ms for complex under heavy query load with system failures. ModelsQueries Top K Features SELECT * FROM users JOIN items, click_logs, pages WHERE …

Experiment: End-to-end Latency in Spark MLlib To JSON HTTP Req.Feature Trans. Evaluate Model Encode PredictionHTTP Response 4

Count out of 1000 Latency measured in milliseconds NOP (Avg = 5.5, P99 = 20.6) Single Logistic Regression (Avg = 21.8, P99 = 38.6) Decision Tree (Avg = 22.4, P99 = 63.8) One-vs-all LR (10-class) (Avg = 137.7, P99 = 217.7) 100 Tree Random Forrest (Avg = 50.5, P99 = 73.4) 500 Tree Random Forrest (Avg = , P99 = 268.7) 500 Tree Random Forrest (Avg = 172.6, P99 = 268.7) AlexNet CNN (Avg = 418.7, P99 = 549.8) End-to-end Latency for Digits Classification 784 dimension input Served using MLlib and Dato Inc.

Adaptive to Change at All Scales Months Rate of Change Minutes Population Granularity of Data Session Shopping for Mom Shopping for Me

Adaptive to Change at All Scales New users and products Evolving interests and trends Months Rate of Change Minutes Population Granularity of Data Session Shopping for Mom Shopping for Me Population Law of Large Numbers  Change Slow Rely on efficient offline retraining  High-throughput Systems Months

Adaptive to Change at All Scales Months Rate of Change Minutes Population Granularity of Data Session Small Data  Rapidly Changing Low Latency  Online Learning Sensitive to feedback bias Shopping for Mom Shopping for Me

The Feedback Loop I once looked at cameras on Amazon … My Amazon Homepage Similar cameras and accessories Opportunity for Bandit Algorithms Bandits present new challenges: computation overhead complicates caching + indexing

Exploration / Exploitation Tradeoff Systems that can take actions can adversely bias future data. Opportunity for Bandits ! Bandits present new challenges: Complicates caching + indexing tuning + counterfactual reasoning

Management : Collaborative Development Teams of data-scientists working on similar tasks  “competing” features and models Complex model dependencies: Cat Photo isCat Cuteness Predictor Cat Classifier Animal Classifier Cute! isAnimal

Predictive Services UC Berkeley AMPLab Daniel Crankshaw, Xin Wang, Joseph Gonzalez Peter Bailis, Haoyuan, Zhao Zhang, Michael J. Franklin, Ali Ghodsi, and Michael I. Jordan

Predictive Services UC Berkeley AMPLab Daniel Crankshaw, Xin Wang, Joseph Gonzalez Peter Bailis, Haoyuan, Zhao Zhang, Michael J. Franklin, Ali Ghodsi, and Michael I. Jordan Active Research Project

Velox Model Serving System Focuses on the multi-task learning (MTL) domain [CIDR’15, LearningSys’15] Spam Classification Content Rec. Scoring Session 1: Session 2: Localized Anomaly Detection

Velox Model Serving System Personalized Models (Mulit-task Learning) [CIDR’15, LearningSys’15] Input Output “Separate” model for each user/context.

Personalized Models (Mulit-task Learning) Split Personalization Model Feature Model Velox Model Serving System [CIDR’15, LearningSys’15]

Hybrid Offline + Online Learning Split Personalization Model Feature Model Update the user weights online: Simple to train + more robust model Address rapidly changing user statistics Update feature functions offline using batch solvers Leverage high-throughput systems (Apache Spark) Exploit slow change in population statistics

Hybrid Online + Offline Learning Results Similar Test ErrorSubstantially Faster Training User Pref. Change Hybrid Offline Full Hybrid Offline Full

Evaluating the Model Split Cache Feature Evaluation Input

Evaluating the Model Split Cache Feature Evaluation Input Feature Caching Across Users Anytime Feature Evaluation Approximate Feature Hashing

Feature Caching Feature Hash Table Hash input: Compute feature: New input:

LSH Cache Coarsening Hash new input: Use Wrong Value!  LSH hash fn. New input Feature Hash Table

LSH Cache Coarsening Feature Hash Table Hash new input: Use Value Anyways!  Req. LSH Locality-Sensitive Hashing: Locality-Sensitive Caching:

Anytime Predictions Compute features asynchronously: if a particular element does not arrive use estimator instead Always able to render a prediction by the latency deadline __

No Coarsening Coarsening + Anytime Predictions Overly Coarsened More Features Approx. Expectation Better Best Coarser Hash Checkout our poster!

Spark Streamin g Spark SQL Graph X ML library BlinkDB MLbas e Velox Training Management + Serving Spark HDFS, S3, … Tachyon Model Manager Predictio n Service Mesos Part of Berkeley Data Analytics Stack

Predictive Services UC Berkeley AMPLab Daniel Crankshaw, Xin Wang, Peter Bailis, Haoyuan, Zhao Zhang, Michael J. Franklin, Ali Ghodsi, and Michael I. Jordan

Dato Predictive Services  Elastic scaling and load balancing of docker.io containers  AWS Cloudwatch Metrics and Reporting  Serves Dato Create models, scikit-learn, and custom python  Distributed shared caching: scale-out to address latency  REST management API: Demo? Production ready model serving and management system

Predictive Services UC Berkeley AMPLab Daniel Crankshaw, Xin Wang, Joseph Gonzalez Peter Bailis, Haoyuan, Zhao Zhang, Michael J. Franklin, Ali Ghodsi, and Michael I. Jordan Caching, Bandits, & Management Online/Offline Learning Latency vs. Accuracy Key Insights: Responsiv e AdaptiveManageable

Train Dat a Mode l Actions Serve Adapt Future of Learning Systems

Thank You Joseph E. Gonzalez Assistant UC Berkeley Dato