A Cloud System for Machine Learning Exploiting a Parallel Array DBMS

A Cloud System for Machine Learning Exploiting a Parallel Array DBMS
Carlos Ordonez Department of Computer Science University of Houston, USA

Our contribution A cloud analytic system for machine learning
shared-nothing architecture backed by a parallel array DBMS; no HDFS, Hadoop, Spark, etc. In-DBMS data summarization orders of magnitude performance improvement even wider gap with GPU acceleration

System components and data flow

2-phase algorithm

Summarization

Defining the input data set X

Data summarization Finding a compact description of the data set
Very useful technique in machine learning Save space Save I/O Save execution time No accuracy sacrifice

What to summarize? Introducing sufficient statistics
Matrix product => Summation of vector outer products.

Dense matrix algorithm: O(d2 n)
10

Sparse matrix algorithm: O(d n) for hyper-sparse matrix
11

Array Storage in SciDB: by Chunks

Parallel computation Coordinator Worker 1 Worker 2 1 2 d 1 2 d 1 2 d
Coordinator Worker 1 Worker 2 send send

1 2 d 1 2 d OK NO! Coordinator 1 2 d Coordinator Worker 1 Worker 1

Linear speed up Let Tj be processing time using j nodes, where 1 ≤ j ≤ N. Under our main assumption and Θ fits in main memory then our optimized algorithm gets close to optimal speedup T1/TN ≈ O(N).

Space complexity and parallel speedup
16

Benchmark

Optimize summarization with GPU
Transfer Summarize Transfer GPU

Optimize summarization with GPU
The C++ operator code is annotated with OpenACC directives to work with GPU The CPU only does the I/O part in the current implementation. Data is transferred from host memory to device (GPU) memory The vector outer products are evaluated and aggregated on GPU, the result is then transferred back.

Time saved by summarizing on GPU
n = 1M d = 400

Summarization Model

Linear regression

Computing LR, SciDB vs. Spark

Future work Approach applicable in any parallel DBMS Square matrices
Low-level GPU instructions to parallelize the vector outer products on GPU for Gamma. Improve fault tolerance during computation; avoid restarts

A Cloud System for Machine Learning Exploiting a Parallel Array DBMS

Similar presentations

Presentation on theme: "A Cloud System for Machine Learning Exploiting a Parallel Array DBMS"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

A Cloud System for Machine Learning Exploiting a Parallel Array DBMS

Similar presentations

Presentation on theme: "A Cloud System for Machine Learning Exploiting a Parallel Array DBMS"— Presentation transcript:

Similar presentations

About project

Feedback