Exploiting Input Features for Controlling Tunable Approximate Programs Sherry Zhou Department of electronic engineering Tsinghua University.

Exploiting Input Features for Controlling Tunable Approximate Programs Sherry Zhou Department of electronic engineering Tsinghua University

Outline Introduction Input features for GEM and SGD Experiments on applying features to error model Experiments on applying new features to cost model Conclusion and future work

Overview of Control Method

Control Problem Formulation Given: – a tunable program, – a set of possible inputs I, and – a probability function p such that for any i ∈ I, p(i) is the probability of getting input i For input i ∈ I, error bound ε > 0 and probability bound 0 ≤ π ≤ 1, find k 1, k 2 such that – Objective: minimize f c (i, k 1, k 2 ) – Subject to: ≥ π Feasible Region – Set of (k 1, k 2 ) satisfying : ≥ π 4

Error and Cost Model Error model is used to determine whether or not a knob setting is in the feasible region – Currently is input-agnostic Cost model uses input features, knob settings to predict running time – Currently is input-specific, simple features

Problem We Focused On Find and exploit computational cheap Input features: – Make error model input-awareness to improve its accuracy – Improve cost model accuracy

Two Benchmarks We Focused On GEM error model SGD error model GEM cost modelSGD cost model features

INPUT FEATURES

Features for GEM Input are social networks Simple features: – Number of nodes – Number of edges – Number of cluster

Features for GEM Sophisticated features but easy to compute: – Leadership – Sub-graph Edge Ratio – Degree Distribution

Feature: Leadership Small groups are usually created by one individual who then recruits others L = ∑ (d max − d i ) / ((n − 2)(n − 1)) Normalize: ∑((n-1)-1)= (n − 2)(n − 1)

Feature: Sub-Graph Edge Ratio Composed by those high-degree nodes Ratio of number of edges in the original to those in the sub-graph

Feature: Degree Distribution Take the scaled distribution as a vector and use K-means to get the vectors into groups

Other Features Explored High Computational Cost Hop-count – defined as the minimal number of distinct links that forms a path connecting the two nodes Clustering Coefficient – defined as the ratio of the number of links y connecting the di neighbors of i over the maximum possible ½y* di(di − 1) Bonding – Measures triadic closure in a graph

Features in SGD Inputs: training instances for SVM classifier Feature: ratio of number of instances to instance dimension: When ratio < 1 – Easier to find a solution When ratio > 1 – Harder to find a solution

APPLY FEATURES TO ERROR MODEL 16

Apply Features to Error Model Directly add features as nodes in Bayes network – High inference cost when number of features are high knob1 knob2 error knob1 knob2 error Feature 1 Feature 2

GEM: Add Features to Bayes Network Add features: – Number of nodes – Number of edges

GEM: Add Features to Bayes Network Add features: – leadership

GEM: Add Features to Bayes Network Add features: – Sub-graph edge ratio

GEM: Apply Features to Error Model Use features to classify inputs and in each class, learn a different model. INPUT CLASSIFIER ERROR MODEL 1 ERROR MODEL n

Results: Only Use Number of Nodes The performance become worse.

Feature: Sub-Graph Edges

Features: Leadership The performance has been slightly improved

Clustering Inputs Based on Knob Errors

Future Work Features: – What kinds of features are more useful to improve error model? Principle? – Will a combination of features be better?

COST MODEL 27

Result: add leadership feature to cost model in GEM Leadership Number of nodes Number of edges Number of nodes Number of edges Feature in cost model:

Result: add leadership feature to cost model in GEM leadership Number of nodes Number of edges Number of nodes Number of edges Feature in cost model:

Future Work The relationship between runtime and cache miss? Build an analytic model of runtime prediction Analyze the difference between analytic model with m5-tree model Provide some guidance to m5 learning algoirhtm

Conclusion Explored several features to improve error and cost models Experiments showed the error and cost model are improved

Exploiting Input Features for Controlling Tunable Approximate Programs Sherry Zhou Department of electronic engineering Tsinghua University.

Similar presentations

Presentation on theme: "Exploiting Input Features for Controlling Tunable Approximate Programs Sherry Zhou Department of electronic engineering Tsinghua University."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Exploiting Input Features for Controlling Tunable Approximate Programs Sherry Zhou Department of electronic engineering Tsinghua University.

Similar presentations

Presentation on theme: "Exploiting Input Features for Controlling Tunable Approximate Programs Sherry Zhou Department of electronic engineering Tsinghua University."— Presentation transcript:

Similar presentations

About project

Feedback