Haimonti Dutta 1 and Hillol Kargupta 2 1 Center for Computational Learning Systems (CCLS), Columbia University, NY, USA. 2 University of Maryland, Baltimore.

Slides:



Advertisements
Similar presentations
Load Balancing Parallel Applications on Heterogeneous Platforms.
Advertisements

Chapter 5: Linear Programming: The Simplex Method
Hadi Goudarzi and Massoud Pedram
Lecture 3 Linear Programming: Tutorial Simplex Method
Operation Research Chapter 3 Simplex Method.
L17 LP part3 Homework Review Multiple Solutions Degeneracy Unbounded problems Summary 1.
Linear Programming – Simplex Method
SIMPLEX METHOD FOR LP LP Model.
Sections 4.1 and 4.2 The Simplex Method: Solving Maximization and Minimization Problems.
Support Vector Machines
Computational Methods for Management and Economics Carla Gomes Module 8b The transportation simplex method.
Modeling and Analysis of Random Walk Search Algorithms in P2P Networks Nabhendra Bisnik, Alhussein Abouzeid ECSE, Rensselaer Polytechnic Institute.
Gossip Scheduling for Periodic Streams in Ad-hoc WSNs Ercan Ucan, Nathanael Thompson, Indranil Gupta Department of Computer Science University of Illinois.
The Simplex Algorithm An Algorithm for solving Linear Programming Problems.
The Transportation and Assignment Problems
CES 514 – Data Mining Lecture 8 classification (contd…)
ISPDC 2007, Hagenberg, Austria, 5-8 July On Grid-based Matrix Partitioning for Networks of Heterogeneous Processors Alexey Lastovetsky School of.
Support Vector Machines
5.6 Maximization and Minimization with Mixed Problem Constraints
D Nagesh Kumar, IIScOptimization Methods: M7L1 1 Integer Programming All Integer Linear Programming.
Easy Optimization Problems, Relaxation, Local Processing for a small subset of variables.
Operations Research Models
Chapter 3 Linear Programming Methods 高等作業研究 高等作業研究 ( 一 ) Chapter 3 Linear Programming Methods (II)
Indiana GIS Conference, March 7-8, URBAN GROWTH MODELING USING MULTI-TEMPORAL IMAGES AND CELLULAR AUTOMATA – A CASE STUDY OF INDIANAPOLIS SHARAF.
Simplex method (algebraic interpretation)
Simplex Algorithm.Big M Method
ECE 556 Linear Programming Ting-Yuan Wang Electrical and Computer Engineering University of Wisconsin-Madison March
Public Policy Modeling Simplex Method Tuesday, October 13, 2015 Hun Myoung Park, Ph.D. Public Management & Policy Analysis Program Graduate School of International.
1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.
Knowledge-Based Breast Cancer Prognosis Olvi Mangasarian UW Madison & UCSD La Jolla Edward Wild UW Madison Computation and Informatics in Biology and Medicine.
Garrett Poppe, Liv Nguekap, Adrian Mirabel CSUDH, Computer Science Department.
Computer Science and Engineering Parallelizing Defect Detection and Categorization Using FREERIDE Leonid Glimcher P. 1 ipdps’05 Scaling and Parallelizing.
Linear Programming Revised Simplex Method, Duality of LP problems and Sensitivity analysis D Nagesh Kumar, IISc Optimization Methods: M3L5.
TagLearner: A P2P Classifier Learning System from Collaboratively Tagged Text Documents Haimonti Dutta 1, Xianshu Zhu 2, Tushar Muhale 2, Hillol Kargupta.
3.3 Implementation (1) naive implementation (2) revised simplex method
Mechanical Engineering Department 1 سورة النحل (78)
Linear Programming Erasmus Mobility Program (24Apr2012) Pollack Mihály Engineering Faculty (PMMK) University of Pécs João Miranda
Simplex Method Continued …
1 Network Models Transportation Problem (TP) Distributing any commodity from any group of supply centers, called sources, to any group of receiving.
Optimal Component Analysis Optimal Linear Representations of Images for Object Recognition X. Liu, A. Srivastava, and Kyle Gallivan, “Optimal linear representations.
1 Iterative Integer Programming Formulation for Robust Resource Allocation in Dynamic Real-Time Systems Sethavidh Gertphol and Viktor K. Prasanna University.
Illustration: 3-Party Secure Sum Compare, match, and analyze data from different organizations without disclosing the private data to any other party Experimental.
1 THE REVISED SIMPLEX METHOD CONTENTS Linear Program in the Matrix Notation Basic Feasible Solution in Matrix Notation Revised Simplex Method in Matrix.
Linear Inequalities and Linear Programming Chapter 5 Dr.Hayk Melikyan/ Department of Mathematics and CS/ 5.5 Dual problem: minimization.
OR Chapter 7. The Revised Simplex Method  Recall Theorem 3.1, same basis  same dictionary Entire dictionary can be constructed as long as we.
IT Applications for Decision Making. Operations Research Initiated in England during the world war II Make scientifically based decisions regarding the.
Nonlinear Knowledge in Kernel Approximation Olvi Mangasarian UW Madison & UCSD La Jolla Edward Wild UW Madison.
A Kernel Approach for Learning From Almost Orthogonal Pattern * CIS 525 Class Presentation Professor: Slobodan Vucetic Presenter: Yilian Qin * B. Scholkopf.
The minimum cost flow problem. Solving the minimum cost flow problem.
Massive Support Vector Regression (via Row and Column Chunking) David R. Musicant and O.L. Mangasarian NIPS 99 Workshop on Learning With Support Vectors.
Business Mathematics MTH-367 Lecture 14. Last Lecture Summary: Finished Sec and Sec.10.3 Alternative Optimal Solutions No Feasible Solution and.
1 Simplex algorithm. 2 The Aim of Linear Programming A Linear Programming model seeks to maximize or minimize a linear function, subject to a set of linear.
GOOD MORNING CLASS! In Operation Research Class, WE MEET AGAIN WITH A TOPIC OF :
::Network Optimization:: Minimum Spanning Trees and Clustering Taufik Djatna, Dr.Eng. 1.
Computer Science and Engineering Parallelizing Feature Mining Using FREERIDE Leonid Glimcher P. 1 ipdps’04 Scaling and Parallelizing a Scientific Feature.
Knowledge-Based Nonlinear Support Vector Machine Classifiers Glenn Fung, Olvi Mangasarian & Jude Shavlik COLT 2003, Washington, DC. August 24-27, 2003.
1 Two-Phase Simplex Method file Simplex3_AMII_05b_gr Rev. 1.4 by M. Miccio on December 17, 2014 from a presentation at the Fuqua School of Business MIT.
Introduction to Machine Learning, its potential usage in network area,
Data Transformation: Normalization
Linear Programming Revised Simplex Method, Duality of LP problems and Sensitivity analysis D Nagesh Kumar, IISc Optimization Methods: M3L5.
The minimum cost flow problem
Network Simplex Animations
Perturbation method, lexicographic method
Linear programming Simplex method.
Richard Maclin University of Minnesota - Duluth
Linear programming Simplex method.
Chapter 5 Transportation, Assignment, and Transshipment Problems
LINEAR PROGRAMMING Example 1 Maximise I = x + 0.8y
Refining Rules Incorporated into Knowledge-Based Support Vector Learners via Successive Linear Programming Richard Maclin University of Minnesota - Duluth.
Parallel Programming in C with MPI and OpenMP
Presentation transcript:

Haimonti Dutta 1 and Hillol Kargupta 2 1 Center for Computational Learning Systems (CCLS), Columbia University, NY, USA. 2 University of Maryland, Baltimore County, Baltimore, MD. Also affiliated to Agnik, LLC, Columbia, MD. Distributed Linear Programming and Resource Management for Data Mining in Distributed Environments

Motivation Support Vector (Kernel) Regression An illustration Support Vector Kernel Regression Find a function f(x)=y to fit a set of example data points Problem can be phrased as constrained optimization task Solved using a standard LP solver

Motivation contd.. Knowledge Based Kernel Regression In addition to sample points, give advice If (x ≥3) and (x ≤5) Then (y≥5) Rules add constraints about regions Constraints added to LP and a new solution (with advice constraints) can be constructed Fung, Mangasarian and Shavlik,”Knowledge Based Support Vector Machine Classifiers”, NIPS, Mangasarian, Shavlik and Wild, “Knowledge Based Kernel Approximation”, JMLR, 5, 1127 – 1141, Figure adapted from McLain, Shavlik, Walker and Torrey, “Knowledge-based Support Vector Regression for Reinforcement Learning”, IJCAI, 2005

Distributed Data Mining Applications – An example of Scientific Data Mining in Astronomy Distributed data and computing resources on the National Virtual Observatory P2P Data Mining on homogeneously partitioned sky survey H Dutta, Empowering Scientific Discovery by Distributed Data Mining on the Grid Infrastructure, Ph.D Thesis, UMBC, Maryland, Need for distributed optimization strategies

Road Map Motivation Related Work Framing an Linear Programming problem The simplex algorithm The distributed simplex algorithm Experimental Results Conclusion and Directions of Future Work

Related Work Resource Discovery in Distributed Environments Imantichi, “Resource Discovery in Large Resource Sharing Experiments”, Ph.D. Thesis, University of Chicago, Livny and Solomon, “Matchmaking: Distributed Resource Management for high throughput computing”, HPDC, Optimization Techniques Yarmish, “Distributed Implementation of the Simplex Method”, Ph.D. Thesis, CIS Polytechnic University, Hall and McKinnon, “Update procedures for parallel revised simplex methods, Tech Report, University of Edinburg, UK, 1992 Craig and Reed, “Hypercube Implementation of the Simplex Algorithm”, ACM, pages 1473 – 1482, 1998.

The Optimization Problem 7 Assumptions: n nodes in the network The network is static Dataset D i at node i Processing Cost at i-th node – ν i per record Transportation Cost between i and j – μ ij Amount of Data Transferred between nodes – x ij Cost Function Z = Σ ij μ ij x ij + ν i x ij = Σ ij c ij x ij

Framing the Linear Programming Problem: An illustration Objective Function z = 6.03x x x x x x x 45 Constraints C(X) = ∑ ij µ ij x ij + ν j x ij = ∑ ij c ij x ij, C ij = µ ij + ν ij x 12 + x 14 + x 15 ≤ 300; x 12 + x 25 + x 23 ≤ 600; x 15 +x 25 +x 45 ≤ 300 ; x 23 +x 34 ≤ 300; 0 ≤ x 12 ≤ D 1 ; 0 ≤ x 23 ≤ D 2 ; 0 ≤ x 15 ≤ D 1 ; 0 ≤ x 14 ≤ D 1 ; 0 ≤ x 25 ≤ D 2 ; 0 ≤ x 34 ≤ D 3 ; 0 ≤ x 45 ≤ D GB GB NodeV

The Simplex Algorithm Find x 1 ≥ 0, x 2 ≥ 0, …., x n ≥ 0 and Min z = c 1 x 1 + c 2 x 2 + …. + c n x n Satisfying Constraints A 1 x 1 + A 2 x 2 + ….. + A n x n = B The Simplex Algorithm a 11 a 12 ….a 1n b1b1 a 21 a 22 ….a 2n b2b2 …. a m1 a m2 …a mn bmbm c1c1 c2c2 …cncn z The simplex tableau

The Simplex Algorithm – Contd … 10 The Problem Maximize z = x 1 + 2x 2 – x 3 Subject to 2x 1 + x 2 + x 3 ≤ 14 4x 1 +2x 2 +3x 3 ≤ 28 2x 1 +5 x 2 +5x 3 ≤ 30 The Steps of the Simplex Algorithm (Dantzig) Obtain a canonical representation (Introduce Slack Variables) Find a Column Pivot Find a Row Pivot Perform Gauss Jordan Elimination

The simplex tableau and iterations x 1 x 2 x 3 s 1 s 2 s 3 B Pivot Column Canonical Representation 14/1= 14 28/2=14 30/5= 6 Pivot Row

Simplex iterations contd … Perform Gauss Jordan Elimination The Final Tableau 8/ /58 16/ /516 2/511001/56 -1/503002/ / /160 -1/85 017/80-1/ /1601/163/813

Road Map Motivation Related Work Framing an Linear Programming problem The simplex algorithm The distributed simplex algorithm Experimental Results Conclusions and Future Work

The Distributed Problem – An Example 14 Node1Node 2 Node 5 Node 4 Node 3 x 12 +x 15 +x 14 +2x 25 ≤300 x 12 +2x 15 -x 25 =2 300 GB x 12 +x 23 +x 25 ≤600 2x 25 -x 12 -x 23 =4 600 GB x 15 +x 25 +x 45 ≤300 x 25 -2x 15 -x 45 =5 300 GB x x 25 ≤ GB x 23 +x 34 ≤ GB Each site observes different constraints, but wants to solve the same objective function z = 6.03x x x x x x x 45

Distributed Canonical Representation 15 An initialization step No of basic variables to add = Total no of constraints in the system Build a spanning tree in the network Perform a distributed sum estimation algorithm Builds a canonical representation exactly identical to the one if data was centralized

The Distributed Algorithm for solving the LP problem 16 Steps involved: Estimate Column pivot Estimate Row pivot (requires communication with neighbors) Perform Gauss Jordan elimination

Illustration of the Distributed Algorithm x 12 x 23 x 15 x 14 x 25 x 34 x 45 s1s2s3s4s5s6s7s8B Node1Node 2 Node 5 Node 4 Node 3 x 12 x 23 x 15 x 14 x 25 x 34 x 45 s1s2s3s4s5s6s7s8B Column pivot selection is done at each node

Distributed Row Pivot selection 18 Protocol Push Min (gossip based) Minimum estimation problem Iteration t-1: {m r } values sent to node i m ti = min {{m r }, current row pivot} Termination: All nodes have exactly the same minimum value

Analysis of Protocol Push Min 19 Based on spread of an epidemic in a large population Suseptible, infected and dead nodes The “epidemic” spreads exponentially fast Node1Node 2 Node 5 Node 4 Node 3

Comments and Discussions 20 Assume η no of nodes in the network Communication Complexity is O(no of iterations of simplex X η ) Worst case Simplex may require exponential no of iterations. For most practical purposes it is λ m ( λ <4)

Road Map Motivation Related Work Framing an Linear Programming problem The simplex algorithm The distributed simplex algorithm Experimental Results Conclusion and Directions of Future Work

Experimental Results Artificial Data Set Simulated constraint matrices at each node Used Distributed Data Mining Toolkit (DDMT) developed at University of Maryland, Baltimore County (UMBC) for simulating the network structure Two different metrics for evaluation: TCC (Total Communication Cost in the network) Average Communication Cost per Node (ACCN)

Communication Cost Average Communication Cost Per Node versus Number of Nodes in the network

More Experimental Results …. TCC versus No of Variables at each node TCC versus No of constraints at each node

Conclusions and Future Work Resource management and pattern recognition present formidable challenges on distributed systems Present a distributed algorithm for resource management based on the simplex algorithm Test our algorithm on simulated data Future Work Incorporation of dynamics of the network Testing the algorithm on a real distributed network Effect of size and structure of network on the mining results Examine the trade-off between accuracy and communication cost incurred before and after using distributed simplex on a mining task like classification or clustering

Selected Bibliography G.B.Dantzig, “Linear Programming and Extensions”. Princeton University Press, Princeton, NJ, 1963 Kargupta and Chan,”Advances in Distributed and Parallel Knowledge Discovery”, AAAI Press, Menlo Park, CA, A. L. Turinsky. “Balancing Cost and Accuracy in Distributed Data Mining”. PhD thesis, University of Illinois at Chicago., Haimonti Dutta, “Empowering Scientific Discovery by Distributed Data Mining on the Grid Infrastructure”, Ph.D. Thesis, UMBC, Mangasarian, “Mathematical Programming in Data Mining”, DMKD, Vol 42, pg 183 – 201, 1997.

Questions ?