Combinatorial Optimization on the Computational Grid Experiments on Grid5000 Nouredine Melab ( ) Member of Grid5000 steering committee Laboratoire.

Slides:



Advertisements
Similar presentations
Christian Delbe1 Christian Delbé OASIS Team INRIA -- CNRS - I3S -- Univ. of Nice Sophia-Antipolis November Automatic Fault Tolerance in ProActive.
Advertisements

Using Parallel Genetic Algorithm in a Predictive Job Scheduling
Multi-Objective Optimization NP-Hard Conflicting objectives – Flow shop with both minimum makespan and tardiness objective – TSP problem with minimum distance,
Distributed Process Scheduling Summery Distributed Process Scheduling Summery BY:-Yonatan Negash.
Workshop on HPC in India Grid Middleware for High Performance Computing Sathish Vadhiyar Grid Applications Research Lab (GARL) Supercomputer Education.
CISC October Goals for today: Foster’s parallel algorithm design –Partitioning –Task dependency graph Granularity Concurrency Collective communication.
GridFlow: Workflow Management for Grid Computing Kavita Shinde.
Using Metacomputing Tools to Facilitate Large Scale Analyses of Biological Databases Vinay D. Shet CMSC 838 Presentation Authors: Allison Waugh, Glenn.
Computer Science Department 1 Load Balancing and Grid Computing David Finkel Computer Science Department Worcester Polytechnic Institute.
1 IOE/MFG 543 Chapter 14: General purpose procedures for scheduling in practice Sections : Dispatching rules and filtered beam search.
Parallel Programming Models and Paradigms
Introduction to Evolutionary Computation  Genetic algorithms are inspired by the biological processes of reproduction and natural selection. Natural selection.
A cooperative parallel tabu search algorithm for the quadratic assignment problem Ya-Tzu, Chiang.
1 A hybrid particle swarm optimization algorithm for optimal task assignment in distributed system Peng-Yeng Yin and Pei-Pei Wang Department of Information.
©Silberschatz, Korth and Sudarshan18.1Database System Concepts Centralized Systems Run on a single computer system and do not interact with other computer.
Diffusion scheduling in multiagent computing system MotivationArchitectureAlgorithmsExamplesDynamics Robert Schaefer, AGH University of Science and Technology,
A Grid-enabled Branch and Bound Algorithm for Solving Challenging Combinatorial Optimization Problems Authors: M. Mezmaz, N. Melab and E-G. Talbi Presented.
Chapter 2 Computer Clusters Lecture 2.1 Overview.
Taxonomy of Hybrid Metaheuristics Presented by: Xiaojun Bao & Lijun Wang Presented by: Xiaojun Bao & Lijun Wang School of Engineering School of Engineering.
Metaheuristics The idea: search the solution space directly. No math models, only a set of algorithmic steps, iterative method. Find a feasible solution.
Software framework for metaheuristics Parallel Cooperative Optimization Research Group Laboratoire d’Informatique Fondamentale de Lille
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
ADLB Update Recent and Current Adventures with the Asynchronous Dynamic Load Balancing Library Rusty Lusk Mathematics and Computer Science Division Argonne.
Optimized Java computing as an application for Desktop Grid Olejnik Richard 1, Bernard Toursel 1, Marek Tudruj 2, Eryk Laskowski 2 1 Université des Sciences.
Work Stealing and Persistence-based Load Balancers for Iterative Overdecomposed Applications Jonathan Lifflander, UIUC Sriram Krishnamoorthy, PNNL* Laxmikant.
Neural and Evolutionary Computing - Lecture 10 1 Parallel and Distributed Models in Evolutionary Computing  Motivation  Parallelization models  Distributed.
DISTRIBUTED COMPUTING
Optimized Search Heuristics: a Survey Susana Fernandes Universidade do Algarve Faro, Portugal Helena Ramalhinho Lourenço Universitat Pompeu.
ProteinShop: A Tool for Protein Structure Prediction and Modeling Silvia Crivelli Computational Research Division Lawrence Berkeley National Laboratory.
SUMA: A Scientific Metacomputer Cardinale, Yudith Figueira, Carlos Hernández, Emilio Baquero, Eduardo Berbín, Luis Bouza, Roberto Gamess, Eric García,
Scheduling Many-Body Short Range MD Simulations on a Cluster of Workstations and Custom VLSI Hardware Sumanth J.V, David R. Swanson and Hong Jiang University.
Loosely Coupled Parallelism: Clusters. Context We have studied older archictures for loosely coupled parallelism, such as mesh’s, hypercubes etc, which.
Optimization Problems - Optimization: In the real world, there are many problems (e.g. Traveling Salesman Problem, Playing Chess ) that have numerous possible.
1 Multiprocessor and Real-Time Scheduling Chapter 10 Real-Time scheduling will be covered in SYSC3303.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Doshisha Univ., Kyoto, Japan CEC2003 Adaptive Temperature Schedule Determined by Genetic Algorithm for Parallel Simulated Annealing Doshisha University,
1 M. Tudruj, J. Borkowski, D. Kopanski Inter-Application Control Through Global States Monitoring On a Grid Polish-Japanese Institute of Information Technology,
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
1 CMPE 511 HIGH PERFORMANCE COMPUTING CLUSTERS Dilek Demirel İşçi.
More on Adaptivity in Grids Sathish S. Vadhiyar Source/Credits: Figures from the referenced papers.
October 2008 Integrated Predictive Simulation System for Earthquake and Tsunami Disaster CREST/Japan Science and Technology Agency (JST)
Ant Algorithm and its Applications for Solving Large Scale Optimization Problems on Parallel Computers Stefka Fidanova Institute for Information and Communication.
FATCOP: A Mixed Integer Program Solver Michael FerrisQun Chen Department of Computer Sciences University of Wisconsin-Madison Jeff Linderoth, Argonne.
Tutorial on ParadisEO: Parallel Models for the TSP
DISTRIBUTED COMPUTING
Outline Why this subject? What is High Performance Computing?
1 Fault-Tolerant Mechanism for Hierarchical Branch and Bound Algorithm Université A/Mira de Béjaïa CEntre de Recherche sur l’Information Scientifique et.
A N I N - MEMORY F RAMEWORK FOR E XTENDED M AP R EDUCE 2011 Third IEEE International Conference on Coud Computing Technology and Science.
1 OASIS Team, INRIA Sophia-Antipolis/I3S CNRS, Univ. Nice Christian Delbé Data Grid Explorer 15/09/03 Large Scale Emulation Mobility in ProActive.
1 ParadisEO-MOEO for a Bi-objective Flow-Shop Scheduling Problem May 2007 E.-G. Talbi and the ParadisEO team
Background Computer System Architectures Computer System Software.
1 Hierarchical Parallelization of an H.264/AVC Video Encoder A. Rodriguez, A. Gonzalez, and M.P. Malumbres IEEE PARELEC 2006.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 April 28, 2005 Session 29.
Use of Performance Prediction Techniques for Grid Management Junwei Cao University of Warwick April 2002.
Multi-cellular paradigm The molecular level can support self- replication (and self- repair). But we also need cells that can be designed to fit the specific.
IPDPS 2003, Nice, France Agent-Based Grid Load Balancing Using Performance-Driven Task Scheduling Junwei Cao (C&C Research Labs, NEC Europe Ltd., Germany)
INTRODUCTION TO HIGH PERFORMANCE COMPUTING AND TERMINOLOGY.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
A new tri-objective model for the flexible docking problem
Introduction to Parallel Computing: MPI, OpenMP and Hybrid Programming
OpenMosix, Open SSI, and LinuxPMI
Parallel Cooperative Optimization Research Group
Parallel Programming in C with MPI and OpenMP
metaheuristic methods and their applications
Metaheuristic methods and their applications. Optimization Problems Strategies for Solving NP-hard Optimization Problems What is a Metaheuristic Method?
Multi-Objective Optimization
IOE/MFG 543 Chapter 14: General purpose procedures for scheduling in practice Sections : Dispatching rules and filtered beam search.
Parallel Programming in C with MPI and OpenMP
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Combinatorial Optimization on the Computational Grid Experiments on Grid5000 Nouredine Melab ( ) Member of Grid5000 steering committee Laboratoire d’Informatique Fondamentale de Lille Parallel Cooperative Optimization Research Group INRIA DOLPHIN Project

Combinatorial optimization problems High-dimensional and complex optimization problems in many areas of industrial concern Parallel hybrid optimization methods allow to efficiently provide effective solutions, but they remain insufficient for large problems … … Need of large scale parallelism (Grid computing) (Multi-Objective) Const. (Mono-Objective) ()

A taxonomy of optimization methods Exact algorithmsHeuristics Branch and X Dynamic Programming A* Specific Heuristics Meta-heuristics Single Solution Population of solutions Local Search Simulated Annealing Tabu Search Evolutionary Algorithms Scatter, Swarm search Near-optimal solutions for large problem instances Optimal solutions for small problem instances

Design and implementation of Grid-based algorithms … Meta-heuristics (near-optimal) Parallel hybrid design … solving challenging problems in combinatorial optimization Exact algorithms Parallel design Implementation Cooperation Implementation Protein Structure Prediction Flow-Shop scheduling problem Supported by ANR-GRID DOCK Supported by ACI-GRID DOC-G Combinatorial Optimization on the Computational Grid Experiments on Grid5000 Supported by ANR-GRID CHOC

Meta-heuristics: Parallel models and hybridization mechanisms Parallel models They allow to improve efficiency and effectiveness Population-based meta-heuristics Island model, parallel evaluation of the population, parallel evaluation of a single solution Single solution-based meta-heuristics Multi-start model, parallel exploration of the neighborhood, parallel evaluation of a single solution Hybridization mechanisms … … allow to combine different methods for better robustness and effectiveness, but are CPU-time intensive N. Melab, E-G. Talbi, S. Cahon, E. Alba and G. Luque. Parallel Meta-heuristics: Algorithms and Frameworks. Chapter 6 in “Parallel Combinatorial Optimization”, Wiley Series on Parallel and Distributed Computing, ISBN: , Nov 2006.

“Gridification” of parallel hybrid meta-heuristics Major properties of computational grids Multi-administrative domain, heterogeneity, dynamic availability of resources, large scale Major adaptations of the different models and mechanisms Asynchronous design and implementation Granularity management and load balancing Checkpointing-based fault tolerance (a memory for each model) Adaptation of the parameters of each model (e.g. migration topology for the island model) N. Melab, S. Cahon and E-G. Talbi. Grid Computing for Parallel Bioinspired Algorithms. Journal of Parallel and Distributed Computing (JPDC), Elsevier Science, Vol.66(8), Pages , 2006.

Our contributions  Multi-Objective EO (MOEO) for the design of multi-objective evolutionary algorithms  Moving Objects (MO) for the design of local search algorithms  ParadisEO for parallel hybrid metaheuristics PARAllel and DIStributed Evolving Objects  Message passing (MPI, PVM)  Clusters, Networks of Workstations,  Multi-programming (PThreads)  Shared Memory Multi-processors (SMP)  Parallel distributed computing  Clusters of SMPs (CLUMPS)  Grid computing  Condor-MW and Globus (MPICH-G2) EO MOMOEO PVM, PThreads MPI (LAM, CH) Condor-MW Globus  S. Cahon, N. Melab and E-G. Talbi. ParadisEO: A Framework for the Reusable Design of Parallel and Distributed Metaheuristics. Journal of Heuristics, Elsevier Science, Vol.10(3), pages , May Evolving Objects framework ( EO ) European project (Geneura Team, INRIA, LIACS) Transparent use

ParadisEO-G4: ParadisEO on Globus 4 Design and implementation Gridification of the parallel models and hybridization mechanisms provided in ParadisEO MPICH-G2 as the communication library Deployment on the computational Grid (Grid5000) Building of system image for Globus 4 including MPICH-G2 Virtual Globus Grid on Grid5000 for the Grid-based deployment of the parallel hybrid meta-heuristics provided in ParadisEO

Design and implementation of Grid-based algorithms … Meta-heuristics (near-optimal) Parallel hybrid design … solving challenging problems in combinatorial optimization Exact algorithms Parallel design Implementation Cooperation Implementation Protein Structure Prediction Flow-Shop scheduling problem Supported by ANR-GRID DOCK Supported by ACI-GRID DOC-G Combinatorial Optimization on the Computational Grid Experiments on Grid5000 Supported by ANR-GRID CHOC

Protein Structure Prediction on the Grid Modelling  The problem consists in finding …  … the ground-state (tertiary stable) conformation of a protein from its primary structure composed of a sequence of amino-acids (residues)  Modelled as a bi-objective optimization problem  Candidate solutions: Molecular conformations (geometries) – vectors of torsion angles  Molecular conformation with lower free energies (bonded atoms and non-bonded atoms)

Protein Structure Prediction on the Grid Complexity and landscape analysis  For a molecule of 40 residues with 10 conformations per residue, conformations are obtained in average …  years are required at conformations explored per second!  Landscape analysis  Multi-modal landscape  Need of parallel hybrid (global and local) meta- heuristics and Grid computing

Parallel evaluation of the population High-level co-evolutionary hybridization Multi-start model High-level co-evolutionary hybridization Cooperative GAs (Island model) Parallel asynchronous hierarchical hybrid meta-heuristic A-A. Tantar, N. Melab, E-G. Talbi, O. Dragos and B. Parain. A Parallel Hybrid Genetic Algorithm for Protein Structure Prediction on the Computational Grid. FGCS, Elsevier Science, Vol.23(3), , ∂1∂1 ∂2∂2 ∂n∂n ∂' 1 ∂' 2 ∂' n Genetic Algorithm Population Local Search Optimized Individual

Grid5000: 7 sites, Avg. 800 CPUs – Execution time: 1h – Cumul. time: 1 month Preliminary experimental results on Grid5000 Implementation with ParadisEO-G4 Protein: Tryptophan-cage from Protein Data Bank (PDB - 1L2Y) Average Quality Improvement: 62%

Interconnection Grid5000-DAS Benefits More resources for dealing with very large proteins with grid-based meta-heuristics New scientific challenge: scalability of ParadisEO-G Requirements Need of a virtual Globus grid between Grid5000 and DAS Common certification authority ? Get longer the default run time of jobs in DAS Deployment time of the virtual Globus grid ~ 10 minutes Only 5 minutes for the combinatorial optimization process on DAS !!

Design and implementation of Grid-based algorithms … Meta-heuristics (near-optimal) Parallel hybrid design … solving challenging problems in combinatorial optimization Exact algorithms Parallel design Implementation Cooperation Implementation Protein Structure Prediction Flow-Shop scheduling problem Supported by ANR-GRID DOCK Supported by ACI-GRID DOC-G Combinatorial Optimization on the Computational Grid Experiments on Grid5000 Supported by ANR-GRID CHOC

Parallel models for exact optimization (B&B inspired) B&B = Exploration + bounding of tree nodes Parallel models Parallel multi-parametric model Parallel exploration of the search tree Parallel evaluation of the bounds Parallel evaluation of a single bound/solution Parallel exploration of the search tree Massive parallelism needing a computational grid Gridification is required

Efficient work distribution during the exploration Need of low cost communications of work units Efficient checkpointing-based Fault tolerance Search of an exact solution in a volatile environment Low cost communication and storage of work units Efficient termination detection May be implicit The proposed approach: objectives

The approach uses a special coding … Node number Work unit (collection of nodes) = an interval Principles of the approach [0,2] [3,5] [0,5]  The approach is Dispatcher-Worker based on the work stealing paradigm  Dispatcher: maintains a pool of work units (intervals) and the global solution found so far  Worker: performs B&B on a given interval and updates the global solution  Work distribution and check-pointing  Communication of intervals (two numbers)  Two efficient operators: folding and unfolding of intervals

Design and implementation of Grid-based algorithms … Meta-heuristics (near-optimal) Parallel hybrid design … solving challenging problems in combinatorial optimization Exact algorithms Parallel design Implementation Cooperation Implementation Protein Structure Prediction Flow-Shop scheduling problem Supported by ANR-GRID DOCK Supported by ACI-GRID DOC-G Combinatorial Optimization on the Computational Grid Experiments on Grid5000 Supported by ANR-GRID CHOC

 N jobs to be scheduled on M machines  Each machine can not be simultaneously assigned to two jobs (colors)  Jobs (colors) must be scheduled in the same order on all machines  One objective must be minimized  Cmax: Makespan (Total completion time) M1M1 M2M2 M3M3 The Flow Shop Scheduling Problem 4 jobs on 3 machines

Network of the campus of Université de Lille1 123 FIL (Lille1) 170 IUT A A grid of more than 2000 processors Grid5000 node at Lille RENATER NR... NR Other sites of GRID’5000 Grid’5000 Front-end  IP forwarding  NAT Dispatcher on a computation node

Experimental results  Standard Taillard ’ s benchmark: Ta jobs on 20 machines  Best known solution: 3681, Ruiz & Stutzle, 2004  Exact solution: 3679, Mezmaz, Melab & Talbi, 2006 Running wall clock time: 25 days 46 minCPU time on a single processor: 22 years 185 days 16 hours Avg. num. of exploited processors: 328Maximum number of exploited processors: Parallel efficiency: 97 %Bordeaux (88), Orsay (360), Sophia (190), Lille (98), Toulouse (112), Rennes (456), Univ. Lille1 (304) M. Mezmaz, N. Melab, E-G. Talbi. A Grid-enabled Branch and Bound Algorithm for Solving Challenging Combinatorial Optimization Problems. Research Report, INRIA 5945, July 2006 (

Interconnection Grid5000-DAS Benefits More resources for solving efficiently and optimally larger problem instances with grid-based combinatorial optimization New scientific challenge: scalability (limits and solutions) The dispatcher has never crashed on Grid5000 (up to 2500 processors) Requirements Avoiding the special configuration of the front-end to allow transparent inter-grid communications between the dispatcher and the workers Viewing DAS as a Grid5000 site and vice versa ? Best-effort reservation mode in DAS Long-running problems Using the nodes as long as they are not requested for reservation