Security-Driven Heuristics and A Fast Genetic Algorithm for Trusted Grid Job Scheduling Shanshan Song, Ricky Kwok, and Kai Hwang University of Southern.

Slides:



Advertisements
Similar presentations
Scheduling in Distributed Systems Gurmeet Singh CS 599 Lecture.
Advertisements

School of Computer Science
1 An Adaptive GA for Multi Objective Flexible Manufacturing Systems A. Younes, H. Ghenniwa, S. Areibi uoguelph.ca.
Using Parallel Genetic Algorithm in a Predictive Job Scheduling
Exact and heuristics algorithms
By: Chris Klumph and Kody Willman. Types of Heuristics References Terminology Static Mappings 6 Example mappings 4 Graph chromosome mappings 1 Tree mapping.
Genetic Algorithms Contents 1. Basic Concepts 2. Algorithm
IBM Labs in Haifa © 2005 IBM Corporation Adaptive Application of SAT Solving Techniques Ohad Shacham and Karen Yorav Presented by Sharon Barner.
All Hands Meeting, 2006 Title: Grid Workflow Scheduling in WOSE (Workflow Optimisation Services for e- Science Applications) Authors: Yash Patel, Andrew.
Workshop on HPC in India Grid Middleware for High Performance Computing Sathish Vadhiyar Grid Applications Research Lab (GARL) Supercomputer Education.
Institute of Intelligent Power Electronics – IPE Page1 Introduction to Basics of Genetic Algorithms Docent Xiao-Zhi Gao Department of Electrical Engineering.
Genetic Algorithms. Some Examples of Biologically Inspired AI Neural networks Evolutionary computation (e.g., genetic algorithms) Immune-system-inspired.
Case Injected Genetic Algorithms Sushil J. Louis Genetic Algorithm Systems Lab (gaslab) University of Nevada, Reno
Hybridization of Search Meta-Heuristics Bob Buehler.
Case Injected Genetic Algorithms Sushil J. Louis Genetic Algorithm Systems Lab (gaslab) University of Nevada, Reno
A new crossover technique in Genetic Programming Janet Clegg Intelligent Systems Group Electronics Department.
1 Kai Hwang, USC, May 2007 Trust Management in P2P and Grid Computing Kai Hwang, University of Southern California Presentation Outline:  Evolution of.
Genetic Algorithm for Variable Selection
Artificial Intelligence Genetic Algorithms and Applications of Genetic Algorithms in Compilers Prasad A. Kulkarni.
On Fairness, Optimizing Replica Selection in Data Grids Husni Hamad E. AL-Mistarihi and Chan Huah Yong IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS,
Using Simulated Annealing and Evolution Strategy scheduling capital products with complex product structure By: Dongping SONG Supervisors: Dr. Chris Hicks.
Genetic Algorithms Overview Genetic Algorithms: a gentle introduction –What are GAs –How do they work/ Why? –Critical issues Use in Data Mining –GAs.
Ch 4. The Evolution of Analytic Scalability
Ranga Rodrigo April 6, 2014 Most of the sides are from the Matlab tutorial. 1.
A Budget Constrained Scheduling of Workflow Applications on Utility Grids using Genetic Algorithms Jia Yu and Rajkumar Buyya Grid Computing and Distributed.
Heterogeneity and Dynamicity of Clouds at Scale: Google Trace Analysis [1] 4/24/2014 Presented by: Rakesh Kumar [1 ]
ROBUST RESOURCE ALLOCATION OF DAGS IN A HETEROGENEOUS MULTI-CORE SYSTEM Luis Diego Briceño, Jay Smith, H. J. Siegel, Anthony A. Maciejewski, Paul Maxwell,
Young Suk Moon Chair: Dr. Hans-Peter Bischof Reader: Dr. Gregor von Laszewski Observer: Dr. Minseok Kwon 1.
Efficient and Scalable Computation of the Energy and Makespan Pareto Front for Heterogeneous Computing Systems Kyle M. Tarplee 1, Ryan Friese 1, Anthony.
Genetic Algorithms by using MapReduce
Zorica Stanimirović Faculty of Mathematics, University of Belgrade
Boltzmann Machine (BM) (§6.4) Hopfield model + hidden nodes + simulated annealing BM Architecture –a set of visible nodes: nodes can be accessed from outside.
An Iterative Heuristic for State Justification in Sequential Automatic Test Pattern Generation Aiman H. El-MalehSadiq M. SaitSyed Z. Shazli Department.
A Survey of Distributed Task Schedulers Kei Takahashi (M1)
Optimization Problems - Optimization: In the real world, there are many problems (e.g. Traveling Salesman Problem, Playing Chess ) that have numerous possible.
Applying Genetic Algorithm to the Knapsack Problem Qi Su ECE 539 Spring 2001 Course Project.
Chapter 4.1 Beyond “Classic” Search. What were the pieces necessary for “classic” search.
Autonomic scheduling of tasks from data parallel patterns to CPU/GPU core mixes Published in: High Performance Computing and Simulation (HPCS), 2013 International.
A genetic approach to the automatic clustering problem Author : Lin Yu Tseng Shiueng Bien Yang Graduate : Chien-Ming Hsiao.
Exact and heuristics algorithms
Learning by Simulating Evolution Artificial Intelligence CSMC February 21, 2002.
C OMPARING T HREE H EURISTIC S EARCH M ETHODS FOR F UNCTIONAL P ARTITIONING IN H ARDWARE -S OFTWARE C ODESIGN Theerayod Wiangtong, Peter Y. K. Cheung and.
Capacity Enhancement with Relay Station Placement in Wireless Cooperative Networks Bin Lin1, Mehri Mehrjoo, Pin-Han Ho, Liang-Liang Xie and Xuemin (Sherman)
O PTIMAL SERVICE TASK PARTITION AND DISTRIBUTION IN GRID SYSTEM WITH STAR TOPOLOGY G REGORY L EVITIN, Y UAN -S HUN D AI Adviser: Frank, Yeong-Sung Lin.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
NASA Space Grant Symposium April 11-12, 2013 Multi-Goal Path Planning Based on the Generalized Traveling Salesman Problem with Neighborhoods by Kevin Vicencio.
A Hyper-heuristic for scheduling independent jobs in Computational Grids Author: Juan Antonio Gonzalez Sanchez Coauthors: Maria Serna and Fatos Xhafa.
EE749 I ntroduction to Artificial I ntelligence Genetic Algorithms The Simple GA.
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
Parameter Sweep and Resources Scaling Automation in Scalarm Data Farming Platform J. Liput, M. Paciorek, M. Wrona, M. Orzechowski, R. Slota, and J. Kitowski.
A Fast Genetic Algorithm Based Static Heuristic For Scheduling Independent Tasks on Heterogeneous Systems Gaurav Menghani Department of Computer Engineering,
Application of the GA-PSO with the Fuzzy controller to the robot soccer Department of Electrical Engineering, Southern Taiwan University, Tainan, R.O.C.
Genetic Search Algorithms Matt Herbster. Why Another Search?  Designed in the 1950s, heavily implemented under John Holland (1970s)  Genetic search.
Multi-Task Assignment for CrowdSensing in Mobile Social Network Mingjun Xiao ∗, Jie Wu†, Liusheng Huang ∗, Yunsheng Wang‡, and Cong Liu§
Artificial Intelligence By Mr. Ejaz CIIT Sahiwal Evolutionary Computation.
1 Comparative Study of two Genetic Algorithms Based Task Allocation Models in Distributed Computing System Oğuzhan TAŞ 2005.
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
Genetic Algorithm(GA)
Genetic Algorithm. Outline Motivation Genetic algorithms An illustrative example Hypothesis space search.
Hirophysics.com The Genetic Algorithm vs. Simulated Annealing Charles Barnes PHY 327.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Intelligent Exploration for Genetic Algorithms Using Self-Organizing.
Application-level Resource Provisioning
Balancing of Parallel Two-Sided Assembly Lines via a GA based Approach
New Workflow Scheduling Techniques Presentation: Anirban Mandal
Who cares about implementation and precision?
Ch 4. The Evolution of Analytic Scalability
Aiman H. El-Maleh Sadiq M. Sait Syed Z. Shazli
Boltzmann Machine (BM) (§6.4)
Md. Tanveer Anwar University of Arkansas
Case Injected Genetic Algorithms
Presentation transcript:

Security-Driven Heuristics and A Fast Genetic Algorithm for Trusted Grid Job Scheduling Shanshan Song, Ricky Kwok, and Kai Hwang University of Southern California Los Angeles, CA USA Presented by Shanshan Song at the IEEE IPDPS’05, Denver, Colorado, April 6, 2005 The work was supported by the NSF ITR Grant

April 6, Presentation Outline: Motivations Motivations The System Model The System Model  Three security-driven scheduling strategies  To bind security to existing time-driven heuristics for parallel job scheduling  A New Space-Time Genetic Algorithm (STGA) Performance Metrics and Workloads Performance Metrics and Workloads NAS and PSA Benchmark Results NAS and PSA Benchmark Results Conclusions Conclusions

April 6, Motivations Highly shared Grid resources create severe insecurity problems and privacy concerns. Highly shared Grid resources create severe insecurity problems and privacy concerns. Most schedulers ignored the ‘risky’ factor when scheduling large number of jobs in a risky real-life Grid environment. Most schedulers ignored the ‘risky’ factor when scheduling large number of jobs in a risky real-life Grid environment.

April 6, … Deterministic … Adaptive Historical database … Security - Driven Model: High secure Low secure site High demand Low demand job Parallel Job Scheduling Scenario in Risky Computational Grids

April 6, (a) Secure (b) Risky (c) f - Risky Historical database The bad thing always could happen --- Murphy’s Law We are scared, Let us just wait … We don’t care, just do it. I am courageous, not a kid anymore … I calculate too, maybe I am lucky … I run a calculated risk, but wait a while …

April 6, Three Scheduling Modes Secure mode – Allocate jobs only to those Grid sites with security level exceeding the job requirement (SD < SL) Secure mode – Allocate jobs only to those Grid sites with security level exceeding the job requirement (SD < SL) Risky mode – Allocate jobs to any available Grid sites without checking the risk level or the job demand Risky mode – Allocate jobs to any available Grid sites without checking the risk level or the job demand f - risky mode – Allocate jobs to those Grid sites taking at most f risk. E.g.: f = 0.5 (50%) f - risky mode – Allocate jobs to those Grid sites taking at most f risk. E.g.: f = 0.5 (50%) Secure f -Risky Risky Risk Scale: 0 f 100% The Failure Model:

April 6, Scheduling Heuristics under Three Risky Modes Min-Min heuristic: Min-Min heuristic:  For each job, the resource site that gives the earliest expected completion time is determined first. The job that has the minimum earliest expected completion time is determined and then assigned to the corresponding site. Sufferage heuristic: Sufferage heuristic:  The Sufferage heuristic is based on the idea that better mappings can be generated by assigning a site to a job that would “suffer” most in terms of expected completion time if that particular site is not assigned to it. Heuristic operational modes: Heuristic operational modes:  Secure, f - Risky, Risky

April 6, Genetic Algorithm (GA) Genetic Algorithm (GA) is a popular technique used for searching large solution spaces Genetic Algorithm (GA) is a popular technique used for searching large solution spaces  It is powerful for generating good solution  It is not widely deployed for its long computation time Number of Evolution Iterations Solution Quality Generate Random Initial Population Good Solution is found STGA Starting Point GA STGA Traditional GA vs. STGA in term of Number of Evolution Iterations

April 6, How does STGA Work? STGA: Space-Time Genetic Algorithm InputSolution (%,**, ###) (423…56) …… (%,****, ###) (368…89) Lookup Table (%%, ***, ####) One batch of jobs (456 … 34) … (167 … 89) Randomly Generated Solutions (123 … 786) GA Final Solution Initial Population

April 6, STGA Convergence Time Converge at 50 iterations, FAST!!!

April 6, Performance Metrics and Workloads Performance Metrics Performance Metrics  Makespan, slowdown ratio, and average response time  Site utilization  Number of failed jobs & number of risk-taking jobs Numerical Aerodynamic Simulation (NAS) Workload Numerical Aerodynamic Simulation (NAS) Workload  A package contains three months worth of sanitized accounting records for the 128-node iPSC/860 located in the Numerical Aerodynamic Simulation (NAS) Systems Division at NASA Ames Research Center. Parameter Sweep Application (PSA) Workload Parameter Sweep Application (PSA) Workload  Contains a set of independent tasks  Each task has some input files for different parameters

April 6, Performance Results (Makespan) NAS trace workload (16000 jobs, 12 sites) NAS trace workload (16000 jobs, 12 sites) Job arrival rate and workload are from trace data Job arrival rate and workload are from trace data STGA evolution iterations: 100 (GA: STGA evolution iterations: 100 (GA: 1000 iterations)

April 6, Performance Results (Response Time) NAS trace workload (16000 jobs, 12 sites) NAS trace workload (16000 jobs, 12 sites) Job arrival rate and workload are from trace data Job arrival rate and workload are from trace data

April 6, Performance Results (Utilization) NAS trace workload (16000 jobs, 12 sites) NAS trace workload (16000 jobs, 12 sites) Job arrival rate and workload are from trace data Job arrival rate and workload are from trace data

April 6, Scalability Analysis The scalability analysis is conducted on Number of simulated jobs (PSA workload) The scalability analysis is conducted on Number of simulated jobs (PSA workload)  N = 1000, 2000, 5000, and 10000

April 6, Conclusions Security binding technique can be applied to improve any time-driven heuristics for online scheduling of parallel jobs in an open risky Grid computing environment. Security binding technique can be applied to improve any time-driven heuristics for online scheduling of parallel jobs in an open risky Grid computing environment. The new STGA algorithm works by swiftly generating good scheduling solutions based on a prior job execution experience on Grid platforms. Both NAS and PSA benchmark results show the superiority of STGA over the heuristics algorithms applied. The new STGA algorithm works by swiftly generating good scheduling solutions based on a prior job execution experience on Grid platforms. Both NAS and PSA benchmark results show the superiority of STGA over the heuristics algorithms applied.

April 6, Min-Min and Sufferage Heuristics Min-Min heuristics: Min-Min heuristics:  For each job, the resource site that gives the earliest expected completion time is determined first. The job that has the minimum earliest expected completion time is determined and then assigned to the corresponding site. Sufferage heuristics: Sufferage heuristics:  The Sufferage heuristic is based on the idea that better mappings can be generated by assigning a site to a job that would “suffer” most in terms of expected completion time if that particular site is not assigned to it. Job1Job2Job3 Site1357 Site2243 Site36910 Expected Time to Complete Matrix Job1Job2Job3 Site1357 Site2243 Site36910 Expected Time to Complete Matrix Suffer value: 1 1 4

April 6, Genetic Algorithm Overview Genetic Algorithms (GAs) are a popular technique used for searching large solution spaces Genetic Algorithms (GAs) are a popular technique used for searching large solution spaces ‘selection’, ‘crossover’, and ‘mutation’ operations ‘selection’, ‘crossover’, and ‘mutation’ operations Selection  keep good solutions Selection  keep good solutions Crossover  global optimization Crossover  global optimization Mutation  local jumping Mutation  local jumping Initial Population After selection After crossover After mutation

April 6, How does GA apply to job scheduling? What we have: What we have:  A set of resource sites  A number of jobs Solution need to generate: Solution need to generate:  Job and site mapping site4site3site5site2site2 Job1 Job2 Job3 Job4 Job5 One solution (chromosome in GA) site4site3site5site2site2 site3site1site5site3site2 site1site3site4site2site6 Initial population (size=200) Job1 Job2 Job3 Job4 Job5