X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, 20-05-2010, Melbourne, Australia Handling Recoverable Temporal Violations in Scientific Workflow.

Slides:



Advertisements
Similar presentations
Page 1 V1 Delta Squared Performance Improvement - background Improve performance of Hotels and Hotel Companies – by optimising revenue generation, sales.
Advertisements

1 An Adaptive GA for Multi Objective Flexible Manufacturing Systems A. Younes, H. Ghenniwa, S. Areibi uoguelph.ca.
SLA-Oriented Resource Provisioning for Cloud Computing
Designing a DTC Verification System Jennifer Mahoney NOAA/ESRL 21 Feb 2007.
All Hands Meeting, 2006 Title: Grid Workflow Scheduling in WOSE (Workflow Optimisation Services for e- Science Applications) Authors: Yash Patel, Andrew.
Resource Management of Grid Computing
Data Management in Cloud Workflow Systems Dong Yuan Faculty of Information and Communication Technology Swinburne University of Technology.
A Cost-Effective Strategy for Intermediate Data Storage in Scientific Cloud Workflow Systems Dong Yuan, Yun Yang, Xiao Liu, Jinjun Chen Swinburne University.
Software Engineering COMP 201
VERTAF: An Application Framework for Design and Verification of Embedded Real-Time Software Pao-Ann Hsiung, Shang-Wei Lin, Chih-Hao Tseng, Trong-Yen Lee,
SE curriculum in CC2001 made by IEEE and ACM: Overview and Ideas for Our Work Katerina Zdravkova Institute of Informatics
A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter : S.Y.Chen.
Genetic Algorithms for multiple resource constraints Production Scheduling with multiple levels of product structure By : Pupong Pongcharoen (Ph.D. Research.
CoLaB 22nd December 2005 Secure Access to Service-based Collaborative Workflow for DAME Duncan Russell Informatics Institute University of Leeds, UK.
Grid Load Balancing Scheduling Algorithm Based on Statistics Thinking The 9th International Conference for Young Computer Scientists Bin Lu, Hongbin Zhang.
1 Software Testing and Quality Assurance Lecture 1 Software Verification & Validation.
A Progressive Fault Tolerant Mechanism in Mobile Agent Systems Michael R. Lyu and Tsz Yeung Wong July 27, 2003 SCI Conference Computer Science Department.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Dr. Xiao Liu Sessional Lecturer, Research Fellow Centre of SUCCESS Swinburne University of Technology Melbourne, Australia Overview: Cloud Computing and.
Introduction of CS3 and Research in Workflow Technology Program Xiao Liu CS3, Swinburne University of Technology Melbourne, Australia.
Self-Organizing Agents for Grid Load Balancing Junwei Cao Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)
Introduction to Parallel Programming MapReduce Except where otherwise noted all portions of this work are Copyright (c) 2007 Google and are licensed under.
June 10, 2007, IDAR’07, Beijing, China 1 Ensuring Consistent Termination of Composite Web Services An Liu 1,2,3 and Qing Li 2,3 1 Department of Computer.
A Budget Constrained Scheduling of Workflow Applications on Utility Grids using Genetic Algorithms Jia Yu and Rajkumar Buyya Grid Computing and Distributed.
KDD 2012, Beijing, China Community Discovery and Profiling with Social Messages Wenjun Zhou Hongxia Jin Yan Liu.
Self-Organizing Agents for Grid Load Balancing Junwei Cao, Ph.D. Research Scientist Center for Space Research Massachusetts Institute of Technology Cambridge,
Workflow Project Luciano Piccoli Illinois Institute of Technology.
SLA-based Resource Allocation for Software as a Service Provider (SaaS) in Cloud Computing Environments Author Linlin Wu, Saurabh Kumar Garg and Rajkumar.
Fault-Tolerant Workflow Scheduling Using Spot Instances on Clouds Deepak Poola, Kotagiri Ramamohanarao, and Rajkumar Buyya Cloud Computing and Distributed.
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
Young Suk Moon Chair: Dr. Hans-Peter Bischof Reader: Dr. Gregor von Laszewski Observer: Dr. Minseok Kwon 1.
Xiao Liu CITR - Centre for Information Technology Research Swinburne University of Technology, Australia Temporal Verification in Grid/
Embedded Multicore processing for Mobile Communications Real-time Contracts and Thread Profiling York Jack Whitham.
Xiao Liu CS3 -- Centre for Complex Software Systems and Services Swinburne University of Technology, Australia Key Research Issues in.
Xiao Liu, Jinjun Chen, Ke Liu, Yun Yang CS3: Centre for Complex Software Systems and Services Swinburne University of Technology, Melbourne, Australia.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Agent-Based Hybrid Intelligent Systems and Their Dynamic Reconfiguration Zili Zhang Faculty of Computer and Information Science Southwest University
Issues in (Financial) High Performance Computing John Darlington Director Imperial College Internet Centre Fast Financial Algorithms and Computing 4th.
Chapter 10 Analysis and Design Discipline. 2 Purpose The purpose is to translate the requirements into a specification that describes how to implement.
A Novel Local Patch Framework for Fixing Supervised Learning Models Yilei Wang 1, Bingzheng Wei 2, Jun Yan 2, Yang Hu 2, Zhi-Hong Deng 1, Zheng Chen 2.
Job scheduling algorithm based on Berger model in cloud environment Advances in Engineering Software (2011) Baomin Xu,Chunyan Zhao,Enzhao Hua,Bin Hu 2013/1/251.
End-To-End Scheduling Angelo Corsaro & Venkita Subramonian Department of Computer Science Washington University Distributed Systems Seminar, Spring 2003.
Xiao Liu 1, Yun Yang 1, Jinjun Chen 1, Qing Wang 2, and Mingshu Li 2 1 Centre for Complex Software Systems and Services Swinburne University of Technology.
X. Liu, Z. Ni, Z. Wu, D. Yuan, J. Chen, Y. Yang, ICPADS10, , Shanghai, China An Effective Framework for Handling Recoverable Temporal Violations.
Application of Fault Injection to Globus Grid Middleware Nik Looker & Jie Xu University of Leeds, Leeds. LS2 9JT, UK Tianyu Wo & Jinpeng Huai Beihang University,
Author Utility-Based Scheduling for Bulk Data Transfers between Distributed Computing Facilities Xin Wang, Wei Tang, Raj Kettimuthu,
1 Καστοριά Μάρτιος 13, 2009 Efficient Service Task Assignment in Grid Computing Environments Dr Angelos Michalas Technological Educational Institute of.
Multi-Task Assignment for CrowdSensing in Mobile Social Network Mingjun Xiao ∗, Jie Wu†, Liusheng Huang ∗, Yunsheng Wang‡, and Cong Liu§
Xiao Liu, Jinjun Chen, Yun Yang CS3: Centre for Complex Software Systems and Services Swinburne University of Technology, Melbourne, Australia {xliu, jchen,
Agent-Based Grid Load-Balancing Daniel P. Spooner University of Warwick, UK Junwei Cao NEC Europe Ltd., Germany.
Use of Performance Prediction Techniques for Grid Management Junwei Cao University of Warwick April 2002.
1 Comparative Study of two Genetic Algorithms Based Task Allocation Models in Distributed Computing System Oğuzhan TAŞ 2005.
INTELLIGENT TEST SCHEDULING TE-MPE Technical Meeting Michael Galetzka.
IPDPS 2003, Nice, France Agent-Based Grid Load Balancing Using Performance-Driven Task Scheduling Junwei Cao (C&C Research Labs, NEC Europe Ltd., Germany)
18 May 2006CCGrid2006 Dynamic Workflow Management Using Performance Data Lican Huang, David W. Walker, Yan Huang, and Omer F. Rana Cardiff School of Computer.
Jérémie Sublime Sonia Yassa Development of meta-heuristics for workflow scheduling based on quality of service requirements 1.
Agent-Based Grid Load-Balancing
Introduction to Load Balancing:
Astronomical Data Processing & Workflow Scheduling in cloud
DuraStore – Achieving Highly Durable Data Centers
Amity University, Noida, India
Probability-based Evolutionary Algorithms
Paper Report in ECCO group
Dept. of MMME, University of Newcastle upon Tyne
Dept. of MMME, University of Newcastle upon Tyne
Utility-Function based Resource Allocation for Adaptable Applications in Dynamic, Distributed Real-Time Systems Presenter: David Fleeman {
Dept. of MMME, University of Newcastle upon Tyne
Knowledge Management via Workflow System in a Distributed Environment
Automated traffic congestion estimation via public video feeds
Presentation transcript:

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Handling Recoverable Temporal Violations in Scientific Workflow Systems: A Workflow Rescheduling Based Strategy Xiao Liu 1, Jinjun Chen 1, Zhangjun Wu 2, Zhiwei Ni 2, Dong Yuan 1, Yun Yang 1 1 CS3, Swinburne University of Technology Melbourne, Australia 2 Institute of Intelligent Management, Hefei University of Technology Hefei, China

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Outline > Background – Workflow Technology Group – SwinDeW Family – SwinGrid, SwinCloud > Workflow Rescheduling – Workflow Rescheduling – Temporal Violations – A Two Stage Workflow Local Rescheduling Strategy – Evaluation > Summary 2

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Workflow Technology Group Overview >About Us: WT group is a part of CS3 ( The Centre for Complex Software Systems and Services), a Tier-1 university research centre at Swinburne University of Technology. Our group conducts research into workflow technologies for complex software systems and services including peer-to- peer, grid, and cloud computing based e-science, e-business, transactional and inter-organisational workflows. 3 Leader: Prof Yun Yang Visitors (7-8/09): Prof Lee Osterweil Prof. Lori Clarke Researchers: Dr Jinjun Chen Nauman Saeed (PhD) Qiang He (PhD) Ke Liu (PhD) Xiao Liu (PhD) Dong Yuan (PhD) Zhangjun Wu (PhD - visitor ) Others: Prof Ryszard Kowalczyk Prof Chengfei Liu Dr Jun Yan (Wollongong) Prof Hai Jin (HUST) Prof Mingshu Li (ISCAS) Prof Qing Wang (ISCAS) Prof Zhiwei Ni (HFUT) Prof Jinpeng Huai (BUAA)

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia SwinDeW Family SwinDeW – Swin burne De centralised W orkflow - foundation prototype based on p2p >SwinDeW – past >SwinDeW-A (for Agents) – ARC DP06 >SwinDeW-G (for Grid) – past >SwinDeW-V (for Verification) – current (pending ARC DP) >SwinDeW-E (for eScience) – current (pending ARC DP) >SwinDeW-C (for cloud) – current (ARC LP) >Others: SwinDeW-B / -S / -P / -G – past 4

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia SwinGrid to SwinCloud 5

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Outline > Background – Workflow Technology Group – SwinDeW Family – SwinGrid, SwinCloud > Workflow Rescheduling – Workflow Rescheduling – Temporal Violations – A Two Stage Workflow Local Rescheduling Strategy – Evaluation > Summary 6

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Workflow Rescheduling >A workflow scheduling is a process that maps the execution of inter-dependent workflow tasks on the distributed resources. It allocates suitable resources to workflow tasks so that the execution can be completed to satisfy objective functions imposed by users. >Workflow rescheduling is a process to regenerate or modify the current scheduling plan, due to –Changes of system environments: resource pool change, resource performance variance, resource break down, etc. –Contract (constraint) violations: task execution failures, temporal violations, cost violations, etc. –Many others: when the current scheduling plan cannot satisfy the pre-defined functional or non-functional objectives

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Workflow Rescheduling >Workflow rescheduling plays an equally or even more important role as workflow scheduling in highly distributed and dynamic computing environments –As a supplement function to workflow scheduling in the resource management component –As a function in the exception handling component >Example rescheduling strategy: –Rescheduling by Stop and Restart –Rescheduling by Processor Swapping >Simple (little additional programming); high cost for resource reservation

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Temporal Violations >Most scientific workflows are subjected to –Local temporal constraints (milestones) for workflow segments –Global temporal constraints (deadlines) for workflow instance >Violations of local and global temporal constraints –Detection: Temporal Checkpoint Selection (ICSE08, TOSEM10) –Handling: Time deficits compensation (CCPE07, CCGrid10) >Statistical recoverable temporal violations –Probability based temporal consistency model (BPM08, ICSP09) –Based on “3sigm” rule in statistics –Details omitted, please refer the paper for details

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Outline > Background – Workflow Technology Group – SwinDeW Family – SwinGrid, SwinCloud > Workflow Rescheduling – Workflow Rescheduling – Temporal Violations – A Two Stage Workflow Local Rescheduling Strategy – Evaluation > Summary 10

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia A Two Stage Workflow Local Rescheduling Strategy >For handling temporal violations >Key objective: reduce or ideally remove the time deficit at the current checkpoint, i.e. to reduce the execution time of the subsequent activities after the checkpoint in the violated workflow segment as much as possible >Requirement 1: fighting good balance between time deficit compensation and the completion time of other activities (workflow activities and general tasks, with or without temporal constraints) – from the overall makespan perspective >Requirement 2: utilising available resources in the system rather than recruiting additional resources – from the overall cost perspective 11

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Integrated Task Resource List 12

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia 13 Pseudo-code for An Abstract Strategy

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Evaluation >Two example implementation of our strategy –An ant colony optimisation based strategy –A genetic algorithm based strategy

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Optimisation on Total Makespan 15

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Optimisation on Total Cost 16

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Compensation on Violated Workflow Segment 17

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Fitness Value 18

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia CPU Time 19

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Experiment Results on Temporal Violation Rates 20

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Outline > Background – Scientific Workflows – Workflow Scheduling and Rescheduling > Workflow Scheduling – Classification – Representative Scheduling Algorithms > Workflow Rescheduling – Classification – A Two Stage Workflow Local Rescheduling Strategy – Case Study 1:GA Based Rescheduling – Case Study 2: ACO Based Rescheduling – Comparison > Summary 21

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia Summary >Workflow Rescheduling – Exception Handling >Exception Handling on Scientific Workflow Temporal Violations –A Two Stage Workflow Local Rescheduling Strategy ACO, GA –Automatic, Cost-Effective >Future Work –Data movement cost –More scheduling algorithms 22

X. Liu, J. Chen, Z. Wu, Z. Ni, D. Yuan, Y. Yang, CCGrid10, , Melbourne, Australia The End – Thank You! >Any questions or comments? > >Website: >CS3: 23