Contention-aware scheduling with task duplication J. Parallel Distrib. Comput. (2011) Oliver Sinnen ∗, Andrea To, Manpreet Kaur Tai, Yu-Chang 11/23/2012.

Slides:



Advertisements
Similar presentations
Costas Busch Louisiana State University CCW08. Becomes an issue when designing algorithms The output of the algorithms may affect the energy efficiency.
Advertisements

Presented by Zhu Jinshun Season: Shelving Interference and Joint Identification in Large- scale RFID Systems.
U of Houston – Clear Lake
CprE 458/558: Real-Time Systems (G. Manimaran)1 CprE 458/558: Real-Time Systems Resource Reclaiming (Contd.)
1 Enhanced EDF Scheduling Algorithms for Orchestrating Network-wide Active Measurements Prasad Calyam, Chang-Gun Lee Phani Kumar Arava, Dima Krymskiy OARnet,
Chapter 9: Graphs Scheduling Networks Mark Allen Weiss: Data Structures and Algorithm Analysis in Java Lydia Sinapova, Simpson College.
1 Advancing Supercomputer Performance Through Interconnection Topology Synthesis Yi Zhu, Michael Taylor, Scott B. Baden and Chung-Kuan Cheng Department.
Price-based Resource Allocation in Wireless Ad Hoc Networks Yuan Xue, Baochun Li and Klara Nahrstedt University of Illinois at Urbana-Champaign University.
National Tsing Hua University Po-Yang Hsu,Hsien-Te Chen,
A system Performance Model Instructor: Dr. Yanqing Zhang Presented by: Rajapaksage Jayampthi S.
Towards a Realistic Scheduling Model Oliver Sinnen, Leonel Sousa, Frode Eika Sandnes IEEE TPDS, Vol. 17, No. 3, pp , 2006.
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=
Outline Max Flow Algorithm Model of Computation Proposed Algorithm Self Stabilization Contribution 1 A self-stabilizing algorithm for the maximum flow.
1 Brief Announcement: Distributed Broadcasting and Mapping Protocols in Directed Anonymous Networks Michael Langberg: Open University of Israel Moshe Schwartz:
Present by Chen, Ting-Wei Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids Maria Chtepen, Filip H.A. Claeys, Bart Dhoedt,
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=
1 Analysis of Link Reversal Routing Algorithms Srikanta Tirthapura (Iowa State University) and Costas Busch (Renssaeler Polytechnic Institute)
1 IOE/MFG 543 Chapter 7: Job shops Sections 7.1 and 7.2 (skip section 7.3)
DETERMINATION OF THE TOPOLOGY OF HIGH SURVIVAL HF RADIO COMMUNICATION NETWORK Andrea Abrardo.
Scheduling Parallel Task
More Realistic Power Grid Verification Based on Hierarchical Current and Power constraints 2 Chung-Kuan Cheng, 2 Peng Du, 2 Andrew B. Kahng, 1 Grantham.
A Topology-based ECO Routing Methodology for Mask Cost Minimization Po-Hsun Wu, Shang-Ya Bai, and Tsung-Yi Ho Department of Computer Science and Information.
Task Alloc. In Dist. Embed. Systems Murat Semerci A.Yasin Çitkaya CMPE 511 COMPUTER ARCHITECTURE.
CAFE router: A Fast Connectivity Aware Multiple Nets Routing Algorithm for Routing Grid with Obstacles Y. Kohira and A. Takahashi School of Computer Science.
Of 21 1 Low-Cost Task Scheduling for Distributed-Memory Machines Andrei Radulescu and Arjan J.C. Van Gemund Presented by Bahadır Kaan Özütam.
Lecture 13 Graphs. Introduction to Graphs Examples of Graphs – Airline Route Map What is the fastest way to get from Pittsburgh to St Louis? What is the.
1 Scheduling CEG 4131 Computer Architecture III Miodrag Bolic Slides developed by Dr. Hesham El-Rewini Copyright Hesham El-Rewini.
Extreme-scale computing systems – High performance computing systems Current No. 1 supercomputer Tianhe-2 at petaflops Pushing toward exa-scale computing.
Efficient and Scalable Computation of the Energy and Makespan Pareto Front for Heterogeneous Computing Systems Kyle M. Tarplee 1, Ryan Friese 1, Anthony.
1 中華大學資訊工程學系 Ching-Hsien Hsu ( 許慶賢 ) Localization and Scheduling Techniques for Optimizing Communications on Heterogeneous.
Wen-Hao Liu 1, Yih-Lang Li 1, and Kai-Yuan Chao 2 1 Department of Computer Science, National Chiao-Tung University, Hsin-Chu, Taiwan 2 Intel Architecture.
Static Process Schedule Csc8320 Chapter 5.2 Yunmei Lu
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 March 01, 2005 Session 14.
1 Customer-Aware Task Allocation and Scheduling for Multi-Mode MPSoCs Lin Huang, Rong Ye and Qiang Xu CHhk REliable computing laboratory (CURE) The Chinese.
1 Nasser Alsaedi. The ultimate goal for any computer system design are reliable execution of task and on time delivery of service. To increase system.
LATA: A Latency and Throughput- Aware Packet Processing System Author: Jilong Kuang and Laxmi Bhuyan Publisher: DAC 2010 Presenter: Chun-Sheng Hsueh Date:
Rapid Content Distribution Using An Ordered Seeding Torrent Raja Naresh Dr. Javed Khan Media Communications Networking Research Lab Kent State University.
1 Andreea Chis under the guidance of Frédéric Desprez and Eddy Caron Scheduling for a Climate Forecast Application ANR-05-CIGC-11.
An Efficient Algorithm for Scheduling Instructions with Deadline Constraints on ILP Machines Wu Hui Joxan Jaffar School of Computing National University.
Autonomic scheduling of tasks from data parallel patterns to CPU/GPU core mixes Published in: High Performance Computing and Simulation (HPCS), 2013 International.
Static Process Scheduling Section 5.2 CSc 8320 Alex De Ruiter
Resource Mapping and Scheduling for Heterogeneous Network Processor Systems Liang Yang, Tushar Gohad, Pavel Ghosh, Devesh Sinha, Arunabha Sen and Andrea.
Zibin Zheng DR 2 : Dynamic Request Routing for Tolerating Latency Variability in Cloud Applications CLOUD 2013 Jieming Zhu, Zibin.
Advanced Communication Network Joint Throughput Optimization for Wireless Mesh Networks R 戴智斌 R 蔡永斌 Xiang-Yang.
Learning the Structure of Related Tasks Presented by Lihan He Machine Learning Reading Group Duke University 02/03/2006 A. Niculescu-Mizil, R. Caruana.
Basic Linear Algebra Subroutines (BLAS) – 3 levels of operations Memory hierarchy efficiently exploited by higher level BLAS BLASMemor y Refs. FlopsFlops/
Maze Routing Algorithms with Exact Matching Constraints for Analog and Mixed Signal Designs M. M. Ozdal and R. F. Hentschke Intel Corporation ICCAD 2012.
Dzmitry Kliazovich University of Luxembourg, Luxembourg
Static Process Scheduling
Scalable and Topology-Aware Load Balancers in Charm++ Amit Sharma Parallel Programming Lab, UIUC.
Wajid Minhass, Paul Pop, Jan Madsen Technical University of Denmark
Localized Low-Power Topology Control Algorithms in IEEE based Sensor Networks Jian Ma *, Min Gao *, Qian Zhang +, L. M. Ni *, and Wenwu Zhu +
Energy-Efficient Randomized Switching for Maximizing Lifetime in Tree- Based Wireless Sensor Networks Sk Kajal Arefin Imon, Adnan Khan, Mario Di Francesco,
ICPADS '12 Proceedings of the 2012 IEEE 18th International Conference on Parallel and Distributed Systems, Pages Tianyi Wang, Gang Quan, Shangping.
A stochastic scheduling algorithm for precedence constrained tasks on Grid Future Generation Computer Systems (2011) Xiaoyong Tang, Kenli Li, Guiping Liao,
Uses some of the slides for chapters 3 and 5 accompanying “Introduction to Parallel Computing”, Addison Wesley, 2003.
Optimally Modifying Software for Safety and Functionality Sampath Kannan U.Penn (with Arvind Easwaran & Insup Lee)
Genetic algorithms for task scheduling problem J. Parallel Distrib. Comput. (2010) Fatma A. Omara, Mona M. Arafa 2016/3/111 Shang-Chi Wu.
Carnegie Mellon Lecture 8 Software Pipelining I. Introduction II. Problem Formulation III. Algorithm Reading: Chapter 10.5 – 10.6 M. LamCS243: Software.
An Algorithm for Automatically Obtaining Distributed and Fault Tolerant Static Schedules Alain Girault - Hamoudi Kalla - Yves Sorel - Mihaela Sighireanu.
Topological Sort. Sorting technique over DAGs (Directed Acyclic Graphs) It creates a linear sequence (ordering) for the nodes such that: –If u has an.
Dynamic Pickup and Delivery with Transfers
Introduction | Model | Solution | Evaluation
آشنایی با الگوریتم های زمانبندی
Analysis of Link Reversal Routing Algorithms
Topological Sort.
Topological Sort.
Hyunchul Park, Kevin Fan, Manjunath Kudlur,Scott Mahlke
CS223 Advanced Data Structures and Algorithms
Barrier Coverage with Optimized Quality for Wireless Sensor Networks
Presentation transcript:

Contention-aware scheduling with task duplication J. Parallel Distrib. Comput. (2011) Oliver Sinnen ∗, Andrea To, Manpreet Kaur Tai, Yu-Chang 11/23/ /11/231

Outline Introduction Task scheduling background Classic scheduling Contention-aware scheduling Complexity analyze Experimental evaluation Conclusions 2012/11/232

Introduction Classic scheduling ① Contention-aware scheduling ③ Duplication ② Duplication ④ 2012/11/233

Task scheduling background Directed acyclic graph (DAG) G = (V, E,w, c),called a task graph nodes n ∈ V : tasks edge e ij ∈ E : communication from ni to nj w(n) : n’s computation cost c(e ij ) : communication cost of edge eij ∈ E 2012/11/234

Task scheduling background set P t s (n, P) : start time of n on processor P ∈ P t f (n, P) : finish time of n on processor P ∈ P t f (n, P) = t s (n, P) + w(n) proc(n) t f (P) = max n ∈ V:proc(n)=P { t f (n, P) } sl( & ) = max n ∈ V { t f (n, proc(n)) } t f (e ij, P src, P dst ) 2012/11/235

Task scheduling background pred(n i ) succ(n i ) pred(n) = ∅ (source node) succ(n) = ∅ (sink node) nini nxnx nxnx nxnx nxnx nxnx nxnx nxnx nxnx 2012/11/236

Two conditions must be fulfilled Two Constraint (1) (2) (3) (4) if pred(n) = ∅ (n is source node) t dr (n) = t dr (n, P) = 0, for all P ∈ P 2012/11/237

Technique used Insertion technique max{A, t dr (n, P)} + w(n) ≤ B - A free node can be scheduled on processor P within the idle time interval [A, B], A, B ∈ [0,∞], i.e. an interval in which no task is executed End technique [A, B] = [t f (P),∞]. 2012/11/238

Introduction Classic scheduling ① (1) classic model (2) List scheduling Contention-aware scheduling ③ (1) Contention model (2) topology network Duplication ② (1) impact on the formulation Duplication ④ 2012/11/239

Classic model Traditionally, most scheduling algorithms have employed a strongly idealised model of the target parallel system,called the classic model properties: (i) local communication has zero costs (ii) communication is performed by a communication subsystem (iii) communication can be performed concurrently (iv) The communication network is fully connected Edge Finish Time 2012/11/2310

List scheduling Using either the insertion or the end technique t s (n, P) = max{A, t dr (n, P)} t f (n, P) = t s (n, P) + w(n) 2012/11/2311

List scheduling A B C D E F 2012/11/2312

List scheduling with dupication (2)become (3)become exemple 2012/11/2313

Contention Model To make task scheduling contention aware(more realistic) The awareness for contention is achieved by edge scheduling properties: (i) local communication has zero costs (ii) communication is performed by a communication subsystem X (iii) communication can be performed concurrently X (iv) The communication network is fully connected Edge Finish Time 2012/11/2314

topology network 2012/11/2315

Exemple(Contention model) 2012/11/2316

Contention-aware scheduling with task duplication 2012/11/2317

2012/11/2318

exemple 2012/11/2319 Contention-aware scheduling with task duplication A C B D E

Complexity analyze O(VlogV+E) O(V) O(P) O(1) O(|P||V|*Algo3) 2012/11/2320

O(P*E) O(1) O(V) O(P)O(E)O(routing) O(|P||V||E| 2 (routing)) O(E) Insertion technique 2012/11/2321

Complexity analyze O( |P| 2 |V| 2 |E| 2 (routing) ) Be aware that this is the worst-case complexity, which should be significantly higher than the expected average case complexity in this case For comparison, the second part of a contention-aware list scheduling with the insertion technique is O(|V| 2 + |P| |E| 2 (routing)) 2012/11/2322

Experimental evaluation Classic scheduling ① (1) Contention-aware scheduling ③ (3) Duplication ② (2) Duplication ④ (4) 2012/11/2323

Experimental evaluation full-duplex half-duplex 2012/11/2324

vs /11/2325

/11/2326

Conclusions Duplication under the contention model is significantly better than under the classic model Task duplication is even more beneficial under the contention model than under the classic model, and this effect increases for more restricted networks The algorithm was proposed based on state-of-the- art scheduling techniques found in task duplication algorithms and other contention-aware algorithms 2012/11/2327