A system Performance Model Instructor: Dr. Yanqing Zhang Presented by: Rajapaksage Jayampthi S.

Slides:



Advertisements
Similar presentations
Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
Advertisements

Hadi Goudarzi and Massoud Pedram
Distributed Systems CS
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
Martha Garcia.  Goals of Static Process Scheduling  Types of Static Process Scheduling  Future Research  References.
Course Outline Introduction in algorithms and applications Parallel machines and architectures Overview of parallel machines, trends in top-500 Cluster.
Distributed Process Scheduling Summery Distributed Process Scheduling Summery BY:-Yonatan Negash.
A SYSTEM PERFORMANCE MODEL CSC 8320 Advanced Operating Systems Georgia State University Yuan Long.
GridFlow: Workflow Management for Grid Computing Kavita Shinde.
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
A Multi-Agent Learning Approach to Online Distributed Resource Allocation Chongjie Zhang Victor Lesser Prashant Shenoy Computer Science Department University.
Cross Cluster Migration Remote access support Adianto Wibisono supervised by : Dr. Dick van Albada Kamil Iskra, M. Sc.
Summary for Chapter 5 --Distributed Process Scheduling
Dynamic Load Sharing and Balancing Sig Freund. Outline Introduction Distributed vs. Traditional scheduling Process Interaction models Distributed Systems.
Summary :- Distributed Process Scheduling Prepared BY:- JAYA KALIDINDI.
Chapter 5 Distributed Process Scheduling. 5.1 A System Performance Model --Niharika Muriki.
1 Distributed Operating Systems and Process Scheduling Brett O’Neill CSE 8343 – Group A6.
Parallel Programming Models Jihad El-Sana These slides are based on the book: Introduction to Parallel Computing, Blaise Barney, Lawrence Livermore National.
Designing and Evaluating Parallel Programs Anda Iamnitchi Federated Distributed Systems Fall 2006 Textbook (on line): Designing and Building Parallel Programs.
Low-Power Wireless Sensor Networks
Chapter 3 Parallel Algorithm Design. Outline Task/channel model Task/channel model Algorithm design methodology Algorithm design methodology Case studies.
1 Distributed Process Scheduling: A System Performance Model Vijay Jain CSc 8320, Spring 2007.
Young Suk Moon Chair: Dr. Hans-Peter Bischof Reader: Dr. Gregor von Laszewski Observer: Dr. Minseok Kwon 1.
Y. Kotani · F. Ino · K. Hagihara Springer Science + Business Media B.V Reporter: 李長霖.
Static Process Schedule Csc8320 Chapter 5.2 Yunmei Lu
Performance Model & Tools Summary Hung-Hsun Su UPC Group, HCS lab 2/5/2004.
1 Multiprocessor and Real-Time Scheduling Chapter 10 Real-Time scheduling will be covered in SYSC3303.
Euro-Par, A Resource Allocation Approach for Supporting Time-Critical Applications in Grid Environments Qian Zhu and Gagan Agrawal Department of.
LATA: A Latency and Throughput- Aware Packet Processing System Author: Jilong Kuang and Laxmi Bhuyan Publisher: DAC 2010 Presenter: Chun-Sheng Hsueh Date:
The Owner Share scheduler for a distributed system 2009 International Conference on Parallel Processing Workshops Reporter: 李長霖.
BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.
Static Process Scheduling Section 5.2 CSc 8320 Alex De Ruiter
Summary :-Distributed Process Scheduling Prepared By:- Monika Patel.
CSC 8420 Advanced Operating Systems Georgia State University Yi Pan.
Lecture 4 TTH 03:30AM-04:45PM Dr. Jianjun Hu CSCE569 Parallel Computing University of South Carolina Department of.
Shuman Guo CSc 8320 Advanced Operating Systems
Rassul Ayani 1 Performance of parallel and distributed systems  What is the purpose of measurement?  To evaluate a system (or an architecture)  To compare.
Distributed Process Scheduling : A Summary
Lecture 3 : Performance of Parallel Programs Courtesy : MIT Prof. Amarasinghe and Dr. Rabbah’s course note.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
CprE 458/558: Real-Time Systems (G. Manimaran)1 CprE 458/558: Real-Time Systems Distributed Real-Time Systems.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
Data Structures and Algorithms in Parallel Computing Lecture 7.
Static Process Scheduling
CDP Tutorial 3 Basics of Parallel Algorithm Design uses some of the slides for chapters 3 and 5 accompanying “Introduction to Parallel Computing”, Addison.
A System Performance Model Distributed Process Scheduling.
Scheduling MPI Workflow Applications on Computing Grids Juemin Zhang, Waleed Meleis, and David Kaeli Electrical and Computer Engineering Department, Northeastern.
Xi He Golisano College of Computing and Information Sciences Rochester Institute of Technology Rochester, NY THERMAL-AWARE RESOURCE.
Introduction to Real-Time Systems
3/12/2013Computer Engg, IIT(BHU)1 INTRODUCTION-1.
Jamie Unger-Fink John David Eriksen.  Allocation and Scheduling Problem  Better MPSoC optimization tool needed  IP and CP alone not good enough  Communication.
Uses some of the slides for chapters 3 and 5 accompanying “Introduction to Parallel Computing”, Addison Wesley, 2003.
Concurrency and Performance Based on slides by Henri Casanova.
Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,
Genetic algorithms for task scheduling problem J. Parallel Distrib. Comput. (2010) Fatma A. Omara, Mona M. Arafa 2016/3/111 Shang-Chi Wu.
COMP7330/7336 Advanced Parallel and Distributed Computing Task Partitioning Dynamic Mapping Dr. Xiao Qin Auburn University
Parallel Molecular Dynamics A case study : Programming for performance Laxmikant Kale
Pradeep Konduri Static Process Scheduling:  Proceedance process model  Communication system model  Application  Dicussion.
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
Department of Computer Science, Johns Hopkins University Lecture 7 Finding Concurrency EN /420 Instructor: Randal Burns 26 February 2014.
Spark on Entropy : A Reliable & Efficient Scheduler for Low-latency Parallel Jobs in Heterogeneous Cloud Huankai Chen PhD Student at University of Kent.
Optimizing Distributed Actor Systems for Dynamic Interactive Services
Parallel Programming By J. H. Wang May 2, 2017.
Dynamic Graph Partitioning Algorithm
Distributed System Concepts and Architectures
Distributed Process Scheduling: 5.1 A System Performance Model
COMP60621 Fundamentals of Parallel and Distributed Systems
Outline Announcement Distributed scheduling – continued
Parallel Programming in C with MPI and OpenMP
COMP60611 Fundamentals of Parallel and Distributed Systems
Presentation transcript:

A system Performance Model Instructor: Dr. Yanqing Zhang Presented by: Rajapaksage Jayampthi S

Ch.5 Distributed Process Scheduling [Randy Chow, 97]

Introduction Processes (jobs) need to be scheduled before execution. Enhance overall system performance  Process completion time  Processor utilization Performed locally. Performed globally. Processes  executed on remote nodes  migrate from one node to node

Introduction (contd.) Why scheduling is complex?  communication overhead can not be ignored.  effect of the system architecture can not be ignored.  dynamic behavior of the system must be addressed. Chapter presents a model for capturing the effect of communication and system architecture on scheduling.

Section I (Theory) 5.1 A system Performance Model

Outline Process interaction: Example  Precedence Process Model  Communication Process Model  Disjoint Process Model Speedup Refined Speedup Efficiency Loss Workload Sharing Queuing Models

Process interaction: Example We used graph models to describe process communication. Eg: A program computation consisting of 4 processes mapped to a two-processor multiple computer system.  Process interaction is expressed differently in each of the three models.

Precedence Process Model The directed edges denote precedence relationship between processes. may occur communication overhead  If processes connected by an edge are mapped to different processors. model is best applied to the concurrent processes generated by concurrent language construct such as cobegin/coend or fork/join. Scheduling is to minimize the total completion time of the task, includes both computation and communication times.

Communication Process Model processes created to coexist and communicate asynchronously. edges represent the need for communication. the time is not definite, thus the goal of the scheduling may be  Optimize the total cost of communication  and computation. Task is partitioned in such a way that  minimizes the inter-processor communication  and computation costs of processes on processors

Disjoint Process Model process interaction is implicit assume that  processes can run independently  completed in finite time Processes are mapped to the processors  to maximize the utilization of the processors  And minimize the turnaround time of processors. (turnaround time is defined as the sum of services and times due to waiting time of the other processes.)

Speedup Speedup Factor is a function of  Parallel Algorithm  System Architecture  Schedule of execution S can be written as: OSPT= optimal sequential processing time; CPT= concurrent processing time; concurrent algorithm + specific scheduling method OCPT ideal = optimal concurrent processing time on an ideal system; no inter-processor communication overhead + optimal scheduling algorithm. S i = ideal speedup obtained by using a multiple processor system over the best sequential time S d = the degradation of the system due to actual implementation compared to an ideal system

Refined Speedup To distinguished the role of algorithm, system, and scheduling for speedup is further refined. S i can be written as:  n - # processors  m - # tasks in the concurrent algorithm  total computation of the concurrent algorithm > OSPT  RP (Relative Processing) - it shows how much loss of speedup is due to the substitution of the best sequential algorithm by an algorithm better adapted for concurrent implementation.  RP (Relative Concurrency) – measures how for from optimal the usage of the n processor is.  RC=1  best use of the processors  A good concurrent algorithm minimize RP and maximize RC

Refined Speedup (contd.) S d can be written as:   - efficiency loss (loss of parallelism when implemented on real machine) function of scheduling + system architecture.   decompose into two independent terms  =  sched +  syst (not easy scheduling & system are intertwined)  communication overhead can be hidden  A good schedule hides the communication overhead as much as possible.

Efficiency Loss interdependency between scheduling and system factors.  X – multiple computer under investigation  Y’ – scheduling policy that extended for system X from a scheduling policy Y on the corresponding ideal system.   can be expressed as:

Efficiency Loss (contd.) Similarly, for non-ideal system following figure shows decomposition of  due to scheduling and system communication. Impact of communication on system performance must be carefully addressed in the design of distributed scheduling algorithm

Workload Sharing If processes are not constrained by precedence relations and are free to move around. Performance can be further improved by sharing workload. processes can be moved from heavily loaded nodes to idle nodes. Load sharing – static workload distribution Load balancing – dynamic workload distribution Benefits of workload distribution  Increase processor utilization  Improved turnaround time for processors. Migration of processes reduce queuing time – cost additional communication overhead.

Queuing Models TTi – average turnaround time - arrival rate  - service rate

Queuing Models (contd.) Comparison of performance for workload sharing

Section II Recent Work

Distributed Measurements for Estimating and Updating Cellular System Performance [Liang Xiao et al, 2008]  Discuss the number and placement of sensors in a given cell for estimating its signal coverage.  Traditional measurements : signal-to-noise ratio, signal-to- interference ratio, outage probability  New performance model : improve measurement efficiency minimizing the required number of measurement sensors Performance Prediction of Component- and Pattern-based Middleware for Distributed Systems [Shruti Gorappa, 2007]  Design patterns, components, and frameworks are used to build various distributed real-time, and embedded (DRE) systems. ex: high-performance servers, telecommunication systems, and control systems.  way to quantify the performance of components design patterns

Section III Future Work

Develop the comparative performance models for different architectures and validating them experimentally  Eg: Acceptor/Connector, Leader/Follower, Threadpool and publish/subscribe.  Proxy, Pipes/Filters, and Proactor. Develop performance model for large–scale, geographically distributed grid systems  Have to capture resource heterogeneity, infrastructure characteristics.

References [1]Randy Chow & Theodore Johnson, 1997,“Distributed Operating Systems & Algorithms”, (Addison-Wesley), p. 149 to 156. [2] Shruti Gorappa,”Performance Prediction of Component- and Pattern-based Middleware for Distributed Systems”, MDS’07 [3] Liang Xiao et al, “Distributed Measurements for Estimating and Updating Cellular System Performance”, 2008 IEEE [4] Paolo Cremonesi et al,“Performance models for hierarchical grid architectures”, 2006 IEEE