Download presentation
Presentation is loading. Please wait.
1
A Performance Model of Many-to-One Collective Communications for Parallel Computing
Alexey Lastovetsky, Maureen O’Flynn UCD School of Computer Science and Informatics Belfield, Dublin 4, Ireland 23 April 2019
2
Objectives Goal: prediction of the execution time of MPI collective communications on a heterogeneous cluster based on a switched Ethernet Background: performance models of single point-to-point, simultaneous independent point-to-point, one-to-many communications Observation: a significant increase in the execution time of many-to-one communication for medium-sized messages on all platforms and MPI implementations Problem: to model many-to-one communications for medium-sized messages
3
Performance models for point-to-point and one-to-many communications
point-to-point: - execution time - message size - fixed delays - variable delays - transmission rate one-to-many:
4
Many-to-one collective communications: non-linear and non-deterministic escalations
Seconds 0.04 .00001 Message size in KB
5
Parameters of many-to-one model for medium-sized messages
M MC M2 message size where escalations begin escalations stop occuring message size from which escalations occur with 100% certainty probabilities of escalations
6
Probability of escalation
Discrete constant levels of escalation of values of 40, 200 and 250 times Probability of escalation to level is found
7
Many-to-one model for small messages
8
Many-to-one model for large messages
9
Multi-spectral satellite application
A typical real-time satellite imaging application (512x512 bytes) A sequence of raw data images divided into partitions for parallel processing by a cluster M2 Mc M1 n1 n2
10
Redesigning Application
Calculate the number of sub-partitions m of a partition of the medium size M so that: Replace MPI_Gather with sequence of MPI_Gather for smaller messages
11
Conclusion Results previously undocumented non-linear non-deterministic behaviour for medium messages is analysed many-to-one model is built on the empirical data and point-to-point model parallel application is redesigned in accordance with many-to-one model The work was supported by Science Foundation Ireland.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.