Optimization of Collective Communication Operations in MPICH
Argonne National Laboratory · University of Stuttgart · +1 more institution
Abstract
We describe our work on improving the performance of collective communication operations in MPICH for clusters connected by switched networks. For each collective operation, we use multiple algorithms depending on the message size, with the goal of minimizing latency for short messages and minimizing bandwidth use for long messages. Although we have implemented new algorithms for all MPI (Message Passing Interface) collective operations, because of limited space we describe only the algorithms for allgather, broadcast, all-to-all, reduce-scatter, reduce, and allreduce. Performance results on a Myrinet-connected Linux cluster and an IBM SP indicate that, in all cases, the new algorithms significantly outperform…
Citation impact
- FWCI
- 20.85
- Percentile
- 100%
- References
- 38
Authors
3Topics & keywords
- Myrinet
- Computer science
- IBM
- Message Passing Interface
- Latency (audio)
- Computer cluster
- Parallel computing
- Operating system