Towards Efficient HPC: Exploring Overlap Strategies Using MPI Non-Blocking Communication
Uloženo v:
| Vydáno v: | Mathematics vol. 13, no. 11 (2025), p. 1848 |
|---|---|
| Hlavní autor: | |
| Další autoři: | |
| Vydáno: |
MDPI AG
|
| Témata: | |
| On-line přístup: | Citation/Abstract Full Text + Graphics Full Text - PDF |
| Tagy: |
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
MARC
| LEADER | 00000nab a2200000uu 4500 | ||
|---|---|---|---|
| 001 | 3217738860 | ||
| 003 | UK-CbPIL | ||
| 022 | |a 2227-7390 | ||
| 024 | 7 | |a 10.3390/math13111848 |2 doi | |
| 035 | |a 3217738860 | ||
| 045 | 2 | |b d20250101 |b d20251231 | |
| 084 | |a 231533 |2 nlm | ||
| 100 | 1 | |a Zheng Yuntian | |
| 245 | 1 | |a Towards Efficient HPC: Exploring Overlap Strategies Using MPI Non-Blocking Communication | |
| 260 | |b MDPI AG |c 2025 | ||
| 513 | |a Journal Article | ||
| 520 | 3 | |a As high-performance computing (HPC) platforms continue to scale up, communication costs have become a critical bottleneck affecting overall application performance. An effective strategy to overcome this limitation is to overlap communication with computation. The Message Passing Interface (MPI), as the de facto standard for communication in HPC, provides non-blocking communication primitives that make such overlapping feasible. By enabling asynchronous communication, non-blocking operations reduce idle time of cores caused by data transfer delays, thereby improving resource utilization. Overlapping communication with computation is particularly important for enhancing the performance of large-scale scientific applications, such as numerical simulations, climate modeling, and other data-intensive tasks. However, achieving efficient overlapping is non-trivial and depends not only on advances in hardware technologies such as Remote Direct Memory Access (RDMA), but also on well-designed and optimized MPI implementations. This paper presents a comprehensive survey on the principles of MPI non-blocking communication, the core techniques for achieving computation–communication overlap, and some representative applications in scientific computing. Alongside the survey, we include a preliminary experimental study evaluating the effectiveness of asynchronous progress mechanism on modern HPC platforms to support the development of parallel programs for HPC researchers and practitioners. | |
| 653 | |a Data transfer (computers) | ||
| 653 | |a Performance enhancement | ||
| 653 | |a Message passing | ||
| 653 | |a Computation | ||
| 653 | |a Parallel programming | ||
| 653 | |a Protocol | ||
| 653 | |a Communication | ||
| 653 | |a Bandwidths | ||
| 653 | |a Climate models | ||
| 653 | |a Optimization | ||
| 653 | |a Weather | ||
| 653 | |a Effectiveness | ||
| 653 | |a Network interface cards | ||
| 653 | |a Weather forecasting | ||
| 653 | |a Resource utilization | ||
| 653 | |a Data transmission | ||
| 700 | 1 | |a Wu, Jianping | |
| 773 | 0 | |t Mathematics |g vol. 13, no. 11 (2025), p. 1848 | |
| 786 | 0 | |d ProQuest |t Engineering Database | |
| 856 | 4 | 1 | |3 Citation/Abstract |u https://www.proquest.com/docview/3217738860/abstract/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch |
| 856 | 4 | 0 | |3 Full Text + Graphics |u https://www.proquest.com/docview/3217738860/fulltextwithgraphics/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch |
| 856 | 4 | 0 | |3 Full Text - PDF |u https://www.proquest.com/docview/3217738860/fulltextPDF/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch |