Towards Efficient HPC: Exploring Overlap Strategies Using MPI Non-Blocking Communication

Uloženo v:
Podrobná bibliografie
Vydáno v:Mathematics vol. 13, no. 11 (2025), p. 1848
Hlavní autor: Zheng Yuntian
Další autoři: Wu, Jianping
Vydáno:
MDPI AG
Témata:
On-line přístup:Citation/Abstract
Full Text + Graphics
Full Text - PDF
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

MARC

LEADER 00000nab a2200000uu 4500
001 3217738860
003 UK-CbPIL
022 |a 2227-7390 
024 7 |a 10.3390/math13111848  |2 doi 
035 |a 3217738860 
045 2 |b d20250101  |b d20251231 
084 |a 231533  |2 nlm 
100 1 |a Zheng Yuntian 
245 1 |a Towards Efficient HPC: Exploring Overlap Strategies Using MPI Non-Blocking Communication 
260 |b MDPI AG  |c 2025 
513 |a Journal Article 
520 3 |a As high-performance computing (HPC) platforms continue to scale up, communication costs have become a critical bottleneck affecting overall application performance. An effective strategy to overcome this limitation is to overlap communication with computation. The Message Passing Interface (MPI), as the de facto standard for communication in HPC, provides non-blocking communication primitives that make such overlapping feasible. By enabling asynchronous communication, non-blocking operations reduce idle time of cores caused by data transfer delays, thereby improving resource utilization. Overlapping communication with computation is particularly important for enhancing the performance of large-scale scientific applications, such as numerical simulations, climate modeling, and other data-intensive tasks. However, achieving efficient overlapping is non-trivial and depends not only on advances in hardware technologies such as Remote Direct Memory Access (RDMA), but also on well-designed and optimized MPI implementations. This paper presents a comprehensive survey on the principles of MPI non-blocking communication, the core techniques for achieving computation–communication overlap, and some representative applications in scientific computing. Alongside the survey, we include a preliminary experimental study evaluating the effectiveness of asynchronous progress mechanism on modern HPC platforms to support the development of parallel programs for HPC researchers and practitioners. 
653 |a Data transfer (computers) 
653 |a Performance enhancement 
653 |a Message passing 
653 |a Computation 
653 |a Parallel programming 
653 |a Protocol 
653 |a Communication 
653 |a Bandwidths 
653 |a Climate models 
653 |a Optimization 
653 |a Weather 
653 |a Effectiveness 
653 |a Network interface cards 
653 |a Weather forecasting 
653 |a Resource utilization 
653 |a Data transmission 
700 1 |a Wu, Jianping 
773 0 |t Mathematics  |g vol. 13, no. 11 (2025), p. 1848 
786 0 |d ProQuest  |t Engineering Database 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/3217738860/abstract/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch 
856 4 0 |3 Full Text + Graphics  |u https://www.proquest.com/docview/3217738860/fulltextwithgraphics/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/3217738860/fulltextPDF/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch