Enhancing MPI remote memory access model for distributed-memory systems through one-sided broadcast implementation

Guardat en:

Dades bibliogràfiques
Publicat a:	Journal of Physics: Conference Series vol. 2697, no. 1 (Feb 2024), p. 012035
Autor principal:	Abuelsoud, M M
Altres autors:	Kogutenko, A A
Publicat:	IOP Publishing
Matèries:	Synchronism Algorithms Message passing Distributed memory Performance tests
Accés en línia:	Citation/Abstract Full Text - PDF
Etiquetes:	Afegir etiqueta Sense etiquetes, Sigues el primer a etiquetar aquest registre!

MARC


LEADER	00000nab a2200000uu 4500
001	2924353985
003	UK-CbPIL
022			\|a 1742-6588
022			\|a 1742-6596
024	7		\|a 10.1088/1742-6596/2697/1/012035 \|2 doi
035			\|a 2924353985
045	2		\|b d20240201 \|b d20240229
100	1		\|a Abuelsoud, M M \|u Department of Computer Science and Engineering, Saint Petersburg Electrotechnical University “LETI” , ul. Professora Popova 5, Saint Petersburg 197022 , Russia
245	1		\|a Enhancing MPI remote memory access model for distributed-memory systems through one-sided broadcast implementation
260			\|b IOP Publishing \|c Feb 2024
513			\|a Journal Article
520	3		\|a Efficiently processing vast and expanding data volumes is a pressing challenge. Traditional high-performance computers, utilizing distributed-memory architecture and a message-passing model, grapple with synchronization issues, hampering their ability to keep up with the growing demands. Remote Memory Access (RMA), often referred to as one-sided MPI communications, offers a solution by allowing a process to directly access another process’s memory, eliminating the need for message exchange and significantly boosting performance. Unfortunately, the existing MPI RMA standard lacks a collective operation interface, limiting efficiency. To overcome this constraint, we introduce an algorithm design that enables efficient parallelizable collective operations within the RMA framework. Our study focuses primarily on the advantages of collective operations, using the broadcast algorithm as a case study. Our implementations surpass traditional methods, highlighting the promising potential of this technique, as indicated by initial performance tests.
653			\|a Synchronism
653			\|a Algorithms
653			\|a Message passing
653			\|a Distributed memory
653			\|a Performance tests
700	1		\|a Kogutenko, A A \|u Department of Computer Science and Engineering, Saint Petersburg Electrotechnical University “LETI” , ul. Professora Popova 5, Saint Petersburg 197022 , Russia
773	0		\|t Journal of Physics: Conference Series \|g vol. 2697, no. 1 (Feb 2024), p. 012035
786	0		\|d ProQuest \|t Advanced Technologies & Aerospace Database
856	4	1	\|3 Citation/Abstract \|u https://www.proquest.com/docview/2924353985/abstract/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch
856	4	0	\|3 Full Text - PDF \|u https://www.proquest.com/docview/2924353985/fulltextPDF/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch