Advancing Parallel Programming Through Program Graph Representation and Unsupervised Code Translation

保存先:

書誌詳細
出版年:	ProQuest Dissertations and Theses (2025)
第一著者:	TehraniJamsaz, Ali
出版事項:	ProQuest Dissertations & Theses
主題:	Computer science Artificial intelligence Information science
オンライン･アクセス:	Citation/Abstract Full Text - PDF
タグ:	タグ追加タグなし, このレコードへの初めてのタグを付けませんか!

MARC


LEADER	00000nab a2200000uu 4500
001	3225668777
003	UK-CbPIL
020			\|a 9798286446667
035			\|a 3225668777
045	2		\|b d20250101 \|b d20251231
084			\|a 66569 \|2 nlm
100	1		\|a TehraniJamsaz, Ali
245	1		\|a Advancing Parallel Programming Through Program Graph Representation and Unsupervised Code Translation
260			\|b ProQuest Dissertations & Theses \|c 2025
513			\|a Dissertation/Thesis
520	3		\|a With the advent of multi-core and many-core systems, developers have increasingly focused on creating parallel programs to harness the potential of this hardware. However, developing parallel programs and high-performance kernels presents a unique set of challenges.Simultaneously, advancements in deep learning (DL) and machine learning (ML) have transformed numerous fields, including software engineering and HPC kernel development. Yet, unlike other domains, applying deep learning models to the HPC field poses distinct difficulties. For example, source code typically exhibits a specific structure, syntax, and semantics, making it challenging to train deep learning models to comprehend these characteristics effectively.Moreover, beyond the general challenges of applying deep learning to understand applications, comprehending parallel applications presents even greater difficulties. These types of applications have unique errors and data-sharing complexities that deep learning models must learn to address.This dissertation presents four studies aimed at enabling deep learning models to better understand parallel and HPC applications. Each study introduces a novel technique to enhance the ability of DL models to comprehend parallel programs.The early chapters mostly focus on the graph representation of parallel programs, with more focus on OpenMP applications. In particular, the first study focuses on how to model OpenMP programs to predict configurations for non-uniform memory Access (NUMA) systems and prefetchers. The second study tries to address limitations in the first study by identifying the flaws that exist in the program representation used in the first study and improving it further. The third study focuses on predicting the runtime of OpenMP applications. In this chapter, an augmented graph representation based on Abstract Syntax Tree (AST) is proposed to predict the runtime of OpenMP kernels. The last study leverages Transformers, and it looks at the problem of parallelization from a different angle. It considers the parallelization problem as a translation task, and an encoder-decoder transformer model is developed to learn how to do this translation in an unsupervised way.The techniques developed in these studies aim to address various challenges associated with applying deep learning models in the HPC domain. They focus on effectively modeling parallel programs and enabling the translation between serial and parallel code.I hope these techniques will inspire further research in this field and help mitigate the challenges inherent to the HPC domain.
653			\|a Computer science
653			\|a Artificial intelligence
653			\|a Information science
773	0		\|t ProQuest Dissertations and Theses \|g (2025)
786	0		\|d ProQuest \|t ProQuest Dissertations & Theses Global
856	4	1	\|3 Citation/Abstract \|u https://www.proquest.com/docview/3225668777/abstract/embedded/6A8EOT78XXH2IG52?source=fedsrch
856	4	0	\|3 Full Text - PDF \|u https://www.proquest.com/docview/3225668777/fulltextPDF/embedded/6A8EOT78XXH2IG52?source=fedsrch