Synergizing High Performance Computing and Machine Learning for Co-Designing Computing Systems

Gorde:

Xehetasun bibliografikoak
Argitaratua izan da:	ProQuest Dissertations and Theses (2025)
Egile nagusia:	Pandey, Santosh
Argitaratua:	ProQuest Dissertations & Theses
Gaiak:	Computer engineering Computer science Design
Sarrera elektronikoa:	Citation/Abstract Full Text - PDF
Etiketak:	Etiketa erantsi Etiketarik gabe, Izan zaitez lehena erregistro honi etiketa jartzen!

MARC


LEADER	00000nab a2200000uu 4500
001	3234535465
003	UK-CbPIL
020			\|a 9798290614465
035			\|a 3234535465
045	2		\|b d20250101 \|b d20251231
084			\|a 66569 \|2 nlm
100	1		\|a Pandey, Santosh
245	1		\|a Synergizing High Performance Computing and Machine Learning for Co-Designing Computing Systems
260			\|b ProQuest Dissertations & Theses \|c 2025
513			\|a Dissertation/Thesis
520	3		\|a The rapid evolution of computing demands from Machine Learning (ML) and High Performance Computing (HPC) applications has exposed the limitations of general-purpose architectures, necessitating a shift towards domain-specific computing. This surge in demand is driven by the need to train massive ML models, process large-scale data in real-time, and execute highly parallelized computations, which general-purpose architectures struggle to handle efficiently. This transition, however, presents significant challenges in system design, particularly in terms of complexity, cost, and human efforts. System design encompasses design and optimization of computing stack ranging from computer architecture, architecture-specific compiler, program optimization and co-designing hardware and software that balances unique constraint requirements like latency, throughput and energy consumption. On one hand, designing an efficient computing system is challenging. On the other hand, there are also opportunities to improve the system design itself. This thesis presents two part work towards designing computing systems for domain-specific applications: system design for HPC applications and ML/HPC for the system design.Towards the first thrust, this thesis introduces C-SAW, a GPU-accelerated HPC system for efficient graph sampling and random walks. C-SAW is the first framework to support a diverse set of mainstream and emerging graph sampling algorithms on GPUs. C-SAW devises a MapReduce-style, bias-centric programming interface that generalizes to diverse algorithms. Towards the second thrust, this thesis lays the groundwork for applying ML in the system microarchitecture by introducing an ML-based microarchitecture performance modeling and performance analysis framework. This thesis introduces ML techniques to accurately model the performance of a microarchitecture, builds an HPC framework for making ML-based microarchitecture simulation efficient, and redesigns ML-based simulation to make it more reusable and adaptable.By building a GPU-accelerated HPC system optimized for graph applications and creating an ML-driven tool for microarchitecture performance analysis, this thesis contributes to both high-efficiency computing systems and improving system evaluation methodologies.
653			\|a Computer engineering
653			\|a Computer science
653			\|a Design
773	0		\|t ProQuest Dissertations and Theses \|g (2025)
786	0		\|d ProQuest \|t ProQuest Dissertations & Theses Global
856	4	1	\|3 Citation/Abstract \|u https://www.proquest.com/docview/3234535465/abstract/embedded/L8HZQI7Z43R0LA5T?source=fedsrch
856	4	0	\|3 Full Text - PDF \|u https://www.proquest.com/docview/3234535465/fulltextPDF/embedded/L8HZQI7Z43R0LA5T?source=fedsrch