Synergizing High Performance Computing and Machine Learning for Co-Designing Computing Systems

Gorde:
Xehetasun bibliografikoak
Argitaratua izan da:ProQuest Dissertations and Theses (2025)
Egile nagusia: Pandey, Santosh
Argitaratua:
ProQuest Dissertations & Theses
Gaiak:
Sarrera elektronikoa:Citation/Abstract
Full Text - PDF
Etiketak: Etiketa erantsi
Etiketarik gabe, Izan zaitez lehena erregistro honi etiketa jartzen!

MARC

LEADER 00000nab a2200000uu 4500
001 3234535465
003 UK-CbPIL
020 |a 9798290614465 
035 |a 3234535465 
045 2 |b d20250101  |b d20251231 
084 |a 66569  |2 nlm 
100 1 |a Pandey, Santosh 
245 1 |a Synergizing High Performance Computing and Machine Learning for Co-Designing Computing Systems 
260 |b ProQuest Dissertations & Theses  |c 2025 
513 |a Dissertation/Thesis 
520 3 |a The rapid evolution of computing demands from Machine Learning (ML) and High Performance Computing (HPC) applications has exposed the limitations of general-purpose architectures, necessitating a shift towards domain-specific computing. This surge in demand is driven by the need to train massive ML models, process large-scale data in real-time, and execute highly parallelized computations, which general-purpose architectures struggle to handle efficiently. This transition, however, presents significant challenges in system design, particularly in terms of complexity, cost, and human efforts. System design encompasses design and optimization of computing stack ranging from computer architecture, architecture-specific compiler, program optimization and co-designing hardware and software that balances unique constraint requirements like latency, throughput and energy consumption. On one hand, designing an efficient computing system is challenging. On the other hand, there are also opportunities to improve the system design itself. This thesis presents two part work towards designing computing systems for domain-specific applications: system design for HPC applications and ML/HPC for the system design.Towards the first thrust, this thesis introduces C-SAW, a GPU-accelerated HPC system for efficient graph sampling and random walks. C-SAW is the first framework to support a diverse set of mainstream and emerging graph sampling algorithms on GPUs. C-SAW devises a MapReduce-style, bias-centric programming interface that generalizes to diverse algorithms. Towards the second thrust, this thesis lays the groundwork for applying ML in the system microarchitecture by introducing an ML-based microarchitecture performance modeling and performance analysis framework. This thesis introduces ML techniques to accurately model the performance of a microarchitecture, builds an HPC framework for making ML-based microarchitecture simulation efficient, and redesigns ML-based simulation to make it more reusable and adaptable.By building a GPU-accelerated HPC system optimized for graph applications and creating an ML-driven tool for microarchitecture performance analysis, this thesis contributes to both high-efficiency computing systems and improving system evaluation methodologies. 
653 |a Computer engineering 
653 |a Computer science 
653 |a Design 
773 0 |t ProQuest Dissertations and Theses  |g (2025) 
786 0 |d ProQuest  |t ProQuest Dissertations & Theses Global 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/3234535465/abstract/embedded/L8HZQI7Z43R0LA5T?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/3234535465/fulltextPDF/embedded/L8HZQI7Z43R0LA5T?source=fedsrch