Synergizing High Performance Computing and Machine Learning for Co-Designing Computing Systems

Zapisane w:
Opis bibliograficzny
Wydane w:ProQuest Dissertations and Theses (2025)
1. autor: Pandey, Santosh
Wydane:
ProQuest Dissertations & Theses
Hasła przedmiotowe:
Dostęp online:Citation/Abstract
Full Text - PDF
Etykiety: Dodaj etykietę
Nie ma etykietki, Dołącz pierwszą etykiete!
Opis
Streszczenie:The rapid evolution of computing demands from Machine Learning (ML) and High Performance Computing (HPC) applications has exposed the limitations of general-purpose architectures, necessitating a shift towards domain-specific computing. This surge in demand is driven by the need to train massive ML models, process large-scale data in real-time, and execute highly parallelized computations, which general-purpose architectures struggle to handle efficiently. This transition, however, presents significant challenges in system design, particularly in terms of complexity, cost, and human efforts. System design encompasses design and optimization of computing stack ranging from computer architecture, architecture-specific compiler, program optimization and co-designing hardware and software that balances unique constraint requirements like latency, throughput and energy consumption. On one hand, designing an efficient computing system is challenging. On the other hand, there are also opportunities to improve the system design itself. This thesis presents two part work towards designing computing systems for domain-specific applications: system design for HPC applications and ML/HPC for the system design.Towards the first thrust, this thesis introduces C-SAW, a GPU-accelerated HPC system for efficient graph sampling and random walks. C-SAW is the first framework to support a diverse set of mainstream and emerging graph sampling algorithms on GPUs. C-SAW devises a MapReduce-style, bias-centric programming interface that generalizes to diverse algorithms. Towards the second thrust, this thesis lays the groundwork for applying ML in the system microarchitecture by introducing an ML-based microarchitecture performance modeling and performance analysis framework. This thesis introduces ML techniques to accurately model the performance of a microarchitecture, builds an HPC framework for making ML-based microarchitecture simulation efficient, and redesigns ML-based simulation to make it more reusable and adaptable.By building a GPU-accelerated HPC system optimized for graph applications and creating an ML-driven tool for microarchitecture performance analysis, this thesis contributes to both high-efficiency computing systems and improving system evaluation methodologies.
ISBN:9798290614465
Źródło:ProQuest Dissertations & Theses Global