Optimizing shared memory programs for distributed memory architectures

Uloženo v:
Podrobná bibliografie
Vydáno v:ProQuest Dissertations and Theses (2009)
Hlavní autor: Min, Seung-Jai
Vydáno:
ProQuest Dissertations & Theses
Témata:
On-line přístup:Citation/Abstract
Full Text - PDF
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

MARC

LEADER 00000nab a2200000uu 4500
001 304990477
003 UK-CbPIL
020 |a 978-1-109-48841-8 
035 |a 304990477 
045 0 |b d20090101 
084 |a 66569  |2 nlm 
100 1 |a Min, Seung-Jai 
245 1 |a Optimizing shared memory programs for distributed memory architectures 
260 |b ProQuest Dissertations & Theses  |c 2009 
513 |a Dissertation/Thesis 
520 3 |a OpenMP has emerged as the de facto standard for writing parallel programs on shared address space platforms. Programmers can parallelize existing sequential programs in an incremental way with OpenMP directives. In contrast, parallel programming with message-passing for distributed-memory systems is effort intensive. The message-passing approach requires the programmer to parallelize programs as a whole and to explicitly manage data communication between processors. The goal of this dissertation is to extend the high programmer productivity of shared memory programming facilitated by OpenMP to distributed memory architectures, such as clusters. An underlying layer of software Distributed Shared Memory (DSM) is used to enable OpenMP shared memory programs to be run on distributed memory systems. However, the performance of state-of-the-art software DSM systems is impaired by unnecessary communication due to the inefficient mechanisms to detect shared data accesses, especially irregular data accesses. This dissertation introduces the Lean Distributed Shared Memory (LDSM) system to overcome the performance limitations of software DSM systems. LDSM is a thin run-time library layer which is tightly integrated with the compiler. Its region-based shared data access analysis efficiently detects irregular data accesses. Additionally, LDSM includes a run-time overhead reduction technique and communication optimizations. A set of representative regular and irregular benchmarks are used to evaluate the performance of LDSM. The performance results demonstrate that LDSM executes OpenMP applications on distributed memory clusters with a comparable performance to hand-coded message-passing programs. 
653 |a Electrical engineering 
773 0 |t ProQuest Dissertations and Theses  |g (2009) 
786 0 |d ProQuest  |t ProQuest Dissertations & Theses Global 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/304990477/abstract/embedded/Y2VX53961LHR7RE6?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/304990477/fulltextPDF/embedded/Y2VX53961LHR7RE6?source=fedsrch