Dram Errors in Enterprise Storage Systems: Probabilistic Modeling and Mitigations
Guardado en:
| Publicado en: | ProQuest Dissertations and Theses (2025) |
|---|---|
| Autor principal: | |
| Publicado: |
ProQuest Dissertations & Theses
|
| Materias: | |
| Acceso en línea: | Citation/Abstract Full Text - PDF |
| Etiquetas: |
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
| Resumen: | Memory reliability is a critical concern in modern computing systems, where DRAM errors can significantly impact performance and data integrity. Systems employ Error Correction Codes (ECC) as a protection mechanism against memory errors, but these mechanisms are not capable of correcting all errors. Uncorrectable errors at this stage present a significant challenge in DRAM systems as they result in degraded performance and reliability and require costly memory replacements.To address this, newer mitigation mechanisms have been developed. However, existing research on their effectiveness has primarily focused on operating-system-level mechanisms such as page offlining, and studies on hardware-targeted mechanisms including Post-Package Repair (PPR), and Adaptive Double Device Data Correction (ADDDC) have been very limited. Additionally, while these actions incur performance and resource overhead, the optimal conditions and timing for triggering them have remained unexplored.We aim to fill this gap by modeling error dynamics with spatial information about error locations, moving towards the ability to predict uncorrectable errors and other events which lead to DRAM replacement, and select the most efficient mitigation action tailored to each unique situation. By leveraging a rich dataset collected from a substantial population of enterprise storage systems, this work provides invaluable insights into the real-world behavior of memory errors, and establishes a foundation for optimized application of error mitigation strategies, which results in enhanced reliability and performance in storage systems. |
|---|---|
| ISBN: | 9798290664705 |
| Fuente: | ProQuest Dissertations & Theses Global |