Javascript must be enabled to continue!
FASA-DRAM: Reducing DRAM Latency with Destructive Activation and Delayed Restoration
View through CrossRef
DRAM memory is a performance bottleneck for many applications, due to its high access latency. Previous work has mainly focused on data locality, introducing small but fast regions to cache frequently accessed data, thereby reducing the average latency. However, these locality-based designs have three challenges in modern multi-core systems: (1) inter-application interference leads to random memory access traffic, (2) fairness issues prevent the memory controller from over-prioritizing data locality, and (3) write-intensive applications have much lower locality and evict substantial dirty entries. With frequent data movement between the fast in-DRAM cache and slow regular arrays, the overhead induced by moving data may even offset the performance and energy benefits of in-DRAM caching.
In this article, we decouple the data movement process into two distinct phases. The first phase is Load-Reduced Destructive Activation (LRDA), which destructively promotes data into the in-DRAM cache. The second phase is Delayed Cycle-Stealing Restoration (DCSR), which restores the original data when the DRAM bank is idle. LRDA decouples the most time-consuming restoration phase from activation, and DCSR hides the restoration latency through prevalent bank-level parallelism. We propose FASA-DRAM, incorporating destructive activation and delayed restoration techniques to enable both in-DRAM caching and proactive latency-hiding mechanisms. Our evaluation shows that FASA-DRAM improves the average performance by 19.9% and reduces average DRAM energy consumption by 18.1% over DDR4 DRAM for four-core workloads, with less than 3.4% extra area overhead. Furthermore, FASA-DRAM outperforms state-of-the-art designs in both performance and energy efficiency.
Title: FASA-DRAM: Reducing DRAM Latency with Destructive Activation and Delayed Restoration
Description:
DRAM memory is a performance bottleneck for many applications, due to its high access latency.
Previous work has mainly focused on data locality, introducing small but fast regions to cache frequently accessed data, thereby reducing the average latency.
However, these locality-based designs have three challenges in modern multi-core systems: (1) inter-application interference leads to random memory access traffic, (2) fairness issues prevent the memory controller from over-prioritizing data locality, and (3) write-intensive applications have much lower locality and evict substantial dirty entries.
With frequent data movement between the fast in-DRAM cache and slow regular arrays, the overhead induced by moving data may even offset the performance and energy benefits of in-DRAM caching.
In this article, we decouple the data movement process into two distinct phases.
The first phase is Load-Reduced Destructive Activation (LRDA), which destructively promotes data into the in-DRAM cache.
The second phase is Delayed Cycle-Stealing Restoration (DCSR), which restores the original data when the DRAM bank is idle.
LRDA decouples the most time-consuming restoration phase from activation, and DCSR hides the restoration latency through prevalent bank-level parallelism.
We propose FASA-DRAM, incorporating destructive activation and delayed restoration techniques to enable both in-DRAM caching and proactive latency-hiding mechanisms.
Our evaluation shows that FASA-DRAM improves the average performance by 19.
9% and reduces average DRAM energy consumption by 18.
1% over DDR4 DRAM for four-core workloads, with less than 3.
4% extra area overhead.
Furthermore, FASA-DRAM outperforms state-of-the-art designs in both performance and energy efficiency.
Related Results
FASA-DRAM: Reducing DRAM Latency with Destructive Activation and Delayed Restoration
FASA-DRAM: Reducing DRAM Latency with Destructive Activation and Delayed Restoration
DRAM memory is a performance bottleneck for many applications, due to its high access latency. Previous work has mainly focused on data locality, introducing small-but-fast regions...
Multiple clone row DRAM
Multiple clone row DRAM
Several previous works have changed DRAM bank structure to reduce memory access latency and have shown performance improvement. However, changes in the area-optimized DRAM bank can...
PENGUJIAN PEMBANGKITAN TEGANGAN GENERATOR AXIAL-FLUX PERMANENT MAGNET (AFPM) TIGA-FASA GANDA
PENGUJIAN PEMBANGKITAN TEGANGAN GENERATOR AXIAL-FLUX PERMANENT MAGNET (AFPM) TIGA-FASA GANDA
Tulisan ini membahas tentang pembuatan generator axial-flux permanent magnet (AFPM) yang dapat menghasilkan dua buah tegangan tiga-fasa dengan pergesera...
Rancang Bangun Modul Pembelajaran Transformator 1 Fasa dan 3 Fasa dengan Daya 120/360 VA
Rancang Bangun Modul Pembelajaran Transformator 1 Fasa dan 3 Fasa dengan Daya 120/360 VA
Transformator adalah salah satu komponen penting dalam sistem tenaga listrik. Maka dengan begitu harus adanya pemahaman yang baik mengenai karakteristik transformator. Sehingga per...
Fault models, test algorithms and embedded test techniques for DRAM circuits
Fault models, test algorithms and embedded test techniques for DRAM circuits
Τις τελευταίες δεκαετίες οι ηλεκτρονικές συσκευές έχουν γίνει αναπόσπαστο κομμάτι της καθημερινότητας. Αυτό οφείλεται κυρίως στη ραγδαία πρόοδο της τεχνολογίας κατασκευής Ολοκληρωμ...
Analisis Karakteristik Arus Inrush pada Transformator 3 Fasa dengan Leg yang Berbeda Berbasis Eksperimen
Analisis Karakteristik Arus Inrush pada Transformator 3 Fasa dengan Leg yang Berbeda Berbasis Eksperimen
Fenomena arus inrush merupakan kondisi transformator pada saat pertama kali dihubungkan dengan sumber tegangan dan menghasilkan lonjakan arus yang tinggi. Arus inrush dapat menyeba...
Mitigating WL-to-WL Disturbance in Dynamic Random-Access Memory (DRAM) through Adopted Spherical Shallow Trench Isolation with Silicon Nitride Layer in the Buried Channel Array Transistor (BCAT)
Mitigating WL-to-WL Disturbance in Dynamic Random-Access Memory (DRAM) through Adopted Spherical Shallow Trench Isolation with Silicon Nitride Layer in the Buried Channel Array Transistor (BCAT)
The Pass Gate Effect (PGE), often referred to as adjacent cell interference, presents a significant challenge in dynamic random-access memory (DRAM). In this study, we investigate ...

