What question did this study set out to answer?

The aim is to enhance the performance of deep neural networks by introducing a reconfigurable architecture for array-based accelerators.

April 10, 2026Open Access

RADI: A High-Performance Reconfigurable Array-Based Accelerator for DNN Implementations

Key Points

The aim is to enhance the performance of deep neural networks by introducing a reconfigurable architecture for array-based accelerators.
Developed a reconfigurable architecture that adapts processing element sizes to match DNN layers.
Conducted simulations on various DNN models to evaluate architecture performance.
Incorporated multithreading to accelerate simulation processing.
Achieved 43% higher speed compared to baseline architecture.
Demonstrated 32% increased resource utilization.
Reduced on-chip memory access rate by 38% compared to traditional implementations.

Abstract

Deep neural networks (DNNs) have gained significant attention due to the rapid growth of learning-based applications. However, the computational demands of DNNs limit their performance in many of these applications. As a result, extensive research has focused on hardware implementations of these networks as accelerators. Array-based accelerators are an efficient architecture type that employs an array of processing elements (PEs) for parallel computations. However, array-based accelerators cannot reach their potential performance due to having fixed dimensions to execute different layers of DNNs. This article proposes a reconfigurable architecture to address this limitation by adaptively selecting the size of PEs to better align with the dimensions of the active DNN layers. Simulations demonstrate significant improvements for various DNN models compared to state-of-the-art architectures. Experimental results show that the proposed architecture achieves, on average, 43% higher speed, 32% more resource utilization, and a 38% reduction in on-chip memory access rate compared to the baseline architecture when executing GoogLeNet model layers. These enhancements are achieved with only a 1.6% area overhead, making the proposed architecture a cost-effective design. Furthermore, by incorporating multithreading into the simulator's source code, we significantly accelerate simulations compared to the basic version.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Mobina Ranjbar Malidareh

Mojtaba Valinataj

Paria Darbani

Journals

ACM Transactions on Design Automation of Electronic Systems

Actions

Institutions

Institute for Research in Fundamental Sciences

Babol Noshirvani University of Technology

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

RADI: A High-Performance Reconfigurable Array-Based Accelerator for DNN Implementations

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study