Reconfigurable CNN accelerator design using dataflow analysis

Limited Access
This item is unavailable until:
2025-03-30

Date

2024-09

Editor(s)

Advisor

Güdükbay, Uğur

Supervisor

Co-Advisor

Öztürk, Özcan

Co-Supervisor

Instructor

BUIR Usage Stats
32
views
1
downloads

Series

Abstract

Dataflow reconfigurability plays a crucial role in Convolutional Neural Network (CNN) acceleration by determining the optimal dataflow pattern for convolution operations. Fully reconfigurable architectures provide versatility and high resource utilization by supporting multiple dataflow options, but this comes with increased design complexity and operational overhead. On the other hand, non-reconfigurable architectures, optimized for a single dataflow pattern, deliver high efficiency for specific tasks but lack adaptability. This thesis introduces a novel intermediate dataflow reconfigurable CNN accelerator that balances flexibility and efficiency by integrating key dataflow patterns, enhancing adaptability and performance across diverse CNN applications. Through a detailed analysis, key dataflows are identified, and a unique architectural unit is developed for dataflow selection, with an average of 0.15% excess latency compared to the optimal scenario. Our specialized systolic array architecture accommodates various kernel sizes, providing an additional layer of reconfigurability. Our architecture requires 39% less area and 35% less power than fully reconfigurable designs. Additionally, it delivers an average of 33% better performance compared to non-reconfigurable architectures. In terms of efficiency, it provides a 7% increase over fully reconfigurable designs and outperforms non-reconfigurable options by up to 3.57X.

Source Title

Publisher

Course

Other identifiers

Book Title

Degree Discipline

Computer Engineering

Degree Level

Master's

Degree Name

MS (Master of Science)

Citation

Published Version (Please cite this version)

Language

English

Type