Configurable Multi-directional Systolic Array Architecture for Convolutional Neural Networks
نویسندگان
چکیده
The systolic array architecture is one of the most popular choices for convolutional neural network hardware accelerators. biggest advantage its simple and efficient design principle. Without complicated control dataflow, accelerators with can calculate traditional convolution very efficiently. However, this also brings new challenges to array. When computing special types convolution, such as small-scale or depthwise processing element (PE) utilization rate decreases sharply. main reason that limits flexibility In article, we a configurable multi-directional (CMSA) address these issues. First, added data path It allows users split through configuration speed up calculation convolution. Second, redesigned PE unit so has multiple transmission modes dataflow strategies. This switch addition, unlike other works, only make few changes modifications existing architecture. avoids additional overheads be easily deployed in application scenarios require small arrays mobile terminals. Based on our evaluation, CMSA increase by 1.6 times compared typical when running last layers ResNet-18. MobileNet, 14.8 times. At same time, are similar area energy consumption.
منابع مشابه
Scalable network-on-chip architecture for configurable neural networks
Providing highly flexible connectivity is a major architectural challenge for hardware implementation of reconfigurable neural networks. We perform an analytical evaluation and comparison of different configurable interconnect architectures (mesh NoC, tree, shared bus and point-to-point) emulating variants of two neural network topologies (having full and random configurable connectivity). We d...
متن کامل3d-filtermap: a Compact Architecture for Deep Convolutional Neural Networks
We present a novel and compact architecture for deep Convolutional Neural Networks (CNNs) in this paper, termed 3D-FilterMap Convolutional Neural Networks (3D-FM-CNNs). The convolution layer of 3D-FM-CNN learns a compact representation of the filters, named 3D-FilterMap, instead of a set of independent filters in the conventional convolution layer. The filters are extracted from the 3D-FilterMa...
متن کاملSimple And Efficient Architecture Search for Convolutional Neural Networks
Neural networks have recently had a lot of success for many tasks. However, neural network architectures that perform well are still typically designed manually by experts in a cumbersome trial-and-error process. We propose a new method to automatically search for well-performing CNN architectures based on a simple hill climbing procedure whose operators apply network morphisms, followed by sho...
متن کاملFPGA-Based Configurable Systolic Architecture for Window-Based Image Processing
Image processing requires more computational power and data throughput than most conventional processors can provide. Designing specific hardware can improve execution time and achieve better performance per unit of silicon area. A fieldprogrammable-gate-array(FPGA-) based configurable systolic architecture specially tailored for real-time window-based image operations is presented in this pape...
متن کاملDesigning A Re-Configurable Fractional Fourier Transform Architecture Using Systolic Array
FRFT (Fractional Fourier Transforms) algorithm, which has been derived from DFT, computes the angular domains within the time and frequency domains. This algorithm is increasingly used in the field of signal filtering, quantum mechanics and optical physics. In this paper we develop an efficient, systolic, reconfigurable architecture for a particular type of FRFT called MA-CDFRFT (Multi Angle Ce...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Architecture and Code Optimization
سال: 2021
ISSN: ['1544-3973', '1544-3566']
DOI: https://doi.org/10.1145/3460776