FiC-RNN: A Multi-FPGA Acceleration Framework for Deep Recurrent Neural Networks

SUN Yuxi, AMANO Hideharu

doi:10.1587/transinf.2020pap0003

抄録

<p>Recurrent neural networks (RNNs) have been proven effective for sequence-based tasks thanks to their capability to process temporal information. In real-world systems, deep RNNs are more widely used to solve complicated tasks such as large-scale speech recognition and machine translation. However, the implementation of deep RNNs on traditional hardware platforms is inefficient due to long-range temporal dependence and irregular computation patterns within RNNs. This inefficiency manifests itself in the proportional increase in the latency of RNN inference with respect to the number of layers of deep RNNs on CPUs and GPUs. Previous work has focused mostly on optimizing and accelerating individual RNN cells. To make deep RNN inference fast and efficient, we propose an accelerator based on a multi-FPGA platform called Flow-in-Cloud (FiC). In this work, we show that the parallelism provided by the multi-FPGA system can be taken advantage of to scale up the inference of deep RNNs, by partitioning a large model onto several FPGAs, so that the latency stays close to constant with respect to increasing number of RNN layers. For single-layer and four-layer RNNs, our implementation achieves 31x and 61x speedup compared with an Intel CPU.</p>

収録刊行物

IEICE Transactions on Information and Systems

IEICE Transactions on Information and Systems E103.D (12), 2457-2462, 2020-12-01

一般社団法人電子情報通信学会

キーワード

詳細情報詳細情報について

CRID: 1390567901490361088

NII論文ID: 130007948494

DOI: 10.1587/transinf.2020pap0003

ISSN: 17451361; 09168532

Web Site: https://www.jstage.jst.go.jp/article/transinf/E103.D/12/E103.D_2020PAP0003/_pdf

本文言語コード: en

データソース種別

JaLC
Crossref
CiNii Articles

抄録ライセンスフラグ: 使用不可

FiC-RNN: A Multi-FPGA Acceleration Framework for Deep Recurrent Neural Networks

抄録

収録刊行物

被引用文献 (3)*注記

参考文献 (15)*注記

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

FiC-RNN: A Multi-FPGA Acceleration Framework for Deep Recurrent Neural Networks

抄録

収録刊行物

被引用文献 (3)*注記

参考文献 (15)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について