TY - GEN
T1 - Deep learning on high performance FPGA switching boards
T2 - 14th International Symposium on Applied Reconfigurable Computing, ARC 2018
AU - Musha, Kazusa
AU - Kudoh, Tomohiro
AU - Amano, Hideharu
N1 - Funding Information:
This paper is based on results obtained from a project commissioned by the New Energy and Industrial Technology Development Organization (NEDO).
Publisher Copyright:
© Springer International Publishing AG, part of Springer Nature 2018.
PY - 2018
Y1 - 2018
N2 - FiC (Flow-in-Cloud)-SW is an FPGA-based switching node for an efficient AI computing system. It is equipped with a number of serial links directly connected to other nodes. Unlike other multi-FPGA systems, the circuit switching fabric with the STDM (Static Time Division Multiplexing) is implemented on the FPGA for predictable communication and cost-efficient data broadcasting. Parallel convolution modules for AlexNet are implemented on FiC-SW1 prototype boards consisting of Kintex Ultrascale FPGA, and evaluation results show that the parallel execution with 20 boards achieved 4.6 times better performance than the state of art implementation on a single Virtex 7 FPGA board.
AB - FiC (Flow-in-Cloud)-SW is an FPGA-based switching node for an efficient AI computing system. It is equipped with a number of serial links directly connected to other nodes. Unlike other multi-FPGA systems, the circuit switching fabric with the STDM (Static Time Division Multiplexing) is implemented on the FPGA for predictable communication and cost-efficient data broadcasting. Parallel convolution modules for AlexNet are implemented on FiC-SW1 prototype boards consisting of Kintex Ultrascale FPGA, and evaluation results show that the parallel execution with 20 boards achieved 4.6 times better performance than the state of art implementation on a single Virtex 7 FPGA board.
UR - http://www.scopus.com/inward/record.url?scp=85046291072&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85046291072&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-78890-6_4
DO - 10.1007/978-3-319-78890-6_4
M3 - Conference contribution
AN - SCOPUS:85046291072
SN - 9783319788890
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 43
EP - 54
BT - Applied Reconfigurable Computing
A2 - Voros, Nikolaos
A2 - Keramidas, Georgios
A2 - Antonopoulos, Christos
A2 - Huebner, Michael
A2 - Diniz, Pedro C.
A2 - Goehringer, Diana
PB - Springer Verlag
Y2 - 2 May 2018 through 4 May 2018
ER -