TY - JOUR
T1 - Analytical performance estimation for large-scale reconfigurable dataflow platforms
AU - Yasudo, Ryota
AU - Coutinho, José G.F.
AU - Varbanescu, Ana Lucia
AU - Luk, Wayne
AU - Amano, Hideharu
AU - Becker, Tobias
AU - Guo, Ce
N1 - Funding Information:
This work was partially supported by the EU H2020 Research and Innovation Programme under grant agreement number 671653, the UK EPSRC (grants EP/P010040/1, EP/N031768/1, and EP/L016796/1), the JST/CREST program “Research and Development on Unified Environment of Accelerated Computing and Interconnection for Post-Petascale Era” in the research area of “Development of System Software Technologies for post-Peta Scale High Performance Computing,” and JSPS KAKENHI grant 20K19770. Authors’ addresses: R. Yasudo, Hiroshima University, 1-4-1 Kagamiyama, Higashi-hiroshima, Hiroshima, 739-8527, Japan; email: yasudo@cs.hiroshima-u.ac.jp; J. G. F. Coutinho, W. Luk, and C. Guo, Imperial College London, 180 Queen’s Gate, London SW7 2BZ, UK; emails: {gabriel.figueiredo, w.luk, c.guo}@imperial.ac.uk; A.-L. Varbanescu, University of Amsterdam, Postbus 94323 1090 GH Amsterdam, The Netherlands; email: a.l.varbanescu@uva.nl; H. Amano, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama-shi, Kanagawa 223-8522, Japan; email: hunga@am.ics.keio.ac.jp; T. Becker, Maxeler Technologies Ltd., 3 Hammersmith Grove London W6 0ND, UK; email: tbecker@maxeler.com. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from permissions@acm.org. © 2021 Association for Computing Machinery. 1936-7406/2021/8-ART12 $15.00 https://doi.org/10.1145/3452742
Publisher Copyright:
© 2021 Association for Computing Machinery.
PY - 2021/9
Y1 - 2021/9
N2 - Next-generation high-performance computing platforms will handle extreme data- and compute-intensive problems that are intractable with today's technology. A promising path in achieving the next leap in high-performance computing is to embrace heterogeneity and specialised computing in the form of reconfigurable accelerators such as FPGAs, which have been shown to speed up compute-intensive tasks with reduced power consumption. However, assessing the feasibility of large-scale heterogeneous systems requires fast and accurate performance prediction. This article proposes Performance Estimation for Reconfigurable Kernels and Systems (PERKS), a novel performance estimation framework for reconfigurable dataflow platforms. PERKS makes use of an analytical model with machine and application parameters for predicting the performance of multi-accelerator systems and detecting their bottlenecks. Model calibration is automatic, making the model flexible and usable for different machine configurations and applications, including hypothetical ones. Our experimental results show that PERKS can predict the performance of current workloads on reconfigurable dataflow platforms with an accuracy above 91%. The results also illustrate how the modelling scales to large workloads, and how performance impact of architectural features can be estimated in seconds.
AB - Next-generation high-performance computing platforms will handle extreme data- and compute-intensive problems that are intractable with today's technology. A promising path in achieving the next leap in high-performance computing is to embrace heterogeneity and specialised computing in the form of reconfigurable accelerators such as FPGAs, which have been shown to speed up compute-intensive tasks with reduced power consumption. However, assessing the feasibility of large-scale heterogeneous systems requires fast and accurate performance prediction. This article proposes Performance Estimation for Reconfigurable Kernels and Systems (PERKS), a novel performance estimation framework for reconfigurable dataflow platforms. PERKS makes use of an analytical model with machine and application parameters for predicting the performance of multi-accelerator systems and detecting their bottlenecks. Model calibration is automatic, making the model flexible and usable for different machine configurations and applications, including hypothetical ones. Our experimental results show that PERKS can predict the performance of current workloads on reconfigurable dataflow platforms with an accuracy above 91%. The results also illustrate how the modelling scales to large workloads, and how performance impact of architectural features can be estimated in seconds.
KW - FPGAs
KW - Heterogeneous systems
KW - Performance modelling
KW - Reconfigurable dataflow platforms
UR - http://www.scopus.com/inward/record.url?scp=85122625544&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85122625544&partnerID=8YFLogxK
U2 - 10.1145/3452742
DO - 10.1145/3452742
M3 - Article
AN - SCOPUS:85122625544
SN - 1936-7406
VL - 14
JO - ACM Transactions on Reconfigurable Technology and Systems
JF - ACM Transactions on Reconfigurable Technology and Systems
IS - 3
M1 - 3452742
ER -