TY - JOUR
T1 - An 8 Bit 12.4 TOPS/W Phase-Domain MAC Circuit for Energy-Constrained Deep Learning Accelerators
AU - Toyama, Yosuke
AU - Yoshioka, Kentaro
AU - Ban, Koichiro
AU - Maya, Shigeru
AU - Sai, Akihide
AU - Onizuka, Kohei
N1 - Funding Information:
The authors would like to thank X. Yang, E. Lee, D. Bankman, and M. Horowitz of Stanford University and D. Miyashita and J. Deguchi of Toshiba Memory for valuable discussion on DLA architectures.
Publisher Copyright:
© 1966-2012 IEEE.
PY - 2019/10
Y1 - 2019/10
N2 - A small-gate-count 8 bit bidirectional phase-domain MAC (PMAC) circuit is proposed to minimize both area and energy consumption of extremely energy-efficient deep neural network (DNN) accelerators, targeting the Internet-of-Things (IoT) edge devices operating with very strict power budgets (e.g., energy harvesting). PMAC consumes significantly less energy than standard fully digital MACs, due to its efficient analog accumulation nature based on gated-ring oscillator (GRO). The architectural analysis of energy-efficient accelerators is performed, and the energy budget analysis is disclosed. Furthermore, theoretical analysis of PMAC is conducted and comparisons with the conventional analog approaches are shown. By exploiting the DNN digital quantization noise, we further improve the PMAC energy efficiency by designing the internal gain. The bidirectional architecture proposed in this paper achieves an area comparable to those of digital MACs and up to a fivefold improvement in power efficiency. Moreover, the system design constraints are relaxed by eliminating the phase error originating in leakage currents. An asynchronous readout technique and a two-step digital-to-time converter (DTC) to enhance system throughput and compact implementation, respectively, are presented for the first time. We also present DNN hardware-software co-training procedures to show further scaling of the PMAC efficiency. Utilizing such techniques, the measured PMAC achieves peak efficiency of 12.4 TOPS/W in 28 nm CMOS.
AB - A small-gate-count 8 bit bidirectional phase-domain MAC (PMAC) circuit is proposed to minimize both area and energy consumption of extremely energy-efficient deep neural network (DNN) accelerators, targeting the Internet-of-Things (IoT) edge devices operating with very strict power budgets (e.g., energy harvesting). PMAC consumes significantly less energy than standard fully digital MACs, due to its efficient analog accumulation nature based on gated-ring oscillator (GRO). The architectural analysis of energy-efficient accelerators is performed, and the energy budget analysis is disclosed. Furthermore, theoretical analysis of PMAC is conducted and comparisons with the conventional analog approaches are shown. By exploiting the DNN digital quantization noise, we further improve the PMAC energy efficiency by designing the internal gain. The bidirectional architecture proposed in this paper achieves an area comparable to those of digital MACs and up to a fivefold improvement in power efficiency. Moreover, the system design constraints are relaxed by eliminating the phase error originating in leakage currents. An asynchronous readout technique and a two-step digital-to-time converter (DTC) to enhance system throughput and compact implementation, respectively, are presented for the first time. We also present DNN hardware-software co-training procedures to show further scaling of the PMAC efficiency. Utilizing such techniques, the measured PMAC achieves peak efficiency of 12.4 TOPS/W in 28 nm CMOS.
KW - Analog computing
KW - deep learning
KW - deep neural network (DNN) accelerator
KW - gated-ring oscillator (GRO)
KW - phase-domain MAC (PMAC)
UR - http://www.scopus.com/inward/record.url?scp=85072777796&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85072777796&partnerID=8YFLogxK
U2 - 10.1109/JSSC.2019.2926649
DO - 10.1109/JSSC.2019.2926649
M3 - Article
AN - SCOPUS:85072777796
SN - 0018-9200
VL - 54
SP - 2730
EP - 2742
JO - IEEE Journal of Solid-State Circuits
JF - IEEE Journal of Solid-State Circuits
IS - 10
M1 - 8771205
ER -