TY - GEN
T1 - A lightweight fault-tolerant mechanism for network-on-chip
AU - Koibuchi, Michihiro
AU - Matsutani, Hiroki
AU - Amano, Hideharu
AU - Pinkston, Timothy Mark
PY - 2008/5/28
Y1 - 2008/5/28
N2 - Survival capability is becoming a crucial factor in designing multicore processors built with on-chip packet networks, or networks on chip (NoCs). In this paper, we propose a lightweight fault-tolerant mechanism for NoCs based on default backup paths (DBPs) designed to maintain, in the presence of failures, network connectivity of both nonfaulty routers as well as healthy processor cores which may be connected to faulty routers. The mechanism provides default paths as backup between certain router ports which serve as alternative datapaths to circumvent failed components within a faulty router. Along with a minimal subset of normal network channels, the set of default backup paths internal to faulty routers form - in the worst case - a unidirectional ring topology that provides network-wide connectivity to all processor cores. Routing using the DBP mechanism is proved to be deadlock-free with only two virtual channels even for fault scenarios in which regular networks degrade to irregular (arbitrary) topologies. Evaluation results show that, for a 2-D mesh wormhole NoC, only 12.6% additional hardware resources are needed to implement the proposed DBP mechanism in order to provide graceful performance degradation without chip-wide failure as the number of faults increases to the maximum needed to form ring.
AB - Survival capability is becoming a crucial factor in designing multicore processors built with on-chip packet networks, or networks on chip (NoCs). In this paper, we propose a lightweight fault-tolerant mechanism for NoCs based on default backup paths (DBPs) designed to maintain, in the presence of failures, network connectivity of both nonfaulty routers as well as healthy processor cores which may be connected to faulty routers. The mechanism provides default paths as backup between certain router ports which serve as alternative datapaths to circumvent failed components within a faulty router. Along with a minimal subset of normal network channels, the set of default backup paths internal to faulty routers form - in the worst case - a unidirectional ring topology that provides network-wide connectivity to all processor cores. Routing using the DBP mechanism is proved to be deadlock-free with only two virtual channels even for fault scenarios in which regular networks degrade to irregular (arbitrary) topologies. Evaluation results show that, for a 2-D mesh wormhole NoC, only 12.6% additional hardware resources are needed to implement the proposed DBP mechanism in order to provide graceful performance degradation without chip-wide failure as the number of faults increases to the maximum needed to form ring.
UR - http://www.scopus.com/inward/record.url?scp=44149126468&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=44149126468&partnerID=8YFLogxK
U2 - 10.1109/NOCS.2008.4492721
DO - 10.1109/NOCS.2008.4492721
M3 - Conference contribution
AN - SCOPUS:44149126468
SN - 0769530982
SN - 9780769530987
T3 - Proceedings - Second IEEE International Symposium on Networks-on-Chip, NOCS 2008
SP - 13
EP - 22
BT - Proceedings - Second IEEE International Symposium on Networks-on-Chip, NOCS 2008
T2 - 2nd IEEE International Symposium on Networks-on-Chip, NOCS 2008
Y2 - 7 April 2008 through 11 April 2008
ER -