TY - GEN
T1 - Ultra fine-grained run-time power gating of on-chip routers for CMPs
AU - Matsutani, Hiroki
AU - Koibuchi, Michihiro
AU - Ikebuchi, Daisuk
AU - Usami, Kimiyoshi
AU - Nakamura, Hiroshi
AU - Amano, Hideharu
N1 - Copyright:
Copyright 2010 Elsevier B.V., All rights reserved.
PY - 2010
Y1 - 2010
N2 - This paper proposes an ultra fine-grained run-time power gating of on-chip router, in which power supply to each router component (e.g., VC queue, crossbar MUX, and output latch) can be individually controlled in response to the applied workload. As only the router components which are just transferring a packet are activated, the leakage power of the on-chip network can be reduced to the near-optimal level. However, a certain amount of wakeup latency is required to activate the sleeping components, and the application performance will be degraded. In this paper, we estimate the wakeup latency for each component based on circuit simulations using a 65nm process. Then we propose four early wakeup methods to overcome the wakeup latency. The proposed router with the early wakeup methods is evaluated in terms of the application performance, area, and leakage power. As a result, it reduces the leakage power by 78.9%, at the expense of the 4.3% area and 4.0% performance when we assume a 1GHz operation.
AB - This paper proposes an ultra fine-grained run-time power gating of on-chip router, in which power supply to each router component (e.g., VC queue, crossbar MUX, and output latch) can be individually controlled in response to the applied workload. As only the router components which are just transferring a packet are activated, the leakage power of the on-chip network can be reduced to the near-optimal level. However, a certain amount of wakeup latency is required to activate the sleeping components, and the application performance will be degraded. In this paper, we estimate the wakeup latency for each component based on circuit simulations using a 65nm process. Then we propose four early wakeup methods to overcome the wakeup latency. The proposed router with the early wakeup methods is evaluated in terms of the application performance, area, and leakage power. As a result, it reduces the leakage power by 78.9%, at the expense of the 4.3% area and 4.0% performance when we assume a 1GHz operation.
UR - http://www.scopus.com/inward/record.url?scp=77955109876&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77955109876&partnerID=8YFLogxK
U2 - 10.1109/NOCS.2010.16
DO - 10.1109/NOCS.2010.16
M3 - Conference contribution
AN - SCOPUS:77955109876
SN - 9780769540535
T3 - NOCS 2010 - The 4th ACM/IEEE International Symposium on Networks-on-Chip
SP - 61
EP - 68
BT - NOCS 2010 - The 4th ACM/IEEE International Symposium on Networks-on-Chip
T2 - 4th ACM/IEEE International Symposium on Networks on Chip, NOCS 2010
Y2 - 3 May 2010 through 6 May 2010
ER -