TY - GEN
T1 - Randomizing Packet Memory Networks for Low-Latency Processor-Memory Communication
AU - Fujiki, Daichi
AU - Matsutani, Hiroki
AU - Koibuchi, Michihiro
AU - Amano, Hideharu
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/3/31
Y1 - 2016/3/31
N2 - Three-dimensional stacked memory is considered to be one of the innovative elements for the next-generation computing system, for it provides high bandwidth and energy efficiency. Particularly, packet routing ability of Hybrid Memory Cubes (HMCs) enables new interconnects for the memories, giving flexibility to its topological design space. Since memory-processor communication is latency-sensitive, our challenge is to alleviate latency of the memory interconnection network, which is subject to high overheads from hop-count increase. Interestingly, random network topologies are known to have remarkably low diameter that is even comparable to theoretical Moore graph. In this context, we first propose to exploit the random topologies for the memory networks. Second, we also propose several optimizations to leverage the random topologies to be further adaptive to the latency-sensitive memory-processor communication: communication path length based selection, deterministic minimal routing, and page-size granularity memory mapping. Finally, we present interesting results of our evaluation: the random networks with universal memory access outperformed non-random networks of which memory access was optimally localized.
AB - Three-dimensional stacked memory is considered to be one of the innovative elements for the next-generation computing system, for it provides high bandwidth and energy efficiency. Particularly, packet routing ability of Hybrid Memory Cubes (HMCs) enables new interconnects for the memories, giving flexibility to its topological design space. Since memory-processor communication is latency-sensitive, our challenge is to alleviate latency of the memory interconnection network, which is subject to high overheads from hop-count increase. Interestingly, random network topologies are known to have remarkably low diameter that is even comparable to theoretical Moore graph. In this context, we first propose to exploit the random topologies for the memory networks. Second, we also propose several optimizations to leverage the random topologies to be further adaptive to the latency-sensitive memory-processor communication: communication path length based selection, deterministic minimal routing, and page-size granularity memory mapping. Finally, we present interesting results of our evaluation: the random networks with universal memory access outperformed non-random networks of which memory access was optimally localized.
KW - 3D stacked memory
KW - Memory network
KW - Random topology
UR - http://www.scopus.com/inward/record.url?scp=84968911894&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84968911894&partnerID=8YFLogxK
U2 - 10.1109/PDP.2016.18
DO - 10.1109/PDP.2016.18
M3 - Conference contribution
AN - SCOPUS:84968911894
T3 - Proceedings - 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2016
SP - 168
EP - 175
BT - Proceedings - 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2016
A2 - Cotronis, Yiannis
A2 - Daneshtalab, Masoud
A2 - Papadopoulos, George Angelos
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2016
Y2 - 17 February 2016 through 19 February 2016
ER -