Accelerating Distributed Deep Reinforcement Learning by In-Network Experience Sampling

Masaki Furukawa, Hiroki Matsutani

研究成果: Conference contribution

抄録

A computing cluster that interconnects multiple compute nodes is used to accelerate distributed reinforcement learning based on DQN (Deep Q-Network). In distributed reinforcement learning, Actor nodes acquire experiences by interacting with a given environment and a Learner node optimizes their DQN model. Since data transfer between Actor and Learner nodes increases depending on the number of Actor nodes and their experience size, communication overhead between them is one of major performance bottlenecks. In this paper, their communication performance is optimized by using DPDK (Data Plane Development Kit). Specifically, DPDK-based low-latency experience replay memory server is deployed between Actor and Learner nodes interconnected with a 40GbE (40Gbit Ethernet) network. Evaluation results show that, as a network optimization technique, kernel bypassing by DPDK reduces network access latencies to a shared memory server by 32.7% to 58.9%. As another network optimization technique, an in-network experience replay memory server between Actor and Learner nodes reduces access latencies to the experience replay memory by 11.7% to 28.1% and communication latencies for prioritized experience sampling by 21.9% to 29.1%.

本文言語English
ホスト出版物のタイトルProceedings - 30th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2022
編集者Arturo Gonzalez-Escribano, Jose Daniel Garcia, Massimo Torquati, Amund Skavhaug
出版社Institute of Electrical and Electronics Engineers Inc.
ページ75-82
ページ数8
ISBN(電子版)9781665469586
DOI
出版ステータスPublished - 2022
イベント30th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2022 - Valladolid, Spain
継続期間: 2022 3月 92022 3月 11

出版物シリーズ

名前Proceedings - 30th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2022

Conference

Conference30th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2022
国/地域Spain
CityValladolid
Period22/3/922/3/11

ASJC Scopus subject areas

  • コンピュータ ネットワークおよび通信
  • ハードウェアとアーキテクチャ
  • 情報システムおよび情報管理

フィンガープリント

「Accelerating Distributed Deep Reinforcement Learning by In-Network Experience Sampling」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル