Tug-of-war model for multi-armed bandit problem

Song Ju Kim, Masashi Aono, Masahiko Hara

研究成果: Conference contribution

17 被引用数 (Scopus)

抄録

We propose a model - the "tug-of-war (TOW) model" - to conduct unique parallel searches using many nonlocally correlated search agents. The model is based on the property of a single-celled amoeba, the true slime mold Physarum, which maintains a constant intracellular resource volume while collecting environmental information by concurrently expanding and shrinking its branches. The conservation law entails a "nonlocal correlation" among the branches, i.e., volume increment in one branch is immediately compensated by volume decrement(s) in the other branch(es). This nonlocal correlation was shown to be useful for decision making in the case of a dilemma. The multi-armed bandit problem is to determine the optimal strategy for maximizing the total reward sum with incompatible demands. Our model can efficiently manage this "exploration-exploitation dilemma" and exhibits good performances. The average accuracy rate of our model is higher than those of well-known algorithms such as the modified ε-greedy algorithm and modified softmax algorithm.

本文言語English
ホスト出版物のタイトルUnconventional Computation - 9th International Conference, UC 2010, Proceedings
ページ69-80
ページ数12
DOI
出版ステータスPublished - 2010
外部発表はい
イベント9th International Conference on Unconventional Computation, UC 2010 - Tokyo, Japan
継続期間: 2010 6月 212010 6月 25

出版物シリーズ

名前Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
6079 LNCS
ISSN(印刷版)0302-9743
ISSN(電子版)1611-3349

Other

Other9th International Conference on Unconventional Computation, UC 2010
国/地域Japan
CityTokyo
Period10/6/2110/6/25

ASJC Scopus subject areas

  • 理論的コンピュータサイエンス
  • コンピュータ サイエンス(全般)

フィンガープリント

「Tug-of-war model for multi-armed bandit problem」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル