Advantage Mapping: Learning Operation Mapping for User-Preferred Manipulation by Extracting Scenes with Advantage Function

Rintaro Hasegawa, Yosuke Fukuchi, Kohei Okuoka, Michita Imai

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

When a user manipulates a system, a user input through an interface, or an operation, is converted to the user's intended action according to the mapping that links operations and actions, which we call "operation mapping". Although many operation mappings are created by designers assuming how a typical user would operate the system, the optimal operation mapping may vary from user to user. The designer cannot prepare in advance all possible operation mappings. One approach to solve this problem involves autonomous learning of an operation mapping during the operation. However, existing methods require manual preparation of scenes for learning mappings. We propose advantage mapping, which enables the efficient learning of operation mappings. Working from the idea that scenes in which the user's desired action is predictable are useful for learning operation mappings, advantage mapping extracts scenes according to the magnitude of entropy in the output of the action value function acquired from reinforcement learning. In our experiment, the user's ideal operation mapping was more accurately obtained from the scenes selected by advantage mapping than from learning through actual play.

Original languageEnglish
Title of host publicationHAI 2022 - Proceedings of the 10th Conference on Human-Agent Interaction
PublisherAssociation for Computing Machinery, Inc
Pages95-103
Number of pages9
ISBN (Electronic)9781450393232
DOIs
Publication statusPublished - 2022 Dec 5
Event10th Conference on Human-Agent Interaction, HAI 2022 - Christchurch, New Zealand
Duration: 2022 Dec 52022 Dec 8

Publication series

NameHAI 2022 - Proceedings of the 10th Conference on Human-Agent Interaction

Conference

Conference10th Conference on Human-Agent Interaction, HAI 2022
Country/TerritoryNew Zealand
CityChristchurch
Period22/12/522/12/8

Keywords

  • adaptive systems
  • intelligent user interfaces
  • reinforcement learning

ASJC Scopus subject areas

  • Artificial Intelligence
  • Human-Computer Interaction
  • Software

Fingerprint

Dive into the research topics of 'Advantage Mapping: Learning Operation Mapping for User-Preferred Manipulation by Extracting Scenes with Advantage Function'. Together they form a unique fingerprint.

Cite this