Pose-aware Disentangled Multiscale Transformer for Pose Guided Person Image Generation

Kei Shibasaki, Masaaki Ikehara

研究成果: Conference contribution

1 被引用数 (Scopus)

抄録

Pose Guided Person Image Generation (PGPIG) is the task that transforms the pose of a person's image from the source image, its pose information, and the target pose information. Most existing PGPIG methods require additional pose information or tasks, which limits their application. Moreover, they use CNNs, which can only extract features from neighboring pixels and cannot consider the consistency of the entire image. This paper proposes a PGPIG network solving these problems by using a module containing Axial Transformers with large receptive field. The proposed method disentangles the PGPIG task into two subtasks: “rough pose transformation” and “detailed texture generation”. In the former task, low-resolution feature maps are transformed by blocks containing Axial Transformer. The latter task uses a CNN network with Adaptive Instance Normalization. Experiments show that the proposed method has competitive performance with other state-of-the-art methods. Furthermore, despite achieving excellent performance, the proposed network has a significantly fewer parameters than existing methods.

本文言語English
ホスト出版物のタイトル31st European Signal Processing Conference, EUSIPCO 2023 - Proceedings
出版社European Signal Processing Conference, EUSIPCO
ページ506-510
ページ数5
ISBN(電子版)9789464593600
DOI
出版ステータスPublished - 2023
イベント31st European Signal Processing Conference, EUSIPCO 2023 - Helsinki, Finland
継続期間: 2023 9月 42023 9月 8

出版物シリーズ

名前European Signal Processing Conference
ISSN(印刷版)2219-5491

Conference

Conference31st European Signal Processing Conference, EUSIPCO 2023
国/地域Finland
CityHelsinki
Period23/9/423/9/8

ASJC Scopus subject areas

  • 信号処理
  • 電子工学および電気工学

フィンガープリント

「Pose-aware Disentangled Multiscale Transformer for Pose Guided Person Image Generation」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル