Face Drawing GAN by Channel Attention and Matrix Product Attention

Hideyuki Ogura, Shinya Ezumi, Masaaki Ikehara

研究成果: Conference contribution

抄録

Face photo-sketch synthesis tasks have been developed with Generative Adversarial Networks (GANs) based on Convolutional Neural Network (CNN) and Vision Transformer (ViT). CNN is good at capturing local features, but its locality results in blurred images and contours. ViT is good at capturing global information, but is not as good as CNN in capturing local features, and while it can prevent blurring of contours and other lines, it does not reflect fine texture. Therefore, we propose a Face Drawing GAN, which generates high-quality face sketches by capturing both local and global features. Face Drawing GAN is a CNN-based model and it incorporates Channel Attention, which functionally adjusts the weights of channels, and Matrix Product Attention (MP Attention), which weights pixels based on the similarity between the vertical and horizontal sides of images obtained by matrix product. Through the experiments, we confirmed that our proposed MP Attention assists in capturing global features and Face Drawing GAN is capable of generating face sketches that outperform conventional methods.

本文言語English
ホスト出版物のタイトル2024 IEEE International Conference on Image Processing, ICIP 2024 - Proceedings
出版社IEEE Computer Society
ページ1588-1594
ページ数7
ISBN(電子版)9798350349399
DOI
出版ステータスPublished - 2024
イベント31st IEEE International Conference on Image Processing, ICIP 2024 - Abu Dhabi, United Arab Emirates
継続期間: 2024 10月 272024 10月 30

出版物シリーズ

名前Proceedings - International Conference on Image Processing, ICIP
ISSN(印刷版)1522-4880

Conference

Conference31st IEEE International Conference on Image Processing, ICIP 2024
国/地域United Arab Emirates
CityAbu Dhabi
Period24/10/2724/10/30

ASJC Scopus subject areas

  • ソフトウェア
  • コンピュータ ビジョンおよびパターン認識
  • 信号処理

フィンガープリント

「Face Drawing GAN by Channel Attention and Matrix Product Attention」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル