TY - GEN
T1 - Human Latent Metrics
T2 - 19th ACM Symposium on Applied Perception, SAP 2022
AU - Shimizu, Kye
AU - Ienaga, Naoto
AU - Takada, Kazuma
AU - Sugimoto, Maki
AU - Kasahara, Shunichi
N1 - Funding Information:
This work was supported by JST Moonshot R&D Program (Grant Number JPMJMS2013). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of any funding agencies.
Publisher Copyright:
© 2022 ACM.
PY - 2022/9/22
Y1 - 2022/9/22
N2 - Generative adversarial networks (GANs) generate high-dimensional vector spaces (latent spaces) that can interchangeably represent vectors as images. Advancements have extended their ability to computationally generate images indistinguishable from real images such as faces, and more importantly, to manipulate images using their inherit vector values in the latent space. This interchangeability of latent vectors has the potential to calculate not only the distance in the latent space, but also the human perceptual and cognitive distance toward images, that is, how humans perceive and recognize images. However, it is still unclear how the distance in the latent space correlates with human perception and cognition. Our studies investigated the relationship between latent vectors and human perception or cognition through psycho-visual experiments that manipulates the latent vectors of face images. In the perception study, a change perception task was used to examine whether participants could perceive visual changes in face images before and after moving an arbitrary distance in the latent space. In the cognition study, a face recognition task was utilized to examine whether participants could recognize a face as the same, even after moving an arbitrary distance in the latent space. Our experiments show that the distance between face images in the latent space correlates with human perception and cognition for visual changes in face imagery, which can be modeled with a logistic function. By utilizing our methodology, it will be possible to interchangeably convert between the distance in the latent space and the metric of human perception and cognition, potentially leading to image processing that better reflects human perception and cognition.
AB - Generative adversarial networks (GANs) generate high-dimensional vector spaces (latent spaces) that can interchangeably represent vectors as images. Advancements have extended their ability to computationally generate images indistinguishable from real images such as faces, and more importantly, to manipulate images using their inherit vector values in the latent space. This interchangeability of latent vectors has the potential to calculate not only the distance in the latent space, but also the human perceptual and cognitive distance toward images, that is, how humans perceive and recognize images. However, it is still unclear how the distance in the latent space correlates with human perception and cognition. Our studies investigated the relationship between latent vectors and human perception or cognition through psycho-visual experiments that manipulates the latent vectors of face images. In the perception study, a change perception task was used to examine whether participants could perceive visual changes in face images before and after moving an arbitrary distance in the latent space. In the cognition study, a face recognition task was utilized to examine whether participants could recognize a face as the same, even after moving an arbitrary distance in the latent space. Our experiments show that the distance between face images in the latent space correlates with human perception and cognition for visual changes in face imagery, which can be modeled with a logistic function. By utilizing our methodology, it will be possible to interchangeably convert between the distance in the latent space and the metric of human perception and cognition, potentially leading to image processing that better reflects human perception and cognition.
KW - change perception
KW - face cognition
KW - generative adversarial networks
UR - http://www.scopus.com/inward/record.url?scp=85139408825&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85139408825&partnerID=8YFLogxK
U2 - 10.1145/3548814.3551460
DO - 10.1145/3548814.3551460
M3 - Conference contribution
AN - SCOPUS:85139408825
T3 - Proceedings - SAP 2022: ACM Symposium on Applied Perception
BT - Proceedings - SAP 2022
A2 - Spencer, Stephen N.
PB - Association for Computing Machinery, Inc
Y2 - 22 September 2022 through 23 September 2022
ER -