TY - JOUR
T1 - Stimulus-specific random effects inflate false-positive classification accuracy in multivariate-voxel-pattern-analysis
T2 - A solution with generalized mixed-effects modelling
AU - Kajimura, Shogo
AU - Hoshino, Takahiro
AU - Murayama, Kou
N1 - Funding Information:
This research was supported by the Leverhulme Trust (Grant Number RL-2016-030); Jacobs Foundation Research Fellowship; and the Alexander von Humboldt Foundation (the Alexander von Humboldt Professorship endowed by the German Federal Ministry of Education and Research).
Publisher Copyright:
© 2023
PY - 2023/4/1
Y1 - 2023/4/1
N2 - When conducting multivariate-voxel pattern analysis (MVPA), researchers typically compute the average accuracy for each subject and statistically test if the average accuracy is different from the chance level across subjects (by-subject analysis). We argue that this traditional by-subject analysis leads to inflated Type-1 error rates, regardless of the type of machine learning method used (e.g., support vector machine). This is because by-subject analysis does not consider the variance attributed to the idiosyncratic features of the stimuli that have a common influence on all subjects (i.e., the random stimulus effect). As a solution, we proposed the use of generalized linear mixed-effects modelling to evaluate average accuracy. This method only requires post-classification data (i.e., it does not consider the type of classification methods used) and is easily implemented in the analysis pipeline with common statistical software (SPSS, R, Python, etc.). Using both statistical simulation and real fMRI data analysis, we demonstrated that the traditional by-subject method indeed increases Type-1 error rates to a considerable degree, while generalized mixed-effects modelling that incorporates random stimulus effects can indeed maintain the nominal Type-1 error rates.
AB - When conducting multivariate-voxel pattern analysis (MVPA), researchers typically compute the average accuracy for each subject and statistically test if the average accuracy is different from the chance level across subjects (by-subject analysis). We argue that this traditional by-subject analysis leads to inflated Type-1 error rates, regardless of the type of machine learning method used (e.g., support vector machine). This is because by-subject analysis does not consider the variance attributed to the idiosyncratic features of the stimuli that have a common influence on all subjects (i.e., the random stimulus effect). As a solution, we proposed the use of generalized linear mixed-effects modelling to evaluate average accuracy. This method only requires post-classification data (i.e., it does not consider the type of classification methods used) and is easily implemented in the analysis pipeline with common statistical software (SPSS, R, Python, etc.). Using both statistical simulation and real fMRI data analysis, we demonstrated that the traditional by-subject method indeed increases Type-1 error rates to a considerable degree, while generalized mixed-effects modelling that incorporates random stimulus effects can indeed maintain the nominal Type-1 error rates.
KW - Generalized mixed-effects modelling
KW - Group-level analysis
KW - Multivariate-voxel-pattern-analysis
KW - Random stimulus effect
KW - Type-1 error
UR - http://www.scopus.com/inward/record.url?scp=85147216729&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85147216729&partnerID=8YFLogxK
U2 - 10.1016/j.neuroimage.2023.119901
DO - 10.1016/j.neuroimage.2023.119901
M3 - Article
C2 - 36706939
AN - SCOPUS:85147216729
SN - 1053-8119
VL - 269
JO - NeuroImage
JF - NeuroImage
M1 - 119901
ER -