TY - GEN
T1 - Dynamic Dual Sparse Topic Model
T2 - 16th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2024
AU - Masuda, Tatsuki
AU - Nakagawa, Kei
AU - Hoshino, Takahiro
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Topic modeling is a statistical technique that identifies underlying abstract 'topics' in a corpus of texts. Notable among current topic modeling methodologies are the Dynamic Topic Model (DTM), which captures the temporal dynamics of topics, and the Dynamic Sparse Model (DSM), which emphasizes the sparsity in the distributions of topics and words. However, no existing model appropriately integrates both the temporal progression and the sparsity within these distributions. Therefore, in this study, we propose the Dynamic Dual Sparse Topic Model (DDSTM), which employs the Spike and Slab Prior as its prior distribution, thereby extending the capabilities of both DTM and DSM. The DDSTM is distinct in its ability to simultaneously manage the temporal dynamics and sparsity in topic distributions, significantly enhancing the model's interpretability. Theoretically, we prove that the DDSTM maintains the same level of sparsity as the existing DSM models. In the empirical analysis, we use real-world text data to examine the characteristics of the DDSTM compared to existing topic models.
AB - Topic modeling is a statistical technique that identifies underlying abstract 'topics' in a corpus of texts. Notable among current topic modeling methodologies are the Dynamic Topic Model (DTM), which captures the temporal dynamics of topics, and the Dynamic Sparse Model (DSM), which emphasizes the sparsity in the distributions of topics and words. However, no existing model appropriately integrates both the temporal progression and the sparsity within these distributions. Therefore, in this study, we propose the Dynamic Dual Sparse Topic Model (DDSTM), which employs the Spike and Slab Prior as its prior distribution, thereby extending the capabilities of both DTM and DSM. The DDSTM is distinct in its ability to simultaneously manage the temporal dynamics and sparsity in topic distributions, significantly enhancing the model's interpretability. Theoretically, we prove that the DDSTM maintains the same level of sparsity as the existing DSM models. In the empirical analysis, we use real-world text data to examine the characteristics of the DDSTM compared to existing topic models.
KW - dynamic topic model
KW - sparse topic model
KW - spike and slab prior
KW - topic model
UR - https://www.scopus.com/pages/publications/85208102944
UR - https://www.scopus.com/inward/citedby.url?scp=85208102944&partnerID=8YFLogxK
U2 - 10.1109/IIAI-AAI63651.2024.00063
DO - 10.1109/IIAI-AAI63651.2024.00063
M3 - Conference contribution
AN - SCOPUS:85208102944
T3 - Proceedings - 2024 16th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2024
SP - 299
EP - 304
BT - Proceedings - 2024 16th IIAI International Congress on Advanced Applied Informatics, IIAI-AAI 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 6 July 2024 through 12 July 2024
ER -