Steering time-dependent estimation of posteriors with hyperparameter indexing in Bayesian topic models
この論文をさがす
抄録
This paper provides a new approach to topical trend analysis. Our aim is to improve the generalization power of latent Dirichlet allocation (LDA) by using document timestamps. Many previous works model topical trends by making latent topic distributions time-dependent. We propose a straightforward approach by preparing a different word multinomial distribution for each time point. Since this approach increases the number of parameters, overfitting becomes a critical issue. Our contribution to this issue is two-fold. First, we propose an effective way of defining Dirichlet priors over the word multinomials. Second, we propose a special scheduling of variational Bayesian (VB) inference. Comprehensive experiments with six datasets prove that our approach can improve LDA and also Topics over Time, a well-known variant of LDA, in terms of test data perplexity in the framework of VB inference.
Lecture Notes in Computer Science, 6634 LNAI(PART 1), pp.435-447; 2011
収録刊行物
-
- Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
-
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 6634 LNAI (PART 1), 435-447, 2011
Springer Verlag
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1050850247193151360
-
- NII論文ID
- 120006985071
-
- NII書誌ID
- AA0071599X
-
- ISSN
- 03029743
- 16113349
-
- HANDLE
- 10069/25516
-
- 本文言語コード
- ja
-
- 資料種別
- journal article
-
- データソース種別
-
- IRDB
- CiNii Articles