Steering time-dependent estimation of posteriors with hyperparameter indexing in Bayesian topic models

この論文をさがす

抄録

This paper provides a new approach to topical trend analysis. Our aim is to improve the generalization power of latent Dirichlet allocation (LDA) by using document timestamps. Many previous works model topical trends by making latent topic distributions time-dependent. We propose a straightforward approach by preparing a different word multinomial distribution for each time point. Since this approach increases the number of parameters, overfitting becomes a critical issue. Our contribution to this issue is two-fold. First, we propose an effective way of defining Dirichlet priors over the word multinomials. Second, we propose a special scheduling of variational Bayesian (VB) inference. Comprehensive experiments with six datasets prove that our approach can improve LDA and also Topics over Time, a well-known variant of LDA, in terms of test data perplexity in the framework of VB inference.

Lecture Notes in Computer Science, 6634 LNAI(PART 1), pp.435-447; 2011

収録刊行物

詳細情報 詳細情報について

問題の指摘

ページトップへ