Modifying Existing Analogy-based N-gram Language Model

この論文をさがす

抄録

By investigating the occurrence of different proportional analogies in corpora, this paper describes an approach to increase the performance of existing analogy-based N-gram language models evaluated by perplexity. Our approach consists in using analogy to reconstruct N-grams from the test data so as to give higher probabilities to these N-grams. By giving different weights to different patterns, we also except that some N-grams which can be reconstructed by different patterns will get more accurate probabilities. The use of suffix arrays for data searching leads to a lesser computation time on text scoring tasks.

収録刊行物

詳細情報 詳細情報について

  • CRID
    1573950402681238016
  • NII論文ID
    110009659641
  • NII書誌ID
    AN10115061
  • 本文言語コード
    en
  • データソース種別
    • CiNii Articles

問題の指摘

ページトップへ