Improvement in Domain Specific Word Segmentation by Symbol Grounding

Bibliographic Information

Other Title
  • シンボルグラウンディングによる分野特有の単語分割の精度向上
  • シンボルグラウンディング ニ ヨル ブンヤ トクユウ ノ タンゴ ブンカツ ノ セイド コウジョウ

Search this article

Abstract

<p>We propose a novel framework for improving a word segmenter using information acquired from symbol grounding. The framework uses a dataset consisting of pairs of non-textual information and a commentary. We generate a pseudo-stochastically segmented corpus from the commentaries, and then build a neural network to predict relationships between non-textual information and the words. We generate a domain specific term dictionary by using the neural network for word segmenter. We applied our method to game records of Japanese chess with commentaries. The experimental results show that the accuracy of a word segmenter can be improved by incorporating the generated dictionary. </p>

Journal

References(13)*help

See more

Related Projects

See more

Details 詳細情報について

Report a problem

Back to top