Unsupervised Domain Adaptations for Word Sense Disambiguation by Learning under Covariate Shift

Shinnou Hiroyuki, Sasaki Minoru

doi:10.5715/jnlp.21.1011

Bibliographic Information

Other Title

共変量シフト下の学習による語義曖昧性解消の教師なし領域適応
キョウヘンリョウシフトカノガクシュウニヨルゴギアイマイセイカイショウノキョウシナシリョウイキテキオウ

Search this article

Abstract

In this paper, we apply the learning under covariate shift to the problem of unsupervised domain adaptation for word sense disambiguation (WSD). This learning is a type of weighted learning method, in which the probability density ratio w(x) = P_T(x)/P_S(x) is used as the weight of an instance. However, w(x) tends to be small in WSD tasks. In order to address this problem, we calculate w(x) by estimating P_T(x) and P_S(x), where P_S(x) is estimating by regarding the corpus combining the source domain corpus and target domain corpus as the source domain corpus. In the experiment, we use three domains -OC (Yahoo! Chiebukuro), PB (books) and PN (news papers)- in BCCWJ, and 16 target words provided by the Japanese WSD task in SemEval-2. For calculating w(x), we also use uLSIF, which directly estimates w(x) without estimating P_T(x) or P_S(x). Moreover, we use the “p power” method and the “relative probability density ratio” method to boost the obtained probability density ratio. These experiments prove our method to be effective.

Journal

Journal of Natural Language Processing

Journal of Natural Language Processing 21 (5), 1011-1035, 2014

The Association for Natural Language Processing

Keywords

Details 詳細情報について

Export

Unsupervised Domain Adaptations for Word Sense Disambiguation by Learning under Covariate Shift

Bibliographic Information

Search this article

Abstract

Journal

Citations (1)*help

References(6)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem

Unsupervised Domain Adaptations for Word Sense Disambiguation by Learning under Covariate Shift

Bibliographic Information

Search this article

Abstract

Journal

Citations (1)*help

References(6)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem

Project list