Balancing up Efficiency and Accuracy in Translation Retrieval.

Baldwin Timothy, Tanaka Hozumi

doi:10.5715/jnlp.8.2_19

この論文をさがす

抄録

This research looks at the effects of segment order and segmentation on translation retrieval performance for an experimental Japanese-English translation memory system. We implement a number of both bag-of-words and segment order-sensitive string comparison methods, and test each over character-based and word-based indexing. The translation retrieval performance of each system configuration is evaluated empirically through the notion of segment edit distance between the translation output and model translation. Our results indicate that character-based indexing is consistently superior to word-based indexing in terms of raw accuracy, although segmentation does have an accelerating effect on TM search times in combination with a number of retrieval optimisation techniques. Segment order-sensitive approaches are demonstrated to generally outperform bag-of-words methods, with 3-operation edit distance proving the most effective comparison method. We additionally reproduced the same basic results over alphabetised data as for lexically differentiated data containing kanji characters.

収録刊行物

自然言語処理

自然言語処理 8 (2), 19-37, 2001

一般社団法人　言語処理学会

キーワード

詳細情報詳細情報について

CRID: 1390282679450639232

NII論文ID: 130004292152; 10008830400

NII書誌ID: AN10472659

DOI: 10.5715/jnlp.8.2_19

ISSN: 21858314; 13407619

NDL書誌ID: 5759148

Web Site: https://ndlsearch.ndl.go.jp/books/R000000004-I5759148; http://www.jstage.jst.go.jp/article/jnlp1994/8/2/8_2_19/_pdf

本文言語コード: en

データソース種別

JaLC
NDL
Crossref
CiNii Articles

抄録ライセンスフラグ: 使用不可

Balancing up Efficiency and Accuracy in Translation Retrieval.

この論文をさがす

抄録

収録刊行物

被引用文献 (1)*注記

参考文献 (16)*注記

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

Balancing up Efficiency and Accuracy in Translation Retrieval.

この論文をさがす

抄録

収録刊行物

被引用文献 (1)*注記

参考文献 (16)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について