Utilizing a Phrase Translation Table and Portion of Patent Families with No Parallel Sentences Extracted in Estimating Translation of Technical Terms

  • Itsuki Toyota
    Graduate School of Systems and Information Engineering, University of Tsukuba
  • Zi Long
    Graduate School of Systems and Information Engineering, University of Tsukuba
  • Lijuan Dong
    Graduate School of Systems and Information Engineering, University of Tsukuba
  • Takehito Utsuro
    Faculty of Engineering, Information and Systems, University of Tsukuba
  • Mikio Yamamoto
    Faculty of Engineering, Information and Systems, University of Tsukuba

Bibliographic Information

Other Title
  • パテントファミリーを用いた専門用語訳語獲得における対訳文対非抽出部分およびフレーズテーブルの利用

Search this article

Abstract

In the previous methods of generating bilingual lexicon from parallel patent sentences extracted from patent families, the portion from which parallel patent sentences are extracted is about 30% out of the whole "Background" and "Embodiment" parts and about 70% are not used. Considering this situation, this paper proposes to generate bilingual lexicon for technical terms not only from the 30% but also from the remaining 70% out of the whole "Background" and "Embodiment" parts. The proposed method employs the compositional translation estimation technique utilizing the remaining 70% as a comparable corpus for validating translation candidates. As the bilingual constituent lexicons in compositional translation, we use an existing bilingual lexicon as well as the phrase translation table trained with the parallel patent sentences extracted from the 30%. Finally, we show that about 7,300 technical term translation pairs can be acquired from 1,000 patent families.

Journal

  • IPSJ SIG Notes

    IPSJ SIG Notes 2013 (12), 1-8, 2013-07-11

    Information Processing Society of Japan (IPSJ)

Keywords

Details 詳細情報について

  • CRID
    1573387452695535360
  • NII Article ID
    110009586063
  • NII Book ID
    AN10115061
  • Text Lang
    ja
  • Data Source
    • CiNii Articles

Report a problem

Back to top