固有表現抽出のためのSVMの高速化

磯崎, 秀樹, 賀沢, 秀人

書誌事項

タイトル別名

コユウヒョウゲンチュウシュツノタメノ SVM ノコウソクカ
Speeding up Support Vector Machines for Named Entity Recognition
自然言語

この論文をさがす

抄録

サポートベクトルマシン（SVM）は新しい高性能な学習手法である．しかし，従来手法より分類処理速度が桁違いに遅いことが知られている．本論文では，まずSVMを用いた固有表現抽出方法が既存手法より高精度であることを実験により示す．固有表現抽出は，地名・人名・組織名・日時などの固有表現を文書から抜き出す技術であり，情報抽出システムや質問応答システムなどの重要な基礎技術である．次に，固有表現抽出のデータの特徴を生かして，処理速度を大幅に改善するアルゴリズムを提案する．このアルゴリズムは，自然言語処理における他の様々なタスクに応用可能であると考えられる．

The Support Vector Machine (SVM) is a powerful new machine learning method.However, it is well known that its classification speed isorders-of-magnitude slower than conventional systems.First, we show that a Named Entity (NE) recognizerbased on SVMs gives better scores thanconventional systems.Named Entity recognition is a task in whichproper nouns and numerical information are extracted from documents andare classified into categories such as person, organization, and date.It is a key technology of Information Extraction andOpen-Domain Question Answering.Then, we present an algorithm that makes the system substantially fasterby exploiting characters of NE data.This algorithm will be applicable to other different tasks inNatural Language Processing.

収録刊行物

情報処理学会論文誌

情報処理学会論文誌 44 (3), 970-979, 2003-03-15

東京 : 情報処理学会

詳細情報詳細情報について

CRID: 1050001337883882880

NII論文ID: 110002765076; 10011463698

NII書誌ID: AN00116647

ISSN: 18827764; 03875806

NDL書誌ID: 6492691

Web Site: http://id.nii.ac.jp/1001/00011306/; https://ndlsearch.ndl.go.jp/books/R000000004-I6492691

本文言語コード: ja

資料種別: journal article

データソース種別

IRDB
NDL
CiNii Articles

固有表現抽出のためのSVMの高速化

書誌事項

この論文をさがす

抄録

収録刊行物

被引用文献 (4)*注記

参考文献 (43)*注記

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

固有表現抽出のためのSVMの高速化

書誌事項

この論文をさがす

抄録

収録刊行物

被引用文献 (4)*注記

参考文献 (43)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について