-
- 本郷 保夫
- 富士電機
書誌事項
- タイトル別名
-
- Character Segmentation and Recognition of Alphanumeric-mixed Documents Based on Pattern Recognition Information
- ニンシキ ジョウホウ オ リヨウ シタ エイスウジ コンザイ ブンショ カラ ノ モジ キリダシ ト ニンシキ
この論文をさがす
抄録
Generally speaking, Japanese OCR cannot easily read Japanese documents that also contain alphanumeric data, bacause of the proportional pitch setting of alphanumeric characters displaced in the fixed pitch setting of the Japanese document.<br> This paper describes how to extract character candidates from combinations of small patterns that may be components of separable Japanese characters or slim patterns as alphanumeric characters, and how to select true character patterns from character candidates. We propose a new segmentation and recognition method for alphanumeric-mixed documents based on pattern recognition information such as similarities, pattern sizes and character kinds.<br> The method was tested on alphanumeric-mixed documents, which were 51 pages of technical journals and transactions containing 68, 867 characters. The resulting segmentation rate was 99.75% and the recognition rate was 99.05%, so we conclude that this method may be applied to Japanese OCR.
収録刊行物
-
- 電気学会論文誌C(電子・情報・システム部門誌)
-
電気学会論文誌C(電子・情報・システム部門誌) 122 (6), 928-935, 2002
一般社団法人 電気学会
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1390001204609919104
-
- NII論文ID
- 130006845174
- 10008508962
-
- NII書誌ID
- AN10065950
-
- ISSN
- 13488155
- 03854221
-
- NDL書誌ID
- 6174783
-
- データソース種別
-
- JaLC
- NDL
- Crossref
- CiNii Articles
-
- 抄録ライセンスフラグ
- 使用不可