Character Segmentation and Recognition of Alphanumeric-mixed Documents Based on Pattern Recognition Information

Hongo Yasuo

doi:10.1541/ieejeiss1987.122.6_928

【Created on October 31, 2023】 Integration of CiNii Dissertations and CiNii Books into CiNii Research

Impact of the Release of the New "NDL Search" on CiNii Services

Character Segmentation and Recognition of Alphanumeric-mixed Documents Based on Pattern Recognition Information

DOI Web Site Web Site 18 References

Hongo Yasuo

Fuji Electric Co., Ltd.

Bibliographic Information

Other Title

認識情報を利用した英数字混在文書からの文字切出しと認識
ニンシキジョウホウオリヨウシタエイスウジコンザイブンショカラノモジキリダシトニンシキ

Search this article

Abstract

Generally speaking, Japanese OCR cannot easily read Japanese documents that also contain alphanumeric data, bacause of the proportional pitch setting of alphanumeric characters displaced in the fixed pitch setting of the Japanese document.<br> This paper describes how to extract character candidates from combinations of small patterns that may be components of separable Japanese characters or slim patterns as alphanumeric characters, and how to select true character patterns from character candidates. We propose a new segmentation and recognition method for alphanumeric-mixed documents based on pattern recognition information such as similarities, pattern sizes and character kinds.<br> The method was tested on alphanumeric-mixed documents, which were 51 pages of technical journals and transactions containing 68, 867 characters. The resulting segmentation rate was 99.75% and the recognition rate was 99.05%, so we conclude that this method may be applied to Japanese OCR.

Journal

IEEJ Transactions on Electronics, Information and Systems

IEEJ Transactions on Electronics, Information and Systems 122 (6), 928-935, 2002

The Institute of Electrical Engineers of Japan

References(18)*help

Keywords

Details 詳細情報について

CRID

1390001204609919104
NII Article ID

130006845174

10008508962
NII Book ID

AN10065950
DOI

10.1541/ieejeiss1987.122.6_928
ISSN

13488155

03854221
NDL BIB ID

6174783
Web Site

https://ndlsearch.ndl.go.jp/books/R000000004-I6174783

https://www.jstage.jst.go.jp/article/ieejeiss1987/122/6/122_6_928/_pdf
Data Source
- JaLC
- NDL
- Crossref
- CiNii Articles
Abstract License Flag
Disallowed

Character Segmentation and Recognition of Alphanumeric-mixed Documents Based on Pattern Recognition Information

Bibliographic Information

Search this article

Abstract

Journal

References(18)*help

Keywords

Details 詳細情報について

Export

Report a problem

Character Segmentation and Recognition of Alphanumeric-mixed Documents Based on Pattern Recognition Information

Bibliographic Information

Search this article

Abstract

Journal

References(18)*help

Keywords

Details 詳細情報について

Export

Report a problem

Project list