A Method for Extracting Logical Structure from a Document Image
-
- Tateishi Yuka
- IBM Japan Ltd
-
- Itoh Nobuyasu
- IBM Japan Ltd
Bibliographic Information
- Other Title
-
- 文書の論理構造を解釈する一手法
Search this article
Abstract
A method of stochastic syntactic analysis is applied to extracting the logical structure of a printed document from their physical layout and keywords indicating logical components.The document is parsed as a sentence consisting of text lines and graphic objects according to a stochastic regular grammar with attributes.By using stochastic analysis,the parser can retain possible results in order of their probability,so that it selects an optimal result more appropriately than deterministic systems if ambiguity occurs.
Journal
-
- IEICE technical report. Natural language understanding and models of communication
-
IEICE technical report. Natural language understanding and models of communication 94 (291), 25-32, 1994-10-20
The Institute of Electronics, Information and Communication Engineers
- Tweet
Details 詳細情報について
-
- CRID
- 1573950402206333696
-
- NII Article ID
- 110003278414
-
- NII Book ID
- AN10091225
-
- Text Lang
- ja
-
- Data Source
-
- CiNii Articles