Spectral Fluctuation Method: A Texture-Based Method to Extract Text Regions in General Scene Images

  • BABA Yoichiro
    Department of Electronic Engineering, The University of Tokyo
  • HIROSE Akira
    Department of Electronic Engineering, The University of Tokyo

Search this article

Abstract

To obtain text information included in a scene image, we first need to extract text regions from the image before recognizing the text. In this paper, we examine human vision and propose a novel method to extract text regions by evaluating textural variation. Human beings are often attracted by textural variation in scenes, which causes foveation. We frame a hypothesis that texts also have similar property that distinguishes them from the natural background. In our method, we calculate spatial variation of texture to obtain the distribution of the degree of likelihood of text region. Here we evaluate the changes in local spatial spectrum as the textural variation. We investigate two options to evaluate the spectrum, that is, those based on one- and two-dimensional Fourier transforms. In particular, in this paper, we put emphasis on the one-dimensional transform, which functions like the Gabor filter. The proposal can be applied to a wide range of characters mainly because it employs neither templates nor heuristics concerning character size, aspect ratio, specific direction, alignment, and so on. We demonstrate that the method effectively extracts text regions contained in various general scene images. We present quantitative evaluation of the method by using databases open to the public.

Journal

References(27)*help

See more

Related Projects

See more

Details 詳細情報について

Report a problem

Back to top