Multi-script text versus non-text classification of regions in scene images

Sriman, B. & Schomaker, L., Jul-2019, In : Journal of Visual Communication and Image Representation. 62, p. 23-42 20 p.

Research output: Contribution to journalArticleAcademicpeer-review

Copy link to clipboard


  • Multi-script text versus non-text classification of regions in scene images

    Final author's version, 2 MB, PDF document

    Embargo ends: 15/04/2021

    Request copy

  • Multi-script text versus non-text classification of regions in sceneimages

    Final publisher's version, 3 MB, PDF document

    Request copy



Text versus non-text region classification is an essential but difficult step in scene-image analysis due to the considerable shape complexity of text and background patterns. There exists a high probability of confusion between background elements and letter parts. This paper proposes a feature-based classification of image blocks using the color autocorrelation histogram (CAH) and the scale-invariant feature transform (SIFT) algorithm, yielding a combined scale and color-invariant feature suitable for scene-text classification. For the evaluation, features were extracted from different color spaces, applying color-histogram autocorrelation. The color features are adjoined with a SIFT descriptor. Parameter tuning is performed and evaluated. For the classification, a standard nearest-neighbor (1NN) and a support-vector machine (SVM) were compared. The proposed method appears to perform robustly and is especially suitable for Asian scripts such as Kannada and Thai, where urban scene-text fonts are characterized by a high curvature and salient color variations.

Original languageEnglish
Pages (from-to)23-42
Number of pages20
JournalJournal of Visual Communication and Image Representation
Publication statusPublished - Jul-2019


  • Text detection in scene images, Text/non-text classification, Color features, Color histogram autocorrelation, SCALE, RECOGNITION

ID: 80412001