Arabic Handwritten Text to Line Segmentation
Text to line segmentation is a crucial phase in character recognition system since segmentation errors affects the recognition accuracy. In this work we present a novel and simple method for Arabic handwritten text images segmentation into text-lines. After converting the gray scale images to binary...
Saved in:
Published in: | 2021 International Conference on Information Systems and Advanced Technologies (ICISAT) pp. 1 - 5 |
---|---|
Main Authors: | , , , , , |
Format: | Conference Proceeding |
Language: | English |
Published: |
IEEE
27-12-2021
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | Text to line segmentation is a crucial phase in character recognition system since segmentation errors affects the recognition accuracy. In this work we present a novel and simple method for Arabic handwritten text images segmentation into text-lines. After converting the gray scale images to binary ones, we combine in this proposed method three approaches based on horizontal projection profile (HPP), on connected components (CC) and on skeleton. Firstly, we apply the smoothed horizontal projection profile to detect approximately the beginning and the end of each line. Then, we identify the connected components in each line basing on computing their centroids in order to cluster them to form an individual text-line. Finally, in case there are vertically touching characters, we use the skeleton to separate them by calculating its intersection point. The experiments are performed with 100 text images from the database Khatt. This approach is evaluated by the MatchScore criterion. The obtained results prove the efficiency of our method. |
---|---|
DOI: | 10.1109/ICISAT54145.2021.9678458 |