This paper presents a textline detection method for degraded historical documents. Our method follows a\nconventional two-step procedure that the binarization is first performed and then the textlines are extracted from the\nbinary image. In order to address the challenges in historical documents such as document degradation, structure\nnoise, and skews, we develop new methods for the binarization and textline extraction. First, we improve the\nperformance of binarization by detecting the non-text regions and processing only text regions. We also improve the\ntextline detection method by extracting main textblock and compensating the skew angle and writing style.\nExperimental results show that the proposed method yields the state-of-the-art performance for several datasets.
Loading....