CRAFT Powered Line-Segmentation

Overview

Robustly segmenting the text lines from a wide variety of Page Layouts (eg: newspapers, books, comics, receipts, manuscripts, stone inscriptions, coffee table books etc) is a hard problem. On the other hand, segmenting text lines from fully standardized page layouts has been solved (eg: Form-4A, Government of XYZ).

Our method builds on the CRAFT model and finds a middle ground - it enables us to segment text lines from documents with variations (curvy lines, handwritten lines, handwriting style, page size, calligraphy, ornamentation, page texture, and page color), as long as they meet a layout based selection criteria. We first bin the target documents based on their layouts into different categories, and then write specific text line segmentation methods for each category.

We learn that it is easier segment text lines using projection-profile based approches when the raw images are first passed through CRAFT to convert them into heatmaps. In other words, it is easier to work with CRAFT outputs(heatmaps) as compared to raw images.

Given the lack of robustness in Neural Networks, end-to-end fully generalized layout analysis for text line segmentation is not currently feasible.

Most methods use some combination of AI (matrix multiplication followed by non-linearities) and algorithmic reasoning with thresholds (if/else statement) to segement text lines.

We believe that CRAFT should be the AI part of the solution - whose job is to robustly convert any characters present on a page to heat maps - with the presence of a character being hot.
We can then work with these heatmaps and flexibly tweak the algorithmic logic based on what the selection criteria of the target documents is (eg: single column, two column, news paper, right to left script).

Below we find the selection criteria for the manuscripts which the code in this repository will robustly handle:

we restrict ourselves to working with non-pictorial, single-column text manuscripts
we require each line of the manuscript to run from left all the way to the right, and not stop somewhere in between.

It is feasible to automatically detect pages which do not meet this selection criteria by doing anomaly detection on the heatmaps.

Getting Started

Clone this repo: git clone https://github.com/flame-cai/line-segmentation.git
Create conda enviroment conda env create --name envname --file=requirements.yml
Create a folder input_images/YOUR_MANUSCRIPT_NAME
Set the craft_fork_v5.ipynb kernel to the conda environment envname
Update the manuscript name in craft_fork_v5.ipynb with YOUR_MANUSCRIPT_NAME
Run all cells of craft_fork_v5.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
input_images/sample_manuscript		input_images/sample_manuscript
output_images/sample_manuscript		output_images/sample_manuscript
pretrained_craft		pretrained_craft
README.md		README.md
craft_fork_v5.ipynb		craft_fork_v5.ipynb
requirements.yml		requirements.yml
sample_img.png		sample_img.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CRAFT Powered Line-Segmentation

Overview

Getting Started

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CRAFT Powered Line-Segmentation

Overview

Getting Started

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages