AdverseBiNet Save

Improving Document Binarization via Adversarial Noise-Texture Augmentation (ICIP 2019)

Project README

Improving Document Binarization via Adversarial Noise-Texture Augmentation [paper] [ICIP 2019]

This repository contains the full source codes and instructions to use the codes on the datasets described in the paper. The paper re-visits the binarization problem by introducingan adversarial learning approach.

The most significant contribution of our framework is that it does not require any paired data unlike other Deep Learning-based methods [ronneberger et al., Vo et al.,Konwer et al.].Such a novel approach has never been implemented earlier thus making it the very first of its kind in Document Image Analysis community.

Results

In this paper, we propose a two-stage network that first learns to augment the document images by using neural style transfer technique. For this purpose, we construct a Texture Augmentation Network that transfers the texture element of a degraded reference document image to a clean binary image.

Achitecture In this way, the network creates multiple versions of the same textual content with various noisy textures, enlarging the available document binarization datasets. At last, the newly generated images are passed through a Binarization network to get back the clean version.

Pre-requisites

python 2.7
Tensorflow
OpenCV
matplotlib

Training

A total of 9 datasets are used in this work: DIBCO 2009, DIBCO 2011, DIBCO 2013, H-DIBCO 2010, [HDIBCO 2012]( http://utopia.duth.gr/~ipratika/HDIBCO2012/benchmark/), H-DIBCO 2014, Bickley diary, PHIDB, and S-MS datasets.
Out of these datasets, DIBCO 2013 dataset is selected for testing purposes. For the testing, the remaining datasets are used as a training set.
We convert the images from these datasets to patches of size 256 X 256.
Download the VGG weights from here and put it in the repository folder.
To train the model run the 'train.py' file

Citation

If you find this code useful in your research, please consider citing:

@article{bhunia2018improving,
  title={Improving Document Binarization via Adversarial Noise-Texture Augmentation},
  author={Bhunia, Ankan Kumar and Bhunia, Ayan Kumar and Sain, Aneeshan and Roy, Partha Pratim},
  journal={arXiv preprint arXiv:1810.11120},
  year={2018}
}

Open Source Agenda is not affiliated with "AdverseBiNet" Project. README Source: ankanbhunia/AdverseBiNet

Stars

Open Issues

Last Commit

4 years ago

Repository

ankanbhunia/AdverseBiNet

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/adversebinet"><img src="https://www.opensourceagenda.com/projects/adversebinet/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022