AdverseBiNet Save

Improving Document Binarization via Adversarial Noise-Texture Augmentation (ICIP 2019)

Project README

Improving Document Binarization via Adversarial Noise-Texture Augmentation [paper] [ICIP 2019]

This repository contains the full source codes and instructions to use the codes on the datasets described in the paper. The paper re-visits the binarization problem by introducingan adversarial learning approach.

The most significant contribution of our framework is that it does not require any paired data unlike other Deep Learning-based methods [ronneberger et al., Vo et al.,Konwer et al.].Such a novel approach has never been implemented earlier thus making it the very first of its kind in Document Image Analysis community.

Results

In this paper, we propose a two-stage network that first learns to augment the document images by using neural style transfer technique. For this purpose, we construct a Texture Augmentation Network that transfers the texture element of a degraded reference document image to a clean binary image.

Achitecture In this way, the network creates multiple versions of the same textual content with various noisy textures, enlarging the available document binarization datasets. At last, the newly generated images are passed through a Binarization network to get back the clean version.

Pre-requisites

  • python 2.7
  • Tensorflow
  • OpenCV
  • matplotlib

Training

Citation

If you find this code useful in your research, please consider citing:

@article{bhunia2018improving,
  title={Improving Document Binarization via Adversarial Noise-Texture Augmentation},
  author={Bhunia, Ankan Kumar and Bhunia, Ayan Kumar and Sain, Aneeshan and Roy, Partha Pratim},
  journal={arXiv preprint arXiv:1810.11120},
  year={2018}
}
Open Source Agenda is not affiliated with "AdverseBiNet" Project. README Source: ankanbhunia/AdverseBiNet

Open Source Agenda Badge

Open Source Agenda Rating