VietnameseOCR Save

Vietnamese Optical Character Recognition. It works with Vietnamese and Latin characters as well.

Project README

VietnameseOCR - Vietnamese Optical Character Recognition

Apply Deep Learning ( CNN networks ) to train a model uses for recognizing Vietnamese characters, it works well with Latin characters.

Dataset in big image ( 10.000 samples, 2800 x 2800 pixel)

Requirements

python 3.6.5
tensorflow
PIL

Model Summary

Layer	Shape	Kernel	Stride	Padding
INPUT	[28, 28, 1]
CONV1		[3, 3, 32, 32]	[1, 1]	SAME
POOL1
CONV2		[3, 3, 32, 64]	[1, 1]	SAME
POOL2
CONV3		[3, 3, 64, 128]	[1, 1]	SAME
POOL3
FC1
FC2	[625, 190]

Results

Training...

......
Epoch: 38 cost = 0.312853018
Epoch: 39 cost = 0.298816641
Epoch: 40 cost = 0.293328794

Evaluation
------------------------------
Test Accuracy: 0.974867469544

Training

Prepare dataset for training

git clone https://github.com/miendinh/VietnameseOCR.git
cd VietnameseOCR/data/train/characters
unzip dataset.zip

Let's train.

python train.py

Create you own dataset

Prepare fonts for generating text-image

You could add more fonts

cd VietnameseOCR/data/train/characters
unzip google.zip
unzip win.zip

Create font list, then save it in fonts.list

source ./list.sh

Generate Text Image Dataset

python generate_data.py

Play with pretrained model

All pretrained weights of model is save to file vocr.brain
Let's test with random character in dataset

python predict.py

Further working

Character classification.
Dataset augmentation.
Improve accuracy.
Text location.
Text recognition.
Apply NLP for spell checking.

References

Author mien.hust [at] gmail [dot] com

Open Source Agenda is not affiliated with "VietnameseOCR" Project. README Source: miendinh/VietnameseOCR

Stars

Open Issues

Last Commit

5 years ago

Repository

miendinh/VietnameseOCR

License

Apache-2.0

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/vietnameseocr"><img src="https://www.opensourceagenda.com/projects/vietnameseocr/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022