Ocr Machine Learning Save

OCR Machine Learning in python

Project README

Optical Character Recogniser

Description

This project is meant to demonstrate machine learning algorithms and techniques to implement an OCR with high accuracy by making use of learning techniques and feature reduction algorithms to make it more efficient.

Scenario

Given some PDF pages to test and bounding boxes for letters, the system takes in the data in the training mode and computes what are the features that are more relevant to identifying characters and stores these features to then use on test pages.

Features

Implemented in python
K-Nearest neighbour classifier
PCA (principal components) for feature reduction
Spelling correction in the basic error detection function
Forward sequential search

To run

Training stage: python code/train.py
Evaluation stage: python code/evaluate.py dev

Results

97.4% Accuracy

96.4% Accuracy

86.2% Accuracy

60.4% Accuracy

60.9% Accuracy

50.7% Accuracy

Open Source Agenda is not affiliated with "Ocr Machine Learning" Project. README Source: johnhany97/ocr-machine-learning

Stars

Open Issues

Last Commit

1 year ago

Repository

johnhany97/ocr-machine-learning

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/ocr-machine-learning"><img src="https://www.opensourceagenda.com/projects/ocr-machine-learning/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022