Ask Your Question

Revision history [back]

How to pre-process images for OCR ?

I am trying to do an OCR on multiple images, using Tesseract, but i am facing a problem where i can't figure out a way to apply on all images even in different contrast and brightness and shades.

As of now, this is my pipeline of preprocessing an image :

 - FastNLMeansDenoisingClored.
 - Dilation
 - MedianBlur
 - Grayscale
 - AdaptiveThresholding

But for each image, i must manually tune params for each step.

Is there anything that can adjust these params automatically, or is there any way of applying machine learning ?