Ask Your Question

Revision history [back]

click to hide/show revision 1
initial version

Using Tesseract via OpenOCR running on Google Compute Engine, I OCR'd your original image and got the following output:

E' ,‘YHwacpMTDCH ; 3?". ‘ V‘L"~m> I shah-r}. I’VMU' i 5: 1“”. A"

I then tried to pre-process it via Stroke Width Transform using this docker image and the following command:

cd /opt/DetectText && ./DetectText 1375472915202212.png out.png 1

Which resulted in this pre-processed image:

image description

When I re-ran Tesseract on it, I got this output:


which isn't perfect, but is a pretty big improvement.