Number plate recognition using tessract

numberplateocr

asked 2018-09-27 02:42:26 -0600

aayushkt
1 ●1 ●1

updated 2018-09-29 00:01:34 -0600

I am planning to perform ocr on Indian number plates.I used tessract 4.0 beta which uses LSTM engine for ocr. Although recognized characters are not coming out to be correct. I used cv2.Laplacian() while picking up images without blur and performed noise reduction using cv2.fastNlMeansDenoisingColored() on the image. For preprocessing I performed following steps

1)Sharpened image

2)performed otsu thresholding

3)Eliminated smaller noises/contours

4)Then inverted image

5)OCR using tesseract 4.0 --oem 1(Which uses lstm as detectio module )

Test images look like these.

Test image

Otsu thresholded image

After removing smaller contours

Detected output: OL 1CT 5079 (Which seems ok)

Can you please suggest any other preprocessing required to improve image(Reduce noise)?

Also is there a way to restrict special characters in tesseract?(Could not find it in latest version)

(ps. I am using python)

Thanks in advance

edit retag flag offensive close merge delete

Comments

You can perform dilation/erosion operation to highlight text. If you mean the Tesseract library you can put the special characters in a "black list" to avoid detecting them (I've used the JS version of Tesseract, I don't know what language are you using)

m93c ( 2018-09-27 09:21:54 -0600 )edit

Did you take a look of openalpr. They use Tesseract as OCR backend.

yancey ( 2018-09-27 10:22:10 -0600 )edit

add a comment

Number plate recognition using tessract

Comments

1 answer

Links

Question Tools

Stats

Number plate recognition using tessract edit

Comments

1 answer

Links

Question Tools

Stats

Number plate recognition using tessract