Ask Your Question

adamdylan's profile - activity

2016-05-22 01:28:32 -0600 received badge  Enthusiast
2016-05-18 03:09:22 -0600 commented question What are some good resources for arabic OCR in the wild dataset?

Hey, I try it , i got some error when running the following command: tesseract photo.jpeg out -l ara (I installed the language package) The error is:

Tesseract Open Source OCR Engine v3.04.00 with Leptonica
Cube ERROR (CubeRecoContext::Load): unable to read cube language model params from /opt/local/share/tessdata/ara.cube.lm
Cube ERROR (CubeRecoContext::Create): unable to init CubeRecoContext object
init_cube_objects(false, &tessdata_manager):Error:Assert failed:in file tessedit.cpp, line 205
2016-05-17 02:46:59 -0600 asked a question What are some good resources for arabic OCR in the wild dataset?

Hello there, I've recently started working on a OCR in the wild algorythm using neural networks. My requirements are as follow: Arabic text, Natural images(not scans etc.)

My goal is detecting weather the image has text or not and then extract the text.

I need some help from you, I need large dataset. If there's any, it would be great, otherwise, I would appreciate some help thinking of reasonable methods to create such dataset by my own.

Thank you very much, A Dylan