Hey everyone,
I'm attempting to identify store logos using a haar cascade classifier I trained. It was trained using 60 positive images and ~600 negative images.
My question comes about because the images in which I am attempting to find the textual logo, there also exists other text.
OpenCV is determining this "other" text to in fact be the found trained logo. Sample image sizes were 90x30. Sample image used in detection is 300px wide by whatever length is needed to maintain aspect ratio.
So I'm wondering a few things:
- Is OpenCV distorting the text in my positive samples to the extent that other text is being detected?
- Should I be fine with the number of samples listed above? I followed the tutorial here, though I have heard from other tutorials that 1000's of samples is needed?
- Are my sample image sizes okay? I've read a few things about opencv preferring 25px by 25px ?
- When training the classifier, I limited it to 5 stages to save time. How much more accurate would the classifier be with say 20 stages? Or is the accuracy only relative to the number of positive samples?
Thanks so much for any input you can provide on this. I'm really looking forward to generating this custom haar file and having it work well.
Very much value your comments :)
Thanks