I have a question about preparing the dataset of positive samples for a cascaded classifier that will be used for object detection.
As positive samples, I have been given 3 sets of images:
- a set of colored images in full size (about 1200x600) with a white background and with the object displayed at a different angles in each image
- another set with the same images in grayscale and with a white background, scaled down to the detection window size (60x60)
- another set with the same images in grayscale and with a black background, scaled down to the detection window size (60x60)
My question is that in set 1, should the background really be white? Should it not instead be an environment that the object is likely to be found in in the testing dataset? Or should I have a fourth set where the images are in their natural environments? How does environment figure into the training samples?