1 | initial version |
Even with TBB you'll not see a sufficient workload of CPU cores by OpenCV traincascade application. Almost all the time only one core'll work. It's because only small part of the training code is parallelized by TBB: finding the best split of tree node and precomputing some part of feature values before the training a new stage. But significant time the traincascade is looking for negative samples that was recognized as positive (face) samples by all trained stages (trained part of a cascade) to train next new stage. This pick of samples is not parallelized.