Ask Your Question

Revision history [back]

Training a real usage cascade is not trivial, number of samples and model size are fundamental in creating something meaningful. For example, for an eyes cascade consider:

  • 9,000 positive samples
  • 0.7 B of neg. sub-regions with negatives samples.
  • Features set: 85.550 features allocated.
  • Training time: ~1 days
  • TP: ~ 95.8% of positive training set
  • FN: ~ 04.2% of positive training set
  • FP: ~ 7.51937e-006% of negative training set
  • Training size w=30 h=60 (aspect ratio 1:2)

Full reports:

  1. eyes: http://www.vision-ary.net/2015/11/boost-the-world-eye-detection-lbp-hog-haar-cascade-opencv
  2. faces: http://www.vision-ary.net/2015/03/boost-the-world-face-detection/
  3. cars: http://www.vision-ary.net/2015/06/boost-the-world-car-detection/
  4. pedestrians: http://www.vision-ary.net/2015/03/boost-the-world-pedestrian/