Details of dataset used for training face haarcascades
I want to compare the performance of HoG-SVM (dalal-triggs) detector and Viola-Jones on faces. In order to have fair comparison, I would like to use the same training data for training HoG-SVM detector that was used for OpenCV Viola-Jones cascade. I tried to look into the openCV documentation and could not find the details of the training set used for generating haar_cascade_frontalface.xml ? I also looked into the paper [1] but it only says that around 5000 positive images have been used for training but does not specify the name of the dataset. Could anyone provide the details of the dataset used for Opencv frontal face cascades?
[1] Rainer Lienhart, Alexander Kuranov, Vadim Pisarevsky, Empirical Analysis of Detection Cascades of Boosted Classifiers for Rapid Object Detection, Tech Report, MERL 2002.
Could anyone please reply to this question?
I've ran across a few comments that opencv has not divulged the dataset used for training. One tutorial suggests it's from the FERET database (which I wasn't able to track down).
"Probably, the OpenCV developers used the FERET database. It looks that the FERET database became available to download over internet from Jan. 31, 2008(?)." - http://note.sonots.com/SciSoftware/haartraining.html
ZachGarner, thanks for your comment. Ya, I have looked into the tutorial. Author was just guessing that it could be FERET. He did not confirm exactly on which database OpenCV face detector was trained on.