Ask Your Question

Logistic Regression on MNIST dataset

asked 2016-05-17 11:32:04 -0600

lino gravatar image

In this post you can find a very good tutorial on how to apply SVM classifier to MNIST dataset. I was wondering if I could use logistic regression instead of SVM classifier. So I searhed for Logistic regression in openCV, And I found that the syntax for both classifiers are almost identical. So I guessed that I could just comment out these parts:

    cv::Ptr<cv::ml::SVM> svm = cv::ml::SVM::create();
    svm->setKernel(cv::ml::SVM::POLY);//LINEAR, RBF, SIGMOID, POLY 
    svm->setTermCriteria(cv::TermCriteria(cv::TermCriteria::MAX_ITER, 100, 1e-6));
    svm->train( trainingMat , cv::ml::ROW_SAMPLE , labelsMat );

and replace it with:

    cv::Ptr<cv::ml::LogisticRegression> lr1 = cv::ml::LogisticRegression::create();
    lr1->train( trainingMat, cv::ml::ROW_SAMPLE, labelsMat);

But first I got this error: OpenCV Error: Bad argument(data and labels must be a floating point matrix)

Then I changed

cv::Mat labelsMat(labels.size(), 1, CV_32S, labelsArray);


cv::Mat labelsMat(labels.size(), 1, CV_32F, labelsArray);

And now I get this error: OpenCV Error: bad argument(data should have atleast two classes)

I have 10 classes (0,1,...,9) but I don't know why I get this error. My codes are almost identical with the ones in the mentioned tutorial.

edit retag flag offensive close merge delete

2 answers

Sort by ยป oldest newest most voted

answered 2016-05-17 12:08:38 -0600

berak gravatar image

updated 2016-05-17 12:36:49 -0600

what is your labelsArray ? (if it is an int[] or similar, you cannot simpy change the type flag)

i think, you need 2 steps:

cv::Mat labelsMat(labels.size(), 1, CV_32S, labelsArray); // assuming, labelsArray is int[]
labelsMat.convertTo(labelsMat, CV_32F); // proper float Mat now.
edit flag offensive delete link more

answered 2016-05-17 12:14:40 -0600

I think that the data should be categorical (also refered as one-hot-encoding) , ie for 2 classes: [1,0] instead of [0], then for mnist, it should be: [1,0,0,0,0,0,0,0,0,0] for class 0, [0,1,0,0,0,0,0,0,0,0] for class 1, etc.

You could find a small description here.

There is plenty of Python methods to do that, but none in OpenCV as far as I know it... But if you implement it, feel free to add a pull request for it (has well as a doc update in another pull request if that solve your issue!)

edit flag offensive delete link more


let me add, that this is true for ANN_MLP (but not for other opencv ml classes)

berak gravatar imageberak ( 2016-05-18 11:52:01 -0600 )edit

Question Tools

1 follower


Asked: 2016-05-17 11:32:04 -0600

Seen: 494 times

Last updated: May 17 '16