How GPU (CUDA) HoG (Histogram of Oriented Gradients) and SVM classification is parallelized?
Can someone please explain me how OpenCV GPU(CUDA) HOG and SVM classification works? How is it parallelized compared to CPU version ? I would like to know the process in terms of Cuda threads and kernels.
Thank you
Offcourse it is parellelized and optimized, else it would not be in the GPU/CUDA folder. Basically what it does is parallelize the block processing for the histograms to be calculated. By doing multiple blocks in parallel speed increase can be enourmous. But to me this seems kind of obvious, since there is no way in parallelizing the histogram combining :D
Thank you for the reply, but how is the block processing is parallelized? I would like to know the process in terms of Cuda threads and kernels. I'm having a look at source file hog.cu but I cannot understand yet. I also want to know how the SVM classification is parallelized.
I haven't got a single clue :) I don't understand cuda code myself. Might need someone here with that experience...