Poor OpenCL performance
Hi, I am trying to perform the detectMultiScale function on GPU using OpenCL module. It is supposed to run faster but it is not. In fact it is even 3-4 times slower than the CPU implementation. I have tested it on both Intel HD Graphics 4000 and NVidia GT650M, and I got the same result. I want to know if anyone ran into the same problem, and if there is a solution.
OpenCV version : 2.4.7.2
It would be nice to know with which CPU you compare those GPUs. The GPUs you mention arent really powerful ones.