1 | initial version |
What code are you using to test this?
Make sure you perform at least one iteration of each function before entering the timing loop, and do 1000 iterations of each inside the loop. Also, I suggest keeping the Mat.upload and GPUMat.download outside the timing loop, since you want to do as much on the GPU as possible, and so those should only be done once and re-used for everything you can.