Hi, I'm experimenting with the TAPI for OpenCL with OpenCV version 4.1.0. My main approach is to use UMat rather than Mat and see what performance differences exist. So far, it is very disappointing. Pretty much every image processing function I've tried runs slower with UMat than Mat. The only exception I can find is cvtColor() which runs quicker but pretty much everything else is at least twice as slow using the GPU. I've also tried this on different hardware with similar results and on prebuilt and source built OpenCV.
OpenCL is definitely present and running and I can observe the GPU loading when enabled, so I am reasonably certain that my configuration is fine. I'm running Windows 7, MSVC2017, Intel i7-4790 CPU, NVS 510 GPU running OpenCL 1.2 CUDA.
So my question is: Which OpenCV functions actually have a GPU implementation and which types of OpenCV functions are likely to benefit from GPU performance over the CPU?
Ta, Dave