How opencv_gpu multiplication works for big images
Hi, I would like to know if someone can explain me how OpenCV computes multiplication for very big image (1920*1080). I have to create a specific image processing but my kernel is very slow. So I have created a kernel for just a multiplication. It takes 45ms against 0ms (1ms) with OpenCV_gpu multiplication. How to optimize the number of thread and wrap for my application (for my real application I use 7 ints and 9 floats for my computation)?