First CUDA Call Takes Over 45 Seconds
All, the title describes my problem. I know the that the first CUDA call initializes a ton of CUDA contexts, but on my CARMA Devboard this takes over 45 second. The first call to a cv::gpu::Mat
which actually constructs the memory blocks for a very long time.
- Is this normal?
- Is there a way to speed it up? Is this caused by JIT compiling cuda object files?
Thank you