1 | initial version |
As for ARM NEON acceleration, please refer to the answer given in open-source NEON optimizations.
To summarize, currently only a few functions have been accelerated for ARM NEON. Volunteering efforts and Github pull requests are welcome.
The locations of the source files are
Currently, these accelerations are available:
Source code for NVIDIA CUDA accelerated algorithms are in a separate directory:
Source code for OpenCL accelerated algorithms are in a separate directory:
To check for preprocessor directives for each type of acceleration:
cv::GpuMat
matrix type, which is capable of copying data between CPU and GPU memory.checkHardwareSupport(CV_CPU_SSE2)
. If an instruction set higher than SSE2 is used (such as SSE3, SSSE3, SSE4.1, etc), a check must be performed on each of these, because the presence of higher instruction set may not imply the presence of all lower instruction sets.