1 | initial version |
You can compile CUDA code only for one compute capability. It will decrease output size of library and probably load time. Set next CMake variables:
CUDA_ARCH_BIN=2.1(2.0) # this means compute capability 2.1 with 2.0 instruction set
CUDA_ARCH_PTX="" # empty string, do nou generate ptx code