Ask Your Question

Revision history [back]

click to hide/show revision 1
initial version

You can compile CUDA code only for one compute capability. It will decrease output size of library and probably load time. Set next CMake variables:

CUDA_ARCH_BIN=2.1(2.0) # this means compute capability 2.1 with 2.0 instruction set
CUDA_ARCH_PTX=""       # empty string, do nou generate ptx code