Samuel Thibault f99ef13c88 Add CUDA kernel submission pipelining, to overlap costs and allow concurrent il y a 11 ans
..
gpu_concurrency.c f99ef13c88 Add CUDA kernel submission pipelining, to overlap costs and allow concurrent il y a 11 ans
long_kernel.cu 092f322b1c Add CUDA concurrent kernel execution support through the STARPU_NWORKER_PER_CUDA environment variable. il y a 11 ans
overlap.c 5fdde967b0 tests: partly revert #11510 and turn all cpu implementations codelets not static so that MIC code is able to successfully call dlsym on the function il y a 12 ans