Samuel Thibault f99ef13c88 Add CUDA kernel submission pipelining, to overlap costs and allow concurrent 10 anos atrás
..
gpu_concurrency.c f99ef13c88 Add CUDA kernel submission pipelining, to overlap costs and allow concurrent 10 anos atrás
long_kernel.cu 092f322b1c Add CUDA concurrent kernel execution support through the STARPU_NWORKER_PER_CUDA environment variable. 11 anos atrás
overlap.c 5fdde967b0 tests: partly revert #11510 and turn all cpu implementations codelets not static so that MIC code is able to successfully call dlsym on the function 11 anos atrás