| .. |
|
gpu_concurrency.c
|
f99ef13c88
Add CUDA kernel submission pipelining, to overlap costs and allow concurrent
|
%!s(int64=11) %!d(string=hai) anos |
|
long_kernel.cu
|
092f322b1c
Add CUDA concurrent kernel execution support through the STARPU_NWORKER_PER_CUDA environment variable.
|
%!s(int64=11) %!d(string=hai) anos |
|
overlap.c
|
5fdde967b0
tests: partly revert #11510 and turn all cpu implementations codelets not static so that MIC code is able to successfully call dlsym on the function
|
%!s(int64=12) %!d(string=hai) anos |