Samuel Thibault f99ef13c88 Add CUDA kernel submission pipelining, to overlap costs and allow concurrent hace 11 años
..
gpu_concurrency.c f99ef13c88 Add CUDA kernel submission pipelining, to overlap costs and allow concurrent hace 11 años
long_kernel.cu 092f322b1c Add CUDA concurrent kernel execution support through the STARPU_NWORKER_PER_CUDA environment variable. hace 11 años
overlap.c bc3fd88dfd replace 0 with macro part 4 hace 12 años