Samuel Thibault f99ef13c88 Add CUDA kernel submission pipelining, to overlap costs and allow concurrent лет назад: 11
..
gpu_concurrency.c f99ef13c88 Add CUDA kernel submission pipelining, to overlap costs and allow concurrent лет назад: 11
long_kernel.cu 092f322b1c Add CUDA concurrent kernel execution support through the STARPU_NWORKER_PER_CUDA environment variable. лет назад: 11
overlap.c bc3fd88dfd replace 0 with macro part 4 лет назад: 12