Commit History

Autor SHA1 Mensaxe Data
  Nathalie Furmento 1ae61a95f9 CUDA and OpenCL drivers: when STARPU_CUDA_PIPELINE and STARPU_OPENCL_PIPELINE are set to 0, force synchronous execution of all kernels %!s(int64=10) %!d(string=hai) anos
  Samuel Thibault 7ac2c135ad protect worker status with the scheduler mutex %!s(int64=10) %!d(string=hai) anos
  Samuel Thibault 6a0c92daab Avoid void* arithmetic, icc refuses it on windows... %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault 1f1954c43b Add OpenCL kernel submission pipelining, to overlap costs %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault f313079113 Factorize code to prepare for OpenCL pipelining %!s(int64=11) %!d(string=hai) anos
  Nathalie Furmento bafe84486c fix #13183 %!s(int64=11) %!d(string=hai) anos
  Nathalie Furmento 7e743c9457 backport branches/perfmodel@13175: src/drivers: use a more meaningful name than args for the worker parameter %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault b25760aa94 fix warnings %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault f14e68dc2c Separate worker and thread state, to see streamed kernel executions %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault 778b4c444a * Add codelet size, footprint and tag id in the paje trace. %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault 3903429984 Fix build with simgrid and without opencl %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault 7832518f4d port r12742 from 1.1: Fix OpenCL device number detection when MAXOPENCLDEVS is not big enough %!s(int64=11) %!d(string=hai) anos
  Nathalie Furmento 7891f63369 src/drivers/opencl/driver_opencl.c: revert #12566 and #12578, and explain why, the function availability would need to be checked at runtime, we do not want to do that %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault d4aeabce96 Also make sure that the header has the declaration %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault d0eab8ef8c Do not rely on the .h file for functions actually available at link time, do check them explicitly %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault 83a38dfa85 clEnqueueMarker is deprecated by clEnqueueMarkerWithWaitList. Be able to use both %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault 092f322b1c Add CUDA concurrent kernel execution support through the STARPU_NWORKER_PER_CUDA environment variable. %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault 5a9ffce947 backport r12506: fix disabling out-of-order for data transfers %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault bf8aa6338d Let the OpenCL driver progress while the GPU is computing %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault 8f26d26ad7 Introduce the STARPU_CUDA_ASYNC and STARPU_OPENCL_ASYNC flags. No optimization is done yet for now. %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault 8cc1f6c0db Use separate OpenCL command queues for CPU->GPU, GPU->CPU, and GPU->GPU transfers %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault e4c9c8774a Fix GPU memory size in the simgrid case %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault 5ec6f8b8a7 Drop unused mask parameter of _starpu_fetch_task_input and _starpu_push_task_output %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault b1dfd30389 Store GPU size in platform file %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault e9e4a6361a Use the same cuda/opencl exclusion mechanism on simgrid, instead of having a bogus assertion %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault 6325a34ab3 Move device id lookup into run-driver functions, to avoid paying the cost at each task execution %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault d448281fc3 New STARPU_DISABLE_KERNELS environment variable to disable actual kernel execution. %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault 89ac7e59ac Provide hint for the user about missing func pointer %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault b801a66994 port r 11483 from 1.1: Default to using only 90% of the available GPU memory, to avoid seeing cudaMemset run out of memory.. %!s(int64=11) %!d(string=hai) anos
  Samuel Thibault 4ce5fe9027 port r11357 from 1.1: Fix memleak %!s(int64=11) %!d(string=hai) anos