Commit History

Autor SHA1 Mensaxe Data
  Nathalie Furmento f413aaa1cd src/util/starpu_cublas.c: add missing #ifdef %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault b613d58104 Assume that we have at least cuda 3.1, so that we can make our BLAS examples always use streams %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 51a7e8c979 Use cudaMemcpyAsync instead of cudaMemcpy %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 78ac4a07ca follow-up r7018: add missing declarations %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault d2cd1868e2 provide good examples by always using cudaMemsetAsync, not cudaMemset %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 57e59bf2d9 provide good examples by always using cudaMemcpyAsync, not cudaMemcpy %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault f18180c477 synchronize only the transfer stream %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 3e58d92492 Add 2d shadow filters %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault cb492ad84b Keep documentation coherent with C convention %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 972caa3d96 Fix build without OpenGL headers %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 70864cd9ea there are some optimizations with eager, just not wise scheduling %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 1d7dcdc5b6 fix compilation warning %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault c3610e1434 drop unused factor %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 9803389d55 fix number %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault fd06e119aa more comments %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault cf5341ae88 more details on the shadow at work %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 6bc0d82b7c fix non-cuda/opencl build %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault a88b26f1a7 update changelog %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault ee1e7926a3 Put single-combined-worker in nice order for starpu_machine_display %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 3b540f359f document that starpu_machine_display shows combined workers %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 65688ef4d8 print topology before bandwidth %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 1aeeab7392 mergeinfo for backport %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 844951370b Show topology in starpu_machine_display %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault bb9da9185b Fix output alignment %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 8f8194dfa6 forwardport r6996 from 1.0: fix bus calibration for more than 32 cpus %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault ab9b3fcfbd forwardport r6997 from 1.0: fix bus calibration for more than 32 cpus %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault f33693b5d4 Add actual bus measurements in starpu_machine_display output %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault f30a1170aa Replace starpu_force_bus_sampling with bus_calibrate configuration option, as calibration should done carefully during starpu initialization, and not after initialization %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault 09dab6fd3e Drop spurious nbsp %!s(int64=13) %!d(string=hai) anos
  Samuel Thibault c39f87d9cd clearly separate CUDA and OpenCL %!s(int64=13) %!d(string=hai) anos