История коммитов

Автор SHA1 Сообщение Дата
  Samuel Thibault 7205b70968 Fix computation/communication overlap: transfers always have to use the local_transfer stream, not the local stream лет назад: 14
  Samuel Thibault 408e6de3ce take transfer time into account in heft when the task is scheduled to be executed almost immediately: the transfer will never be overlappable in that case. лет назад: 14
  Cyril Roelandt 7332d96c1d Fix typo. лет назад: 14
  Samuel Thibault f1e95134c6 heft should now be fixed for multiple implementations лет назад: 14
  Samuel Thibault 8360d50c2b Fix expected start of tasks: after task execution, the CPU time is already elapsed. лет назад: 14
  Cyril Roelandt 0a185ee4d8 Talking about the scheduler in the "multiple implementations" paragraph. лет назад: 14
  Cyril Roelandt 2a921d3c5e Bugfix : do not assign a particular codelet implementation to a job before selecting an appropriate worker. лет назад: 14
  Samuel Thibault af9be1babd fix comment лет назад: 14
  Nathalie Furmento 288af06552 remove un-needed copyright лет назад: 14
  Nathalie Furmento 2177148f7e add missing licence information лет назад: 14
  Cyril Roelandt 16f070b7cd Updated the documentation with data related to mutliple implementations. лет назад: 14
  Samuel Thibault b58ea76005 Spurious variable лет назад: 14
  Samuel Thibault eb5822de45 Assert rather than do bogus things for non-cuda/opencl case. лет назад: 14
  Samuel Thibault 6af576989c Add missing prototypes лет назад: 14
  Samuel Thibault 8e7b92c6e8 silence warning лет назад: 14
  Samuel Thibault 6b70d00b21 silent warning лет назад: 14
  Samuel Thibault 846df70675 Fix build without fxt лет назад: 14
  Samuel Thibault e9e66b6c78 Mention DriverCopyAsync in the trace. лет назад: 14
  Samuel Thibault 3926b7c7be Fix hack which was wrong with peers: simply record both source and destination, it actually makes the statistics code simpler лет назад: 14
  Samuel Thibault 209ebef2ac Trace async transfers differently to watch for cuda spurious blocking лет назад: 14
  Cyril Roelandt 2787e1fa46 Added an SSE codelet to the vector scaling example. лет назад: 14
  Ludovic Courtès 886913ffe3 gcc: Support interleaved declarations & definitions of task implementations. лет назад: 14
  Ludovic Courtès 68b2f37506 gcc: Simplify `task' attribute handling. лет назад: 14
  Samuel Thibault ed747c4790 typo лет назад: 14
  Samuel Thibault 77007ead6a revert r4190, it's completely bogus, we need to add -L for the AC_HAVE_LIBRARY. AC_HAVE_LIBRARY actually does not add -lcudart to LDFLAGS because it has a non-empty action. Let's thus add CUDA_LDFLAGS before the cublas test лет назад: 14
  Samuel Thibault 89f3df6586 keep -lcudart when checking for libcublas. This is needed for linking when that library path is not in LD_LIBRARY_PATH лет назад: 14
  Olivier Aumage 6c4d62d0b0 - code disabled for years лет назад: 14
  Olivier Aumage 4a747573b4 - add missing ifdefs лет назад: 14
  Samuel Thibault 3d15b9bbb1 drop debugging лет назад: 14
  Nathalie Furmento fd06148167 configure.ac: remove trailing whitespaces лет назад: 14