История коммитов

Автор SHA1 Сообщение Дата
  Cyril Roelandt 2787e1fa46 Added an SSE codelet to the vector scaling example. лет назад: 14
  Ludovic Courtès 886913ffe3 gcc: Support interleaved declarations & definitions of task implementations. лет назад: 14
  Ludovic Courtès 68b2f37506 gcc: Simplify `task' attribute handling. лет назад: 14
  Samuel Thibault ed747c4790 typo лет назад: 14
  Samuel Thibault 77007ead6a revert r4190, it's completely bogus, we need to add -L for the AC_HAVE_LIBRARY. AC_HAVE_LIBRARY actually does not add -lcudart to LDFLAGS because it has a non-empty action. Let's thus add CUDA_LDFLAGS before the cublas test лет назад: 14
  Samuel Thibault 89f3df6586 keep -lcudart when checking for libcublas. This is needed for linking when that library path is not in LD_LIBRARY_PATH лет назад: 14
  Olivier Aumage 6c4d62d0b0 - code disabled for years лет назад: 14
  Olivier Aumage 4a747573b4 - add missing ifdefs лет назад: 14
  Samuel Thibault 3d15b9bbb1 drop debugging лет назад: 14
  Nathalie Furmento fd06148167 configure.ac: remove trailing whitespaces лет назад: 14
  Nathalie Furmento 6db2c372ca merge branch gpumem_prefetch лет назад: 14
  Samuel Thibault 61eda0dcef Fix asynchronicity of the wt mechanism by keeping a read reference on the data лет назад: 14
  Samuel Thibault 4a0ed0dc25 Fix asynchronous prefetch: we also need to notify data dependencies in that case. Allocate the wrapper structure dynamically to permit asynchronous termination. Permit prefetch in callbacks and codelets лет назад: 14
  Samuel Thibault b9f24287b1 Fix replicate reference counting лет назад: 14
  Samuel Thibault d7c53078b1 avoid crashing on no model name лет назад: 14
  Olivier Aumage 1c6a3ad8b0 - fix initialization лет назад: 14
  Samuel Thibault 6f253ca156 do not make _starpu_prefetch_data_on_node_with_mode wait for the request when async is true лет назад: 14
  Samuel Thibault 73123b92d6 Count the number of workers in memory nodes, to avoid scheduling transfers from memory nodes without a worker (e.g. 0 cpus). Fixes at least the wt mask when no CPU is enabled. лет назад: 14
  Samuel Thibault 5c4a4e9c84 Add gdb functions to print data requests лет назад: 14
  Samuel Thibault 73ac11ecf8 provide file and line of errors лет назад: 14
  Samuel Thibault 0e3edeee5c provide file and line of errors лет назад: 14
  Samuel Thibault 96c23945a0 revert 4155, there is another exp_end actually... лет назад: 14
  Samuel Thibault 6847e9ac82 exp_end is not actually the expected end, only estimations with the new task лет назад: 14
  Olivier Aumage 66e35b6222 - add support for complex number cases in the LU example лет назад: 14
  Cyril Roelandt d7f7328c29 src/sched_policies/detect_combined_workers.c: replacing find_combinations_without_hwloc by find_and_assign_combinations_without_hwloc. лет назад: 14
  Nicolas Collin 0e82138ef3 Fixed the functions used for combining cpus in several combined workers, using hwloc data. лет назад: 14
  Nathalie Furmento f641bbcf02 mp/examples/cholesky: code cleaning лет назад: 14
  Cyril Roelandt 2b60aedf3b Replacing _starpu_topology_get_nhwcpu(config) by config->topology.ncpus лет назад: 14
  Cyril Roelandt b29709b352 src/core/perfmodel/perfmodel_bus.c:replacing cudaGetDeviceCount by _starpu_get_cuda_device_count лет назад: 14
  Nathalie Furmento ce10465a8c gcc-plugin/examples/Makefile.am: only enable cholesky example if a blas library is enabled лет назад: 14