Samuel Thibault
|
922ff28a1e
Make datawizard_progress use worker 0 of worker sets as reference for determining which memory nodes to make progress
|
8 年之前 |
Samuel Thibault
|
384397546c
Label the CUDA driver just 'CUDA' one using just one thread
|
8 年之前 |
Samuel Thibault
|
283d125846
fix build without CUDA
|
8 年之前 |
Samuel Thibault
|
7cdeadd447
Add STARPU_CUDA_THREAD_PER_DEV environment variable to support driving all
|
8 年之前 |
Samuel Thibault
|
2ea92eb0fb
Always set the current CUDA devid when allocating, it is not very costly and makes things more flexible
|
8 年之前 |
Samuel Thibault
|
43a3c904ed
directly lookup destination node instead of relying on the node of the current worker
|
8 年之前 |
Samuel Thibault
|
f37e65a6f2
Drop the memory_node thread key, we do not really need it since we can get it from the worker key, and it makes switching more costly
|
8 年之前 |
Samuel Thibault
|
c71f05d00b
fix warning
|
8 年之前 |
Samuel Thibault
|
62fb4d67b8
comment
|
8 年之前 |
Nathalie Furmento
|
422cad39ec
New function starpu_worker_display_names to display the names of all the workers of a specified type.
|
8 年之前 |
Nathalie Furmento
|
ca777699be
tests/main/driver_api/init_run_deinit.c: fix number of workers
|
8 年之前 |
Samuel Thibault
|
a2d3350e43
Fix file leak
|
8 年之前 |
Nathalie Furmento
|
8e8b686bb5
doc: fix text
|
8 年之前 |
Samuel Thibault
|
5a5eb593d1
Always reset the local worker key to worker0 for datawizard to get the proper wait queue when using multiple workers in the same driver thread
|
8 年之前 |
Samuel Thibault
|
a41080d2a6
Fix setting the proper CUDA stream when using several streams from the same driver thread
|
8 年之前 |
Samuel Thibault
|
52d97c6ae9
Fix cublas initialization/shutdown when using one thread per stream with multistream
|
8 年之前 |
Samuel Thibault
|
27cfafb119
Move setting worker_set->nworkers to topology, where it makes more sense
|
8 年之前 |
Samuel Thibault
|
6dbd5add56
Fix indent
|
8 年之前 |
Samuel Thibault
|
d3d32d6451
Fix priorities for schedulers which don't have unbound priority values
|
8 年之前 |
Samuel Thibault
|
ace9f6b23e
Use better priorities in cholesky algorithm
|
8 年之前 |
Samuel Thibault
|
5278ab7d1b
typo
|
8 年之前 |
Samuel Thibault
|
14492fb09a
typo
|
8 年之前 |
Nathalie Furmento
|
461ca6a408
mpi/src/starpu_mpi_cache_stats.c: remove internal array which is not needed
|
8 年之前 |
Nathalie Furmento
|
d8961fb495
mpi: move cache data in the starpu_data_handle_t
|
8 年之前 |
Samuel Thibault
|
6a9541027d
comments
|
8 年之前 |
Samuel Thibault
|
5e61b183a5
Fix reporting CUDA workers when using several workers per CUDA
|
8 年之前 |
Samuel Thibault
|
8a713e9521
Fix missing include, thanks Léo
|
8 年之前 |
Nathalie Furmento
|
4ee7c64768
mpi/tests/insert_task_node_choice.c: use long long to make sure test is valid on a 32-bit architecture
|
8 年之前 |
Samuel Thibault
|
cd847ea3d3
Fix build with upcoming simgrid release
|
8 年之前 |
Samuel Thibault
|
e71124580f
shut up warning
|
8 年之前 |