Cédric Augonnet
|
5556836c9e
- Use libnuma if possible
|
15 jaren geleden |
Cédric Augonnet
|
f9ecd9b4a7
Test if libnuma is available at configure time.
|
15 jaren geleden |
Cédric Augonnet
|
b3caf41960
Get rid of mpi/examples/Makefile.am to avoid a dependency issue.
|
15 jaren geleden |
Samuel Thibault
|
e9acd82e11
double-buffer 12 and 21 contributions
|
15 jaren geleden |
Cédric Augonnet
|
7cec004adf
Do not wait for the termination of TAG*_SAVE when the data was processed localy
|
15 jaren geleden |
Cédric Augonnet
|
722ba734ee
Allocate pinned memory in an existing context rather than from the application
|
15 jaren geleden |
Cédric Augonnet
|
13dd236182
Keep this function public
|
15 jaren geleden |
Nathalie Furmento
|
3c7a6bade8
Rename cuda functions
|
15 jaren geleden |
Samuel Thibault
|
fd11fd4590
TODO: use hwloc for binding for cuda perfs
|
15 jaren geleden |
Cédric Augonnet
|
f838d1aae0
Use an array of temporary buffers for "task 11" instead of a single buffer.
|
15 jaren geleden |
Cédric Augonnet
|
1d37506834
by default, disable debugging messages
|
15 jaren geleden |
Samuel Thibault
|
a378191763
debugging message according to C convention
|
15 jaren geleden |
Samuel Thibault
|
d22f7b3aef
fix tag dependency, seems to work better now
|
15 jaren geleden |
Samuel Thibault
|
34e12b640f
more readable debugging
|
15 jaren geleden |
Samuel Thibault
|
2c2170c9c8
initialize thread support, warn if no thread support is available
|
15 jaren geleden |
Samuel Thibault
|
fd84a2abd0
cosmetic changes
|
15 jaren geleden |
Samuel Thibault
|
31ec191f32
fix parameter name coherencey
|
15 jaren geleden |
Samuel Thibault
|
c5cc01e6d5
Fix MPI scatter
|
15 jaren geleden |
Cédric Augonnet
|
417c14b7f0
use the good parameters for CUDA calls
|
15 jaren geleden |
Samuel Thibault
|
c05e4565d4
only the owner of the block should send it to node 0
|
15 jaren geleden |
Cédric Augonnet
|
578c319b4a
Fix bug and add error checking
|
15 jaren geleden |
Cédric Augonnet
|
4408a54de9
add an experiment to measure the latency between 2 GPUs
|
15 jaren geleden |
Cédric Augonnet
|
c1273c8f02
- add a (pretty naive) test to measure the latency between 2 GPUs
|
15 jaren geleden |
Cédric Augonnet
|
516411357c
forgot to commit the public headers in the previous commit
|
15 jaren geleden |
Cédric Augonnet
|
3ea8462620
- Use events instead of streams to check whether a data transfer is terminated
|
15 jaren geleden |
Cédric Augonnet
|
b308e85842
The user events are typically not executed by a worker, so in case the event is
|
15 jaren geleden |
Cédric Augonnet
|
3946a1e554
Forgot to move the cuda kernel associated to the sync_and_notify_data test.
|
15 jaren geleden |
Cédric Augonnet
|
c7d8e962c0
The progression hook is a mechanism which should be use in very specific cases.
|
15 jaren geleden |
Cédric Augonnet
|
5a38bde976
tag-wait-api is not a microbench either
|
15 jaren geleden |
Cédric Augonnet
|
59a0283279
move the sync_and_notify_data example in a more appropriate directory too
|
15 jaren geleden |