Samuel Thibault
|
fd84a2abd0
cosmetic changes
|
15 years ago |
Samuel Thibault
|
31ec191f32
fix parameter name coherencey
|
15 years ago |
Samuel Thibault
|
c5cc01e6d5
Fix MPI scatter
|
15 years ago |
Cédric Augonnet
|
417c14b7f0
use the good parameters for CUDA calls
|
15 years ago |
Samuel Thibault
|
c05e4565d4
only the owner of the block should send it to node 0
|
15 years ago |
Cédric Augonnet
|
578c319b4a
Fix bug and add error checking
|
15 years ago |
Cédric Augonnet
|
4408a54de9
add an experiment to measure the latency between 2 GPUs
|
15 years ago |
Cédric Augonnet
|
c1273c8f02
- add a (pretty naive) test to measure the latency between 2 GPUs
|
15 years ago |
Cédric Augonnet
|
516411357c
forgot to commit the public headers in the previous commit
|
15 years ago |
Cédric Augonnet
|
3ea8462620
- Use events instead of streams to check whether a data transfer is terminated
|
15 years ago |
Cédric Augonnet
|
b308e85842
The user events are typically not executed by a worker, so in case the event is
|
15 years ago |
Cédric Augonnet
|
3946a1e554
Forgot to move the cuda kernel associated to the sync_and_notify_data test.
|
15 years ago |
Cédric Augonnet
|
c7d8e962c0
The progression hook is a mechanism which should be use in very specific cases.
|
15 years ago |
Cédric Augonnet
|
5a38bde976
tag-wait-api is not a microbench either
|
15 years ago |
Cédric Augonnet
|
59a0283279
move the sync_and_notify_data example in a more appropriate directory too
|
15 years ago |
Cédric Augonnet
|
7363413be1
Move the "dsm stress" example in a more appropriate directory, this is not a
|
15 years ago |
Cédric Augonnet
|
89c1870f00
Define a new header which should contain the public functions that should only
|
15 years ago |
Cédric Augonnet
|
4a8fe3239e
improve error checking
|
15 years ago |
Cédric Augonnet
|
dc7a276f27
add some debug messages in the MPI lib
|
15 years ago |
Cédric Augonnet
|
d247fadc97
add a -display flag
|
15 years ago |
Cédric Augonnet
|
0a02a72fd6
Something got messed up with fortran vs. C ordering, so we temporarility
|
15 years ago |
Cédric Augonnet
|
bbb0e033a8
- yet more debugging
|
15 years ago |
Cédric Augonnet
|
c1f052654a
fix MPI LU
|
15 years ago |
Cédric Augonnet
|
9484d29449
Add various functions to debug the MPI LU code.
|
15 years ago |
Cédric Augonnet
|
681fcbb676
make sure that nvcc finds cuda headers
|
15 years ago |
Cédric Augonnet
|
395d4f8c66
- Add timing in the incrementer example.
|
15 years ago |
Cédric Augonnet
|
ee82317aad
- fix some bugs
|
15 years ago |
Cédric Augonnet
|
3bfae32576
define a poison value to help debugging
|
15 years ago |
Cédric Augonnet
|
eee6350711
handle termination properly
|
15 years ago |
Cédric Augonnet
|
e5a8efbe98
fix multiple bus (eg. tag misuse)
|
15 years ago |