Samuel Thibault
|
f313079113
Factorize code to prepare for OpenCL pipelining
|
11 vuotta sitten |
Samuel Thibault
|
62e0ff89af
Fix missing include for windows port of locking
|
11 vuotta sitten |
Samuel Thibault
|
23636fb4f4
Fix simgrid build
|
11 vuotta sitten |
Samuel Thibault
|
77e107e4af
Warn if the user uses concurrent execution on a device which doesn't support it
|
11 vuotta sitten |
Samuel Thibault
|
62194c5c86
Fix filling the pipeline
|
11 vuotta sitten |
Samuel Thibault
|
f99ef13c88
Add CUDA kernel submission pipelining, to overlap costs and allow concurrent
|
11 vuotta sitten |
Samuel Thibault
|
647ccc35a7
Fix estimated allocated memory
|
11 vuotta sitten |
Olivier Aumage
|
c21a7d4669
- make the conf struct global as it may be referenced outside the function
|
11 vuotta sitten |
Samuel Thibault
|
41cfdc75cd
free data outside critical section
|
11 vuotta sitten |
Samuel Thibault
|
3b766361ce
free data outside critical section
|
11 vuotta sitten |
Samuel Thibault
|
206b3adf49
port r13269 from 1.1: free data outside critical section
|
11 vuotta sitten |
Samuel Thibault
|
aa5c3e5c50
Let interfaces declare which transfers they allow with the can_copy methode.
|
11 vuotta sitten |
Samuel Thibault
|
1ec6838c01
Implement perfmodel locking on windows
|
11 vuotta sitten |
Samuel Thibault
|
e5cf638f86
Pass codelet model when computing footprints for the bound computation
|
11 vuotta sitten |
Olivier Aumage
|
01e6cc7d00
- update argument name
|
11 vuotta sitten |
Olivier Aumage
|
3a272669cb
- merge trunk
|
11 vuotta sitten |
Samuel Thibault
|
b294c2f942
Fix hardcoded limitation to 64 MPI nodes in traces
|
11 vuotta sitten |
Samuel Thibault
|
f8ff9c7d1c
Fix build without memcpy_peer
|
11 vuotta sitten |
Samuel Thibault
|
89ece6b14f
changelog
|
11 vuotta sitten |
Samuel Thibault
|
2c6e5a75b0
enable gpu-gpu transfers for matrices
|
11 vuotta sitten |
Samuel Thibault
|
d76f89457d
Separate bitfields set by starpu from bitfields set by user, since the latter are accessed unsafely by starpu, but the former shouldn't
|
11 vuotta sitten |
Samuel Thibault
|
2a1bb7bf49
Update 1.1.2 svn revision
|
11 vuotta sitten |
Samuel Thibault
|
14b8f531a6
Lock performance model files while writing and reading them to avoid issues on parallel launches, MPI runs notably.
|
11 vuotta sitten |
Nathalie Furmento
|
6d015be440
tests/main/{subgraph_repeat_tag.c,subgraph_repeat_regenerate_tag.c}: stop regenerating tasks when all loops are done, this permits to call starpu_data_unregister(), otherwise it would wait for the tasks
|
11 vuotta sitten |
Nathalie Furmento
|
c232f22ddd
examples/spmv/dw_block_spmv.c: fix following #13213
|
11 vuotta sitten |
Samuel Thibault
|
782dddbbca
Do not drop forward declarations
|
11 vuotta sitten |
Samuel Thibault
|
c631e5ed44
Make more examples compatible with simgrid
|
11 vuotta sitten |
Samuel Thibault
|
70af20310c
Make LU example compatible with simgrid
|
11 vuotta sitten |
Nathalie Furmento
|
c803479334
tests/microbenchs/matrix_as_vector.c: fix typo, define matrix_codelet and not vector_codelet
|
11 vuotta sitten |
Nathalie Furmento
|
bafe84486c
fix #13183
|
11 vuotta sitten |