Olivier Aumage
|
0bb92b846b
- more ongoing work on the omp runtime support
|
11 anos atrás |
Olivier Aumage
|
18cb0e9e8c
- merge trunk
|
11 anos atrás |
Olivier Aumage
|
934fa3edfc
- initial thread handling
|
11 anos atrás |
Samuel Thibault
|
c60d91cb3f
Enable asynchronous OpenCL wherever possible
|
11 anos atrás |
Olivier Aumage
|
1712f84e3b
- merge trunk
|
11 anos atrás |
Olivier Aumage
|
09ea840391
- ongoing work on openmp runtime support
|
11 anos atrás |
Samuel Thibault
|
840af37490
push pending requests at the back of the list of pending requests, not the front
|
11 anos atrás |
Samuel Thibault
|
058e113825
port r12533 from 1.1: Do not let non-CUDA workers use non-0 streams, CUDA seems not very threadsafe with that. Make the coherency engine avoid selecting non-CUDA workers to issue transfers, to avoid letting them use the 0 stream.
|
11 anos atrás |
Samuel Thibault
|
9e7c9936ce
Use STARPU_MAIN_RAM where appropriate
|
11 anos atrás |
Samuel Thibault
|
9aa1c5ba6c
Use STARPU_MAIN_RAM where appropriate
|
11 anos atrás |
Samuel Thibault
|
2bafbc890a
fix build
|
11 anos atrás |
Samuel Thibault
|
92d443d920
Make documentation clearer
|
11 anos atrás |
Samuel Thibault
|
ea5d8f596b
Make a copy of the interface to the memchunk only when the latter gets detached from the data, and thus the interface code will not work on it. Drop the copy when the memchunk gets reattached. This allows interfaces to modify pointers in the interface in unpack, notably
|
11 anos atrás |
Samuel Thibault
|
fb03301fdf
port r12523 from 1.1: Fix detaching a memchunk from a handle: the memchunk is not supposed to still have a pointer to the replicate
|
11 anos atrás |
Samuel Thibault
|
1aad553cc2
Fix crash with perfmodels having type common
|
11 anos atrás |
Andra Hugo
|
aed609a645
rollback r12518: don't allow unregister in callback or task codelets
|
11 anos atrás |
Andra Hugo
|
84e77b5ad7
add prolog callbacks to mpi
|
11 anos atrás |
Olivier Aumage
|
84fe3d8fb6
- merge trunk
|
11 anos atrás |
Andra Hugo
|
a78e7bdc6a
allow unregisters in callbacks and codelets when we have only CPUs
|
11 anos atrás |
Olivier Aumage
|
c9029780f2
- merge trunk
|
11 anos atrás |
Olivier Aumage
|
2a38b1c99d
- add test for OpenMP environment parsing
|
11 anos atrás |
Olivier Aumage
|
0e987f4f1d
- add partial support for OpenMP environment parsing
|
11 anos atrás |
Andra Hugo
|
2938dd4805
patch Terry: add a new prologue callback (one executed at pop time)
|
11 anos atrás |
Andra Hugo
|
e991168dec
rollback book_workers
|
11 anos atrás |
Olivier Aumage
|
b661cb806b
- basic test for the openmp runtime support
|
11 anos atrás |
Samuel Thibault
|
5c4a168e72
Do not destroy containers before using them
|
11 anos atrás |
Olivier Aumage
|
f4d932acbf
- merge trunk
|
11 anos atrás |
Samuel Thibault
|
5a9ffce947
backport r12506: fix disabling out-of-order for data transfers
|
11 anos atrás |
Samuel Thibault
|
bf8aa6338d
Let the OpenCL driver progress while the GPU is computing
|
11 anos atrás |
Samuel Thibault
|
7d2bf0c630
Enable asynchronous CUDA kernels wherever possible
|
11 anos atrás |