Cédric Augonnet
|
2c2f40c638
The increment_redux test performs a succession of reduction phases followed by
|
14 роки тому |
Samuel Thibault
|
8dc58afd4d
Fix non-Linux builds.
|
14 роки тому |
Cédric Augonnet
|
04cfc235ec
Destroy the reduction-specific per-handle request list when the handle is
|
14 роки тому |
Cédric Augonnet
|
14ebc94929
Implement cg with filters
|
14 роки тому |
Cédric Augonnet
|
9c959490e7
Start to implement a conjugate gradient
|
14 роки тому |
Cédric Augonnet
|
c0be233412
Define CUBLAS functions
|
14 роки тому |
Cédric Augonnet
|
0828950dd7
Add a wrapper for DDOT
|
14 роки тому |
Cédric Augonnet
|
b910d275eb
The async task overhead now indicates precisely how time is spent
|
14 роки тому |
Cédric Augonnet
|
e2c5593252
Add new fields in the per-task profiling info structure
|
14 роки тому |
Cédric Augonnet
|
b42f38c8b2
Avoid overflow when computing pi
|
14 роки тому |
Cédric Augonnet
|
e4ed360572
When a data handle is accessed in a R/W mode for the first time after it was
|
14 роки тому |
Cédric Augonnet
|
193905f07a
Use starpu_task_wait_for_all instead of condition variables to simplify the
|
14 роки тому |
Cédric Augonnet
|
8d2b6c2760
Rename the STARPU_TASK_BLOCKED_ON_JOB status into STARPU_TASK_BLOCKED_ON_DATA
|
14 роки тому |
Cédric Augonnet
|
23a21d47db
Fix compilation without CUDA
|
14 роки тому |
Cédric Augonnet
|
3e6aadaca8
The pi_redux example is now one of the tests used by make check
|
14 роки тому |
Cédric Augonnet
|
1b0a67230d
No need to lock the header again
|
14 роки тому |
Cédric Augonnet
|
af9ea8975f
StarPU automatically calls starpu_data_start_reduction_mode when the data is
|
14 роки тому |
Cédric Augonnet
|
e4c85b9e07
Implement the Monte Carlo algorithm used to compute Pi by the means of
|
14 роки тому |
Cédric Augonnet
|
9ae657bdb4
Fix compilation for machines without CUDA
|
14 роки тому |
Cédric Augonnet
|
3b5af731f6
Don't use CUDA/CUBLAS if CUDA is not available
|
14 роки тому |
Cédric Augonnet
|
11561287cf
The dot_product example computes the scalar product of two vectors by the means
|
14 роки тому |
Cédric Augonnet
|
8e094a1fec
First draft of implementation of the STARPU_REDUX mode.
|
14 роки тому |
Cédric Augonnet
|
58215b1860
- Add sanity checks
|
14 роки тому |
Cédric Augonnet
|
d3dd8e5bd6
The "manual_reduction" test performs a reduction operation by the means of
|
14 роки тому |
Cédric Augonnet
|
78faf9849a
The starpu_data_unregister_no_coherency does the same as starpu_data_unregister
|
14 роки тому |
Cédric Augonnet
|
461d4d99b7
Provide an implementation of the spinlocks that actually rely on mutexes (with
|
14 роки тому |
Cédric Augonnet
|
b566d09898
Use a rw-lock instead of a mutex to protect the list of progression hooks.
|
14 роки тому |
Cédric Augonnet
|
280aa1e569
Document STARPU_LIMIT_GPU_MEM
|
14 роки тому |
Cédric Augonnet
|
e3df8c20c8
Implement STARPU_LIMIT_GPU_MEM on both CUDA and OpenCL devices.
|
14 роки тому |
Nathalie Furmento
|
a5f704ef6b
mpi_lu example: modifications towards the new API
|
14 роки тому |