Samuel Thibault
|
b195529cd4
Add missing calls to can_execute hook
|
12 vuotta sitten |
Samuel Thibault
|
14016e793f
port r11537 from 1.1: cudaFree takes much more than 125µs on average. 750µs is an average on the cholesky example. This still needs to be somehow tuned
|
12 vuotta sitten |
Samuel Thibault
|
c77d312268
port r 11535 from 1.1: Add missing termination of data transfers for reclaiming write-back
|
12 vuotta sitten |
Samuel Thibault
|
098414fc11
port r11533 from 1.1: Also explicitly show in traces the writing back time during reclaiming
|
12 vuotta sitten |
Samuel Thibault
|
9a05d0c3b8
port r11531 from 1.1: Also show free periods in the trace, to distinguish with data transfers and algorithmic complexity in reclaiming
|
12 vuotta sitten |
Samuel Thibault
|
5f6f2bf55b
port r11529 from 1.1: Ignore data races on statistics
|
12 vuotta sitten |
Samuel Thibault
|
0001dad2f6
port r11527 from 1.1: Do not care about races on number of tasks, this is used as a statistic only
|
12 vuotta sitten |
Nathalie Furmento
|
f3a3538ac7
examples/stencil/stencil-kernels.c: check return value of function clEnqueueCopyBuffer
|
12 vuotta sitten |
Marc Sergent
|
b2f8911c89
Adding configure option --enable-calibration-heuristic which allows the user to set the maximum authorized deviation of the history-based calibrator
|
12 vuotta sitten |
Samuel Thibault
|
0423864961
port r11516 from 1.1: considerably reduce the amount of requests submitted at the same time. Pushing more does not seem to really improve performance, and on the contrary increases the latency of possibly urging requests
|
12 vuotta sitten |
Samuel Thibault
|
631d6e3185
port r 11514 from 1.1: Add unpartition state in trace
|
12 vuotta sitten |
Nathalie Furmento
|
3221332ec5
mpi: minor fixes for function declarations
|
12 vuotta sitten |
Nathalie Furmento
|
5fdde967b0
tests: partly revert #11510 and turn all cpu implementations codelets not static so that MIC code is able to successfully call dlsym on the function
|
12 vuotta sitten |
Nathalie Furmento
|
8ec3cd5612
tests: forgot to add file in previous commit
|
12 vuotta sitten |
Nathalie Furmento
|
0c6c658cb1
tests: turn function static when it makes sense to do so
|
12 vuotta sitten |
Samuel Thibault
|
d80744f490
port r11508 from 1.1: Fix passing key to hasthable: the passed pointer has to be pointing to the key at all time
|
12 vuotta sitten |
Samuel Thibault
|
806d128705
port r11506 from 1.1: Do not remove reused memchunk from the cache, it was already removed in _starpu_memchunk_cache_lookup_locked. Thanks Cyril for managing to find a case where it crashes :)
|
12 vuotta sitten |
Samuel Thibault
|
dc2959f5f2
port r11504 from 1.1: Make run_driver cover the case when the task is not finished when we call starpu_drivers_request_termination. This is actually not working any more in the trunk, and needs fixing
|
12 vuotta sitten |
Nathalie Furmento
|
402e568b27
src/datawizard/memalloc: store interface size of the memchunk as the handle may no longer be valid when reusing the memchunk (thanks to Cyril Bordage for reporting the bug)
|
12 vuotta sitten |
Nathalie Furmento
|
9bb5a09263
src: minor fixes
|
12 vuotta sitten |
Nathalie Furmento
|
e1166e9eb0
include/starpu_bitmap.h: add missing starpu_bitmap_has_next() function prototype
|
12 vuotta sitten |
Nathalie Furmento
|
5f35f4c737
src/datawizard/interfaces/data_interface.c: rename internal function
|
12 vuotta sitten |
Nathalie Furmento
|
59dc8c395f
src/datawizard: add missing include starpu_scheduler.h
|
12 vuotta sitten |
Nathalie Furmento
|
ddf6a00144
src/datawizard/malloc: define prototypes
|
12 vuotta sitten |
Samuel Thibault
|
01764c5ddc
ignore race about the watchdog state, we are ok with it
|
12 vuotta sitten |
Nathalie Furmento
|
15acb4be0f
doc/Makefile.am: add showcheck target
|
12 vuotta sitten |
Nathalie Furmento
|
78f4820844
mpi/src/starpu_mpi.c: initialisation can only be done once mpi itself has been initialised
|
12 vuotta sitten |
Nathalie Furmento
|
a31d46c22b
mpi/src/starpu_mpi.c: make sure all internal structures are initialised before starting mpi communication thread
|
12 vuotta sitten |
Nathalie Furmento
|
7667e0b20a
tests/perfmodels/value_nan.c: delete temporary file
|
12 vuotta sitten |
Samuel Thibault
|
b801a66994
port r 11483 from 1.1: Default to using only 90% of the available GPU memory, to avoid seeing cudaMemset run out of memory..
|
12 vuotta sitten |