Samuel Thibault
|
f3d488b260
Avoid including cublas.h to avoid conflicting with cublas_v2.h
|
10 éve |
Samuel Thibault
|
34248f4d14
explicit versions of the blas
|
10 éve |
Nathalie Furmento
|
36fe1cbc6e
src/core/perfmodel: store at the beginning of the application the names for the different perfmodel directories, there is no need to recalcute them at each function call
|
10 éve |
Olivier Aumage
|
a0c36cc735
- execute Cuda/OpenCL omp target tasks without StarPU/OpenMP wrapper
|
10 éve |
Samuel Thibault
|
353019fea0
Enable integer resolution for gemm, it's not costly here
|
10 éve |
Samuel Thibault
|
c53f16a65d
Add gemm perfmodels
|
10 éve |
Samuel Thibault
|
c66eca1555
increase default matrix size
|
10 éve |
Samuel Thibault
|
be9be0e496
Fix flops computation in gemm
|
10 éve |
Samuel Thibault
|
f7fe21380e
TODO
|
10 éve |
Samuel Thibault
|
8ede268af6
Add bound support to gemm
|
10 éve |
Samuel Thibault
|
ed082a5e41
Integrate bound in lu output
|
10 éve |
Samuel Thibault
|
3192fc5b08
Make some example outputs nicer to plot
|
10 éve |
Samuel Thibault
|
24a6bf49c0
Fix footprint computation without bound
|
10 éve |
Samuel Thibault
|
d6b04d16b6
Fix crash when enabling bounds
|
10 éve |
Samuel Thibault
|
d045a58e37
Add stencil perfmodel
|
10 éve |
Samuel Thibault
|
0630c33945
Fix running stencil in simgrid
|
10 éve |
Samuel Thibault
|
699f63f475
Use bigger tasks by default
|
10 éve |
Samuel Thibault
|
0fee7354fd
Increase the default matrix size to get about 1GiB data
|
10 éve |
Samuel Thibault
|
e4d72271e7
port r14072 from 1.1: Document a bit the mirage architecture
|
10 éve |
Samuel Thibault
|
4b28886ba9
Add more measurements
|
10 éve |
Samuel Thibault
|
c7c9ae1aa7
Add missing flops
|
10 éve |
Samuel Thibault
|
95993904fc
Use approximation when comparing declared flops, since when printing the performance file we get some rounding
|
10 éve |
Samuel Thibault
|
60b85b966b
add lu double perfmodel
|
10 éve |
Samuel Thibault
|
6f02123c98
assume double support is available in simgrid mode
|
10 éve |
Samuel Thibault
|
41d4c8a7fe
Also enable ASYNC in simgrid mode
|
10 éve |
Samuel Thibault
|
e09b976171
Ajout perfmodels lu float
|
10 éve |
Samuel Thibault
|
eb7b6a9fbc
Add performance models for mirage
|
10 éve |
Samuel Thibault
|
ed8c3ac9e6
Fix initialization in case there'd be no data at all: some very-well-known node has to be doing it, for instance 0
|
10 éve |
Samuel Thibault
|
04855f8ec7
Fix initialization in case there'd be no data at all
|
10 éve |
Samuel Thibault
|
90bbc50bb2
harmonize fixes with the distributed version
|
10 éve |