14 years ago · 091808305a
--- a/doc/starpu.texi
+++ b/doc/starpu.texi
@@ -1487,12 +1487,32 @@ to configure a performance model for the codelets of the application (see
 
																 @ref{Performance model example} for instance). History-based performance models
															
 
																 use on-line calibration.  StarPU will automatically calibrate codelets
															
 
																 which have never been calibrated yet. To force continuing calibration, use
															
 
																-@code{export STARPU_CALIBRATE=1} . To drop existing calibration information
															
 
																-completely and re-calibrate from start, use @code{export STARPU_CALIBRATE=2}.
															
 
																+@code{export STARPU_CALIBRATE=1} . This may be necessary if your application
															
 
																+have not-so-stable performance. Details on the current performance model status
															
 
																+can be obtained from the @code{starpu_perfmodel_display} command: the @code{-l}
															
 
																+option lists the available performance models, and the @code{-s} option permits
															
 
																+to choose the performance model to be displayed. The result looks like:
															
 
																+
															
 
																+@example
															
 
																+€ starpu_perfmodel_display -s starpu_dlu_lu_model_22
															
 
																+performance model for cpu
															
 
																+# hash		size		mean		dev		n
															
 
																+5c6c3401	1572864        	1.216300e+04   	2.277778e+03   	1240
															
 
																+@end example
															
 
																+
															
 
																+Which shows that for the LU 22 kernel with a 1.5MiB matrix, the average
															
 
																+execution time on CPUs was about 12ms, with a 2ms standard deviation, over
															
 
																+1240 samples. It is a good idea to check this before doing actual performance
															
 
																+measurements.
															
 
																+
															
 
																+If a kernel source code was modified (e.g. performance improvement), the
															
 
																+calibration information is stale and should be dropped, to re-calibrate from
															
 
																+start. This can be done by using @code{export STARPU_CALIBRATE=2}.
															
 
																+
															
 
																 Note: due to CUDA limitations, to be able to measure kernel duration,
															
 
																 calibration mode needs to disable asynchronous data transfers. Calibration thus
															
 
																 disables data transfer / computation overlapping, and should thus not be used
															
 
																-for eventual benchmarks. Note 2: history-based performance model get calibrated
															
 
																+for eventual benchmarks. Note 2: history-based performance models get calibrated
															
 
																 only if a performance-model-based scheduler is chosen.
															
 
																 @node Task distribution vs Data transfer
															
@@ -1514,7 +1534,7 @@ the good results that a precise estimation would give.
 
																 @node Data prefetch
															
 
																 @section Data prefetch
															
 
																-The heft scheduling policy performs data prefetch (see @ref{STARPU_PREFETCH}):
															
 
																+The heft, dmda and pheft scheduling policies perform data prefetch (see @ref{STARPU_PREFETCH}):
															
 
																 as soon as a scheduling decision is taken for a task, requests are issued to
															
 
																 transfer its required data to the target processing unit, if needeed, so that
															
 
																 when the processing unit actually starts the task, its data will hopefully be