14 years ago · 559996dfe1
--- a/doc/starpu.texi
+++ b/doc/starpu.texi
@@ -1228,16 +1228,16 @@ Partitioning can be applied several times, see
 
																 @section Performance model example
															
 
																 To achieve good scheduling, StarPU scheduling policies need to be able to
															
 
																-estimate in advance the duration of a task. This is done by giving to codelets a
															
 
																-performance model. There are several kinds of performance models.
															
 
																+estimate in advance the duration of a task. This is done by giving to codelets
															
 
																+a performance model, by defining a @code{starpu_perfmodel_t} structure and
															
 
																+providing its address in the @code{model} field of the @code{starpu_codelet}
															
 
																+structure. The @code{symbol} and @code{type} fields of @code{starpu_perfmodel_t}
															
 
																+are mandatory, to give a name to the model, and the type of the model, since
															
 
																+there are several kinds of performance models.
															
 
																 @itemize
															
 
																 @item
															
 
																-Providing an estimation from the application itself (@code{STARPU_COMMON} model type and @code{cost_model} field),
															
 
																-see for instance
															
 
																-@code{examples/common/blas_model.h} and @code{examples/common/blas_model.c}. It can also be provided for each architecture (@code{STARPU_PER_ARCH} model type and @code{per_arch} field)
															
 
																-@item
															
 
																-Measured at runtime (STARPU_HISTORY_BASED model type). This assumes that for a
															
 
																+Measured at runtime (@code{STARPU_HISTORY_BASED} model type). This assumes that for a
															
 
																 given set of data input/output sizes, the performance will always be about the
															
 
																 same. This is very true for regular kernels on GPUs for instance (<0.1% error),
															
 
																 and just a bit less true on CPUs (~=1% error). This also assumes that there are
															
@@ -1277,7 +1277,7 @@ starpu_codelet cl = @{
 
																 @end cartouche
															
 
																 @item
															
 
																-Measured at runtime and refined by regression (STARPU_REGRESSION_*_BASED
															
 
																+Measured at runtime and refined by regression (@code{STARPU_REGRESSION_*_BASED}
															
 
																 model type). This still assumes performance regularity, but can work
															
 
																 with various data input sizes, by applying regression over observed
															
 
																 execution times. STARPU_REGRESSION_BASED uses an a*n^b regression
															
@@ -1287,7 +1287,12 @@ STARPU_REGRESSION_BASED, but costs a lot more to compute). For instance,
 
																 model for the @code{memset} operation.
															
 
																 @item
															
 
																-Provided explicitly by the application (STARPU_PER_ARCH model type): the
															
 
																+Provided as an estimation from the application itself (@code{STARPU_COMMON} model type and @code{cost_model} field),
															
 
																+see for instance
															
 
																+@code{examples/common/blas_model.h} and @code{examples/common/blas_model.c}.
															
 
																+
															
 
																+@item
															
 
																+Provided explicitly by the application (@code{STARPU_PER_ARCH} model type): the
															
 
																 @code{.per_arch[i].cost_model} fields have to be filled with pointers to
															
 
																 functions which return the expected duration of the task in micro-seconds, one
															
 
																 per architecture.