@@ -96,7 +96,7 @@ func <<<grid,block,0,starpu_cuda_get_local_stream()>>> (foo, bar);
cudaStreamSynchronize(starpu_cuda_get_local_stream());
\endcode
-as well as the use of cudaMemcpyAsync(), etc. for each CUDA operation one needs
+as well as the use of \c cudaMemcpyAsync(), etc. for each CUDA operation one needs
to use a version that takes the a stream parameter.
Unfortunately, some CUDA libraries do not have stream variants of
@@ -145,7 +145,7 @@ The scheduling algorithms worth trying are thus <code>dmdar</code> and
<code>lws</code>, which privilege data locality over priorities. There will be
work on this area in the coming future.
-\section Feedback figures
+\section FeedBackFigures Feedback Figures
Beyond pure performance feedback, some figures are interesting to have a look at.