瀏覽代碼

Document that using cudaThreadSynchronize() instead of streams will affect
performance.

Cédric Augonnet 14 年之前
父節點
當前提交
eda488ace3
共有 1 個文件被更改,包括 2 次插入0 次删除
  1. 2 0
      doc/starpu.texi

+ 2 - 0
doc/starpu.texi

@@ -3192,6 +3192,8 @@ function is only provided for convenience so that programmers can easily use
 asynchronous operations within codelets without having to create a stream by
 hand. Note that the application is not forced to use the stream provided by
 @code{starpu_cuda_get_local_stream} and may also create its own streams.
+Synchronizing with @code{cudaThreadSynchronize()} is allowed, but will reduce
+the likelihood of having all transfers overlapped.
 
 @item @emph{Prototype}:
 @code{cudaStream_t *starpu_cuda_get_local_stream(void);}