configuration.texi 22 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539540541542543544545546547548549550551552553554555556557558559560561562563564565566567568569570571
  1. @c -*-texinfo-*-
  2. @c This file is part of the StarPU Handbook.
  3. @c Copyright (C) 2009--2011 Universit@'e de Bordeaux 1
  4. @c Copyright (C) 2010, 2011, 2012 Centre National de la Recherche Scientifique
  5. @c Copyright (C) 2011, 2012 Institut National de Recherche en Informatique et Automatique
  6. @c See the file starpu.texi for copying conditions.
  7. @menu
  8. * Compilation configuration::
  9. * Execution configuration through environment variables::
  10. @end menu
  11. @node Compilation configuration
  12. @section Compilation configuration
  13. The following arguments can be given to the @code{configure} script.
  14. @menu
  15. * Common configuration::
  16. * Configuring workers::
  17. * Extension configuration::
  18. * Advanced configuration::
  19. @end menu
  20. @node Common configuration
  21. @subsection Common configuration
  22. @defvr {Configure option} --enable-debug
  23. Enable debugging messages.
  24. @end defvr
  25. @defvr {Configure option} --enable-debug
  26. Enable debugging messages.
  27. @end defvr
  28. @defvr {Configure option} --enable-fast
  29. Disable assertion checks, which saves computation time.
  30. @end defvr
  31. @defvr {Configure option} --enable-verbose
  32. Increase the verbosity of the debugging messages. This can be disabled
  33. at runtime by setting the environment variable @code{STARPU_SILENT} to
  34. any value.
  35. @smallexample
  36. % STARPU_SILENT=1 ./vector_scal
  37. @end smallexample
  38. @end defvr
  39. @defvr {Configure option} --enable-coverage
  40. Enable flags for the @code{gcov} coverage tool.
  41. @end defvr
  42. @defvr {Configure option} --enable-quick-check
  43. Specify tests and examples should be run on a smaller data set, i.e
  44. allowing a faster execution time
  45. @end defvr
  46. @defvr {Configure option} --with-hwloc
  47. Specify hwloc should be used by StarPU. hwloc should be found by the
  48. means of the tools @code{pkg-config}.
  49. @end defvr
  50. @defvr {Configure option} --with-hwloc=@var{prefix}
  51. Specify hwloc should be used by StarPU. hwloc should be found in the
  52. directory specified by @var{prefix}.
  53. @end defvr
  54. @defvr {Configure option} --without-hwloc
  55. Specify hwloc should not be used by StarPU.
  56. @end defvr
  57. @defvr {Configure option} --disable-build-doc
  58. Disable the creation of the documentation. This should be done on a
  59. machine which does not have the tools @code{makeinfo} and @code{tex}.
  60. @end defvr
  61. Additionally, the @command{configure} script recognize many variables, which
  62. can be listed by typing @code{./configure --help}. For example,
  63. @code{./configure NVCCFLAGS="-arch sm_13"} adds a flag for the compilation of
  64. CUDA kernels.
  65. @node Configuring workers
  66. @subsection Configuring workers
  67. @defvr {Configure option} --enable-maxcpus=@var{count}
  68. Use at most @var{count} CPU cores. This information is then
  69. available as the @code{STARPU_MAXCPUS} macro.
  70. @end defvr
  71. @defvr {Configure option} --disable-cpu
  72. Disable the use of CPUs of the machine. Only GPUs etc. will be used.
  73. @end defvr
  74. @defvr {Configure option} --enable-maxcudadev=@var{count}
  75. Use at most @var{count} CUDA devices. This information is then
  76. available as the @code{STARPU_MAXCUDADEVS} macro.
  77. @end defvr
  78. @defvr {Configure option} --disable-cuda
  79. Disable the use of CUDA, even if a valid CUDA installation was detected.
  80. @end defvr
  81. @defvr {Configure option} --with-cuda-dir=@var{prefix}
  82. Search for CUDA under @var{prefix}, which should notably contain
  83. @file{include/cuda.h}.
  84. @end defvr
  85. @defvr {Configure option} --with-cuda-include-dir=@var{dir}
  86. Search for CUDA headers under @var{dir}, which should
  87. notably contain @code{cuda.h}. This defaults to @code{/include} appended to the
  88. value given to @code{--with-cuda-dir}.
  89. @end defvr
  90. @defvr {Configure option} --with-cuda-lib-dir=@var{dir}
  91. Search for CUDA libraries under @var{dir}, which should notably contain
  92. the CUDA shared libraries---e.g., @file{libcuda.so}. This defaults to
  93. @code{/lib} appended to the value given to @code{--with-cuda-dir}.
  94. @end defvr
  95. @defvr {Configure option} --disable-cuda-memcpy-peer
  96. Explicitly disable peer transfers when using CUDA 4.0.
  97. @end defvr
  98. @defvr {Configure option} --enable-maxopencldev=@var{count}
  99. Use at most @var{count} OpenCL devices. This information is then
  100. available as the @code{STARPU_MAXOPENCLDEVS} macro.
  101. @end defvr
  102. @defvr {Configure option} --disable-opencl
  103. Disable the use of OpenCL, even if the SDK is detected.
  104. @end defvr
  105. @defvr {Configure option} --with-opencl-dir=@var{prefix}
  106. Search for an OpenCL implementation under @var{prefix}, which should
  107. notably contain @file{include/CL/cl.h} (or @file{include/OpenCL/cl.h} on
  108. Mac OS).
  109. @end defvr
  110. @defvr {Configure option} --with-opencl-include-dir=@var{dir}
  111. Search for OpenCL headers under @var{dir}, which should notably contain
  112. @file{CL/cl.h} (or @file{OpenCL/cl.h} on Mac OS). This defaults to
  113. @code{/include} appended to the value given to @code{--with-opencl-dir}.
  114. @end defvr
  115. @defvr {Configure option} --with-opencl-lib-dir=@var{dir}
  116. Search for an OpenCL library under @var{dir}, which should notably
  117. contain the OpenCL shared libraries---e.g. @file{libOpenCL.so}. This defaults to
  118. @code{/lib} appended to the value given to @code{--with-opencl-dir}.
  119. @end defvr
  120. @defvr {Configure option} --enable-maximplementations=@var{count}
  121. Allow for at most @var{count} codelet implementations for the same
  122. target device. This information is then available as the
  123. @code{STARPU_MAXIMPLEMENTATIONS} macro.
  124. @end defvr
  125. @defvr {Configure option} --disable-asynchronous-copy
  126. Disable asynchronous copies between CPU and GPU devices.
  127. The AMD implementation of OpenCL is known to
  128. fail when copying data asynchronously. When using this implementation,
  129. it is therefore necessary to disable asynchronous data transfers.
  130. @end defvr
  131. @defvr {Configure option} --disable-asynchronous-cuda-copy
  132. Disable asynchronous copies between CPU and CUDA devices.
  133. @end defvr
  134. @defvr {Configure option} --disable-asynchronous-opencl-copy
  135. Disable asynchronous copies between CPU and OpenCL devices.
  136. The AMD implementation of OpenCL is known to
  137. fail when copying data asynchronously. When using this implementation,
  138. it is therefore necessary to disable asynchronous data transfers.
  139. @end defvr
  140. @node Extension configuration
  141. @subsection Extension configuration
  142. @defvr {Configure option} --disable-socl
  143. Disable the SOCL extension (@pxref{SOCL OpenCL Extensions}). By
  144. default, it is enabled when an OpenCL implementation is found.
  145. @end defvr
  146. @defvr {Configure option} --disable-starpu-top
  147. Disable the StarPU-Top interface (@pxref{StarPU-Top}). By default, it
  148. is enabled when the required dependencies are found.
  149. @end defvr
  150. @defvr {Configure option} --disable-gcc-extensions
  151. Disable the GCC plug-in (@pxref{C Extensions}). By default, it is
  152. enabled when the GCC compiler provides a plug-in support.
  153. @end defvr
  154. @defvr {Configure option} --with-mpicc=@var{path}
  155. Use the @command{mpicc} compiler at @var{path}, for starpumpi
  156. (@pxref{StarPU MPI support}).
  157. @end defvr
  158. @node Advanced configuration
  159. @subsection Advanced configuration
  160. @defvr {Configure option} --enable-perf-debug
  161. Enable performance debugging through gprof.
  162. @end defvr
  163. @defvr {Configure option} --enable-model-debug
  164. Enable performance model debugging.
  165. @end defvr
  166. @defvr {Configure option} --enable-stats
  167. @c see ../../src/datawizard/datastats.c
  168. Enable gathering of various data statistics (@pxref{Data statistics}).
  169. @end defvr
  170. @defvr {Configure option} --enable-maxbuffers
  171. Define the maximum number of buffers that tasks will be able to take
  172. as parameters, then available as the @code{STARPU_NMAXBUFS} macro.
  173. @end defvr
  174. @defvr {Configure option} --enable-allocation-cache
  175. Enable the use of a data allocation cache to avoid the cost of it with
  176. CUDA. Still experimental.
  177. @end defvr
  178. @defvr {Configure option} --enable-opengl-render
  179. Enable the use of OpenGL for the rendering of some examples.
  180. @c TODO: rather default to enabled when detected
  181. @end defvr
  182. @defvr {Configure option} --enable-blas-lib
  183. Specify the blas library to be used by some of the examples. The
  184. library has to be 'atlas' or 'goto'.
  185. @end defvr
  186. @defvr {Configure option} --disable-starpufft
  187. Disable the build of libstarpufft, even if fftw or cuFFT is available.
  188. @end defvr
  189. @defvr {Configure option} --with-magma=@var{prefix}
  190. Search for MAGMA under @var{prefix}. @var{prefix} should notably
  191. contain @file{include/magmablas.h}.
  192. @end defvr
  193. @defvr {Configure option} --with-fxt=@var{prefix}
  194. Search for FxT under @var{prefix}.
  195. @url{http://savannah.nongnu.org/projects/fkt, FxT} is used to generate
  196. traces of scheduling events, which can then be rendered them using ViTE
  197. (@pxref{Off-line, off-line performance feedback}). @var{prefix} should
  198. notably contain @code{include/fxt/fxt.h}.
  199. @end defvr
  200. @defvr {Configure option} --with-perf-model-dir=@var{dir}
  201. Store performance models under @var{dir}, instead of the current user's
  202. home.
  203. @end defvr
  204. @defvr {Configure option} --with-goto-dir=@var{prefix}
  205. Search for GotoBLAS under @var{prefix}, which should notably contain @file{libgoto.so} or @file{libgoto2.so}.
  206. @end defvr
  207. @defvr {Configure option} --with-atlas-dir=@var{prefix}
  208. Search for ATLAS under @var{prefix}, which should notably contain
  209. @file{include/cblas.h}.
  210. @end defvr
  211. @defvr {Configure option} --with-mkl-cflags=@var{cflags}
  212. Use @var{cflags} to compile code that uses the MKL library.
  213. @end defvr
  214. @defvr {Configure option} --with-mkl-ldflags=@var{ldflags}
  215. Use @var{ldflags} when linking code that uses the MKL library. Note
  216. that the
  217. @url{http://software.intel.com/en-us/articles/intel-mkl-link-line-advisor/,
  218. MKL website} provides a script to determine the linking flags.
  219. @end defvr
  220. @defvr {Configure option} --disable-build-examples
  221. Disable the build of examples.
  222. @end defvr
  223. @defvr {Configure option} --enable-memory-stats
  224. Enable memory statistics (@pxref{Memory feedback}).
  225. @end defvr
  226. @defvr {Configure option} --enable-simgrid
  227. Enable simulation of execution in simgrid, to allow easy experimentation with
  228. various numbers of cores and GPUs, or amount of memory, etc. Experimental.
  229. @end defvr
  230. @node Execution configuration through environment variables
  231. @section Execution configuration through environment variables
  232. @menu
  233. * Workers:: Configuring workers
  234. * Scheduling:: Configuring the Scheduling engine
  235. * Extensions::
  236. * Misc:: Miscellaneous and debug
  237. @end menu
  238. @node Workers
  239. @subsection Configuring workers
  240. @defvr {Environment variable} STARPU_NCPU
  241. Specify the number of CPU workers (thus not including workers dedicated to control accelerators). Note that by default, StarPU will not allocate
  242. more CPU workers than there are physical CPUs, and that some CPUs are used to control
  243. the accelerators.
  244. @end defvr
  245. @defvr {Environment variable} STARPU_NCPUS
  246. This variable is deprecated. You should use @code{STARPU_NCPU}.
  247. @end defvr
  248. @defvr {Environment variable} STARPU_NCUDA
  249. Specify the number of CUDA devices that StarPU can use. If
  250. @code{STARPU_NCUDA} is lower than the number of physical devices, it is
  251. possible to select which CUDA devices should be used by the means of the
  252. @code{STARPU_WORKERS_CUDAID} environment variable. By default, StarPU will
  253. create as many CUDA workers as there are CUDA devices.
  254. @end defvr
  255. @defvr {Environment variable} STARPU_NOPENCL
  256. OpenCL equivalent of the @code{STARPU_NCUDA} environment variable.
  257. @end defvr
  258. @defvr {Environment variable} STARPU_OPENCL_ON_CPUS
  259. By default, the OpenCL driver only enables GPU and accelerator
  260. devices. By setting the environment variable
  261. @code{STARPU_OPENCL_ON_CPUS} to 1, the OpenCL driver will also enable
  262. CPU devices.
  263. @end defvr
  264. @defvr {Environment variable} STARPU_WORKERS_NOBIND
  265. Setting it to non-zero will prevent StarPU from binding its threads to
  266. CPUs. This is for instance useful when running the testsuite in parallel.
  267. @end defvr
  268. @defvr {Environment variable} STARPU_WORKERS_CPUID
  269. Passing an array of integers (starting from 0) in @code{STARPU_WORKERS_CPUID}
  270. specifies on which logical CPU the different workers should be
  271. bound. For instance, if @code{STARPU_WORKERS_CPUID = "0 1 4 5"}, the first
  272. worker will be bound to logical CPU #0, the second CPU worker will be bound to
  273. logical CPU #1 and so on. Note that the logical ordering of the CPUs is either
  274. determined by the OS, or provided by the @code{hwloc} library in case it is
  275. available.
  276. Note that the first workers correspond to the CUDA workers, then come the
  277. OpenCL workers, and finally the CPU workers. For example if
  278. we have @code{STARPU_NCUDA=1}, @code{STARPU_NOPENCL=1}, @code{STARPU_NCPU=2}
  279. and @code{STARPU_WORKERS_CPUID = "0 2 1 3"}, the CUDA device will be controlled
  280. by logical CPU #0, the OpenCL device will be controlled by logical CPU #2, and
  281. the logical CPUs #1 and #3 will be used by the CPU workers.
  282. If the number of workers is larger than the array given in
  283. @code{STARPU_WORKERS_CPUID}, the workers are bound to the logical CPUs in a
  284. round-robin fashion: if @code{STARPU_WORKERS_CPUID = "0 1"}, the first and the
  285. third (resp. second and fourth) workers will be put on CPU #0 (resp. CPU #1).
  286. This variable is ignored if the @code{use_explicit_workers_bindid} flag of the
  287. @code{starpu_conf} structure passed to @code{starpu_init} is set.
  288. @end defvr
  289. @defvr {Environment variable} STARPU_WORKERS_CUDAID
  290. Similarly to the @code{STARPU_WORKERS_CPUID} environment variable, it is
  291. possible to select which CUDA devices should be used by StarPU. On a machine
  292. equipped with 4 GPUs, setting @code{STARPU_WORKERS_CUDAID = "1 3"} and
  293. @code{STARPU_NCUDA=2} specifies that 2 CUDA workers should be created, and that
  294. they should use CUDA devices #1 and #3 (the logical ordering of the devices is
  295. the one reported by CUDA).
  296. This variable is ignored if the @code{use_explicit_workers_cuda_gpuid} flag of
  297. the @code{starpu_conf} structure passed to @code{starpu_init} is set.
  298. @end defvr
  299. @defvr {Environment variable} STARPU_WORKERS_OPENCLID
  300. OpenCL equivalent of the @code{STARPU_WORKERS_CUDAID} environment variable.
  301. This variable is ignored if the @code{use_explicit_workers_opencl_gpuid} flag of
  302. the @code{starpu_conf} structure passed to @code{starpu_init} is set.
  303. @end defvr
  304. @defvr {Environment variable} @code{STARPU_SINGLE_COMBINED_WORKER}
  305. If set, StarPU will create several workers which won't be able to work
  306. concurrently. It will create combined workers which size goes from 1 to the
  307. total number of CPU workers in the system.
  308. @end defvr
  309. @defvr {Environment variable} STARPU_SYNTHESIZE_ARITY_COMBINED_WORKER
  310. Let the user decide how many elements are allowed between combined workers
  311. created from hwloc information. For instance, in the case of sockets with 6
  312. cores without shared L2 caches, if @code{SYNTHESIZE_ARITY_COMBINED_WORKER} is
  313. set to 6, no combined worker will be synthesized beyond one for the socket
  314. and one per core. If it is set to 3, 3 intermediate combined workers will be
  315. synthesized, to divide the socket cores into 3 chunks of 2 cores. If it set to
  316. 2, 2 intermediate combined workers will be synthesized, to divide the the socket
  317. cores into 2 chunks of 3 cores, and then 3 additional combined workers will be
  318. synthesized, to divide the former synthesized workers into a bunch of 2 cores,
  319. and the remaining core (for which no combined worker is synthesized since there
  320. is already a normal worker for it).
  321. The default, 2, thus makes StarPU tend to building a binary trees of combined
  322. workers.
  323. @end defvr
  324. @defvr {Environment variable} STARPU_DISABLE_ASYNCHRONOUS_COPY
  325. Disable asynchronous copies between CPU and GPU devices.
  326. The AMD implementation of OpenCL is known to
  327. fail when copying data asynchronously. When using this implementation,
  328. it is therefore necessary to disable asynchronous data transfers.
  329. @end defvr
  330. @defvr {Environment variable} STARPU_DISABLE_ASYNCHRONOUS_CUDA_COPY
  331. Disable asynchronous copies between CPU and CUDA devices.
  332. @end defvr
  333. @defvr {Environment variable} STARPU_DISABLE_ASYNCHRONOUS_OPENCL_COPY
  334. Disable asynchronous copies between CPU and OpenCL devices.
  335. The AMD implementation of OpenCL is known to
  336. fail when copying data asynchronously. When using this implementation,
  337. it is therefore necessary to disable asynchronous data transfers.
  338. @end defvr
  339. @defvr {Environment variable} STARPU_DISABLE_CUDA_GPU_GPU_DIRECT
  340. Disable direct CUDA transfers from GPU to GPU, and let CUDA copy through RAM
  341. instead. This permits to test the performance effect of GPU-Direct.
  342. @end defvr
  343. @node Scheduling
  344. @subsection Configuring the Scheduling engine
  345. @defvr {Environment variable} STARPU_SCHED
  346. Choose between the different scheduling policies proposed by StarPU: work
  347. random, stealing, greedy, with performance models, etc.
  348. Use @code{STARPU_SCHED=help} to get the list of available schedulers.
  349. @end defvr
  350. @defvr {Environment variable} STARPU_CALIBRATE
  351. If this variable is set to 1, the performance models are calibrated during
  352. the execution. If it is set to 2, the previous values are dropped to restart
  353. calibration from scratch. Setting this variable to 0 disable calibration, this
  354. is the default behaviour.
  355. Note: this currently only applies to @code{dm}, @code{dmda} and @code{heft} scheduling policies.
  356. @end defvr
  357. @defvr {Environment variable} STARPU_BUS_CALIBRATE
  358. If this variable is set to 1, the bus is recalibrated during intialization.
  359. @end defvr
  360. @defvr {Environment variable} STARPU_PREFETCH
  361. @anchor{STARPU_PREFETCH}
  362. This variable indicates whether data prefetching should be enabled (0 means
  363. that it is disabled). If prefetching is enabled, when a task is scheduled to be
  364. executed e.g. on a GPU, StarPU will request an asynchronous transfer in
  365. advance, so that data is already present on the GPU when the task starts. As a
  366. result, computation and data transfers are overlapped.
  367. Note that prefetching is enabled by default in StarPU.
  368. @end defvr
  369. @defvr {Environment variable} STARPU_SCHED_ALPHA
  370. To estimate the cost of a task StarPU takes into account the estimated
  371. computation time (obtained thanks to performance models). The alpha factor is
  372. the coefficient to be applied to it before adding it to the communication part.
  373. @end defvr
  374. @defvr {Environment variable} STARPU_SCHED_BETA
  375. To estimate the cost of a task StarPU takes into account the estimated
  376. data transfer time (obtained thanks to performance models). The beta factor is
  377. the coefficient to be applied to it before adding it to the computation part.
  378. @end defvr
  379. @defvr {Environment variable} STARPU_SCHED_GAMMA
  380. Define the execution time penalty of a joule (@pxref{Power-based scheduling}).
  381. @end defvr
  382. @defvr {Environment variable} STARPU_IDLE_POWER
  383. Define the idle power of the machine (@pxref{Power-based scheduling}).
  384. @end defvr
  385. @defvr {Environment variable} STARPU_PROFILING
  386. Enable on-line performance monitoring (@pxref{Enabling on-line performance monitoring}).
  387. @end defvr
  388. @node Extensions
  389. @subsection Extensions
  390. @defvr {Environment variable} SOCL_OCL_LIB_OPENCL
  391. THE SOCL test suite is only run when the environment variable
  392. @code{SOCL_OCL_LIB_OPENCL} is defined. It should contain the location
  393. of the libOpenCL.so file of the OCL ICD implementation.
  394. @end defvr
  395. @defvr {Environment variable} STARPU_COMM_STATS
  396. @anchor{STARPU_COMM_STATS}
  397. Communication statistics for starpumpi (@pxref{StarPU MPI support})
  398. will be enabled when the environment variable @code{STARPU_COMM_STATS}
  399. is defined to an value other than 0.
  400. @end defvr
  401. @defvr {Environment variable} STARPU_MPI_CACHE
  402. @anchor{STARPU_MPI_CACHE}
  403. Communication cache for starpumpi (@pxref{StarPU MPI support}) will be
  404. disabled when the environment variable @code{STARPU_MPI_CACHE} is set
  405. to 0. It is enabled by default or for any other values of the variable
  406. @code{STARPU_MPI_CACHE}.
  407. @end defvr
  408. @node Misc
  409. @subsection Miscellaneous and debug
  410. @defvr {Environment variable} STARPU_OPENCL_PROGRAM_DIR
  411. @anchor{STARPU_OPENCL_PROGRAM_DIR}
  412. This specifies the directory where the OpenCL codelet source files are
  413. located. The function @ref{starpu_opencl_load_program_source} looks
  414. for the codelet in the current directory, in the directory specified
  415. by the environment variable @code{STARPU_OPENCL_PROGRAM_DIR}, in the
  416. directory @code{share/starpu/opencl} of the installation directory of
  417. StarPU, and finally in the source directory of StarPU.
  418. @end defvr
  419. @defvr {Environment variable} STARPU_SILENT
  420. This variable allows to disable verbose mode at runtime when StarPU
  421. has been configured with the option @code{--enable-verbose}. It also
  422. disables the display of StarPU information and warning messages.
  423. @end defvr
  424. @defvr {Environment variable} STARPU_LOGFILENAME
  425. This variable specifies in which file the debugging output should be saved to.
  426. @end defvr
  427. @defvr {Environment variable} STARPU_FXT_PREFIX
  428. This variable specifies in which directory to save the trace generated if FxT is enabled. It needs to have a trailing '/' character.
  429. @end defvr
  430. @defvr {Environment variable} STARPU_LIMIT_GPU_MEM
  431. This variable specifies the maximum number of megabytes that should be
  432. available to the application on each GPUs. In case this value is smaller than
  433. the size of the memory of a GPU, StarPU pre-allocates a buffer to waste memory
  434. on the device. This variable is intended to be used for experimental purposes
  435. as it emulates devices that have a limited amount of memory.
  436. @end defvr
  437. @defvr {Environment variable} STARPU_GENERATE_TRACE
  438. When set to 1, this variable indicates that StarPU should automatically
  439. generate a Paje trace when @code{starpu_shutdown()} is called.
  440. @end defvr
  441. @defvr {Environment variable} STARPU_MEMORY_STATS
  442. When set to 0, disable the display of memory statistics on data which
  443. have not been unregistered at the end of the execution (@pxref{Memory
  444. feedback}).
  445. @end defvr
  446. @defvr {Environment variable} STARPU_BUS_STATS
  447. When defined, statistics about data transfers will be displayed when calling
  448. @code{starpu_shutdown()} (@pxref{Profiling}).
  449. @end defvr
  450. @defvr {Environment variable} STARPU_WORKER_STATS
  451. When defined, statistics about the workers will be displayed when calling
  452. @code{starpu_shutdown()} (@pxref{Profiling}). When combined with the
  453. environment variable @code{STARPU_PROFILING}, it displays the power
  454. consumption (@pxref{Power-based scheduling}).
  455. @end defvr
  456. @defvr {Environment variable} STARPU_STATS
  457. When set to 0, data statistics will not be displayed at the
  458. end of the execution of an application (@pxref{Data statistics}).
  459. @end defvr