| 123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539540541542543544545546547548549550551552553554555556557558559560561562563564565566567568569570571572573574575576577578579580581582583584585586587588589590591592593594595596597598599600601602603604605606607608609610611612613614615616617618619620621622623624625626627628629630631632633634635636637638639640641642643644645646647648649650651652653654655656657658659660661662663664665666667668669670671672673674675676677678679680681682683684685686687688689690691692693694695696697698699700701702703704705706707708709710711712713714715716717718719720721722723724725726727728729730731732733734735736737738739740741742743744745746747748749750751752753754755756757758759760761762763764765766767768769770771772773774775776777778779780781782783784785786787788789790791792793794795796797798799800801802803804805806807808809810811812813814815816817818819820821822823824825826827828829830831832833834835836837838839840841842843844845846847848849850851852853854855856857858859860861862863864865866867868869870871872873874875876 | /* * This file is part of the StarPU Handbook. * Copyright (C) 2009--2011  Universit@'e de Bordeaux * Copyright (C) 2010, 2011, 2012, 2013, 2014, 2015  CNRS * Copyright (C) 2011, 2012 INRIA * See the file version.doxy for copying conditions. *//*! \page ExecutionConfigurationThroughEnvironmentVariables Execution Configuration Through Environment VariablesThe behavior of the StarPU library and tools may be tuned thanks tothe following environment variables.\section ConfiguringWorkers Configuring Workers<dl><dt>STARPU_NCPU</dt><dd>\anchor STARPU_NCPU\addindex __env__STARPU_NCPUSpecify the number of CPU workers (thus not including workersdedicated to control accelerators). Note that by default, StarPU willnot allocate more CPU workers than there are physical CPUs, and thatsome CPUs are used to control the accelerators.</dd><dt>STARPU_NCPUS</dt><dd>\anchor STARPU_NCPUS\addindex __env__STARPU_NCPUSThis variable is deprecated. You should use \ref STARPU_NCPU.</dd><dt>STARPU_NCUDA</dt><dd>\anchor STARPU_NCUDA\addindex __env__STARPU_NCUDASpecify the number of CUDA devices that StarPU can use. If\ref STARPU_NCUDA is lower than the number of physical devices, it ispossible to select which CUDA devices should be used by the means of theenvironment variable \ref STARPU_WORKERS_CUDAID. By default, StarPU willcreate as many CUDA workers as there are CUDA devices.</dd><dt>STARPU_NWORKER_PER_CUDA</dt><dd>\anchor STARPU_NWORKER_PER_CUDA\addindex __env__STARPU_NWORKER_PER_CUDASpecify the number of workers per CUDA device, and thus the number of kernelswhich will be concurrently running on the devices. The default value is 1.</dd><dt>STARPU_CUDA_PIPELINE</dt><dd>\anchor STARPU_CUDA_PIPELINE\addindex __env__STARPU_CUDA_PIPELINESpecify how many asynchronous tasks are submitted in advance on CUDAdevices. This for instance permits to overlap task management with the executionof previous tasks, but it also allows concurrent execution on Fermi cards, whichotherwise bring spurious synchronizations. The default is 2. Setting the value to 0 forces a synchronousexecution of all tasks.</dd><dt>STARPU_NOPENCL</dt><dd>\anchor STARPU_NOPENCL\addindex __env__STARPU_NOPENCLOpenCL equivalent of the environment variable \ref STARPU_NCUDA.</dd><dt>STARPU_OPENCL_PIPELINE</dt><dd>\anchor STARPU_OPENCL_PIPELINE\addindex __env__STARPU_OPENCL_PIPELINESpecify how many asynchronous tasks are submitted in advance on OpenCLdevices. This for instance permits to overlap task management with the executionof previous tasks, but it also allows concurrent execution on Fermi cards, whichotherwise bring spurious synchronizations. The default is 2. Setting the value to 0 forces a synchronousexecution of all tasks.</dd><dt>STARPU_OPENCL_ON_CPUS</dt><dd>\anchor STARPU_OPENCL_ON_CPUS\addindex __env__STARPU_OPENCL_ON_CPUSBy default, the OpenCL driver only enables GPU and acceleratordevices. By setting the environment variable \refSTARPU_OPENCL_ON_CPUS to 1, the OpenCL driver will also enable CPUdevices.</dd><dt>STARPU_OPENCL_ONLY_ON_CPUS</dt><dd>\anchor STARPU_OPENCL_ONLY_ON_CPUS\addindex __env__STARPU_OPENCL_ONLY_ON_CPUSBy default, the OpenCL driver enables GPU and acceleratordevices. By setting the environment variable \refSTARPU_OPENCL_ONLY_ON_CPUS to 1, the OpenCL driver will ONLY enableCPU devices.</dd><dt>STARPU_NMIC</dt><dd>\anchor STARPU_NMIC\addindex __env__STARPU_NMICMIC equivalent of the environment variable \ref STARPU_NCUDA, i.e. the number ofMIC devices to use.</dd><dt>STARPU_NMICCORES</dt><dd>\anchor STARPU_NMICCORES\addindex __env__STARPU_NMICCORESNumber of cores to use on the MIC devices.</dd><dt>STARPU_NSCC</dt><dd>\anchor STARPU_NSCC\addindex __env__STARPU_NSCCSCC equivalent of the environment variable \ref STARPU_NCUDA.</dd><dt>STARPU_WORKERS_NOBIND</dt><dd>\anchor STARPU_WORKERS_NOBIND\addindex __env__STARPU_WORKERS_NOBINDSetting it to non-zero will prevent StarPU from binding its threads toCPUs. This is for instance useful when running the testsuite in parallel.</dd><dt>STARPU_WORKERS_CPUID</dt><dd>\anchor STARPU_WORKERS_CPUID\addindex __env__STARPU_WORKERS_CPUIDPassing an array of integers in \ref STARPU_WORKERS_CPUIDspecifies on which logical CPU the different workers should bebound. For instance, if <c>STARPU_WORKERS_CPUID = "0 1 4 5"</c>, the firstworker will be bound to logical CPU #0, the second CPU worker will be bound tological CPU #1 and so on.  Note that the logical ordering of the CPUs is eitherdetermined by the OS, or provided by the library <c>hwloc</c> in case it isavailable. Ranges can be provided: for instance, <c>STARPU_WORKERS_CPUID = "1-35"</c> will bind the first three workers on logical CPUs #1, #2, and #3, and thefourth worker on logical CPU #5. Unbound ranges can also be provided:<c>STARPU_WORKERS_CPUID = "1-"</c> will bind the workers starting from logicalCPU #1 up to last CPU.Note that the first workers correspond to the CUDA workers, then come theOpenCL workers, and finally the CPU workers. For example ifwe have <c>STARPU_NCUDA=1</c>, <c>STARPU_NOPENCL=1</c>, <c>STARPU_NCPU=2</c>and <c>STARPU_WORKERS_CPUID = "0 2 1 3"</c>, the CUDA device will be controlledby logical CPU #0, the OpenCL device will be controlled by logical CPU #2, andthe logical CPUs #1 and #3 will be used by the CPU workers.If the number of workers is larger than the array given in \refSTARPU_WORKERS_CPUID, the workers are bound to the logical CPUs in around-robin fashion: if <c>STARPU_WORKERS_CPUID = "0 1"</c>, the firstand the third (resp. second and fourth) workers will be put on CPU #0(resp. CPU #1).This variable is ignored if the fieldstarpu_conf::use_explicit_workers_bindid passed to starpu_init() isset.</dd><dt>STARPU_WORKERS_CUDAID</dt><dd>\anchor STARPU_WORKERS_CUDAID\addindex __env__STARPU_WORKERS_CUDAIDSimilarly to the \ref STARPU_WORKERS_CPUID environment variable, it ispossible to select which CUDA devices should be used by StarPU. On a machineequipped with 4 GPUs, setting <c>STARPU_WORKERS_CUDAID = "1 3"</c> and<c>STARPU_NCUDA=2</c> specifies that 2 CUDA workers should be created, and thatthey should use CUDA devices #1 and #3 (the logical ordering of the devices isthe one reported by CUDA).This variable is ignored if the fieldstarpu_conf::use_explicit_workers_cuda_gpuid passed to starpu_init()is set.</dd><dt>STARPU_WORKERS_OPENCLID</dt><dd>\anchor STARPU_WORKERS_OPENCLID\addindex __env__STARPU_WORKERS_OPENCLIDOpenCL equivalent of the \ref STARPU_WORKERS_CUDAID environment variable.This variable is ignored if the fieldstarpu_conf::use_explicit_workers_opencl_gpuid passed to starpu_init()is set.</dd><dt>STARPU_WORKERS_MICID</dt><dd>\anchor STARPU_WORKERS_MICID\addindex __env__STARPU_WORKERS_MICIDMIC equivalent of the \ref STARPU_WORKERS_CUDAID environment variable.This variable is ignored if the fieldstarpu_conf::use_explicit_workers_mic_deviceid passed to starpu_init()is set.</dd><dt>STARPU_WORKERS_SCCID</dt><dd>\anchor STARPU_WORKERS_SCCID\addindex __env__STARPU_WORKERS_SCCIDSCC equivalent of the \ref STARPU_WORKERS_CUDAID environment variable.This variable is ignored if the fieldstarpu_conf::use_explicit_workers_scc_deviceid passed to starpu_init()is set.</dd><dt>STARPU_SINGLE_COMBINED_WORKER</dt><dd>\anchor STARPU_SINGLE_COMBINED_WORKER\addindex __env__STARPU_SINGLE_COMBINED_WORKERIf set, StarPU will create several workers which won't be able to workconcurrently. It will by default create combined workers which size goes from 1to the total number of CPU workers in the system. \ref STARPU_MIN_WORKERSIZEand \ref STARPU_MAX_WORKERSIZE can be used to change this default.</dd><dt>STARPU_MIN_WORKERSIZE</dt><dd>\anchor STARPU_MIN_WORKERSIZE\addindex __env__STARPU_MIN_WORKERSIZE\ref STARPU_MIN_WORKERSIZEpermits to specify the minimum size of the combined workers (instead of the default 2)</dd><dt>STARPU_MAX_WORKERSIZE</dt><dd>\anchor STARPU_MAX_WORKERSIZE\addindex __env__STARPU_MAX_WORKERSIZE\ref STARPU_MAX_WORKERSIZEpermits to specify the minimum size of the combined workers (instead of thenumber of CPU workers in the system)</dd><dt>STARPU_SYNTHESIZE_ARITY_COMBINED_WORKER</dt><dd>\anchor STARPU_SYNTHESIZE_ARITY_COMBINED_WORKER\addindex __env__STARPU_SYNTHESIZE_ARITY_COMBINED_WORKERLet the user decide how many elements are allowed between combined workerscreated from hwloc information. For instance, in the case of sockets with 6cores without shared L2 caches, if \ref STARPU_SYNTHESIZE_ARITY_COMBINED_WORKER isset to 6, no combined worker will be synthesized beyond one for the socketand one per core. If it is set to 3, 3 intermediate combined workers will besynthesized, to divide the socket cores into 3 chunks of 2 cores. If it set to2, 2 intermediate combined workers will be synthesized, to divide the the socketcores into 2 chunks of 3 cores, and then 3 additional combined workers will besynthesized, to divide the former synthesized workers into a bunch of 2 cores,and the remaining core (for which no combined worker is synthesized since thereis already a normal worker for it).The default, 2, thus makes StarPU tend to building a binary trees of combinedworkers.</dd><dt>STARPU_DISABLE_ASYNCHRONOUS_COPY</dt><dd>\anchor STARPU_DISABLE_ASYNCHRONOUS_COPY\addindex __env__STARPU_DISABLE_ASYNCHRONOUS_COPYDisable asynchronous copies between CPU and GPU devices.The AMD implementation of OpenCL is known tofail when copying data asynchronously. When using this implementation,it is therefore necessary to disable asynchronous data transfers.</dd><dt>STARPU_DISABLE_ASYNCHRONOUS_CUDA_COPY</dt><dd>\anchor STARPU_DISABLE_ASYNCHRONOUS_CUDA_COPY\addindex __env__STARPU_DISABLE_ASYNCHRONOUS_CUDA_COPYDisable asynchronous copies between CPU and CUDA devices.</dd><dt>STARPU_DISABLE_ASYNCHRONOUS_OPENCL_COPY</dt><dd>\anchor STARPU_DISABLE_ASYNCHRONOUS_OPENCL_COPY\addindex __env__STARPU_DISABLE_ASYNCHRONOUS_OPENCL_COPYDisable asynchronous copies between CPU and OpenCL devices.The AMD implementation of OpenCL is known tofail when copying data asynchronously. When using this implementation,it is therefore necessary to disable asynchronous data transfers.</dd><dt>STARPU_DISABLE_ASYNCHRONOUS_MIC_COPY</dt><dd>\anchor STARPU_DISABLE_ASYNCHRONOUS_MIC_COPY\addindex __env__STARPU_DISABLE_ASYNCHRONOUS_MIC_COPYDisable asynchronous copies between CPU and MIC devices.</dd><dt>STARPU_ENABLE_CUDA_GPU_GPU_DIRECT</dt><dd>\anchor STARPU_ENABLE_CUDA_GPU_GPU_DIRECT\addindex __env__STARPU_ENABLE_CUDA_GPU_GPU_DIRECTEnable (1) or Disable (0) direct CUDA transfers from GPU to GPU, without copyingthrough RAM. The default is Enabled.This permits to test the performance effect of GPU-Direct.</dd><dt>STARPU_DISABLE_PINNING</dt><dd>\anchor STARPU_DISABLE_PINNING\addindex __env__STARPU_DISABLE_PINNINGDisable (1) or Enable (0) pinning host memory allocated through starpu_malloc, starpu_memory_pinand friends.  The default is Enabled.This permits to test the performance effect of memory pinning.</dd></dl>\section ConfiguringTheSchedulingEngine Configuring The Scheduling Engine<dl><dt>STARPU_SCHED</dt><dd>\anchor STARPU_SCHED\addindex __env__STARPU_SCHEDChoose between the different scheduling policies proposed by StarPU: workrandom, stealing, greedy, with performance models, etc.Use <c>STARPU_SCHED=help</c> to get the list of available schedulers.</dd><dt>STARPU_CALIBRATE</dt><dd>\anchor STARPU_CALIBRATE\addindex __env__STARPU_CALIBRATEIf this variable is set to 1, the performance models are calibrated duringthe execution. If it is set to 2, the previous values are dropped to restartcalibration from scratch. Setting this variable to 0 disable calibration, thisis the default behaviour.Note: this currently only applies to <c>dm</c> and <c>dmda</c> scheduling policies.</dd><dt>STARPU_CALIBRATE_MINIMUM</dt><dd>\anchor STARPU_CALIBRATE_MINIMUM\addindex __env__STARPU_CALIBRATE_MINIMUMThis defines the minimum number of calibration measurements that will be madebefore considering that the performance model is calibrated. The default value is 10.</dd><dt>STARPU_BUS_CALIBRATE</dt><dd>\anchor STARPU_BUS_CALIBRATE\addindex __env__STARPU_BUS_CALIBRATEIf this variable is set to 1, the bus is recalibrated during intialization.</dd><dt>STARPU_PREFETCH</dt><dd>\anchor STARPU_PREFETCH\addindex __env__STARPU_PREFETCHThis variable indicates whether data prefetching should be enabled (0 meansthat it is disabled). If prefetching is enabled, when a task is scheduled to beexecuted e.g. on a GPU, StarPU will request an asynchronous transfer inadvance, so that data is already present on the GPU when the task starts. As aresult, computation and data transfers are overlapped.Note that prefetching is enabled by default in StarPU.</dd><dt>STARPU_SCHED_ALPHA</dt><dd>\anchor STARPU_SCHED_ALPHA\addindex __env__STARPU_SCHED_ALPHATo estimate the cost of a task StarPU takes into account the estimatedcomputation time (obtained thanks to performance models). The alpha factor isthe coefficient to be applied to it before adding it to the communication part.</dd><dt>STARPU_SCHED_BETA</dt><dd>\anchor STARPU_SCHED_BETA\addindex __env__STARPU_SCHED_BETATo estimate the cost of a task StarPU takes into account the estimateddata transfer time (obtained thanks to performance models). The beta factor isthe coefficient to be applied to it before adding it to the computation part.</dd><dt>STARPU_SCHED_GAMMA</dt><dd>\anchor STARPU_SCHED_GAMMA\addindex __env__STARPU_SCHED_GAMMADefine the execution time penalty of a joule (\ref Power-basedScheduling).</dd><dt>STARPU_IDLE_POWER</dt><dd>\anchor STARPU_IDLE_POWER\addindex __env__STARPU_IDLE_POWERDefine the idle power of the machine (\ref Power-basedScheduling).</dd><dt>STARPU_PROFILING</dt><dd>\anchor STARPU_PROFILING\addindex __env__STARPU_PROFILINGEnable on-line performance monitoring (\ref EnablingOn-linePerformanceMonitoring).</dd></dl>\section Extensions Extensions<dl><dt>SOCL_OCL_LIB_OPENCL</dt><dd>\anchor SOCL_OCL_LIB_OPENCL\addindex __env__SOCL_OCL_LIB_OPENCLTHE SOCL test suite is only run when the environment variable \refSOCL_OCL_LIB_OPENCL is defined. It should contain the locationof the file <c>libOpenCL.so</c> of the OCL ICD implementation.</dd><dt>OCL_ICD_VENDORS</dt><dd>\anchor OCL_ICD_VENDORS\addindex __env__OCL_ICD_VENDORSWhen using SOCL with OpenCL ICD(https://forge.imag.fr/projects/ocl-icd/), this variable may be usedto point to the directory where ICD files are installed. The defaultdirectory is <c>/etc/OpenCL/vendors</c>. StarPU installs ICDfiles in the directory <c>$prefix/share/starpu/opencl/vendors</c>.</dd><dt>STARPU_COMM_STATS</dt><dd>\anchor STARPU_COMM_STATS\addindex __env__STARPU_COMM_STATSCommunication statistics for starpumpi (\ref MPISupport)will be enabled when the environment variable \ref STARPU_COMM_STATSis defined to an value other than 0.</dd><dt>STARPU_MPI_CACHE</dt><dd>\anchor STARPU_MPI_CACHE\addindex __env__STARPU_MPI_CACHECommunication cache for starpumpi (\ref MPISupport) will bedisabled when the environment variable \ref STARPU_MPI_CACHE is setto 0. It is enabled by default or for any other values of the variable\ref STARPU_MPI_CACHE.</dd><dt>STARPU_MPI_COMM</dt><dd>\anchor STARPU_MPI_COMM\addindex __env__STARPU_MPI_COMMCommunication trace for starpumpi (\ref MPISupport) will beenabled when the environment variable \ref STARPU_MPI_COMM is setto 1, and StarPU has been configured with the option\ref enable-verbose "--enable-verbose".</dd><dt>STARPU_MPI_CACHE_STATS</dt><dd>\anchor STARPU_MPI_CACHE_STATS\addindex __env__STARPU_MPI_CACHE_STATSWhen set to 1, statistics are enabled for the communication cache (\ref MPISupport). For now,it prints messages on the standard output when data are added or removed from the receivedcommunication cache.</dd><dt>STARPU_SIMGRID_CUDA_MALLOC_COST</dt><dd>\anchor STARPU_SIMGRID_CUDA_MALLOC_COST\addindex __env__STARPU_SIMGRID_CUDA_MALLOC_COSTWhen set to 1 (which is the default), CUDA malloc costs are taken into accountin simgrid mode.</dd><dt>STARPU_SIMGRID_CUDA_QUEUE_COST</dt><dd>\anchor STARPU_SIMGRID_CUDA_QUEUE_COST\addindex __env__STARPU_SIMGRID_CUDA_QUEUE_COSTWhen set to 1 (which is the default), CUDA task and transfer queueing costs aretaken into account in simgrid mode.</dd></dl>\section MiscellaneousAndDebug Miscellaneous And Debug<dl><dt>STARPU_HOME</dt><dd>\anchor STARPU_HOME\addindex __env__STARPU_HOMEThis specifies the main directory in which StarPU stores itsconfiguration files. The default is <c>$HOME</c> on Unix environments,and <c>$USERPROFILE</c> on Windows environments.</dd><dt>STARPU_PERF_MODEL_DIR</dt><dd>\anchor STARPU_PERF_MODEL_DIR\addindex __env__STARPU_PERF_MODEL_DIRThis specifies the main directory in which StarPU stores itsperformance model files. The default is <c>$STARPU_HOME/.starpu/sampling</c>.</dd><dt>STARPU_HOSTNAME</dt><dd>\anchor STARPU_HOSTNAME\addindex __env__STARPU_HOSTNAMEWhen set, force the hostname to be used when dealing performance modelfiles. Models are indexed by machine name. When running for example ona homogenenous cluster, it is possible to share the models betweenmachines by setting <c>export STARPU_HOSTNAME=some_global_name</c>.</dd><dt>STARPU_OPENCL_PROGRAM_DIR</dt><dd>\anchor STARPU_OPENCL_PROGRAM_DIR\addindex __env__STARPU_OPENCL_PROGRAM_DIRThis specifies the directory where the OpenCL codelet source files arelocated. The function starpu_opencl_load_program_source() looksfor the codelet in the current directory, in the directory specifiedby the environment variable \ref STARPU_OPENCL_PROGRAM_DIR, in thedirectory <c>share/starpu/opencl</c> of the installation directory ofStarPU, and finally in the source directory of StarPU.</dd><dt>STARPU_SILENT</dt><dd>\anchor STARPU_SILENT\addindex __env__STARPU_SILENTThis variable allows to disable verbose mode at runtime when StarPUhas been configured with the option \ref enable-verbose "--enable-verbose". It alsodisables the display of StarPU information and warning messages.</dd><dt>STARPU_LOGFILENAME</dt><dd>\anchor STARPU_LOGFILENAME\addindex __env__STARPU_LOGFILENAMEThis variable specifies in which file the debugging output should be saved to.</dd><dt>STARPU_FXT_PREFIX</dt><dd>\anchor STARPU_FXT_PREFIX\addindex __env__STARPU_FXT_PREFIXThis variable specifies in which directory to save the trace generated if FxT is enabled. It needs to have a trailing '/' character.</dd><dt>STARPU_LIMIT_CUDA_devid_MEM</dt><dd>\anchor STARPU_LIMIT_CUDA_devid_MEM\addindex __env__STARPU_LIMIT_CUDA_devid_MEMThis variable specifies the maximum number of megabytes that should beavailable to the application on the CUDA device with the identifier<c>devid</c>. This variable is intended to be used for experimentalpurposes as it emulates devices that have a limited amount of memory.When defined, the variable overwrites the value of the variable\ref STARPU_LIMIT_CUDA_MEM.</dd><dt>STARPU_LIMIT_CUDA_MEM</dt><dd>\anchor STARPU_LIMIT_CUDA_MEM\addindex __env__STARPU_LIMIT_CUDA_MEMThis variable specifies the maximum number of megabytes that should beavailable to the application on each CUDA devices. This variable isintended to be used for experimental purposes as it emulates devicesthat have a limited amount of memory.</dd><dt>STARPU_LIMIT_OPENCL_devid_MEM</dt><dd>\anchor STARPU_LIMIT_OPENCL_devid_MEM\addindex __env__STARPU_LIMIT_OPENCL_devid_MEMThis variable specifies the maximum number of megabytes that should beavailable to the application on the OpenCL device with the identifier<c>devid</c>. This variable is intended to be used for experimentalpurposes as it emulates devices that have a limited amount of memory.When defined, the variable overwrites the value of the variable\ref STARPU_LIMIT_OPENCL_MEM.</dd><dt>STARPU_LIMIT_OPENCL_MEM</dt><dd>\anchor STARPU_LIMIT_OPENCL_MEM\addindex __env__STARPU_LIMIT_OPENCL_MEMThis variable specifies the maximum number of megabytes that should beavailable to the application on each OpenCL devices. This variable isintended to be used for experimental purposes as it emulates devicesthat have a limited amount of memory.</dd><dt>STARPU_LIMIT_CPU_MEM</dt><dd>\anchor STARPU_LIMIT_CPU_MEM\addindex __env__STARPU_LIMIT_CPU_MEMThis variable specifies the maximum number of megabytes that should beavailable to the application on each CPU device. Setting it enables allocationcache in main memory</dd><dt>STARPU_MINIMUM_AVAILABLE_MEM</dt><dd>\anchor STARPU_MINIMUM_AVAILABLE_MEM\addindex __env__STARPU_MINIMUM_AVAILABLE_MEMThis specifies the minimum percentage of memory that should be available in GPUs(or in main memory, when using out of core), below which a reclaiming pass isperformed. The default is 5%.</dd><dt>STARPU_TARGET_AVAILABLE_MEM</dt><dd>\anchor STARPU_TARGET_AVAILABLE_MEM\addindex __env__STARPU_TARGET_AVAILABLE_MEMThis specifies the target percentage of memory that should be reached inGPUs (or in main memory, when using out of core), when performing a periodicreclaiming pass. The default is 10%.</dd><dt>STARPU_MINIMUM_CLEAN_BUFFERS</dt><dd>\anchor STARPU_MINIMUM_CLEAN_BUFFERS\addindex __env__STARPU_MINIMUM_CLEAN_BUFFERSThis specifies the minimum percentage of number of buffers that should be clean in GPUs(or in main memory, when using out of core), below which asynchronous writebacks will beissued. The default is 5%.</dd><dt>STARPU_TARGET_CLEAN_BUFFERS</dt><dd>\anchor STARPU_TARGET_CLEAN_BUFFERS\addindex __env__STARPU_TARGET_CLEAN_BUFFERSThis specifies the target percentage of number of buffers that should be reached inGPUs (or in main memory, when using out of core), when performing an asynchronouswriteback pass. The default is 10%.</dd><dt>STARPU_DISK_SWAP</dt><dd>\anchor STARPU_DISK_SWAP\addindex __env__STARPU_DISK_SWAPThis specifies a path where StarPU can push data when the main memory is gettingfull.</dd><dt>STARPU_DISK_SWAP_BACKEND</dt><dd>\anchor STARPU_DISK_SWAP_BACKEND\addindex __env__STARPU_DISK_SWAP_BACKENDThis specifies then backend to be used by StarPU to push data when the mainmemory is getting full. The default is unistd (i.e. using read/write functions),other values are stdio (i.e. using fread/fwrite), unistd_o_direct (i.e. usingread/write with O_DIRECT), and leveldb (i.e. using a leveldb database).</dd><dt>STARPU_DISK_SWAP_SIZE</dt><dd>\anchor STARPU_DISK_SWAP_SIZE\addindex __env__STARPU_DISK_SWAP_SIZEThis specifies then size to be used by StarPU to push data when the mainmemory is getting full. The default is unlimited.</dd><dt>STARPU_LIMIT_MAX_NSUBMITTED_TASKS</dt><dd>\anchor STARPU_LIMIT_MAX_NSUBMITTED_TASKS\addindex __env__STARPU_LIMIT_MAX_NSUBMITTED_TASKS    This variable allows the user to control the task submission flow by specifyingto StarPU a maximum number of submitted tasks allowed at a given time, i.e. whenthis limit is reached task submission becomes blocking until enough tasks havecompleted, specified by STARPU_LIMIT_MIN_NSUBMITTED_TASKS.Setting it enables allocation cache buffer reuse in main memory.</dd><dt>STARPU_LIMIT_MIN_NSUBMITTED_TASKS</dt><dd>\anchor STARPU_LIMIT_MIN_NSUBMITTED_TASKS\addindex __env__STARPU_LIMIT_MIN_NSUBMITTED_TASKS    This variable allows the user to control the task submission flow by specifyingto StarPU a submitted task threshold to wait before unblocking task submission. Thisvariable has to be used in conjunction with \refSTARPU_LIMIT_MAX_NSUBMITTED_TASKS which puts the task submission thread tosleep.  Setting it enables allocation cache buffer reuse in main memory.</dd><dt>STARPU_TRACE_BUFFER_SIZE</dt><dd>\anchor STARPU_TRACE_BUFFER_SIZE\addindex __env__STARPU_TRACE_BUFFER_SIZEThis sets the buffer size for recording trace events in MiB. Setting it to a bigsize allows to avoid pauses in the trace while it is recorded on the disk. Thishowever also consumes memory, of course. The default value is 64.</dd><dt>STARPU_GENERATE_TRACE</dt><dd>\anchor STARPU_GENERATE_TRACE\addindex __env__STARPU_GENERATE_TRACEWhen set to <c>1</c>, this variable indicates that StarPU should automaticallygenerate a Paje trace when starpu_shutdown() is called.</dd><dt>STARPU_ENABLE_STATS</dt><dd>\anchor STARPU_ENABLE_STATS\addindex __env__STARPU_ENABLE_STATSWhen defined, enable gathering various data statistics (\ref DataStatistics).</dd><dt>STARPU_MEMORY_STATS</dt><dd>\anchor STARPU_MEMORY_STATS\addindex __env__STARPU_MEMORY_STATSWhen set to 0, disable the display of memory statistics on data whichhave not been unregistered at the end of the execution (\ref MemoryFeedback).</dd><dt>STARPU_BUS_STATS</dt><dd>\anchor STARPU_BUS_STATS\addindex __env__STARPU_BUS_STATSWhen defined, statistics about data transfers will be displayed when callingstarpu_shutdown() (\ref Profiling).</dd><dt>STARPU_WORKER_STATS</dt><dd>\anchor STARPU_WORKER_STATS\addindex __env__STARPU_WORKER_STATSWhen defined, statistics about the workers will be displayed when callingstarpu_shutdown() (\ref Profiling). When combined with theenvironment variable \ref STARPU_PROFILING, it displays the powerconsumption (\ref Power-basedScheduling).</dd><dt>STARPU_STATS</dt><dd>\anchor STARPU_STATS\addindex __env__STARPU_STATSWhen set to 0, data statistics will not be displayed at theend of the execution of an application (\ref DataStatistics).</dd><dt>STARPU_WATCHDOG_TIMEOUT</dt><dd>\anchor STARPU_WATCHDOG_TIMEOUT\addindex __env__STARPU_WATCHDOG_TIMEOUTWhen set to a value other than 0, allows to make StarPU print an errormessage whenever StarPU does not terminate any task for the given time (in µs). Shouldbe used in combination with \ref STARPU_WATCHDOG_CRASH (see \refDetectionStuckConditions).</dd><dt>STARPU_WATCHDOG_CRASH</dt><dd>\anchor STARPU_WATCHDOG_CRASH\addindex __env__STARPU_WATCHDOG_CRASHWhen set to a value other than 0, it triggers a crash when the watchdog is reached, thus allowing to catch the situation in gdb, etc(see \ref DetectionStuckConditions)</dd><dt>STARPU_DISABLE_KERNELS</dt><dd>\anchor STARPU_DISABLE_KERNELS\addindex __env__STARPU_DISABLE_KERNELSWhen set to a value other than 1, it disables actually calling the kernelfunctions, thus allowing to quickly check that the task scheme is workingproperly, without performing the actual application-provided computation.</dd><dt>STARPU_HISTORY_MAX_ERROR</dt><dd>\anchor STARPU_HISTORY_MAX_ERROR\addindex __env__STARPU_HISTORY_MAX_ERRORHistory-based performance models will drop measurements which are really farfroom the measured average. This specifies the allowed variation. The default is50 (%), i.e. the measurement is allowed to be x1.5 faster or /1.5 slower than theaverage.</dd><dt>STARPU_RAND_SEED</dt><dd>\anchor STARPU_RAND_SEED\addindex __env__STARPU_RAND_SEEDThe random scheduler and some examples use random numbers for their ownworking. Depending on the examples, the seed is by default juste always 0 orthe current time() (unless simgrid mode is enabled, in which case it is always0). STARPU_RAND_SEED allows to set the seed to a specific value.</dd></dl>\section ConfiguringTheHypervisor Configuring The Hypervisor<dl><dt>SC_HYPERVISOR_POLICY</dt><dd>\anchor SC_HYPERVISOR_POLICY\addindex __env__SC_HYPERVISOR_POLICYChoose between the different resizing policies proposed by StarPU for the hypervisor: idle, app_driven, feft_lp, teft_lp; ispeed_lp, throughput_lp etc.Use <c>SC_HYPERVISOR_POLICY=help</c> to get the list of available policies for the hypervisor</dd><dt>SC_HYPERVISOR_TRIGGER_RESIZE</dt><dd>\anchor SC_HYPERVISOR_TRIGGER_RESIZE\addindex __env__SC_HYPERVISOR_TRIGGER_RESIZEChoose how should the hypervisor be triggered: <c>speed</c> if the resizing algorithm shouldbe called whenever the speed of the context does not correspond to an optimal precomputed value,<c>idle</c> it the resizing algorithm should be called whenever the workers are idle for a periodlonger than the value indicated when configuring the hypervisor.</dd><dt>SC_HYPERVISOR_START_RESIZE</dt><dd>\anchor SC_HYPERVISOR_START_RESIZE\addindex __env__SC_HYPERVISOR_START_RESIZEIndicate the moment when the resizing should be available. The value correspond to the percentageof the total time of execution of the application. The default value is the resizing frame.</dd><dt>SC_HYPERVISOR_MAX_SPEED_GAP</dt><dd>\anchor SC_HYPERVISOR_MAX_SPEED_GAP\addindex __env__SC_HYPERVISOR_MAX_SPEED_GAPIndicate the ratio of speed difference between contexts that should trigger the hypervisor.This situation may occur only when a theoretical speed could not be computed and the hypervisorhas no value to compare the speed to. Otherwise the resizing of a context is not influenced by the the speed of the other contexts, but only by the the value that a context should have.</dd><dt>SC_HYPERVISOR_STOP_PRINT</dt><dd>\anchor SC_HYPERVISOR_STOP_PRINT\addindex __env__SC_HYPERVISOR_STOP_PRINTBy default the values of the speed of the workers is printed during the executionof the application. If the value 1 is given to this environment variable this printingis not done.</dd><dt>SC_HYPERVISOR_LAZY_RESIZE</dt><dd>\anchor SC_HYPERVISOR_LAZY_RESIZE\addindex __env__SC_HYPERVISOR_LAZY_RESIZEBy default the hypervisor resizes the contexts in a lazy way, that is workers are firstly added to a new contextbefore removing them from the previous one. Once this workers are clearly taken into account into the new context (a task was poped there) we remove them from the previous one. However if the applicationwould like that the change in the distribution of workers should change right away this variable should be set to 0</dd><dt>SC_HYPERVISOR_SAMPLE_CRITERIA</dt><dd>\anchor SC_HYPERVISOR_SAMPLE_CRITERIA\addindex __env__SC_HYPERVISOR_SAMPLE_CRITERIABy default the hypervisor uses a sample of flops when computing the speed of the contexts and of the workers.If this variable is set to <c>time</c> the hypervisor uses a sample of time (10% of an aproximation of the totalexecution time of the application)</dd></dl>*/
 |