| 123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338 | /* * This file is part of the StarPU Handbook. * Copyright (C) 2009--2011  Universit@'e de Bordeaux * Copyright (C) 2010, 2011, 2012, 2013, 2014, 2015  CNRS * Copyright (C) 2011, 2012 INRIA * See the file version.doxy for copying conditions. *//*! \defgroup API_Data_Management Data Management\brief This section describes the data management facilities providedby StarPU. We show how to use existing data interfaces in \refAPI_Data_Interfaces, but developers can design their own data interfaces ifrequired.\typedef starpu_data_handle_t\ingroup API_Data_ManagementStarPU uses ::starpu_data_handle_t as an opaque handle tomanage a piece of data. Once a piece of data has been registered toStarPU, it is associated to a ::starpu_data_handle_t which keeps trackof the state of the piece of data over the entire machine, so that wecan maintain data consistency and locate data replicates for instance.\typedef starpu_arbiter_t\ingroup API_Data_ManagementThis is an arbiter, which implements an advanced but centralized management ofconcurrent data accesses, see \ref ConcurrentDataAccess for the details.\enum starpu_data_access_mode\ingroup API_Data_ManagementThis datatype describes a data access mode.\var starpu_data_access_mode::STARPU_NONE\ingroup API_Data_ManagementTODO\var starpu_data_access_mode::STARPU_R\ingroup API_Data_Managementread-only mode.\var starpu_data_access_mode::STARPU_W\ingroup API_Data_Managementwrite-only mode.\var starpu_data_access_mode::STARPU_RW\ingroup API_Data_Managementread-write mode. This is equivalent to ::STARPU_R|::STARPU_W\var starpu_data_access_mode::STARPU_SCRATCH\ingroup API_Data_ManagementA temporary buffer is allocated for the task, but StarPU does notenforce data consistency---i.e. each device has its own buffer,independently from each other (even for CPUs), and no data transfer isever performed. This is useful for temporary variables to avoidallocating/freeing buffers inside each task. Currently, no behavior isdefined concerning the relation with the ::STARPU_R and ::STARPU_W modesand the value provided at registration --- i.e., the value of thescratch buffer is undefined at entry of the codelet function.  It isbeing considered for future extensions at least to define the initialvalue.  For now, data to be used in ::STARPU_SCRATCH mode should beregistered with node <c>-1</c> and a <c>NULL</c> pointer, since thevalue of the provided buffer is simply ignored for now.\var starpu_data_access_mode::STARPU_REDUX\ingroup API_Data_Managementtodo\var starpu_data_access_mode::STARPU_COMMUTE\ingroup API_Data_ManagementIn addition to that, ::STARPU_COMMUTE can be passed along ::STARPU_Wor ::STARPU_RW to express that StarPU can let tasks commute, which isuseful e.g. when bringing a contribution into some data, which can bedone in any order (but still require sequential consistency againstreads or non-commutative writes).\var starpu_data_access_mode::STARPU_SSEND\ingroup API_Data_Managementused in starpu_mpi_insert_task() to specify the data has to be sentusing a synchronous and non-blocking mode (see starpu_mpi_issend())@name Basic Data Management API\ingroup API_Data_ManagementData management is done at a high-level in StarPU: rather thanaccessing a mere list of contiguous buffers, the tasks may manipulatedata that are described by a high-level construct which we call datainterface.An example of data interface is the "vector" interface which describesa contiguous data array on a spefic memory node. This interface is asimple structure containing the number of elements in the array, thesize of the elements, and the address of the array in the appropriateaddress space (this address may be invalid if there is no valid copyof the array in the memory node). More informations on the datainterfaces provided by StarPU are given in \ref API_Data_Interfaces.When a piece of data managed by StarPU is used by a task, the taskimplementation is given a pointer to an interface describing a validcopy of the data that is accessible from the current processing unit.Every worker is associated to a memory node which is a logicalabstraction of the address space from which the processing unit getsits data. For instance, the memory node associated to the differentCPU workers represents main memory (RAM), the memory node associatedto a GPU is DRAM embedded on the device. Every memory node isidentified by a logical index which is accessible from thefunction starpu_worker_get_memory_node(). When registering a piece ofdata to StarPU, the specified memory node indicates where the piece ofdata initially resides (we also call this memory node the home node ofa piece of data).\fn void starpu_data_register(starpu_data_handle_t *handleptr, unsigned home_node, void *data_interface, struct starpu_data_interface_ops *ops)\ingroup API_Data_ManagementRegister a piece of data into the handle located at the\p handleptr address. The \p data_interface buffer contains the initialdescription of the data in the \p home_node. The \p ops argument is apointer to a structure describing the different methods used tomanipulate this type of interface. See starpu_data_interface_ops formore details on this structure.If \p home_node is -1, StarPU will automatically allocate the memory whenit is used for the first time in write-only mode. Once such datahandle has been automatically allocated, it is possible to access itusing any access mode.Note that StarPU supplies a set of predefined types of interface (e.g.vector or matrix) which can be registered by the means of helperfunctions (e.g. starpu_vector_data_register() orstarpu_matrix_data_register()).\fn void starpu_data_ptr_register(starpu_data_handle_t handle, unsigned node)\ingroup API_Data_ManagementRegister that a buffer for \p handle on \p node will be set. This is typicallyused by starpu_*_ptr_register helpers before setting the interface pointers forthis node, to tell the core that that is now allocated.\fn void starpu_data_register_same(starpu_data_handle_t *handledst, starpu_data_handle_t handlesrc)\ingroup API_Data_ManagementRegister a new piece of data into the handle \p handledst with thesame interface as the handle \p handlesrc.\fn void starpu_data_unregister(starpu_data_handle_t handle)\ingroup API_Data_ManagementThis function unregisters a data handle from StarPU. If thedata was automatically allocated by StarPU because the home node was-1, all automatically allocated buffers are freed. Otherwise, a validcopy of the data is put back into the home node in the buffer that wasinitially registered. Using a data handle that has been unregisteredfrom StarPU results in an undefined behaviour. In case we do not needto update the value of the data in the home node, we can usethe function starpu_data_unregister_no_coherency() instead.\fn void starpu_data_unregister_no_coherency(starpu_data_handle_t handle)\ingroup API_Data_ManagementThis is the same as starpu_data_unregister(), except thatStarPU does not put back a valid copy into the home node, in thebuffer that was initially registered.\fn void starpu_data_unregister_submit(starpu_data_handle_t handle)\ingroup API_Data_ManagementDestroy the data handle once it is not needed anymore by anysubmitted task. No coherency is assumed.\fn void starpu_data_invalidate(starpu_data_handle_t handle)\ingroup API_Data_ManagementDestroy all replicates of the data handle immediately. Afterdata invalidation, the first access to the handle must be performed inwrite-only mode. Accessing an invalidated data in read-mode results inundefined behaviour.\fn void starpu_data_invalidate_submit(starpu_data_handle_t handle)\ingroup API_Data_ManagementSubmits invalidation of the data handle after completion ofpreviously submitted tasks.\fn void starpu_data_set_wt_mask(starpu_data_handle_t handle, uint32_t wt_mask)\ingroup API_Data_ManagementThis function sets the write-through mask of a given data (andits children), i.e. a bitmask of nodes where the data should be alwaysreplicated after modification. It also prevents the data from beingevicted from these nodes when memory gets scarse. When the data ismodified, it is automatically transfered into those memory node. Forinstance a <c>1<<0</c> write-through mask means that the CUDA workerswill commit their changes in main memory (node 0).\fn int starpu_data_fetch_on_node(starpu_data_handle_t handle, unsigned node, unsigned async)\ingroup API_Data_ManagementIssue a fetch request for a given data to a given node, i.e.requests that the data be replicated to the given node as soon as possible, so that it isavailable there for tasks. If the \p async parameter is 0, the call willblock until the transfer is achieved, else the call will return immediately,after having just queued the request. In the latter case, the request willasynchronously wait for the completion of any task writing on the data.\fn int starpu_data_prefetch_on_node(starpu_data_handle_t handle, unsigned node, unsigned async)\ingroup API_Data_ManagementIssue a prefetch request for a given data to a given node, i.e.requests that the data be replicated to the given node when there is room for it, so that it isavailable there for tasks. If the \p async parameter is 0, the call willblock until the transfer is achieved, else the call will return immediately,after having just queued the request. In the latter case, the request willasynchronously wait for the completion of any task writing on the data.\fn int starpu_data_idle_prefetch_on_node(starpu_data_handle_t handle, unsigned node, unsigned async)\ingroup API_Data_ManagementIssue an idle prefetch request for a given data to a given node, i.e.requests that the data be replicated to the given node, so that it isavailable there for tasks, but only when the bus is really idle. If the \p async parameter is 0, the call willblock until the transfer is achieved, else the call will return immediately,after having just queued the request. In the latter case, the request willasynchronously wait for the completion of any task writing on the data.\fn void starpu_data_wont_use(starpu_data_handle_t handle)\ingroup API_Data_ManagementAdvise StarPU that this handle will not be used in the close future, and isthus a good candidate for eviction from GPUs. StarPU will thus write its valueback to its home node when the bus is idle, and select this data in priorityfor eviction when memory gets low.\fn starpu_data_handle_t starpu_data_lookup(const void *ptr)\ingroup API_Data_ManagementReturn the handle corresponding to the data pointed to by the \p ptr host pointer.\fn int starpu_data_request_allocation(starpu_data_handle_t handle, unsigned node)\ingroup API_Data_ManagementExplicitly ask StarPU to allocate room for a piece of data onthe specified memory node.\fn void starpu_data_query_status(starpu_data_handle_t handle, int memory_node, int *is_allocated, int *is_valid, int *is_requested)\ingroup API_Data_ManagementQuery the status of \p handle on the specified \p memory_node.\fn void starpu_data_advise_as_important(starpu_data_handle_t handle, unsigned is_important)\ingroup API_Data_ManagementThis function allows to specify that a piece of data can bediscarded without impacting the application.\fn void starpu_data_set_reduction_methods(starpu_data_handle_t handle, struct starpu_codelet *redux_cl, struct starpu_codelet *init_cl)\ingroup API_Data_ManagementThis sets the codelets to be used for \p handle when it isaccessed in the mode ::STARPU_REDUX. Per-worker buffers will be initialized withthe codelet \p init_cl, and reduction between per-worker buffers will bedone with the codelet \p redux_cl.\fn struct starpu_data_interface_ops* starpu_data_get_interface_ops(starpu_data_handle_t handle)\ingroup API_Data_Managementtodo@name Access registered data from the application\ingroup API_Data_Management\fn int starpu_data_acquire(starpu_data_handle_t handle, enum starpu_data_access_mode mode)\ingroup API_Data_ManagementThe application must call this function prior to accessingregistered data from main memory outside tasks. StarPU ensures thatthe application will get an up-to-date copy of the data in main memorylocated where the data was originally registered, and that allconcurrent accesses (e.g. from tasks) will be consistent with theaccess mode specified in the mode argument. starpu_data_release() mustbe called once the application does not need to access the piece ofdata anymore. Note that implicit data dependencies are also enforcedby starpu_data_acquire(), i.e. starpu_data_acquire() will wait for alltasks scheduled to work on the data, unless they have been disabledexplictly by calling starpu_data_set_default_sequential_consistency_flag() orstarpu_data_set_sequential_consistency_flag(). starpu_data_acquire() is ablocking call, so that it cannot be called from tasks or from theircallbacks (in that case, starpu_data_acquire() returns <c>-EDEADLK</c>). Uponsuccessful completion, this function returns 0.\fn int starpu_data_acquire_cb(starpu_data_handle_t handle, enum starpu_data_access_mode mode, void (*callback)(void *), void *arg)\ingroup API_Data_ManagementAsynchronous equivalent of starpu_data_acquire(). When the dataspecified in \p handle is available in the appropriate accessmode, the \p callback function is executed. The application may accessthe requested data during the execution of this \p callback. The \p callbackfunction must call starpu_data_release() once the application does notneed to access the piece of data anymore. Note that implicit datadependencies are also enforced by starpu_data_acquire_cb() in case theyare not disabled. Contrary to starpu_data_acquire(), this function isnon-blocking and may be called from task callbacks. Upon successfulcompletion, this function returns 0.\fn int starpu_data_acquire_cb_sequential_consistency(starpu_data_handle_t handle, enum starpu_data_access_mode mode, void (*callback)(void *), void *arg, int sequential_consistency)\ingroup API_Data_ManagementEquivalent of starpu_data_acquire_cb() with the possibility of enabling or disabling data dependencies.When the data specified in \p handle is available in the appropriate accessmode, the \p callback function is executed. The application may accessthe requested data during the execution of this \p callback. The \p callbackfunction must call starpu_data_release() once the application does notneed to access the piece of data anymore. Note that implicit datadependencies are also enforced by starpu_data_acquire_cb_sequential_consistency() in case theyare not disabled specifically for the given \p handle or by the parameter \p sequential_consistency.Similarly to starpu_data_acquire_cb(), this function isnon-blocking and may be called from task callbacks. Upon successfulcompletion, this function returns 0.\fn int starpu_data_acquire_on_node(starpu_data_handle_t handle, int node, enum starpu_data_access_mode mode)\ingroup API_Data_ManagementThis is the same as starpu_data_acquire(), except that the datawill be available on the given memory node instead of main memory.\fn int starpu_data_acquire_on_node_cb(starpu_data_handle_t handle, int node, enum starpu_data_access_mode mode, void (*callback)(void *), void *arg)\ingroup API_Data_ManagementThis is the same as starpu_data_acquire_cb(), except that thedata will be available on the given memory node instead of mainmemory.\fn int starpu_data_acquire_on_node_cb_sequential_consistency(starpu_data_handle_t handle, int node, enum starpu_data_access_mode mode, void (*callback)(void *), void *arg, int sequential_consistency)\ingroup API_Data_ManagementThis is the same as starpu_data_acquire_cb_sequential_consistency(), except that thedata will be available on the given memory node instead of mainmemory.\def STARPU_DATA_ACQUIRE_CB(handle, mode, code)\ingroup API_Data_ManagementSTARPU_DATA_ACQUIRE_CB() is the same as starpu_data_acquire_cb(),except that the code to be executed in a callback is directly providedas a macro parameter, and the data \p handle is automatically releasedafter it. This permits to easily execute code which depends on thevalue of some registered data. This is non-blocking too and may becalled from task callbacks.\fn void starpu_data_release(starpu_data_handle_t handle)\ingroup API_Data_ManagementThis function releases the piece of data acquired by theapplication either by starpu_data_acquire() or bystarpu_data_acquire_cb().\fn void starpu_data_release_on_node(starpu_data_handle_t handle, int node)\ingroup API_Data_ManagementThis is the same as starpu_data_release(), except that the datawill be available on the given memory \p node instead of main memory.\fn starpu_arbiter_t starpu_arbiter_create(void)\ingroup API_Data_ManagementThis creates a data access arbiter, see \ref ConcurrentDataAccess for the details\fn void starpu_data_assign_arbiter(starpu_data_handle_t handle, starpu_arbiter_t arbiter)\ingroup API_Data_ManagementThis makes accesses to \p handle managed by \p arbiter\fn void starpu_arbiter_destroy(starpu_arbiter_t arbiter)\ingroup API_Data_ManagementThis destroys the \p arbiter . This must only be called after all data assignedto it have been unregistered.*/
 |