ChangeLog 3.2 KB

12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758596061626364656667686970717273747576777879808182838485868788
  1. StarPU 0.4 (svn revision ????)
  2. ==============================================
  3. The API strengthening release
  4. * Major API improvements
  5. - Provide the STARPU_SCRATCH data access mode
  6. - Rework data filter interface
  7. - Rework data interface structure
  8. - A script that automatically renames old functions to accomodate with the new
  9. API is available from https://scm.gforge.inria.fr/svn/starpu/scripts/renaming
  10. * Implement dependencies between task directly (eg. without tags)
  11. * Implicit data-driven task dependencies simplifies the design of
  12. data-parallel algorithms
  13. * Dynamic profiling capabilities
  14. - starpu_top
  15. * Provide a library to help accelerating MPI applications
  16. * Automate data movement overhead prediction
  17. * Transparently benchmark buses to generate performance models
  18. * Bind accelerator-controlling threads with respect to NUMA locality
  19. * Improve StarPU's portability
  20. - Finalize OpenCL support
  21. - Add support for Windows
  22. - StarPU can be used from C++ or NVCC directly now
  23. StarPU 0.2.901 aka 0.3-rc1 (svn revision 1236)
  24. ==============================================
  25. The asynchronous heterogeneous multi-accelerator release
  26. * Many API changes and code cleanups
  27. - Implement starpu_worker_get_id
  28. - Implement starpu_worker_get_name
  29. - Implement starpu_worker_get_type
  30. - Implement starpu_worker_get_count
  31. - Implement starpu_display_codelet_stats
  32. - Implement starpu_data_prefetch_on_node
  33. - Expose the starpu_data_set_wb_mask function
  34. * Support nvidia (heterogeneous) multi-GPU
  35. * Add the data request mechanism
  36. - All data transfers use data requests now
  37. - Implement asynchronous data transfers
  38. - Implement prefetch mechanism
  39. - Chain data requests to support GPU->RAM->GPU transfers
  40. * Make it possible to bypass the scheduler and to assign a task to a specific
  41. worker
  42. * Support restartable tasks to reinstanciate dependencies task graphs
  43. * Improve performance prediction
  44. - Model data transfer overhead
  45. - One model is created for each accelerator
  46. * Support for CUDA's driver API is deprecated
  47. * The STARPU_WORKERS_CUDAID and STARPU_WORKERS_CPUID env. variables make it possible to
  48. specify where to bind the workers
  49. * Use the hwloc library to detect the actual number of cores
  50. StarPU 0.2.0 (svn revision 1013)
  51. ==============================================
  52. The Stabilizing-the-Basics release
  53. * Various API cleanups
  54. * Mac OS X is supported now
  55. * Add dynamic code loading facilities onto Cell's SPUs
  56. * Improve performance analysis/feedback tools
  57. * Application can interact with StarPU tasks
  58. - The application may access/modify data managed by the DSM
  59. - The application may wait for the termination of a (set of) task(s)
  60. * An initial documentation is added
  61. * More examples are supplied
  62. StarPU 0.1.0 (svn revision 794)
  63. ==============================================
  64. First release.
  65. Status:
  66. * Only supports Linux platforms yet
  67. * Supported architectures
  68. - multicore CPUs
  69. - NVIDIA GPUs (with CUDA 2.x)
  70. - experimental Cell/BE support
  71. Changes:
  72. * Scheduling facilities
  73. - run-time selection of the scheduling policy
  74. - basic auto-tuning facilities
  75. * Software-based DSM
  76. - transparent data coherency management
  77. - High-level expressive interface