ChangeLog 3.3 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990
  1. StarPU 0.4 (svn revision ????)
  2. ==============================================
  3. The API strengthening release
  4. * Major API improvements
  5. - Provide the STARPU_SCRATCH data access mode
  6. - Rework data filter interface
  7. - Rework data interface structure
  8. - A script that automatically renames old functions to accomodate with the new
  9. API is available from https://scm.gforge.inria.fr/svn/starpu/scripts/renaming
  10. (login: anonsvn, password: anonsvn)
  11. * Implement dependencies between task directly (eg. without tags)
  12. * Implicit data-driven task dependencies simplifies the design of
  13. data-parallel algorithms
  14. * Add dynamic profiling capabilities
  15. - Provide per-task feedback
  16. - Provide per-worker feedback
  17. - Provide feedback about memory transfers
  18. * Provide a library to help accelerating MPI applications
  19. * Improve data transfers overhead prediction
  20. - Transparently benchmark buses to generate performance models
  21. - Bind accelerator-controlling threads with respect to NUMA locality
  22. * Improve StarPU's portability
  23. - Add OpenCL support
  24. - Add support for Windows
  25. StarPU 0.2.901 aka 0.3-rc1 (svn revision 1236)
  26. ==============================================
  27. The asynchronous heterogeneous multi-accelerator release
  28. * Many API changes and code cleanups
  29. - Implement starpu_worker_get_id
  30. - Implement starpu_worker_get_name
  31. - Implement starpu_worker_get_type
  32. - Implement starpu_worker_get_count
  33. - Implement starpu_display_codelet_stats
  34. - Implement starpu_data_prefetch_on_node
  35. - Expose the starpu_data_set_wb_mask function
  36. * Support nvidia (heterogeneous) multi-GPU
  37. * Add the data request mechanism
  38. - All data transfers use data requests now
  39. - Implement asynchronous data transfers
  40. - Implement prefetch mechanism
  41. - Chain data requests to support GPU->RAM->GPU transfers
  42. * Make it possible to bypass the scheduler and to assign a task to a specific
  43. worker
  44. * Support restartable tasks to reinstanciate dependencies task graphs
  45. * Improve performance prediction
  46. - Model data transfer overhead
  47. - One model is created for each accelerator
  48. * Support for CUDA's driver API is deprecated
  49. * The STARPU_WORKERS_CUDAID and STARPU_WORKERS_CPUID env. variables make it possible to
  50. specify where to bind the workers
  51. * Use the hwloc library to detect the actual number of cores
  52. StarPU 0.2.0 (svn revision 1013)
  53. ==============================================
  54. The Stabilizing-the-Basics release
  55. * Various API cleanups
  56. * Mac OS X is supported now
  57. * Add dynamic code loading facilities onto Cell's SPUs
  58. * Improve performance analysis/feedback tools
  59. * Application can interact with StarPU tasks
  60. - The application may access/modify data managed by the DSM
  61. - The application may wait for the termination of a (set of) task(s)
  62. * An initial documentation is added
  63. * More examples are supplied
  64. StarPU 0.1.0 (svn revision 794)
  65. ==============================================
  66. First release.
  67. Status:
  68. * Only supports Linux platforms yet
  69. * Supported architectures
  70. - multicore CPUs
  71. - NVIDIA GPUs (with CUDA 2.x)
  72. - experimental Cell/BE support
  73. Changes:
  74. * Scheduling facilities
  75. - run-time selection of the scheduling policy
  76. - basic auto-tuning facilities
  77. * Software-based DSM
  78. - transparent data coherency management
  79. - High-level expressive interface