Skip to content

Releases: rapidsai/rmm

v24.12.00

11 Dec 18:23
v24.12.00
e464797
Compare
Choose a tag to compare

🚨 Breaking Changes

  • Deprecate support for directly accessing logger (#1690) @vyasr

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

[NIGHTLY] v25.02.00

26 Nov 20:23
v25.02.00a
96418d3
Compare
Choose a tag to compare
[NIGHTLY] v25.02.00 Pre-release
Pre-release

🔗 Links

🚨 Breaking Changes

  • Switch to using separate rapids-logger repo (#1774) @vyasr
  • Remove deprecated factory functions from resource adaptors. (#1767) @bdice
  • Remove rmm._lib (#1765) @Matt711
  • Remove memory access flags from cuda_async_memory_resource (#1754) @abellina
  • Create logger wrapper around spdlog that can be easily reused in other libraries (#1722) @vyasr

🐛 Bug Fixes

📖 Documentation

🚀 New Features

  • Remove deprecated factory functions from resource adaptors. (#1767) @bdice
  • Remove rmm._lib (#1765) @Matt711
  • Reduce dependencies on numba. (#1761) @bdice
  • Use ruff, remove isort and black. (#1759) @bdice
  • Use bindings layout for all cuda-python imports. (#1756) @bdice
  • Add configuration for pre-commit.ci, update pre-commit hooks (#1746) @bdice
  • Adds fabric handle and memory protection flags to cuda_async_memory_resource (#1743) @abellina
  • Remove upper bounds on cuda-python to allow 12.6.2 and 11.8.5 (#1729) @bdice

🛠️ Improvements

v24.10.00

09 Oct 14:34
v24.10.00
3223f84
Compare
Choose a tag to compare

🚨 Breaking Changes

  • Inline functions that return static references must have default visibility (#1653) @wence-
  • Hide visibility of non-public symbols (#1644) @jameslamb
  • Deprecate adaptor factories. (#1626) @bdice

🐛 Bug Fixes

📖 Documentation

  • Recommend miniforge for conda install. (#1681) @bdice
  • Fix docs cross reference in DeviceBuffer.prefetch (#1636) @bdice

🚀 New Features

  • [FEA] Allow setting *_pool_size with human-readable string (#1670) @Matt711
  • Update RMM adaptors, containers and tests to use get/set_current_device_resource_ref() (#1661) @harrism
  • Deprecate adaptor factories. (#1626) @bdice
  • Allow testing of earliest/latest dependencies (#1613) @seberg
  • Add resource_ref versions of get/set_current_device_resource (#1598) @harrism

🛠️ Improvements

v24.08.00

07 Aug 15:02
v24.08.00
ba181dd
Compare
Choose a tag to compare

🚨 Breaking Changes

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

v24.06.00

05 Jun 14:38
v24.06.00
d889275
Compare
Choose a tag to compare

🚨 Breaking Changes

  • Refactor polymorphic allocator to use device_async_resource_ref (#1555) @harrism
  • Remove deprecated functionality (#1537) @harrism
  • Remove deprecated cuda_async_memory_resource constructor that takes thrust::optional parameters (#1535) @harrism
  • Remove deprecated supports_streams and get_mem_info methods. (#1519) @harrism

🐛 Bug Fixes

📖 Documentation

  • Allow specifying mr in DeviceBuffer construction, and document ownership requirements in Python/C++ interfacing (#1552) @wence-
  • Fix Python install instruction (#1547) @wence-
  • Update multi-gpu discussion for device_buffer and device_vector dtors (#1524) @wence-
  • Fix ordering / heading levels in README.md and python example in guide.md (#1513) @harrism

🚀 New Features

🛠️ Improvements

v24.04.00

10 Apr 14:23
v24.04.00
8f19c9c
Compare
Choose a tag to compare

🚨 Breaking Changes

  • Accept stream argument in DeviceMemoryResource allocate/deallocate (#1494) @wence-
  • Replace all internal usage of get_upstream with get_upstream_resource (#1491) @miscco
  • Deprecate rmm::mr::device_memory_resource::supports_streams() (#1452) @harrism
  • Remove deprecated rmm::detail::available_device_memory (#1438) @harrism
  • Make device_memory_resource::supports_streams() not pure virtual. Remove derived implementations and calls in RMM (#1437) @harrism
  • Deprecate rmm::mr::device_memory_resource::get_mem_info() and supports_get_mem_info(). (#1436) @harrism

🐛 Bug Fixes

  • Fix search path for torch allocator in editable installs and ensure CUDA support is available (#1498) @vyasr
  • Accept stream argument in DeviceMemoryResource allocate/deallocate (#1494) @wence-
  • Run STATISTICS_TEST and TRACKING_TEST in serial to avoid OOM errors. (#1487) @bdice

📖 Documentation

  • Pin to recent breathe, to prevent getting an unsupported sphinx version. (#1495) @bdice

🚀 New Features

  • Replace all internal usage of get_upstream with get_upstream_resource (#1491) @miscco
  • Add complete set of resource ref aliases (#1479) @nvdbaranec
  • Automate include grouping using clang-format (#1463) @harrism
  • Add get_upstream_resource to resource adaptors (#1456) @miscco
  • Deprecate rmm::mr::device_memory_resource::supports_streams() (#1452) @harrism
  • Remove duplicated memory_resource_tests (#1451) @miscco
  • Change rmm::exec_policy to take async_resource_ref (#1449) @miscco
  • Change device_scalar to take async_resource_ref (#1447) @miscco
  • Add device_async_resource_ref convenience alias (#1441) @harrism
  • Remove deprecated rmm::detail::available_device_memory (#1438) @harrism
  • Make device_memory_resource::supports_streams() not pure virtual. Remove derived implementations and calls in RMM (#1437) @harrism
  • Deprecate rmm::mr::device_memory_resource::get_mem_info() and supports_get_mem_info(). (#1436) @harrism
  • Support CUDA 12.2 (#1419) @jameslamb

🛠️ Improvements

v24.02.00

12 Feb 20:53
v24.02.00
09b406c
Compare
Choose a tag to compare

🚨 Breaking Changes

  • Make device_memory_resource::do_get_mem_info() and supports_get_mem_info() not pure virtual. Remove derived implementations and calls in RMM (#1430) @harrism
  • Deprecate detail::available_device_memory, most detail/aligned.hpp utilities, and optional pool_memory_resource initial size (#1424) @harrism
  • Require explicit pool size in pool_memory_resource and move some things out of detail namespace (#1417) @harrism
  • Remove HTML builds of librmm (#1415) @vyasr
  • Update to CCCL 2.2.0. (#1404) @bdice
  • Switch to scikit-build-core (#1287) @vyasr

🐛 Bug Fixes

📖 Documentation

🚀 New Features

  • Make device_memory_resource::do_get_mem_info() and supports_get_mem_info() not pure virtual. Remove derived implementations and calls in RMM (#1430) @harrism
  • Deprecate detail::available_device_memory, most detail/aligned.hpp utilities, and optional pool_memory_resource initial size (#1424) @harrism
  • Add a host-pinned memory resource that can be used as upstream for pool_memory_resource. (#1392) @harrism

🛠️ Improvements

  • Remove usages of rapids-env-update (#1423) @KyleFromNVIDIA
  • Refactor CUDA versions in dependencies.yaml. (#1422) @bdice
  • Require explicit pool size in pool_memory_resource and move some things out of detail namespace (#1417) @harrism
  • Update dependencies.yaml to support CUDA 12.*. (#1414) @bdice
  • Define python dependency range as a matrix fallback. (#1409) @bdice
  • Use latest cuda-python within CUDA major version. (#1406) @bdice
  • Update to CCCL 2.2.0. (#1404) @bdice
  • Remove RMM_BUILD_WHEELS and standardize Python builds (#1401) @vyasr
  • Update to fmt 10.1.1 and spdlog 1.12.0. (#1374) @bdice
  • Switch to scikit-build-core (#1287) @vyasr

v23.12.00

06 Dec 15:04
v23.12.00
2db5cbb
Compare
Choose a tag to compare

🚨 Breaking Changes

  • Document minimum CUDA version of 11.4 (#1385) @harrism
  • Store and set the correct CUDA device in device_buffer (#1370) @harrism
  • Use cuda::mr::memory_resource instead of raw device_memory_resource (#1095) @miscco

🐛 Bug Fixes

  • Update actions/labeler to v4 (#1397) @raydouglass
  • Backport arena MR fix for simultaneous access by PTDS and other streams (#1396) @bdice
  • Deliberately leak PTDS thread_local events in stream ordered mr (#1375) @wence-
  • Add missing CUDA 12 dependencies and fix dlopen library names (#1366) @vyasr

📖 Documentation

🚀 New Features

  • Make internally stored default argument values public (#1373) @vyasr
  • Store and set the correct CUDA device in device_buffer (#1370) @harrism
  • Update rapids-cmake functions to non-deprecated signatures (#1357) @robertmaynard
  • Generate unified Python/C++ docs (#1324) @vyasr
  • Use cuda::mr::memory_resource instead of raw device_memory_resource (#1095) @miscco

🛠️ Improvements

v23.10.00

11 Oct 14:33
v23.10.00
f8ac6f8
Compare
Choose a tag to compare

🚨 Breaking Changes

🐛 Bug Fixes

  • Compile cdef public functions from torch_allocator with C ABI (#1350) @wence-
  • Make doxygen only a conda dependency. (#1344) @bdice
  • Use conda mambabuild not mamba mambabuild (#1338) @wence-
  • Fix stream_ordered_memory_resource attempt to record event in stream from another device (#1333) @harrism

📖 Documentation

🚀 New Features

🛠️ Improvements

v23.08.00

09 Aug 16:38
v23.08.00
f3af0e8
Compare
Choose a tag to compare

🚨 Breaking Changes

🐛 Bug Fixes

  • Fix typo in wheels-test.yaml. (#1310) @bdice
  • Add a missing '#include <array>' in logger.hpp (#1295) @valgur
  • Use gbench thread_index() accessor to fix replay bench compilation (#1293) @harrism
  • Ensure logger tests don't generate temp directories in build dir (#1289) @robertmaynard

🚀 New Features

  • Remove now-deprecated top-level allocator functions (#1281) @wence-

🛠️ Improvements