[Contrib] Workspace for cuBLAS backend #16413

MasterJH5574 · 2024-01-17T01:39:27Z

This PR adds a 32MB workspace for cuBLAS backend, so that functions like cublasLtMatmul can take the workspace as input.

The workspace is managed under CuBlasThreadEntry so that it will be allocated only once in each thread.

MasterJH5574 · 2024-01-17T05:04:36Z

cc @vinx13 @masahi @junrushao @tqchen

masahi · 2024-01-17T09:14:55Z

There is a pass to allocate such workspace and append it to arguments of a BYOC function: https://github.com/apache/tvm/blob/unity/python/tvm/relax/transform/transform.py#L1305-L1317

If this can be used, I think that would be preferred.

src/runtime/contrib/cublas/cublas_utils.h

MasterJH5574 · 2024-01-25T04:52:49Z

There is a pass to allocate such workspace and append it to arguments of a BYOC function: https://github.com/apache/tvm/blob/unity/python/tvm/relax/transform/transform.py#L1305-L1317

If this can be used, I think that would be preferred.

@masahi Thank you for the great suggestion! Yes I think this one can be a next step, so that the workspace size can be adjustable. For now, given I am not particular familiar with the BYOC flow, I may not be able to quickly enable the workspace to work with the AllocateWorksapce pass. Adding a fixed size workspace in thread entry is the fastest way to enable workspace. I agree that we can leverage the pass later on.

This PR adds a 32MB workspace for cuBLAS backend, so that functions like `cublasLtMatmul` can take the workspace as input. The workspace is managed under CuBlasThreadEntry so that it will be allocated only once in each thread.

junrushao · 2024-01-25T05:57:01Z

@masahi would you mind sharing more guidance to @MasterJH5574?

masahi · 2024-01-25T10:03:06Z

src/runtime/contrib/cublas/cublas_utils.h

+  void* workspace_ptr{nullptr};
+  // 32MB workspace as suggested by NVIDIA
+  // https://docs.nvidia.com/cuda/cublas/index.html#cublassetworkspace.
+  static constexpr const size_t workspace_size = 33554432;


I'm assuming that 32MB is also good for pre-Hopper since it is bigger than the recommended size, 4MB. @vinx13

masahi

Ok, given the very specific nature of this workspace, I think a fast path like this is reasonable. The AllocateWorkspace-based approach can be used in more general settings but it is overkill when we only need one workspace of a fixed size.

MasterJH5574 added the branch: unity label Jan 17, 2024

MasterJH5574 force-pushed the unity-dev/2024-01-16-cublas-workspace branch from a102768 to 86a6213 Compare January 17, 2024 01:54

masahi reviewed Jan 17, 2024

View reviewed changes

src/runtime/contrib/cublas/cublas_utils.h Outdated Show resolved Hide resolved

MasterJH5574 force-pushed the unity-dev/2024-01-16-cublas-workspace branch from 86a6213 to 6e50364 Compare January 25, 2024 04:48

MasterJH5574 changed the base branch from unity to main January 25, 2024 04:48

MasterJH5574 changed the title ~~[Unity][Contrib] Workspace for cuBLAS backend~~ [Contrib] Workspace for cuBLAS backend Jan 25, 2024

[Contrib] Workspace for cuBLAS backend

8edf01f

This PR adds a 32MB workspace for cuBLAS backend, so that functions like `cublasLtMatmul` can take the workspace as input. The workspace is managed under CuBlasThreadEntry so that it will be allocated only once in each thread.

MasterJH5574 force-pushed the unity-dev/2024-01-16-cublas-workspace branch from 6e50364 to 8edf01f Compare January 25, 2024 05:11

Hzfengsy removed the branch: unity label Jan 25, 2024

masahi reviewed Jan 25, 2024

View reviewed changes

masahi approved these changes Jan 25, 2024

View reviewed changes

masahi merged commit bbbc895 into apache:main Jan 25, 2024
20 checks passed

ysh329 mentioned this pull request Apr 21, 2024

[Release] v0.16.0 Release Candidate Notes #16911

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Contrib] Workspace for cuBLAS backend #16413

[Contrib] Workspace for cuBLAS backend #16413

MasterJH5574 commented Jan 17, 2024

MasterJH5574 commented Jan 17, 2024

masahi commented Jan 17, 2024

MasterJH5574 commented Jan 25, 2024

junrushao commented Jan 25, 2024

masahi Jan 25, 2024

masahi left a comment

[Contrib] Workspace for cuBLAS backend #16413

[Contrib] Workspace for cuBLAS backend #16413

Conversation

MasterJH5574 commented Jan 17, 2024

MasterJH5574 commented Jan 17, 2024

masahi commented Jan 17, 2024

MasterJH5574 commented Jan 25, 2024

junrushao commented Jan 25, 2024

masahi Jan 25, 2024

Choose a reason for hiding this comment

masahi left a comment

Choose a reason for hiding this comment