[Hexagon] Use single allocation to back 2-d arrays #10903

Lunderberg · 2022-04-05T16:27:27Z

Currently, each allocation allocates an entire page, so even a relatively small number of allocations can use very large amounts of VTCM. This commit changes calls to AllocVtcmWorkspace of shape [N,M] from performing N allocations of size M, to 1 allocation of size N*M. Since N is usually much smaller than a page, this reduces the total amount of memory required.

This is an intermediate step, where the long-term solution is to use static planning for VTCM allocations. This returns the same void** type as the static planning eventually will, but avoids excess memory use in the meantime.

Lunderberg · 2022-04-05T16:28:30Z

To maintain alignment of each individual region, padding may be added to the single allocation. As a result, this PR is dependent on functionality introduced in #10878

Lunderberg · 2022-04-05T16:50:35Z

Current CI failures are the expected differences at the C++ side, which are resolved in #10878. No changes expected to be needed for them, but will need to relaunch CI once it lands.

[2022-04-05T16:42:41.580Z]  196 - HexagonBuffer.nd_copy_from (Failed)
[2022-04-05T16:42:41.580Z] 	198 - HexagonBuffer.2d_copy_from_1d (Failed)
[2022-04-05T16:42:41.580Z] 	199 - HexagonBuffer.1d_copy_from_2d (Failed)
[2022-04-05T16:42:41.580Z] 	201 - HexagonBuffer.nd_copy_from_nd_smaller_size (Failed)
[2022-04-05T16:42:41.580Z] 	202 - HexagonBuffer.md_copy_from_nd (Failed)

Currently, each allocation allocates an entire page, so even a relatively small number of allocations can use very large amounts of VTCM. This commit changes calls to `AllocVtcmWorkspace` of shape `[N,M]` from performing `N` allocations of size `M`, to 1 allocation of size `N*M`. Since `N` is usually much smaller than a page, this reduces the total amount of memory required. This is an intermediate step, where the long-term solution is to use static planning for VTCM allocations. This returns the same `void**` type as the static planning eventually will, but avoids excess memory use in the meantime.

Previously, when a single monolithic allocation is used to back a 2-d Hexagon buffer of shape `[nallocs, nbytes_per_allocation]`, the allocation itself is aligned, but each individual region is not. This commit ensures that each individual region also followed the alignment specified.

Lunderberg · 2022-04-05T21:05:35Z

Rebased onto main to restart CI.

csullivan

This is a great way to handle separate allocations, thanks @Lunderberg

* [Hexagon] Use single allocation to back 2-d arrays Currently, each allocation allocates an entire page, so even a relatively small number of allocations can use very large amounts of VTCM. This commit changes calls to `AllocVtcmWorkspace` of shape `[N,M]` from performing `N` allocations of size `M`, to 1 allocation of size `N*M`. Since `N` is usually much smaller than a page, this reduces the total amount of memory required. This is an intermediate step, where the long-term solution is to use static planning for VTCM allocations. This returns the same `void**` type as the static planning eventually will, but avoids excess memory use in the meantime. * [Hexagon] Maintain alignment of allocations Previously, when a single monolithic allocation is used to back a 2-d Hexagon buffer of shape `[nallocs, nbytes_per_allocation]`, the allocation itself is aligned, but each individual region is not. This commit ensures that each individual region also followed the alignment specified.

Lunderberg mentioned this pull request Apr 5, 2022

[Hexagon][LLVM] Enable/test tensorized Hexagon DMA on 2d transformed layout #10905

Merged

Lunderberg added 2 commits April 5, 2022 16:05

Lunderberg force-pushed the hexagon_buffer_single_alloc branch from 2db1475 to 42e6e85 Compare April 5, 2022 21:05

Lunderberg marked this pull request as ready for review April 5, 2022 21:05

Lunderberg requested a review from csullivan April 6, 2022 14:30

Lunderberg added the status: need review label Apr 6, 2022

csullivan approved these changes Apr 6, 2022

View reviewed changes

csullivan merged commit 591a000 into apache:main Apr 6, 2022

Lunderberg deleted the hexagon_buffer_single_alloc branch April 6, 2022 16:00

Lunderberg mentioned this pull request Apr 6, 2022

[Hexagon] Add unit tests executing 2-d VTCM usage #10904

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Hexagon] Use single allocation to back 2-d arrays #10903

[Hexagon] Use single allocation to back 2-d arrays #10903

Lunderberg commented Apr 5, 2022

Lunderberg commented Apr 5, 2022

Lunderberg commented Apr 5, 2022

Lunderberg commented Apr 5, 2022

csullivan left a comment

[Hexagon] Use single allocation to back 2-d arrays #10903

[Hexagon] Use single allocation to back 2-d arrays #10903

Conversation

Lunderberg commented Apr 5, 2022

Lunderberg commented Apr 5, 2022

Lunderberg commented Apr 5, 2022

Lunderberg commented Apr 5, 2022

csullivan left a comment

Choose a reason for hiding this comment