[Relay][Strategy] Use x86 dense schedules for arm_cpu #15470

lhutton1 · 2023-08-03T16:50:43Z

Currently the fallback used when compiling a dense operation with targets such as llvm -device=arm_cpu is dense.generic. This results in very poor performance. Although #13775 meant that x86 schedules are used in cases where no strategy is provided by arm_cpu, the dense strategy is registered due to the existence of specialized schedules for arm_cpu e.g. a schedule for embedded devices. This commit ensures x86 schedules are used inplace of a generic schedule which yields much better performance.

The commit also follows the same approach for the dense.generic schedule as the x86 strategy. This will only be used when auto-scheduler is enabled.

A test has been added to check the intended schedules are picked when compiling with arm_cpu.

cc @ekalda @neildhickey

tvm-bot · 2023-08-03T16:50:46Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @shingjan _{See #10317 for details}

_{Generated by tvm-bot}

ekalda

Thanks @lhutton1, LGTM!

Currently the fallback used when compiling a dense operation with targets such as `llvm -device=arm_cpu` is `dense.generic`. This results very poor performance. Although apache#13775 meant that x86 schedules are used in cases where no strategy is provided by arm_cpu, the dense strategy is registered due to the existance of specialized schedules for arm_cpu e.g. a schedule for embedded devices. This commit ensures x86 schedules are used inplace of a generic schedule which yeilds much better performance. The commit also follows the same approach for the `dense.generic` schedule as the x86 strategy. This will only be used when autoscheduler is enabled. A test has been added to check the intended schedules are picked when compiling with `arm_cpu`. Change-Id: I8697f630d4acfab71a9626cf9e0dc3086987f163

leandron

LGTM, thanks! Merging this now, thanks @ekalda @lhutton1!

Similar to apache#15470, x86 schedules are used in place of generic schedules to improve performance. Since the pooling strategy does not use `OpStrategy`, mocking is used to ensure the relevant `schedule_pool` function is called when lowing a Relay pooling operation with respect to a given target. Change-Id: I782fe00e29f9c9cf41b3405d33a82a79cd85a99b

Similar to #15470, x86 schedules are used in place of generic schedules to improve performance. Since the pooling strategy does not use `OpStrategy`, mocking is used to ensure the relevant `schedule_pool` function is called when lowing a Relay pooling operation with respect to a given target.

github-actions bot requested a review from ekalda August 3, 2023 16:51

ekalda approved these changes Aug 4, 2023

View reviewed changes

lhutton1 force-pushed the use-x86-dense branch from b70c662 to c02ff2e Compare August 4, 2023 15:32

leandron approved these changes Aug 7, 2023

View reviewed changes

leandron merged commit ae45b04 into apache:main Aug 7, 2023

lhutton1 deleted the use-x86-dense branch August 7, 2023 14:42

lhutton1 mentioned this pull request Aug 8, 2023

[Relay][Strategy] Use x86 pool schedules for arm_cpu #15506

Merged

ysh329 mentioned this pull request Oct 18, 2023

[Release] v0.14.0 Release Candidate Notes #15948

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relay][Strategy] Use x86 dense schedules for arm_cpu #15470

[Relay][Strategy] Use x86 dense schedules for arm_cpu #15470

lhutton1 commented Aug 3, 2023 •

edited

Loading

tvm-bot commented Aug 3, 2023

ekalda left a comment

leandron left a comment

[Relay][Strategy] Use x86 dense schedules for arm_cpu #15470

[Relay][Strategy] Use x86 dense schedules for arm_cpu #15470

Conversation

lhutton1 commented Aug 3, 2023 • edited Loading

tvm-bot commented Aug 3, 2023

ekalda left a comment

Choose a reason for hiding this comment

leandron left a comment

Choose a reason for hiding this comment

lhutton1 commented Aug 3, 2023 •

edited

Loading