Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Adreno] Adapt reduction schedule for adreno #13100

Merged
merged 5 commits into from
Oct 24, 2022

Conversation

elvin-n
Copy link
Contributor

@elvin-n elvin-n commented Oct 17, 2022

Origin cuda schedule uses rfactor that is 10x-50x slower on Adreno than without barries

for example mean on QHD picture on Snapdragon 888 with cuda schedule is executed for 69ms while with new proposed schedule is executed for 6.2
the same for argmin: 183ms -> 3.9ms

@tvm-bot
Copy link
Collaborator

tvm-bot commented Oct 17, 2022

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

Generated by tvm-bot

Origin cuda schedule uses rfactor that is 10x-50x slower on
Adreno than without barries
@elvin-n elvin-n force-pushed the amalyshe/adreno_reduction branch from 4aa0dc6 to 5320706 Compare October 17, 2022 15:56
@areusch areusch added the needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it label Oct 19, 2022
@areusch areusch removed the needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it label Oct 19, 2022
Copy link
Contributor

@echuraev echuraev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. One minor comment

python/tvm/topi/cuda/reduction.py Show resolved Hide resolved
Copy link
Contributor

@echuraev echuraev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks!

@elvin-n
Copy link
Contributor Author

elvin-n commented Oct 21, 2022

@masahi could you please review/merge?

@masahi masahi merged commit 03d989f into apache:main Oct 24, 2022
xinetzone pushed a commit to daobook/tvm that referenced this pull request Nov 10, 2022
* [Adreno] Adapt reduction schedule for adreno

Origin cuda schedule uses rfactor that is 10x-50x slower on
Adreno than without barries

* Address PR comments

* Remove copy-paste, start reuse cuda impl

* Address pylint hits

* Extend comment for cuda schedule_reduce_impl
xinetzone pushed a commit to daobook/tvm that referenced this pull request Nov 25, 2022
* [Adreno] Adapt reduction schedule for adreno

Origin cuda schedule uses rfactor that is 10x-50x slower on
Adreno than without barries

* Address PR comments

* Remove copy-paste, start reuse cuda impl

* Address pylint hits

* Extend comment for cuda schedule_reduce_impl
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants