[Topi][Unittests] Parametrized tests in `test_topi_dense.py`, split out gpu-independent implementations #8336

Lunderberg · 2021-06-24T22:57:57Z

[Topi][UnitTests] Parametrized tests in test_topi_dense.py

Now, tests run for multiple data types, can be extended with additional datatypes.

[Topi] Separated generic-gpu nn.dense implementations into topi.gpu.dense

As a follow-up to the renaming of "gpu" to "cuda", separating implementations that require CUDA (e.g. dense_cublas.cuda) from implementations that require any GPU, but not necessarily a CUDA GPU (e.g. dense_small_batch.gpu).

My intent is to pair this migration with the extension of unit tests to cover additional GPU runtimes, migrating only implementations that run correctly on non-CUDA GPU devices.

python/tvm/topi/gpu/dense.py

jcf94 · 2021-06-28T03:25:45Z

tests/python/topi/python/test_topi_dense.py

+    return (a_np, b_np, c_np, d_np)
+
+
+def test_dense(


I'm thinking that wheather this can test different targets as you expected.

Shouldn't this function be decorated with "@tvm.testing.parametrize_targets"?

As of #8010 , the decorator for @tvm.testing.parametrize_targets is only needed if the test should override the default targets. Otherwise, having the target and/or dev fixture arguments is sufficient to have the test run on all enabled targets.

Oh, my bad. That really a helpful feature.

Now, tests run for multiple data types, can be extended with additional datatypes.

Lunderberg · 2021-06-29T17:18:44Z

Added a few fixes in the codegen needed for the vulkan tests to run correctly on newer nvidia GPUs. (@masahi )

…ense As a follow-up to the renaming of "gpu" to "cuda", separating implementations that require CUDA (e.g. dense_cublas.cuda) from implementations that require any GPU, but not necessarily a CUDA GPU (e.g. dense_small_batch.gpu). My intent is to pair this migration with the extension of unit tests to cover additional GPU runtimes, migrating only implementations that run correctly on non-CUDA GPU devices.

…lts on some GPUs - In ThreadAllreduceBuilder, separate out load/store so that they can have a memory barrier in-between. - In Vulkan codegen, added Workgroup memory sync for subgroup thread sync, since the different subgroup threads can still access workgroup memory. Longer-term, may need tir enhancements to separate out sync of control/memory.

jcf94

Thanks! Looks good to me.
I have a PR #8234 which also have some modifications on topi & cuda strategy. It should be conflict with this.
I can rebase it after this has been merged.

…ut gpu-independent implementations (apache#8336) * [Topi][UnitTests] Parametrized tests in test_topi_dense.py Now, tests run for multiple data types, can be extended with additional datatypes. * [Topi] Separated generic-gpu nn.dense implementations into topi.gpu.dense As a follow-up to the renaming of "gpu" to "cuda", separating implementations that require CUDA (e.g. dense_cublas.cuda) from implementations that require any GPU, but not necessarily a CUDA GPU (e.g. dense_small_batch.gpu). My intent is to pair this migration with the extension of unit tests to cover additional GPU runtimes, migrating only implementations that run correctly on non-CUDA GPU devices. * [Vulkan][Codegen] Updated storage sync to avoid incorrect matmul results on some GPUs - In ThreadAllreduceBuilder, separate out load/store so that they can have a memory barrier in-between. - In Vulkan codegen, added Workgroup memory sync for subgroup thread sync, since the different subgroup threads can still access workgroup memory. Longer-term, may need tir enhancements to separate out sync of control/memory. Co-authored-by: Eric Lunderberg <[email protected]>

Lunderberg force-pushed the topi_dense_parametrize branch 2 times, most recently from 0dde14d to 98a145b Compare June 24, 2021 23:04

jcf94 reviewed Jun 28, 2021

View reviewed changes

[Topi][UnitTests] Parametrized tests in test_topi_dense.py

58dd620

Now, tests run for multiple data types, can be extended with additional datatypes.

Lunderberg force-pushed the topi_dense_parametrize branch from 98a145b to 4786a21 Compare June 29, 2021 17:17

Lunderberg added 2 commits June 29, 2021 10:25

Lunderberg force-pushed the topi_dense_parametrize branch from 4786a21 to adb2431 Compare June 29, 2021 17:25

jcf94 approved these changes Jun 30, 2021

View reviewed changes

jcf94 merged commit ae58f2c into apache:main Jun 30, 2021

Lunderberg deleted the topi_dense_parametrize branch June 30, 2021 13:31

Lunderberg mentioned this pull request Jul 9, 2021

[Topi][UnitTests] Parameterize conv2d and depthwise_conv2d tests #8433

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Topi][Unittests] Parametrized tests in `test_topi_dense.py`, split out gpu-independent implementations #8336

[Topi][Unittests] Parametrized tests in `test_topi_dense.py`, split out gpu-independent implementations #8336

Lunderberg commented Jun 24, 2021

jcf94 Jun 28, 2021

Lunderberg Jun 29, 2021

jcf94 Jun 30, 2021

Lunderberg commented Jun 29, 2021 •

edited

Loading

jcf94 left a comment

[Topi][Unittests] Parametrized tests in test_topi_dense.py, split out gpu-independent implementations #8336

[Topi][Unittests] Parametrized tests in test_topi_dense.py, split out gpu-independent implementations #8336

Conversation

Lunderberg commented Jun 24, 2021

jcf94 Jun 28, 2021

Choose a reason for hiding this comment

Lunderberg Jun 29, 2021

Choose a reason for hiding this comment

jcf94 Jun 30, 2021

Choose a reason for hiding this comment

Lunderberg commented Jun 29, 2021 • edited Loading

jcf94 left a comment

Choose a reason for hiding this comment

[Topi][Unittests] Parametrized tests in `test_topi_dense.py`, split out gpu-independent implementations #8336

[Topi][Unittests] Parametrized tests in `test_topi_dense.py`, split out gpu-independent implementations #8336

Lunderberg commented Jun 29, 2021 •

edited

Loading