Re-configure benchmarking devices & add markers to `bench_cugraph_uniform_neighbor_sample` #4561

nv-rliu · 2024-07-29T21:00:59Z

This PR re-enables the benchmarks/cugraph/pytest-based/bench_cugraph_uniform_neighbor_sample.py benchmark in the MNMG nightlies.

This benchmark was previously being skipped due to the fact that the benchmarks are configured to run with Pytest markers "managedmem_off and poolallocator_on". Thanks to @jameslamb for spotting this inside conftest.py

By adding the missing markers to the test (and removing some dgx machine specific dask-client configs), the benchmark should properly run in the nightly jobs.

nv-rliu · 2024-07-29T21:02:35Z

This PR is targeting 24.10 and should not be reviewed until the forward-merge has been completed.

…ample

jameslamb

I do think the pytest marker changes will help ensure the benchmark actually runs.

For my own understanding ... why is it desirable to remove the ability to configure which GPUs on the system are used? (this is just for my own curiosity... can answer this after you merge)

nv-rliu · 2024-08-05T16:04:14Z

I do think the pytest marker changes will help ensure the benchmark actually runs.

For my own understanding ... why is it desirable to remove the ability to configure which GPUs on the system are used? (this is just for my own curiosity... can answer this after you merge)

Hi James. Thanks for the question.
When we run these benchmarks on our clustered machines, we run then in multiple configurations (2-GPU, 8-GPU, 32, 64, etc) and ideally want to test the algorithm while using all of the available GPUs (allocated by the job scheduler).
As for the old code that selected 4-GPUs (Devices 1-4), my guess is that this was for running this benchmark locally on the lab machines.

rlratzel · 2024-08-05T16:05:06Z

/merge

nv-rliu added 2 commits July 29, 2024 13:53

Re-configure benchmarking devices & add markers

709fe52

Remove comment

deacce3

nv-rliu added bug Something isn't working non-breaking Non-breaking change graph-devops Issues for the graph-devops team benchmarks labels Jul 29, 2024

nv-rliu added this to the 24.08 milestone Jul 29, 2024

nv-rliu requested review from rlratzel and jameslamb July 29, 2024 21:00

nv-rliu changed the base branch from branch-24.08 to branch-24.10 July 29, 2024 21:02

nv-rliu modified the milestones: 24.08, 24.10 Jul 29, 2024

rlratzel approved these changes Jul 31, 2024

View reviewed changes

nv-rliu added 2 commits July 31, 2024 02:15

Merge branch 'branch-24.10' into reenable-bench-cugraph-unif-neighb-s…

7d5d577

…ample

Merge branch 'branch-24.10' into reenable-bench-cugraph-unif-neighb-s…

bd85ad3

…ample

nv-rliu marked this pull request as ready for review August 1, 2024 20:16

jameslamb approved these changes Aug 5, 2024

View reviewed changes

rapids-bot bot merged commit 1be81a4 into rapidsai:branch-24.10 Aug 5, 2024
132 checks passed

nv-rliu deleted the reenable-bench-cugraph-unif-neighb-sample branch August 5, 2024 16:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-configure benchmarking devices & add markers to `bench_cugraph_uniform_neighbor_sample` #4561

Re-configure benchmarking devices & add markers to `bench_cugraph_uniform_neighbor_sample` #4561

nv-rliu commented Jul 29, 2024

nv-rliu commented Jul 29, 2024

jameslamb left a comment

nv-rliu commented Aug 5, 2024

rlratzel commented Aug 5, 2024

Re-configure benchmarking devices & add markers to bench_cugraph_uniform_neighbor_sample #4561

Re-configure benchmarking devices & add markers to bench_cugraph_uniform_neighbor_sample #4561

Conversation

nv-rliu commented Jul 29, 2024

nv-rliu commented Jul 29, 2024

jameslamb left a comment

Choose a reason for hiding this comment

nv-rliu commented Aug 5, 2024

rlratzel commented Aug 5, 2024

Re-configure benchmarking devices & add markers to `bench_cugraph_uniform_neighbor_sample` #4561

Re-configure benchmarking devices & add markers to `bench_cugraph_uniform_neighbor_sample` #4561