`nx-cugraph`: support `should_run` that was added in NetworkX 3.3 #4348

eriknw · 2024-04-16T15:43:41Z

Also, update complete_bipartite_graph for nx dev

edit: ~~This does not yet add per-algorithm should_run, but it makes doing so possible.~~

Add should_run for triangles, clustering, is_isolate

eriknw · 2024-04-17T18:52:13Z

@rlratzel, do you think we should add should_run elsewhere? Maybe is_negatively_weighted or number_of_selfloops?

rlratzel

I'm looking forward to having some should_run()s in place - I think that will result in a decent performance boost for no-code-change cases involving smaller graphs. (well, I should say a performance boost compared to not having should_run()s when nx-cugraph is installed)

rlratzel · 2024-04-18T18:37:40Z

python/nx-cugraph/nx_cugraph/algorithms/bipartite/generators.py

+    if nx.__version__[:3] <= "3.2":
+        name = f"complete_bipartite_graph({orig_n1}, {orig_n2})"
+    else:
+        name = f"complete_bipartite_graph({n1}, {n2})"


I'm curious what name is used for and why it has to be different for newer versions of NX. From looking at from_coo(), I think name is an attr that is applied to the new graph object, but I didn't notice how it's used from there. Would a brief comment be out-of-place here?

Here is context for posterity: networkx/networkx#7399

rlratzel · 2024-04-18T18:48:05Z

python/nx-cugraph/nx_cugraph/algorithms/cluster.py

+def _(G, nodes=None):
+    if nodes is None or nodes not in G:
+        return True
+    return "Fast algorithm when computing for a single node; not worth converting."


For this and other should_runs: does should_run get called if the user passes in a backend graph (which does not need converting)? I remember we honor backend= regardless of can/should_run, but is passing a backend graph treated as if a user set backend= precedence-wise?

should_run is only used when the graph needs to be converted. Hence, should_convert_and_run would be a more accurate name, but I as recall you liked the shorter name should_run. The backend algorithm will always be used if a backend graph if passed or backend= argument is used.

@eriknw and I talked about this offline. The use case I was envisioning was one where should_run implementations can choose whether or not a backend algorithm is not ideally applied to a particular set of inputs, regardless of if the graph has been converted. For example, if a backend is optimized for very large graphs and the graph is already converted but very small, that backend may choose to let other backends run by returning False for should_run.

@eriknw pointed out that a backend graph, in that situation, would have to be converted to another backend graph type (backend-to-backend) or to a NX graph (backend-to-nx) in order to continue, and neither of those are currently supported. So for now, should_run only applies to calls that require a conversion.

rlratzel · 2024-04-25T22:37:49Z

@rlratzel, do you think we should add should_run elsewhere? Maybe is_negatively_weighted or number_of_selfloops?

Yes, those might be good but degree_centrality and descendants_at_distance would definitely benefit:

eriknw · 2024-04-26T10:33:29Z

Thanks @rlratzel. I added should_run to degree centralities and number_of_selfloop.

To try to follow DRY principle, I'm considering an enhancement to add a module that has a collection of should_run or string constants of the reasons. We have a lot of copy/paste right now.

I also wonder about when an input graph already has a pre-converted backend graph. If it does, then should should_run even be run? Should networkx be smart enough to know that no conversion is necessary and not call should_run? Or, should it be the responsibility of backends to look at the cache and decide whether it should be run? I think I know my preference, but I'm curious to hear your thoughts.

rlratzel · 2024-04-26T18:47:40Z

I also wonder about when an input graph already has a pre-converted backend graph. If it does, then should should_run even be run? Should networkx be smart enough to know that no conversion is necessary and not call should_run? Or, should it be the responsibility of backends to look at the cache and decide whether it should be run? I think I know my preference, but I'm curious to hear your thoughts.

I personally like giving more autonomy to backends to make decisions that are best for them. Similar to my initial thought of having the dispatcher call should_run even if the input graph is a backend graph type - e.g. "maybe this particular algo on this particular graph is still not a good choice for this backend despite already being converted, even though it's technically possible", but as discussed above, I understand why we're not currently doing that. So for the question of "if a backend graph conversion is cached, should should_run still be called", my vote is 'yes'.

rlratzel · 2024-04-29T16:10:11Z

/merge

nx-cugraph: support should_run that was added in NetworkX 3.3

e2e5e90

eriknw requested a review from a team as a code owner April 16, 2024 15:43

github-actions bot added the python label Apr 16, 2024

eriknw added 3 commits April 16, 2024 11:06

Merge branch 'branch-24.06' into support_should_run

a57ab7b

Merge branch 'branch-24.06' into support_should_run

a5674ff

Add should_run to triangles, clustering, is_isolate

d18167d

rlratzel reviewed Apr 18, 2024

View reviewed changes

eriknw added 3 commits April 23, 2024 10:11

Merge branch 'branch-24.06' into support_should_run

9e7b354

oops, fix version condition in complete_bipartite_graph

70451d8

Merge branch 'branch-24.06' into support_should_run

38c0968

rlratzel approved these changes Apr 25, 2024

View reviewed changes

Add should_run to degree centralities and number_of_selfloops

7eee6b0

rlratzel added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Apr 26, 2024

rapids-bot bot merged commit 226e7de into rapidsai:branch-24.06 Apr 29, 2024
141 checks passed

Schefflera-Arboricola mentioned this pull request Aug 20, 2024

Incorporating should_run in nx-parallel networkx/nx-parallel#77

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`nx-cugraph`: support `should_run` that was added in NetworkX 3.3 #4348

`nx-cugraph`: support `should_run` that was added in NetworkX 3.3 #4348

eriknw commented Apr 16, 2024 •

edited

Loading

eriknw commented Apr 17, 2024

rlratzel left a comment •

edited

Loading

rlratzel Apr 18, 2024

eriknw Apr 19, 2024

rlratzel Apr 18, 2024

eriknw Apr 19, 2024

rlratzel Apr 25, 2024

rlratzel commented Apr 25, 2024

eriknw commented Apr 26, 2024

rlratzel commented Apr 26, 2024 •

edited

Loading

rlratzel commented Apr 29, 2024

nx-cugraph: support should_run that was added in NetworkX 3.3 #4348

nx-cugraph: support should_run that was added in NetworkX 3.3 #4348

Conversation

eriknw commented Apr 16, 2024 • edited Loading

eriknw commented Apr 17, 2024

rlratzel left a comment • edited Loading

Choose a reason for hiding this comment

rlratzel Apr 18, 2024

Choose a reason for hiding this comment

eriknw Apr 19, 2024

Choose a reason for hiding this comment

rlratzel Apr 18, 2024

Choose a reason for hiding this comment

eriknw Apr 19, 2024

Choose a reason for hiding this comment

rlratzel Apr 25, 2024

Choose a reason for hiding this comment

rlratzel commented Apr 25, 2024

eriknw commented Apr 26, 2024

rlratzel commented Apr 26, 2024 • edited Loading

rlratzel commented Apr 29, 2024

`nx-cugraph`: support `should_run` that was added in NetworkX 3.3 #4348

`nx-cugraph`: support `should_run` that was added in NetworkX 3.3 #4348

eriknw commented Apr 16, 2024 •

edited

Loading

rlratzel left a comment •

edited

Loading

rlratzel commented Apr 26, 2024 •

edited

Loading