nx-cugraph: add k_truss and degree centralities #3945

eriknw · 2023-10-19T21:45:19Z

New algorithms:

degree_centrality
in_degree_centrality
k_truss
number_of_selfloops
out_degree_centrality

Also, rename row_indices, col_indices to src_indices, dst_indices

New algorithms: - `degree_centrality` - `in_degree_centrality` - `k_truss` - `number_of_selfloops` - `out_degree_centrality Also, rename `row_indices, col_indices` to `src_indices, dst_indices`

python/nx-cugraph/nx_cugraph/algorithms/core.py

rlratzel

Thanks Erik, LGTM, I just had two requests/questions.

Also, are we getting good coverage via the NX test suite? We were bit by this for edge BC so I want to make sure we're covered.

rlratzel · 2023-10-24T21:39:36Z

python/nx-cugraph/nx_cugraph/algorithms/core.py

+            "Consider using G.remove_edges_from(nx.selfloop_edges(G))."
+        )
+    # TODO: create renumbering helper function(s)
+    if k < 3:


Can you add a comment as an overview of what's being done differently here vs. the else block and why? (eg. PLC isn't even being called, etc.)

I thought I did leave a comment: we drop nodes with zero degree. No need to call into pylibcugraph just for that. What comment would you find helpful here?

sorry I meant a comment describing the overall goal of the entire k < 3 block vs. the else block. I admittedly don't know much about k_truss (maybe that's the problem?), but I was hoping to get at least a rough idea why they're so different. For instance, I'm curious why plc.k_truss_subgraph() couldn't just be used unconditionally even when k < 3, or why we wouldn't remove zero degree nodes in both cases. However, if this would require an explanation of the k_truss algo itself, then feel free to ignore this.

Gotcha. Take a quick look at k_truss documentation:
https://networkx.org/documentation/stable/reference/algorithms/generated/networkx.algorithms.core.k_truss.html

See: "[...] where every edge is incident to at least k-2 triangles.". So, k < 3 is an edge case.

I'm curious why plc.k_truss_subgraph() couldn't just be used unconditionally even when k < 3

It can--we can delete this entire branch if we wanted. PLC behaves the same as networkx for 0 <= k < 3. We do the branch b/c it is faster, and it lets us check the "stop early" condition if there are no nodes of degree zero. I like fast, and I don't think this branch is excessively complicated.

why we wouldn't remove zero degree nodes in both cases

k_truss always removes zero-degree nodes. PLC's k_truss does this for us when we call it.

I suppose I can add a couple comments to the code like I just explained here ;) --thanks for the discussion.

More detailed code comment added.

Thanks, that explains everything and is really helpful.

Oh, I remember the primary reason I made this code branch in the first place and why I think it should exist for now: I was exploring renumbering and wanted to try multiple approaches. I'm sure we'll be doing more renumbering soon.

python/nx-cugraph/nx_cugraph/algorithms/core.py

eriknw · 2023-10-24T22:33:28Z

Also, are we getting good coverage via the NX test suite? We were bit by this for edge BC so I want to make sure we're covered.

Great question. Yeah, it's pretty good. I think lines 29, 31, 43, 79 aren't covered, which doesn't concern me atm.

After #3954, I think it will be a lot easier to do comparison testing with a variety of input graphs without too much effort.

fwiw, I do like to have solid coverage, but our shared tooling today isn't great. It would help if I could quickly find "nx-cugraph" tests in CI. As is, I test for major coverage during development and occasionally look at coverage reports locally.

rlratzel · 2023-10-24T22:57:06Z

It would help if I could quickly find "nx-cugraph" tests in CI.

I added rapids_logger calls to the CI script to announce in the log when it's running the nx-cugraph tests. You should be able to search for "pytest nx-cugraph" or "pytest networkx" to find them. Would that help, or did I misunderstand?

eriknw · 2023-10-24T23:01:37Z

It would help if I could quickly find "nx-cugraph" tests in CI.

I added rapids_logger calls to the CI script to announce in the log when it's running the nx-cugraph tests. You should be able to search for "pytest nx-cugraph" or "pytest networkx" to find them. Would that help, or did I misunderstand?

I suppose that helps somewhat. Browsing and searching CI is pretty slow and cumbersome, so my preferred UX would to have an exandable/collapsable nx-cugraph section.

rlratzel · 2023-10-24T23:04:09Z

my preferred UX would to have an exandable/collapsable nx-cugraph section.

ah, that would be nice. I'll keep that in mind. I think it would require editing the yaml in the .github folder, which I'm not that familiar with, but I like that idea in general.

rlratzel · 2023-10-25T20:40:45Z

/merge

nx-cugraph: add k_truss and degree centralities

786c1e2

New algorithms: - `degree_centrality` - `in_degree_centrality` - `k_truss` - `number_of_selfloops` - `out_degree_centrality Also, rename `row_indices, col_indices` to `src_indices, dst_indices`

eriknw requested a review from a team as a code owner October 19, 2023 21:45

rlratzel added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Oct 20, 2023

use mapper.dtype

6d43317

eriknw commented Oct 20, 2023

View reviewed changes

python/nx-cugraph/nx_cugraph/algorithms/core.py Show resolved Hide resolved

Merge branch 'branch-23.12' into k_truss_and_degrees

19e6b8f

rlratzel reviewed Oct 24, 2023

View reviewed changes

Add code comments explaining k_truss k<2 branch

bc5849a

rlratzel approved these changes Oct 25, 2023

View reviewed changes

rapids-bot bot merged commit 629e63c into rapidsai:branch-23.12 Oct 25, 2023
71 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nx-cugraph: add k_truss and degree centralities #3945

nx-cugraph: add k_truss and degree centralities #3945

eriknw commented Oct 19, 2023

rlratzel left a comment

rlratzel Oct 24, 2023

eriknw Oct 24, 2023

rlratzel Oct 24, 2023

eriknw Oct 24, 2023

eriknw Oct 25, 2023

rlratzel Oct 25, 2023

eriknw Oct 25, 2023

eriknw commented Oct 24, 2023

rlratzel commented Oct 24, 2023

eriknw commented Oct 24, 2023

rlratzel commented Oct 24, 2023

rlratzel commented Oct 25, 2023

nx-cugraph: add k_truss and degree centralities #3945

nx-cugraph: add k_truss and degree centralities #3945

Conversation

eriknw commented Oct 19, 2023

rlratzel left a comment

Choose a reason for hiding this comment

rlratzel Oct 24, 2023

Choose a reason for hiding this comment

eriknw Oct 24, 2023

Choose a reason for hiding this comment

rlratzel Oct 24, 2023

Choose a reason for hiding this comment

eriknw Oct 24, 2023

Choose a reason for hiding this comment

eriknw Oct 25, 2023

Choose a reason for hiding this comment

rlratzel Oct 25, 2023

Choose a reason for hiding this comment

eriknw Oct 25, 2023

Choose a reason for hiding this comment

eriknw commented Oct 24, 2023

rlratzel commented Oct 24, 2023

eriknw commented Oct 24, 2023

rlratzel commented Oct 24, 2023

rlratzel commented Oct 25, 2023