-
Notifications
You must be signed in to change notification settings - Fork 310
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update GNN sampling post processing functions to optionally consider seed vertices in renumbering #4329
Update GNN sampling post processing functions to optionally consider seed vertices in renumbering #4329
Conversation
…to bug_post_processing
…to bug_post_processing
I'm tentatively approving this, will give final approval once I test |
…to bug_post_processing
…to bug_post_processing
…to bug_post_processing
Should be able to merge once @alexbarghi-nv approves. Let us know how testing goes. |
I'm seeing a bug where memory addresses are getting corrupted. Not sure if it's on my end or if it's something this PR introduced. Still troubleshooting. Will post a MRE if I am able to isolate it to something in this PR. |
Have you set do_expensive_check to true? |
I'll try that |
The error always occurs after the first call. The first call runs without any issues and produces the correct result. All subsequent calls either produce incorrect/garbage results, or fail with a memory access or frontier out of range error. When |
It looks like it may be something I introduced in my PR |
I couldn't replicate this with only the code in this PR |
I have now isolated and fixed the bug. It was an issue with data being improperly copied in my PR. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Small change request. Otherwise good.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approved, but we can't integrate this with dask+Python until that code gets refactored (#4358)
This is ready to go for the new distributed sampling framework, though, which just uses pylibcugraph
and NCCL.
👍
/merge |
Upon cugraph-pyg requests, this PR updates GNM sampling post processing functions to optionally consider seed vertices in renumbering.
Previously, seed vertices with 0 neighbors (to sample from) are ignored in renumbering. Once this PR is merged and if seed vertices is provided, seed vertices (regardless of whether they appear in the sampled edge list or not) will have a smaller vertex ID than vertices that first appear as seed vertices' neighbors or later hops.
Marked as non-breaking assuming that this functions are only internally used.