Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bulk Sampling #3148

Closed
wants to merge 399 commits into from
Closed

Conversation

alexbarghi-nv
Copy link
Member

@alexbarghi-nv alexbarghi-nv commented Jan 17, 2023

Resolves #3040
Merge after #3101 and #3082

Also resolves two major bugs in MG sampling, one that results in an error when not passing a seed, and another that causes the sampling call to hang on MG.

ChuckHastings and others added 30 commits November 29, 2022 13:58
…art so it fails faster, including the stdout/stderr output in the exception raised or to console depending on how it exited.
… can be more easily shared, bug fix in benchmark extension testing util.
…arkers to pytest.ini, added pythonpath and test paths to pytest.ini, gave benchmark files unique names so pytest can collect them across different bench dirs.
…ception handler for loading extensions, run server from a temp dir to avoid collisions with 'cugraph' namespace package in benchmarks dir.
@alexbarghi-nv alexbarghi-nv removed the request for review from a team January 24, 2023 15:44
@alexbarghi-nv alexbarghi-nv added non-breaking Non-breaking change and removed breaking Breaking change labels Jan 24, 2023
Copy link
Member

@VibhuJawa VibhuJawa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A minor change to prevent recusions. Looks good otherwise.

python/cugraph/cugraph/gnn/data_loading/bulk_sampler.py Outdated Show resolved Hide resolved
@VibhuJawa VibhuJawa mentioned this pull request Jan 25, 2023
8 tasks
VibhuJawa
VibhuJawa previously approved these changes Jan 26, 2023
Copy link
Member

@VibhuJawa VibhuJawa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good to me based on testing. Have requested a minor changes but approving assuming change is added.

@@ -250,17 +273,21 @@ def uniform_neighbor_sample(
weight_t=weight_t,
with_edge_properties=with_edge_properties,
# FIXME accept and properly transmute a numpy/cupy random state.
random_state=hash(random_state, i),
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just a comment that catching bugs like this is why we need to get dask even on SG to run in CI ASAP. CC: @rlratzel

Co-authored-by: Vibhu Jawa <[email protected]>
@BradReesWork BradReesWork changed the base branch from branch-23.02 to branch-23.04 January 26, 2023 15:01
@BradReesWork BradReesWork dismissed stale reviews from VibhuJawa and ChuckHastings January 26, 2023 15:01

The base branch was changed.

@rlratzel rlratzel modified the milestones: 23.04, 23.02 Jan 26, 2023
@alexbarghi-nv
Copy link
Member Author

pre-empted by #3170

rapids-bot bot pushed a commit that referenced this pull request Jan 31, 2023
This PR depends upon ~#3148 (#3170)

This PR closes #3154


- [x] Works on SG Homogenous Graph (OBGN-Products) 
- [x] Works on SG Heterogenous Graphs  (OBGN-Mag) 
- [x] Works on MG Homogenous Graphs  (OBGN-Products, OBGN-PAPERS100M (1 Billion edge graph)) 
- [x] Works on MG Heterogenous Graphs (OBGN-Mag) 

Tests to add:
- [x] Tests on SG Homogenous Graphs
- [x] Tests  on SG Heterogenous Graphs
- [x] Tests  on MG Homogenous Graphs
- [x] Tests on MG Heterogenous Graphs

Maybe Todo: 
<s>- [ ] Test Multi-trainer examples and verification </s> (Will do a follow up)

Authors:
  - Vibhu Jawa (https://github.com/VibhuJawa)
  - Rick Ratzel (https://github.com/rlratzel)
  - Alex Barghi (https://github.com/alexbarghi-nv)
  - Chuck Hastings (https://github.com/ChuckHastings)

Approvers:
  - Alex Barghi (https://github.com/alexbarghi-nv)
  - Rick Ratzel (https://github.com/rlratzel)

URL: #3181
rapids-bot bot pushed a commit that referenced this pull request Jan 31, 2023
Closes #3152 
Depends on #3148 and #3159 
This PR does support multiple trainers, even though there is no example for it.

Authors:
  - Alex Barghi (https://github.com/alexbarghi-nv)
  - Chuck Hastings (https://github.com/ChuckHastings)
  - Vibhu Jawa (https://github.com/VibhuJawa)

Approvers:
  - Vibhu Jawa (https://github.com/VibhuJawa)
  - Rick Ratzel (https://github.com/rlratzel)

URL: #3170
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request non-breaking Non-breaking change
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FEA] Support Prefetching
7 participants