-
Notifications
You must be signed in to change notification settings - Fork 311
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Don't store redundant columns in PropertyGraph Dataframes #2449
Don't store redundant columns in PropertyGraph Dataframes #2449
Conversation
Closes rapidsai#2400 I still need to update MG tests. I'll also remove the in-code assertions, since they won't always be True, because a column name could have previously been used as a property. Nevertheless, seeing these assertions pass should give us warm-fuzzies :)
Codecov Report
@@ Coverage Diff @@
## branch-22.08 #2449 +/- ##
===============================================
Coverage ? 61.15%
===============================================
Files ? 106
Lines ? 5543
Branches ? 0
===============================================
Hits ? 3390
Misses ? 2153
Partials ? 0 Continue to review full report at Codecov.
|
One thing I don't like about the current state of this PR is that it makes the user use our column names for source, dest, etc. in the query string. What if |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good. There might be places in here that will be affected by the outcome of how we want to handle the "internal" column names in output and tests, but no need to change anything now.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Have requested changes to drop inpace for cudf columns. LGTM otherwise.
Co-authored-by: Vibhu Jawa <[email protected]>
Co-authored-by: Vibhu Jawa <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks Erik
…into remove_redundant_columns
Test failures are now the same as seen in other PRs:
|
@gpucibot merge |
Updating these tests was missed in rapidsai#2449 Also, MG tests are finally working for me 🎉
Updating these tests was missed in #2449 Also, MG tests are finally working for me 🎉 Authors: - Erik Welch (https://github.com/eriknw) Approvers: - Rick Ratzel (https://github.com/rlratzel) URL: #2511
The main purpose of this is to reduce memory usage.
Closes #2400
I still need to update MG tests.
I'll also remove the in-code assertions, since they won't always be True, because a column name could have previously been used as a property. Nevertheless, seeing these assertions pass should give us warm-fuzzies :)