-
Notifications
You must be signed in to change notification settings - Fork 310
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[REVIEW] Optimize PG.add_data
#2924
Conversation
@@ -909,7 +910,8 @@ def add_edge_data( | |||
if self.__series_type is cudf.Series: | |||
# cudf does not yet support initialization with a scalar | |||
tmp_df[TCN] = cudf.Series( | |||
np.repeat(type_name, len(tmp_df)), index=tmp_df.index, dtype=cat_dtype | |||
cudf.Series([type_name], dtype=cat_dtype).repeat(len(tmp_df)), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice one doing this operation on GPU.
Very nice. LGTM. |
Codecov Report
Additional details and impacted files@@ Coverage Diff @@
## branch-22.12 #2924 +/- ##
===============================================
Coverage ? 60.81%
===============================================
Files ? 122
Lines ? 6891
Branches ? 0
===============================================
Hits ? 4191
Misses ? 2700
Partials ? 0 Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. ☔ View full report at Codecov. |
@gpucibot merge |
This PR depends upon #2924 . We add `Graph-Sage` example to claim `dgl` , `cugraph` integration. Waiting on SWIPAT before merging CC: @TristonC Authors: - Vibhu Jawa (https://github.com/VibhuJawa) Approvers: - Xiaoyun Wang (https://github.com/wangxiaoyunNV) - Rick Ratzel (https://github.com/rlratzel) - Alex Barghi (https://github.com/alexbarghi-nv) - Tingyu Wang (https://github.com/tingyu66) URL: #2925
This PR fixes #2903 .
We reduce the memory foot print by
3.5x
and speeds up the add_data by557x
and also allows us to not be limited in the size of edges we can save. (Time is in seconds vs ms)Before PR:
After PR