-
Notifications
You must be signed in to change notification settings - Fork 223
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Modify files to support distributed mode for XGBoost #553
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, please refer to inline comments for suggestions
@@ -62,7 +62,7 @@ def _find_and_send_split(self, split_ref, tree_num, node_num): | |||
self.model[tree_num][node_num].member = client_id | |||
|
|||
split_ref['feature_idx'] -= accum_dim | |||
split_child = False | |||
split_child = 0 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I suggest preserving True/False
here to make it readable, and only transforming the type when split_child
is in message.content
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
When using distributed mode, we need to support the case where adopted datasets are distributed (load from different files)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
TODO: to enable multiple files in distributed mode, which is provided in the previous version but has bugs now. We will fix the bugs ASAP.
As the title says.