Prevent training without setting up caches. #4066

trivialfis · 2019-01-19T15:08:26Z

Add warning for internal functions.
Check number of features.
close [Blocking] python kernel failed when call Booster().predict #4056 .

* Add warning for internal functions. * Check number of features.

trivialfis · 2019-01-19T15:13:40Z

Calling Booster.update without first initializing Booster with cache for dtrain results in num_feature == 0. I can raise an error in Python, but checking it in C++ seems more reliable. Added documentation effectively marked Booster.update and Booster.boost as internal methods.

@RAMitchell , @hcho3 Could you take a look see if there is better decision?

codecov-io · 2019-01-19T16:06:03Z

Codecov Report

Merging #4066 into master will decrease coverage by 0.01%.
The diff coverage is 0%.

@@            Coverage Diff             @@
##           master    #4066      +/-   ##
==========================================
- Coverage   60.56%   60.55%   -0.02%     
==========================================
  Files         130      130              
  Lines       11756    11758       +2     
==========================================
  Hits         7120     7120              
- Misses       4636     4638       +2

Impacted Files	Coverage Δ
python-package/xgboost/core.py	`77.38% <ø> (ø)`	⬆️
src/learner.cc	`26.23% <0%> (-0.14%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1fc37e4...5eb543a. Read the comment docs.

RAMitchell

Can we really say don't use something that is documented as a part of the public api? Maybe it is better to say 'power users only' or something.

trivialfis · 2019-01-20T07:57:09Z

Can we really say don't use something that is documented as a part of the public api? Maybe it is better to say 'power users only' or something.

Actually I don't like my solution. Let's see if I can make new datasets part of the caches safely.

trivialfis · 2019-01-25T06:14:48Z

@RAMitchell Turns out I can't make the incoming dataset for training to become part of the caches for the following reason:

In c_api, DMatrix handler is a std::shared_ptr, while Learner.UpdaterOneIter accepts raw pointer. Every time c_api::XGBoostUpdaterOneIter calls Learner::UpdaterOneIter, it first calls shared_ptr::get() to pass the raw pointer. So there's no way to let the caches from Learner obtain ownership of this DMatrix without changing Learner's interface, hence can not make it part of the caches (which is a vector of shared pointer).

And no, it's not "power user only". I'm a power user ( I think :) ), I don't know how to make it work other than making a copy of the abstracted APIs.

trivialfis · 2019-01-31T06:38:53Z

I will go ahead and merge this if no objections. @RAMitchell @hcho3

hcho3 · 2019-01-31T13:31:17Z

Go ahead.

python-package/xgboost/core.py

trivialfis · 2019-02-03T07:15:45Z

@thvasilo Thanks for the pointer.

Prevent training without setting up caches.

5eb543a

* Add warning for internal functions. * Check number of features.

RAMitchell reviewed Jan 19, 2019

View reviewed changes

thvasilo reviewed Jan 31, 2019

View reviewed changes

python-package/xgboost/core.py Outdated Show resolved Hide resolved

hcho3 approved these changes Jan 31, 2019

View reviewed changes

RAMitchell approved these changes Jan 31, 2019

View reviewed changes

Address reviewer's comment.

d52a481

hcho3 merged commit 1088dff into dmlc:master Feb 3, 2019

trivialfis deleted the fix/num_feature branch February 3, 2019 10:01

hcho3 mentioned this pull request Mar 4, 2019

[RFC] Version 0.82 release candidate #4201

Merged

lock bot locked as resolved and limited conversation to collaborators May 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent training without setting up caches. #4066

Prevent training without setting up caches. #4066

trivialfis commented Jan 19, 2019 •

edited

Loading

trivialfis commented Jan 19, 2019 •

edited

Loading

codecov-io commented Jan 19, 2019 •

edited

Loading

RAMitchell left a comment

trivialfis commented Jan 20, 2019

trivialfis commented Jan 25, 2019

trivialfis commented Jan 31, 2019

hcho3 commented Jan 31, 2019

trivialfis commented Feb 3, 2019

Prevent training without setting up caches. #4066

Prevent training without setting up caches. #4066

Conversation

trivialfis commented Jan 19, 2019 • edited Loading

trivialfis commented Jan 19, 2019 • edited Loading

codecov-io commented Jan 19, 2019 • edited Loading

Codecov Report

RAMitchell left a comment

Choose a reason for hiding this comment

trivialfis commented Jan 20, 2019

trivialfis commented Jan 25, 2019

trivialfis commented Jan 31, 2019

hcho3 commented Jan 31, 2019

trivialfis commented Feb 3, 2019

trivialfis commented Jan 19, 2019 •

edited

Loading

trivialfis commented Jan 19, 2019 •

edited

Loading

codecov-io commented Jan 19, 2019 •

edited

Loading