-
Notifications
You must be signed in to change notification settings - Fork 197
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TabPFNRegressor preprocessing fails on bigger datasets #169
Comments
In the preprocessing
I quickly checked the time cost of the second option with this test:
And got:
so surprisingly increasing the subsample seems to be a bit faster 🤔 (for 1K quantiles I get
@noahho would you have an opinion among both options, and on whether changing this parameter after training might be an issue? |
See https://huggingface.co/Prior-Labs/TabPFN-v2-reg/discussions/2
It seems that QuantileTransformer fails on big datasets with message
The number of quantiles cannot be greater than the number of samples used
, which means TabPFN is unusable for these bigger datasets even withignore_pretraining_contraints=True
. Seems to only happen on regression? (not sure)The text was updated successfully, but these errors were encountered: