You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
More control on the generation does make sense. A reasonable way to allow for more control is probably to add more generation args in the GRPOConfig.
Are you willing to contribute?
More control on the generation does make sense. A reasonable way to allow for more control is probably to add more generation args in the GRPOConfig. Are you willing to contribute?
More control on the generation does make sense. A reasonable way to allow for more control is probably to add more generation args in the GRPOConfig. Are you willing to contribute?
Yes I will contribute this feature.
Please add stop_strings or stopping criteria :)
Although I don’t see why not exposing the full generation config to avoid the next issue of this type in a few weeks.
More control on the generation does make sense. A reasonable way to allow for more control is probably to add more generation args in the GRPOConfig. Are you willing to contribute?
Yes I will contribute this feature.
Please add stop_strings or stopping criteria :)
Although I don’t see why not exposing the full generation config to avoid the next issue of this type in a few weeks.
@qgallouedec wdyt? Either we should just directly expose the entire generation config because there are all kinds of tricks that people might want to tune there.
Feature request
Often people need to customize the generation config, now it's embedded in the training loop. Should be easy to extract it out.
Motivation
Customization
Your contribution
I can help to contribute.
The text was updated successfully, but these errors were encountered: