You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A new implementation (Improvement, Extension)
Text objects should have a built-in method to convert strings to tokens, according to a user-specified tokenizer.
Is your feature request related to a problem?
Partially. Text objects currently use Bert tokenizers by default.
If your feature will improve HUB
A common use case is streaming data from a Hub Dataset to a Hugging Face transformer. This feature would remove friction from that process.
🚨🚨 Feature Request
Text objects should have a built-in method to convert strings to tokens, according to a user-specified tokenizer.
Is your feature request related to a problem?
Partially. Text objects currently use Bert tokenizers by default.
If your feature will improve
HUB
A common use case is streaming data from a Hub Dataset to a Hugging Face transformer. This feature would remove friction from that process.
Description of the possible solution
A solution would improve the current syntax:
The text was updated successfully, but these errors were encountered: