You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
aacebo opened this issue
Dec 18, 2023
· 0 comments
Assignees
Labels
P0parityJS → dotnet and/or JS → PythonPythonChange/fix applies to Python. If all three, use the 'JS & dotnet & Python' labelsmalltshirt size small (1-4 days)
## Linked issues
closes: #1066
## Details
1. Implement tokenizers for Python based on the JS SDK
2. Changes the underlying coding to `cl100k_base`, which is used by gpt4
and gpt3.5. JS is using `r50k_base` and I have created
#1171 to track this issue.
3. Rename `GPT3Tokenizer` to `GPTTokenizer`, which seems making more
sense for its functionality, as both gpt4 and gpt3.5 can use this
tokenizer.
4. Add unit tests for the code
5. Add docstring for the code
## Attestation Checklist
- [x] My code follows the style guidelines of this project
- I have checked for/fixed spelling, linting, and other errors
- I have commented my code for clarity
- I have made corresponding changes to the documentation (we use
[TypeDoc](https://typedoc.org/) to document our code)
- My changes generate no new warnings
- I have added tests that validates my changes, and provides sufficient
test coverage. I have tested with:
- Local testing
- E2E testing in Teams
- New and existing unit tests pass locally with my changes
P0parityJS → dotnet and/or JS → PythonPythonChange/fix applies to Python. If all three, use the 'JS & dotnet & Python' labelsmalltshirt size small (1-4 days)
implement
tokenizers
functionality of theJS
SDK https://github.com/microsoft/teams-ai/tree/main/js/packages/teams-ai/src/tokenizersThe text was updated successfully, but these errors were encountered: