Why are the tokens counted differently than OpenAI? #2

jucor · 2023-07-24T22:47:55Z

Hi @blaze-Youssef !

First, thanks for the tool, very useful, as I'm stuck in openlimit by similar issues that you met in shobrook/openlimit#4 .

However, I'm a bit stuck trying to understand. What does the argument max_tokens correspond to, please, in https://github.com/blaze-Youssef/openai-ratelimiter/blob/main/openai_ratelimiter/defs.py#L9 ?

I am trying to understand it, but the way you count tokens, which is the same as in openlimit https://github.com/shobrook/openlimit/blob/master/openlimit/utilities/token_counters.py#L14 , is different than in OpenAI's cookbook https://github.com/openai/openai-cookbook/blob/main/examples/How_to_count_tokens_with_tiktoken.ipynb (see section 6).

Would you or @shobrook be able to help clarify this counting, please?

The text was updated successfully, but these errors were encountered:

jucor · 2023-07-24T23:04:23Z

Ah! Found max_tokens in the OpenAI API, as an optional parameter: https://platform.openai.com/docs/api-reference/completions/create#completions/create-max_tokens
Do I understand correctly that n*max_tokens` is here to preemptively count into the rate limit the maximum possible number of output tokens?

jucor · 2023-07-24T23:20:15Z

Ungh, now I see that OpenAI has a separate token counter in their batch-API script than in their tutorial Notebook: https://github.com/openai/openai-cookbook/blob/main/examples/api_request_parallel_processor.py#L339
The former does indeed take max_tokens into account. But is also less recent than their notebook, which has different token increments for roles' names. So I don't know which version of their token counter to believe 😅

Shouldn't a rate limiter, in any case, be updated after the actual completion is returned by the API, to account for the actual number of output tokens?

Youssefbenhammouda · 2023-07-25T12:18:23Z

Hi,
I will dig into the OpenAI notebooks and update the implémentation if necessary.
As far as I know, each request's tokens are calculated this way:
Prompt tokens + Max tokens = request total tokens.

jucor · 2023-07-25T12:45:37Z

Thanks. It looks like there are two contradictory implementations from OpenAI: one in the notebook, the other in their batch call. They differ not just by the accounting for max_count, but also by how they handle the role names depending on the model (admittedly, a rather smaller factor!)

…

On Tue, Jul 25, 2023, 13:18 Youssef Benhammouda ***@***.***> wrote: Hi, I will dig into the OpenAI notebooks and update the implémentation if necessary. As far as I know, each request's tokens are calculated this way: Prompt tokens + Max tokens = request total tokens. — Reply to this email directly, view it on GitHub <#2 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAFBERK5LVHHK54RCKYFFSLXR62RTANCNFSM6AAAAAA2WHFFPA> . You are receiving this because you authored the thread.Message ID: ***@***.***>

jucor mentioned this issue Jul 24, 2023

What is max_tokens in the token counting utilities, please? shobrook/openlimit#6

Closed

Repository owner locked and limited conversation to collaborators Apr 13, 2024

Repository owner unlocked this conversation Apr 13, 2024

Youssefbenhammouda closed this as completed Apr 13, 2024

Youssefbenhammouda reopened this Sep 14, 2024

Youssefbenhammouda added the bug Something isn't working label Sep 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why are the tokens counted differently than OpenAI? #2

Why are the tokens counted differently than OpenAI? #2

jucor commented Jul 24, 2023

jucor commented Jul 24, 2023

jucor commented Jul 24, 2023

Youssefbenhammouda commented Jul 25, 2023

jucor commented Jul 25, 2023 via email

Why are the tokens counted differently than OpenAI? #2

Why are the tokens counted differently than OpenAI? #2

Comments

jucor commented Jul 24, 2023

jucor commented Jul 24, 2023

jucor commented Jul 24, 2023

Youssefbenhammouda commented Jul 25, 2023

jucor commented Jul 25, 2023 via email