[Text Generation] Avoid mutating the original logits in-place when mapping from logits to token. #1414

dbogunowicz · 2023-11-17T10:17:27Z

Feature Description

TokenGenerator.generate(logits: numpy.ndarray) takes a logits array and, if do_sample=True, optionally mutates them to enforce the appropriate sampling strategy.

This results in the correct generation of tokens, but an in-place modification of the logits array. Logits array is then returned to the user in the mutated form. This can be confusing for the users who are interested in logits value, specially when returned together with prompt logits:

First column before: the generated logits returned to the user are mutated (the distribution is spikier, arguably close to Dirac distribution)

Second column after: the generated logits are the non-mutatated, original logits, consistent with the prompt logits.

src/deepsparse/transformers/utils/token_generator.py

initial copy

0f5f23f

dbogunowicz requested review from mgoin, bfineran and dsikka November 17, 2023 10:19

mgoin reviewed Nov 17, 2023

View reviewed changes

src/deepsparse/transformers/utils/token_generator.py Show resolved Hide resolved

dbogunowicz requested a review from mgoin November 20, 2023 09:54

mgoin approved these changes Nov 20, 2023

View reviewed changes

mgoin merged commit 9383653 into main Nov 20, 2023
13 checks passed

mgoin deleted the feature/damian/logits_copy branch November 20, 2023 14:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Text Generation] Avoid mutating the original logits in-place when mapping from logits to token. #1414

[Text Generation] Avoid mutating the original logits in-place when mapping from logits to token. #1414

dbogunowicz commented Nov 17, 2023 •

edited

Loading

[Text Generation] Avoid mutating the original logits in-place when mapping from logits to token. #1414

[Text Generation] Avoid mutating the original logits in-place when mapping from logits to token. #1414

Conversation

dbogunowicz commented Nov 17, 2023 • edited Loading

Feature Description

dbogunowicz commented Nov 17, 2023 •

edited

Loading