[Fix][Text Generation Pipeline] Fix the erroneous sampling logic #1406

dbogunowicz · 2023-11-15T12:32:53Z

Fix Description

Before: regardless of whether sampling=True or False we would do top_k and top_p sampling.
Now: if sampling=False, we directly "jump" to the argmax function and avoid any sampling logic.

@horheynm Could you please validate the rest of the logic in def generate(self, logits: numpy.ndarray)? In the most complex scenario, we can apply both top_k, top_p, and sampling_temperature sequentially to our logits. Let's make sure that the order in which the sampling functions are applied matches the one defined in HF (I assume this is the original implementation that we want to mimic).

src/deepsparse/transformers/utils/token_generator.py

rahul-tuli · 2023-11-15T14:35:18Z

Could you add output before and after?

initial commit

369e1cd

dbogunowicz requested review from bfineran, horheynm and dsikka November 15, 2023 12:32

dbogunowicz mentioned this pull request Nov 15, 2023

[Cherry-Pick][Fix][Text Generation Pipeline] Fix the erroneous sampling logic #1407

Merged

dsikka reviewed Nov 15, 2023

View reviewed changes

src/deepsparse/transformers/utils/token_generator.py Show resolved Hide resolved

rahul-tuli approved these changes Nov 15, 2023

View reviewed changes

bfineran approved these changes Nov 15, 2023

View reviewed changes

dbogunowicz merged commit fec7650 into main Nov 15, 2023
13 checks passed

dbogunowicz deleted the fix/damian/sampling branch November 15, 2023 14:45

bfineran pushed a commit that referenced this pull request Nov 16, 2023

[Fix][Text Generation Pipeline] Fix the erroneous sampling logic(#1406)

df91d97

bfineran pushed a commit that referenced this pull request Nov 16, 2023

[Fix][Text Generation Pipeline] Fix the erroneous sampling logic(#1406)

251b1d2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix][Text Generation Pipeline] Fix the erroneous sampling logic #1406

[Fix][Text Generation Pipeline] Fix the erroneous sampling logic #1406

dbogunowicz commented Nov 15, 2023 •

edited

Loading

rahul-tuli commented Nov 15, 2023

[Fix][Text Generation Pipeline] Fix the erroneous sampling logic #1406

[Fix][Text Generation Pipeline] Fix the erroneous sampling logic #1406

Conversation

dbogunowicz commented Nov 15, 2023 • edited Loading

Fix Description

rahul-tuli commented Nov 15, 2023

dbogunowicz commented Nov 15, 2023 •

edited

Loading