[Bugfix] MLPSpeculator: Use ParallelLMHead in tie_weights=False case. #6303

tdoublep · 2024-07-10T11:34:11Z

Fixes #6302

cc @robertgshaw2-neuralmagic

Signed-off-by: Thomas Parnell <[email protected]>

robertgshaw2-redhat · 2024-07-10T12:45:05Z

Thanks for the fix!

…vllm-project#6303) Signed-off-by: Thomas Parnell <[email protected]> (cherry picked from commit c38eba3)

…vllm-project#6303) Signed-off-by: Thomas Parnell <[email protected]>

…vllm-project#6303) Signed-off-by: Thomas Parnell <[email protected]> Signed-off-by: Alvant <[email protected]>

Use ParallelLMHead in tie_weights=False case.

e709e25

Signed-off-by: Thomas Parnell <[email protected]>

robertgshaw2-redhat approved these changes Jul 10, 2024

View reviewed changes

mgoin approved these changes Jul 10, 2024

View reviewed changes

mgoin merged commit c38eba3 into vllm-project:main Jul 10, 2024
70 of 71 checks passed

tdoublep deleted the mlpsepc-notie-fix branch July 10, 2024 13:08

dtrifiro pushed a commit to opendatahub-io/vllm that referenced this pull request Jul 17, 2024

[Bugfix] MLPSpeculator: Use ParallelLMHead in tie_weights=False case. (…

4b182d3

…vllm-project#6303) Signed-off-by: Thomas Parnell <[email protected]>

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

[Bugfix] MLPSpeculator: Use ParallelLMHead in tie_weights=False case. (…

836ae08

…vllm-project#6303) Signed-off-by: Thomas Parnell <[email protected]>

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Bugfix] MLPSpeculator: Use ParallelLMHead in tie_weights=False case. (…

78cb733

…vllm-project#6303) Signed-off-by: Thomas Parnell <[email protected]> Signed-off-by: Alvant <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix] MLPSpeculator: Use ParallelLMHead in tie_weights=False case. #6303

[Bugfix] MLPSpeculator: Use ParallelLMHead in tie_weights=False case. #6303

tdoublep commented Jul 10, 2024 •

edited

Loading

robertgshaw2-redhat commented Jul 10, 2024

[Bugfix] MLPSpeculator: Use ParallelLMHead in tie_weights=False case. #6303

[Bugfix] MLPSpeculator: Use ParallelLMHead in tie_weights=False case. #6303

Conversation

tdoublep commented Jul 10, 2024 • edited Loading

robertgshaw2-redhat commented Jul 10, 2024

tdoublep commented Jul 10, 2024 •

edited

Loading