Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regional compilation support #595

Merged
merged 1 commit into from
Dec 5, 2024

Conversation

Kacper-Pietkun
Copy link

@Kacper-Pietkun Kacper-Pietkun commented Dec 5, 2024

Add support for regional compilation. It is turned on by default, but can be turned off with VLLM_REGIONAL_COMPILATION env variable. It works only for torch.compile execution mode. It significantly speeds up warmup time and slightly increases throughput.

Change is already merged to habana_main. Creating this PR (to v1.19.0 branch) after request from QA.

Add support for regional compilation. It is turned on by default, but
can be turned off with `VLLM_REGIONAL_COMPILATION` env variable. It
works only for torch.compile execution mode. It significantly speeds up
warmup time and slightly increases throughput.
@Kacper-Pietkun Kacper-Pietkun requested a review from RafLit December 5, 2024 10:03
Copy link

@RafLit RafLit left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@Kacper-Pietkun Kacper-Pietkun merged commit e37d381 into v1.19.0 Dec 5, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants