[Doc] Update description of vLLM support for CPUs #6003

DamonFool · 2024-06-30T14:11:31Z

Hi all,

The description of CPU support in vLLM is out of date.
It would be better to update it.

Thanks.
Best regards,
Jie

mgoin · 2024-07-01T17:00:43Z

docs/source/getting_started/cpu-installation.rst

@@ -20,7 +20,7 @@ Requirements

 * OS: Linux
 * Compiler: gcc/g++>=12.3.0 (optional, recommended)
-* Instruction set architecture (ISA) requirement: AVX512 is required.
+* Instruction set architecture (ISA) requirement: for x86, AVX2 is required; for PowerPC, Power9+ is required.


Since the AVX2 and PowerPC backends don't have active testing and this documentation assumes you are building on a machine with AVX512, I'm a little hesitant to update in this doc. Maybe if you call out specifically where you can get directions for AVX2 and PowerPC, and that this doc is assuming AVX512, that would be more clear.

Thanks @mgoin for the review.

The origin description AVX512 is required is quite misleading, which means vLLM requires AVX512 ISA to run.
It would limit the usage of vLLM if people wrongly get the msg that vLLM only supports AVX512.

Actually, we've built and run vLLM on AVX2-only machines.
So I would suggest removing AVX512 is required here.

However, if you're against adding the PowerPC CPUs here, I'm fine.
This is because we don't have PowerPC CPUs and don't test it at all.

Thanks.

Since the AVX2 and PowerPC backends don't have active testing and this documentation assumes you are building on a machine with AVX512, I'm a little hesitant to update in this doc. Maybe if you call out specifically where you can get directions for AVX2 and PowerPC, and that this doc is assuming AVX512, that would be more clear.

Hi @mgoin , I kept AVX512 in the doc.
But make it to be (optional, recommended) as the description of the compiler.
What do you think?
Thanks.

DamonFool · 2024-07-09T15:31:03Z

Hi @mgoin , are you fine with the current change in cpu-installation.rst?
Thanks.

mgoin · 2024-07-09T15:37:45Z

Hi @DamonFool, yes this is fine! It would be nice to make a new section for avx2, but I'll leave that up to you.

DamonFool · 2024-07-09T16:04:32Z

Hi @DamonFool, yes this is fine! It would be nice to make a new section for avx2, but I'll leave that up to you.

The patch aims to tell people that AVX512 is not required for vLLM on x86 CPU (and you can also try it on avx2 too).

There seems no difference to build and run vLLM on avx512 and avx2.
So I'm not sure what should I add for the avx2 section.
Any suggestions?
Thanks.

DamonFool · 2024-07-09T22:55:39Z

Hi @WoosukKwon , are you OK with this change?
Thanks.

DamonFool · 2024-07-10T14:27:14Z

Hi @mgoin , @WoosukKwon seems to be busy with other things.
Do you think it's fine to get this doc-only change merged regarding that there is no objection from the community?
Thanks.

mgoin · 2024-07-10T14:59:44Z

@DamonFool Yes, I am simply waiting for your check to be green. I cannot merge without passing checks. Please try merging with main

DamonFool · 2024-07-10T15:14:36Z

Please try merging with main

Done.
Thanks.

DamonFool · 2024-07-10T22:39:46Z

All the required CI tests had been passed. @mgoin
Thanks.

WoosukKwon · 2024-07-10T22:47:44Z

I'm good with this doc change, but a little bit worried about the potential confusion and complexity as the Intel team will be adding IPEX or other intel-cpu-only optimizations to the cpu backend.

DamonFool · 2024-07-10T23:09:44Z

I'm good with this doc change, but a little bit worried about the potential confusion and complexity as the Intel team will be adding IPEX or other intel-cpu-only optimizations to the cpu backend.

Thanks @WoosukKwon .

You mean vllm-cpu on intel may run faster than other cpus?
Since vLLM can already run on AMD and PowerPC cpus, it would be good to let people know that fact.
Then, vLLM would be used more widely.
And we can do more opts on other cpus when necessary in the future, right?

WoosukKwon · 2024-07-11T04:15:08Z

@DamonFool Sorry for misleading you. Yes this PR doesn't have a problem. I just wanted to say we'll need to figure out how to efficiently maintain the Intel and non-Intel CPU backends while not complicating the code.

DamonFool · 2024-07-11T04:53:18Z

@DamonFool Sorry for misleading you. Yes this PR doesn't have a problem. I just wanted to say we'll need to figure out how to efficiently maintain the Intel and non-Intel CPU backends while not complicating the code.

Got it.
Thank you all for the help.

(cherry picked from commit 439c845)

Signed-off-by: Alvant <[email protected]>

[Doc] Update description of vLLM support for CPUs

e3413eb

DarkLight1337 added the x86 CPU label Jul 1, 2024

DarkLight1337 requested a review from WoosukKwon July 1, 2024 08:20

mgoin reviewed Jul 1, 2024

View reviewed changes

Address review comments

f9d83d2

mgoin approved these changes Jul 9, 2024

View reviewed changes

Merge branch 'main' into doc-cpu

6b32453

WoosukKwon approved these changes Jul 10, 2024

View reviewed changes

WoosukKwon merged commit 439c845 into vllm-project:main Jul 11, 2024
70 of 71 checks passed

DamonFool deleted the doc-cpu branch July 11, 2024 04:53

adityagoel14 pushed a commit to adityagoel14/vllm-torchrun-test that referenced this pull request Jul 11, 2024

[Doc] Update description of vLLM support for CPUs (vllm-project#6003)

616ce78

(cherry picked from commit 439c845)

dtrifiro pushed a commit to opendatahub-io/vllm that referenced this pull request Jul 17, 2024

[Doc] Update description of vLLM support for CPUs (vllm-project#6003)

e84b284

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

[Doc] Update description of vLLM support for CPUs (vllm-project#6003)

9b4ff1b

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Doc] Update description of vLLM support for CPUs (vllm-project#6003)

fc5807d

Signed-off-by: Alvant <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Doc] Update description of vLLM support for CPUs #6003

[Doc] Update description of vLLM support for CPUs #6003

DamonFool commented Jun 30, 2024

mgoin Jul 1, 2024

DamonFool Jul 1, 2024

DamonFool Jul 2, 2024

DamonFool commented Jul 9, 2024

mgoin commented Jul 9, 2024

DamonFool commented Jul 9, 2024

DamonFool commented Jul 9, 2024

DamonFool commented Jul 10, 2024

mgoin commented Jul 10, 2024

DamonFool commented Jul 10, 2024

DamonFool commented Jul 10, 2024

WoosukKwon commented Jul 10, 2024

DamonFool commented Jul 10, 2024

WoosukKwon commented Jul 11, 2024 •

edited

Loading

DamonFool commented Jul 11, 2024

[Doc] Update description of vLLM support for CPUs #6003

[Doc] Update description of vLLM support for CPUs #6003

Conversation

DamonFool commented Jun 30, 2024

mgoin Jul 1, 2024

Choose a reason for hiding this comment

DamonFool Jul 1, 2024

Choose a reason for hiding this comment

DamonFool Jul 2, 2024

Choose a reason for hiding this comment

DamonFool commented Jul 9, 2024

mgoin commented Jul 9, 2024

DamonFool commented Jul 9, 2024

DamonFool commented Jul 9, 2024

DamonFool commented Jul 10, 2024

mgoin commented Jul 10, 2024

DamonFool commented Jul 10, 2024

DamonFool commented Jul 10, 2024

WoosukKwon commented Jul 10, 2024

DamonFool commented Jul 10, 2024

WoosukKwon commented Jul 11, 2024 • edited Loading

DamonFool commented Jul 11, 2024

WoosukKwon commented Jul 11, 2024 •

edited

Loading