请追加模型量化 #18

Minami-su · 2023-08-27T09:58:29Z

如题

Minami-su · 2023-08-27T09:59:39Z

量化可以只针对llm

simonJJJ · 2023-08-27T12:50:42Z

We are working on it.

Minami-su · 2023-08-27T12:51:54Z

We are working on it.

谢谢你们的工作

77h2l · 2023-08-28T02:44:11Z

A10 22G , out of memory @simonJJJ

77h2l · 2023-08-29T02:14:21Z

@simonJJJ hello, thx for your work, may I ask what is the least memory requirement to deploy such a Qwen-VL model, I have tested that single A10 is not able to run, Is there a way to offload the memory, thx

ShuaiBai623 · 2023-09-04T11:51:55Z

The Int4 quantized model for Qwen-VL-Chat, Qwen-VL-Chat-Int4, is available. about 12G cost

Minami-su closed this as completed Sep 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

请追加模型量化 #18

请追加模型量化 #18

Minami-su commented Aug 27, 2023

Minami-su commented Aug 27, 2023

simonJJJ commented Aug 27, 2023

Minami-su commented Aug 27, 2023

77h2l commented Aug 28, 2023

77h2l commented Aug 29, 2023

ShuaiBai623 commented Sep 4, 2023

请追加模型量化 #18

请追加模型量化 #18

Comments

Minami-su commented Aug 27, 2023

Minami-su commented Aug 27, 2023

simonJJJ commented Aug 27, 2023

Minami-su commented Aug 27, 2023

77h2l commented Aug 28, 2023

77h2l commented Aug 29, 2023

ShuaiBai623 commented Sep 4, 2023