You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I see when I using default setting to start a vllm model, the console show that:
Computed max_num_seqs (min(256, 128000 // 131072)) to be less than 1. Setting it to the mi
nimum value of 1.
I think some setting has enlarged the occupancy of input videos or images compared to Qwen 2 VL, how can I reduce it so that accelerating the deployment of my vllm model.
The text was updated successfully, but these errors were encountered:
I see when I using default setting to start a vllm model, the console show that:
I think some setting has enlarged the occupancy of input videos or images compared to Qwen 2 VL, how can I reduce it so that accelerating the deployment of my vllm model.
The text was updated successfully, but these errors were encountered: