-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[New Model]: QwQ-32B #257
Comments
Update 2025.03.07: according the feedback from community users, QwQ-32B works well with vLLM Ascend v0.7.3-dev branch! |
基于vllm(v0.7.3) + vllm-ascend(v0.7.3-dev)配套验证: |
The model to consider.
https://huggingface.co/Qwen/QwQ-32B
The closest model vllm already supports.
Consider same arch with Qwen2: https://huggingface.co/Qwen/QwQ-32B/blob/main/config.json#L3
It should be works well.
What's your difficulty of supporting the model you want?
No response
The text was updated successfully, but these errors were encountered: