-
Notifications
You must be signed in to change notification settings - Fork 51
Issues: vllm-project/vllm-ascend
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Feature]: speculative decoding、Chunked Prefill、Prefix caching
feature request
#289
opened Mar 10, 2025 by
jxz542189
[Bug]: npu oom when deploy DeepSeek R1
bug
Something isn't working
#264
opened Mar 7, 2025 by
gameofdimension
[Bug]: 运行BAAI/bge-m3时候,无论输入是什么,返回特征向量都是同一串。
bug
Something isn't working
#255
opened Mar 7, 2025 by
caolicaoli
[Bug]: ERROR 03-06 09:35:19 worker_base.py:572] TypeError: 'NoneType' object is not callable
bug
Something isn't working
#253
opened Mar 6, 2025 by
man-in-sky
[Bug]: openai起服务后对话两次报错MQLLMEngine terminated
bug
Something isn't working
#223
opened Mar 3, 2025 by
GenerallyCovetous
[Bug]: Qwen2.5-7B-Instruct 模型0.7.1和0.7.3版本vllm-ascend输出不相同,怀疑0.7.3有精度问题
bug
Something isn't working
#221
opened Mar 3, 2025 by
new-TonyWang
[Bug]: Memory Leak or Abnormal Memory Increase When Deploying Fine-Tuned Qwen2VL-72B Model with vLLM Serve
bug
Something isn't working
#216
opened Mar 2, 2025 by
XuyaoWang
[Doc]: Add latest / stable version after first final release
documentation
Improvements or additions to documentation
#214
opened Mar 1, 2025 by
Yikun
[Bug]: TBE Subprocess Task Distribute Failure When TP>1
question
Further information is requested
#198
opened Feb 27, 2025 by
XuyaoWang
Previous Next
ProTip!
Updated in the last three days: updated:>2025-03-07.