You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I read this comment about how to adapt vllm implementations for sglang. I was wondering if there were any pointers on how to implement vision-language models for sglang? I've been reading through llava.py and it's not entirely clear to me what I need to change from a typical huggingface implementation.
Also, I was wondering why there's no check for ForwardMode.PREFILL in the forward function for LlavaLlamaForCausalLM?
The text was updated successfully, but these errors were encountered:
I read this comment about how to adapt vllm implementations for sglang. I was wondering if there were any pointers on how to implement vision-language models for sglang? I've been reading through
llava.py
and it's not entirely clear to me what I need to change from a typical huggingface implementation.Also, I was wondering why there's no check for
ForwardMode.PREFILL
in the forward function for LlavaLlamaForCausalLM?The text was updated successfully, but these errors were encountered: