[Model]: Support for InternVL2 #6321

Weiyun1025 · 2024-07-11T05:20:17Z

🚀 The feature, motivation and pitch

InternVL2 is currently the most powerful open-source Multimodal Large Language Model (MLLM). The InternVL2 family includes models ranging from a 2B model, suitable for edge devices, to a 108B model, which is significantly more powerful. With larger-scale language models, InternVL2-Pro demonstrates outstanding multimodal understanding capabilities, matching the performance of commercial closed-source models across various benchmarks.

Given the significant potential of InternVL2, we believe that integrating it with vLLM would greatly benefit both the vLLM community and users of this model. We kindly request your assistance in enabling the deployment of InternVL2 using the vLLM framework.

We look forward to your positive response and are eager to collaborate on this exciting endeavor.

Alternatives

No response

Additional context

Blog：https://internvl.github.io/blog/2024-07-02-InternVL-2.0/
Model Family：https://huggingface.co/collections/OpenGVLab/internvl-20-667d3961ab5eb12c7ed1463e

ywang96 · 2024-07-14T01:12:46Z

Hey @Weiyun1025! Thank you for making this issue and I took a brief look at the model repo https://huggingface.co/OpenGVLab/InternVL2-40B/tree/main. It seems to me that supporting this model should be pretty straightforward (similar to what we did with Phi-3-vision).

Are you planning to make a pull request on this? If so, feel free to take a look at other vision language model implementations on vLLM and let us know if you run into any issue. We're happy to help you on getting this model supported.

If you cannot make a pull request, I will try to see if I have some bandwidth to make a PR on this. Feel free to check out #4194 for the full roadmap around multi-modality.

Thanks!

Weiyun1025 added the feature request label Jul 11, 2024

DarkLight1337 mentioned this issue Jul 11, 2024

[RFC]: Multi-modality Support on vLLM #4194

Open

86 tasks

DarkLight1337 added new model Requests to new models and removed feature request labels Jul 11, 2024

DarkLight1337 changed the title ~~[Feature]: Suport for InternVL2~~ [Model]: Support for InternVL2 Jul 11, 2024

Isotr0py mentioned this issue Jul 17, 2024

[Model] Initialize support for InternVL2 series models #6514

Merged

3 tasks

ywang96 closed this as completed in #6514 Jul 29, 2024

SovereignRemedy mentioned this issue Aug 29, 2024

[Bug]: InternVL2-26B infer error:Attempted to assign 7 x 256 = 1792 multimodal tokens to 506 placeholders #7996

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model]: Support for InternVL2 #6321

[Model]: Support for InternVL2 #6321

Weiyun1025 commented Jul 11, 2024

ywang96 commented Jul 14, 2024 •

edited

Loading

[Model]: Support for InternVL2 #6321

[Model]: Support for InternVL2 #6321

Comments

Weiyun1025 commented Jul 11, 2024

🚀 The feature, motivation and pitch

Alternatives

Additional context

ywang96 commented Jul 14, 2024 • edited Loading

ywang96 commented Jul 14, 2024 •

edited

Loading