Support qwenvl model for HPU #793

yingjie-han · 2025-02-07T03:30:20Z

This PR aims to support qwenvl vision infer on HPU.

Issue to solve

The function merge_multimodal_embeddings() in utils.py has dynamic problem on HPU.

Solution

Flatten the embeddings tensor , and use index_put_() to merge the multimodal embeddings in qwen.py instead of calling merge_multimodal_embeddings() in utils.py.

Test

Single image
python examples/offline_inference/vision_language.py -m qwen_vl

Multiple images
python examples/offline_inference/vision_language_multi_image.py -m qwen_vl_chat

enable qwenvl on hpu

834ee00

yingjie-han requested review from kzawora-intel, madamczykhabana, michalkuligowski, mgawarkiewicz, vivekgoe and afierka-intel as code owners February 7, 2025 03:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support qwenvl model for HPU #793

Support qwenvl model for HPU #793

yingjie-han commented Feb 7, 2025 •

edited by github-actions bot

Loading

Support qwenvl model for HPU #793

Are you sure you want to change the base?

Support qwenvl model for HPU #793

Conversation

yingjie-han commented Feb 7, 2025 • edited by github-actions bot Loading

Issue to solve

Solution

Test

yingjie-han commented Feb 7, 2025 •

edited by github-actions bot

Loading