Enable qwen2-vl multimodal input on v0.6.1 #43

hzjane · 2024-10-09T06:27:00Z

Refer to this issue. We need to remove self.rope_scaling["type"] = "default". "fp8" is recommended to use.

pip install transformers==4.45.1
# vim /usr/local/lib/python3.11/dist-packages/transformers/models/qwen2_vl/configuration_qwen2_vl.py line 241
if self.rope_scaling["type"] == "mrope":
    #self.rope_scaling["type"] = "default"
    pass

online test

curl http://localhost:7999/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "Qwen2-VL-7B-Instruct",
    "messages": [
      {
        "role": "user",
        "content": [
          {
            "type": "text",
            "text": "What is in the image?"
          },
          {
            "type": "image_url",
            "image_url": {
              "url": "http://farm6.staticflickr.com/5268/5602445367_3504763978_z.jpg"
            }
          }
        ]
      }
    ],
    "max_tokens": 128,
    "temperature": 0.1,
    "top_p": 0.001,
    "repetition_penalty": 1.05
  }'

If encounter duplicate output problems, try to increaserepetition_penalty to 1.99.

xiangyuT

LGTM

* Enable qwen2-vl multimodal input on v0.6.1 (#43) * enable mrope model * update minicpm * update utils * update qwen2_vl * update * update * enable parallel multimodal input * update * remove error

guang11644331 · 2024-11-30T08:46:30Z

it's work ^_^!

hzjane added 6 commits October 9, 2024 14:04

enable mrope model

f565d34

update minicpm

3c22db1

update utils

97d89f9

update qwen2_vl

f96541b

update

b199e85

update

d0a03a9

hzjane changed the title ~~Enable qwen2-vl multimodal input~~ Enable qwen2-vl multimodal input on v0.6.1 Oct 9, 2024

hzjane added 2 commits October 9, 2024 15:19

update

e34a17b

enable parallel multimodal input

559d9f2

xiangyuT approved these changes Oct 11, 2024

View reviewed changes

xiangyuT merged commit 32c883f into analytics-zoo:061_test_0924 Oct 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable qwen2-vl multimodal input on v0.6.1 #43

Enable qwen2-vl multimodal input on v0.6.1 #43

hzjane commented Oct 9, 2024 •

edited

Loading

xiangyuT left a comment

guang11644331 commented Nov 30, 2024

Enable qwen2-vl multimodal input on v0.6.1 #43

Enable qwen2-vl multimodal input on v0.6.1 #43

Conversation

hzjane commented Oct 9, 2024 • edited Loading

xiangyuT left a comment

Choose a reason for hiding this comment

guang11644331 commented Nov 30, 2024

hzjane commented Oct 9, 2024 •

edited

Loading