Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TRANSFORMATIONS] SDPAToPagedAttention transformation: support decompression case in the Qwen-7b-Chat pattern #28514

Conversation

CuriousPanCake
Copy link
Contributor

Qwen-7b-Chat has a decompression if the model is executed in lower
precision resulting into the model having additional Converts (i.e. FP16
to FP32).

Handle this case of optional Convert in PositionIDsReplacerQwen

Added a unit test for it.

Tickets:

Signed-off-by: Andrii Staikov [email protected]
Signed-off-by: Ivan Tikhonov [email protected]

…ression case in the Qwen-7b-Chat pattern (openvinotoolkit#28493)

Qwen-7b-Chat has a decompression if the model is executed in lower
precision resulting into the model having additional Converts (i.e. FP16
to FP32).

Handle this case of optional Convert in PositionIDsReplacerQwen

Added a unit test for it.

### Tickets:
 - *CVS-157308*

Signed-off-by: Andrii Staikov <[email protected]>
Signed-off-by: Ivan Tikhonov <[email protected]>

---------

Co-authored-by: Ivan Tikhonov <[email protected]>
@github-actions github-actions bot added the category: transformations OpenVINO Runtime library - Transformations label Jan 17, 2025
@itikhono itikhono marked this pull request as ready for review January 17, 2025 11:16
@itikhono itikhono requested a review from a team as a code owner January 17, 2025 11:16
@itikhono itikhono requested review from itikhono and removed request for a team January 17, 2025 11:16
@itikhono itikhono added this to the 2025.0 milestone Jan 17, 2025
@dorloff dorloff added this pull request to the merge queue Jan 20, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jan 20, 2025
@dorloff dorloff added this pull request to the merge queue Jan 20, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jan 20, 2025
@dorloff dorloff added this pull request to the merge queue Jan 20, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jan 20, 2025
@dorloff dorloff added this pull request to the merge queue Jan 20, 2025
Merged via the queue into openvinotoolkit:releases/2025/0 with commit d39d784 Jan 20, 2025
184 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: transformations OpenVINO Runtime library - Transformations Code Freeze
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants