Skip to content

[triton] Support head_dim not 2^n in triton extend and decode attention #162

[triton] Support head_dim not 2^n in triton extend and decode attention

[triton] Support head_dim not 2^n in triton extend and decode attention #162

Triggered via pull request September 9, 2024 08:30
@zhyncszhyncs
closed #1281
Status Success
Total duration 11s
Artifacts

cancel-pr-workflow.yml

on: pull_request_target
Fit to window
Zoom out
Zoom in

Annotations

1 warning
cancel
Unexpected input(s) 'pr_number', valid inputs are ['workflow_id', 'ignore_sha', 'access_token', 'all_but_latest', 'only_status']