[KVCache] Fix kernel dispatch based on attention kinds #18122

MasterJH5574 · 2025-07-07T19:45:24Z

This PR fixes a few kernel dispatch issues due to the recent introduction of mha_sliding as a new attention kind.

Tested on Qwen3 1.7B with MLC-LLM.

This PR fixes a few kernel dispatch issues due to the recent introduction of `mha_sliding` as a new attention kind. Tested on Qwen3 1.7B with MLC-LLM.

* [KVCache] Fix kernel dispatch based on attention kinds This PR fixes a few kernel dispatch issues due to the recent introduction of `mha_sliding` as a new attention kind. Tested on Qwen3 1.7B with MLC-LLM. * Fix lint --------- Co-authored-by: Yong Wu <[email protected]>

[KVCache] Fix kernel dispatch based on attention kinds

2bf7230

This PR fixes a few kernel dispatch issues due to the recent introduction of `mha_sliding` as a new attention kind. Tested on Qwen3 1.7B with MLC-LLM.

tqchen approved these changes Jul 7, 2025

View reviewed changes

Fix lint

6564cf0

yongwww approved these changes Jul 7, 2025

View reviewed changes

yongwww merged commit d9f0838 into apache:main Jul 7, 2025
13 checks passed

ysh329 mentioned this pull request Oct 24, 2025

[Release] v0.22.0 Release Candidate Notes #18391

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[KVCache] Fix kernel dispatch based on attention kinds #18122

[KVCache] Fix kernel dispatch based on attention kinds #18122

Uh oh!

MasterJH5574 commented Jul 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[KVCache] Fix kernel dispatch based on attention kinds #18122

[KVCache] Fix kernel dispatch based on attention kinds #18122

Uh oh!

Conversation

MasterJH5574 commented Jul 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants