[Neuron][Kernel] Support Longer Sequences in NKI-based Flash PagedAttention and Improve Efficiency#12921
Merged
simon-mo merged 7 commits intovllm-project:mainfrom Feb 12, 2025
Merged
Commits
Commits on Feb 7, 2025
- committed
- committed
- committed
- committed
- committed