Skip to content

Pull requests: vllm-project/vllm-ascend

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Cherry-pick][0.11.0] Adapted to torch_npu.npu_fused_infer_attention_score ready read for review ready-for-test start test by label for PR
#4202 opened Nov 14, 2025 by wxsIcey Loading…
[HybridKV] Support KV sharing in mambaspec and fullattnspec module:tests ready read for review ready-for-test start test by label for PR
#4196 opened Nov 14, 2025 by MengqingCao Loading…
[Kernel] add custom moe ops for prefill
#4194 opened Nov 14, 2025 by shiro-zzzz Loading…
optimiation encoder attention
#4192 opened Nov 14, 2025 by Jeaniowang Loading…
test-lora-310p-build
#4187 opened Nov 13, 2025 by liuchenbing Loading…
revise doc documentation Improvements or additions to documentation
#4185 opened Nov 13, 2025 by Angazenn Loading…
[bugfix] add mlapo test script module:tests ready read for review ready-for-test start test by label for PR
#4184 opened Nov 13, 2025 by chenjunyi-dev Loading…
[feature] Mooncake_connector support pcp/dcp
#4183 opened Nov 13, 2025 by wangxiaochao6 Loading…
[P/D] Add readme for PD separation documentation Improvements or additions to documentation
#4182 opened Nov 13, 2025 by wangxiaoteng888 Loading…
[DoNotMerge]fia pa documentation Improvements or additions to documentation module:tests
#4163 opened Nov 13, 2025 by Angazenn Loading…
[P/D] pd proxy support ipv6 documentation Improvements or additions to documentation
#4161 opened Nov 13, 2025 by liziyu179 Loading…
[ops] npu_top_k_top_p supports k or p is None
#4154 opened Nov 12, 2025 by linfeng-yuan Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.