-
Notifications
You must be signed in to change notification settings - Fork 562
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[0.11.0][HybridKV] Support KV sharing in mambaspec and fullattnspec
#4210
opened Nov 14, 2025 by
MengqingCao
Loading…
[long seq feat]GQA support long-prefill-token-threshold
#4209
opened Nov 14, 2025 by
Delphine-Nic
Loading…
[v0.11.0-dev][misc]change default capture size for Qwen3-MoE when using full dp
module:core
#4205
opened Nov 14, 2025 by
Angazenn
Loading…
[Cherry-pick][0.11.0] Adapted to torch_npu.npu_fused_infer_attention_score
ready
read for review
ready-for-test
start test by label for PR
#4202
opened Nov 14, 2025 by
wxsIcey
Loading…
[main][misc]change default capture size for Qwen3-MoE when using pure dp
module:core
module:tests
ready
read for review
ready-for-test
start test by label for PR
#4199
opened Nov 14, 2025 by
Angazenn
Loading…
[MM][Perf] Replace VisionPatchEmbed with that in vllm for better performance
#4198
opened Nov 14, 2025 by
shen-shanshan
Loading…
[HybridKV] Support KV sharing in mambaspec and fullattnspec
module:tests
ready
read for review
ready-for-test
start test by label for PR
#4196
opened Nov 14, 2025 by
MengqingCao
Loading…
[Test] enhance test coverage for model_runner_v1
module:tests
#4189
opened Nov 14, 2025 by
noemotiovon
Loading…
[Feat] Flashcomm2 use o_shared linear
module:core
module:tests
#4188
opened Nov 13, 2025 by
zzhx1
Loading…
revise doc
documentation
Improvements or additions to documentation
#4185
opened Nov 13, 2025 by
Angazenn
Loading…
[bugfix] add mlapo test script
module:tests
ready
read for review
ready-for-test
start test by label for PR
#4184
opened Nov 13, 2025 by
chenjunyi-dev
Loading…
[P/D] Add readme for PD separation
documentation
Improvements or additions to documentation
#4182
opened Nov 13, 2025 by
wangxiaoteng888
Loading…
make vllm-ascend work well in developer mode
module:core
#4179
opened Nov 13, 2025 by
Ronald1995
Loading…
Encoder separation for Encode-Prefill-Decode Disaggregation
#4176
opened Nov 13, 2025 by
amy-why-3459
Loading…
Adopt inductor fusion and define quantization fusion pass
merge-conflicts
module:core
module:ops
#4168
opened Nov 13, 2025 by
wxsIcey
Loading…
[Bug_fix] fix torchair o_proj forward parameter
module:ops
#4166
opened Nov 13, 2025 by
zzhx1
Loading…
qwen3-vl Vit module enable sp and mrope fusion op
module:multimodal
#4165
opened Nov 13, 2025 by
qigangc
Loading…
[DoNotMerge]fia pa
documentation
Improvements or additions to documentation
module:tests
#4163
opened Nov 13, 2025 by
Angazenn
Loading…
[P/D] pd proxy support ipv6
documentation
Improvements or additions to documentation
#4161
opened Nov 13, 2025 by
liziyu179
Loading…
[0.11.0][ops] npu_top_k_top_p supports k and p only
#4153
opened Nov 12, 2025 by
linfeng-yuan
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.