Skip to content

[Bugfix] Revert QKVCrossParallelLinear usage in Mllama to keep BNB quantization work#14498

Merged
DarkLight1337 merged 3 commits intovllm-project:mainfrom
Isotr0py:fix-x-qkv
Mar 9, 2025
Merged

[Bugfix] Revert QKVCrossParallelLinear usage in Mllama to keep BNB quantization work#14498
DarkLight1337 merged 3 commits intovllm-project:mainfrom
Isotr0py:fix-x-qkv

Commits

Commits on Mar 8, 2025