[Bugfix] Revert QKVCrossParallelLinear usage in Mllama to keep BNB quantization work#14498
Merged
DarkLight1337 merged 3 commits intovllm-project:mainfrom Mar 9, 2025
Merged
[Bugfix] Revert QKVCrossParallelLinear usage in Mllama to keep BNB quantization work#14498DarkLight1337 merged 3 commits intovllm-project:mainfrom
DarkLight1337 merged 3 commits intovllm-project:mainfrom