Skip to content

Commit fb22be5

Browse files
authored
[moe][quant] add weight name case for offset (#15515)
Signed-off-by: Mengqing Cao <[email protected]>
1 parent 7f301dd commit fb22be5

File tree

1 file changed

+3
-2
lines changed
  • vllm/model_executor/layers/fused_moe

1 file changed

+3
-2
lines changed

vllm/model_executor/layers/fused_moe/layer.py

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -699,8 +699,9 @@ def weight_loader(self, param: torch.nn.Parameter,
699699
tp_rank=self.tp_rank)
700700
return
701701

702-
# Case weight scales and zero_points
703-
if ("scale" in weight_name or "zero" in weight_name):
702+
# Case weight scales, zero_points and offset
703+
if ("scale" in weight_name or "zero" in weight_name
704+
or "offset" in weight_name):
704705
# load the weight scales and zp based on the quantization scheme
705706
# supported weight scales/zp can be found in
706707
# FusedMoeWeightScaleSupported

0 commit comments

Comments
 (0)