File tree Expand file tree Collapse file tree 2 files changed +2
-6
lines changed Expand file tree Collapse file tree 2 files changed +2
-6
lines changed Original file line number Diff line number Diff line change 1313/vllm /model_executor /guided_decoding @ mgoin @ russellb
1414/vllm /multimodal @ DarkLight1337 @ ywang96
1515/vllm /vllm_flash_attn @ LucasWilkinson
16+ /vllm /lora @ jeejeelee
1617CMakeLists.txt @ tlrmchlsmth
1718
1819# vLLM V1
@@ -40,3 +41,4 @@ CMakeLists.txt @tlrmchlsmth
4041/tests /v1 /entrypoints /llm /test_struct_output_generate.py @ mgoin @ russellb
4142/tests /v1 /structured_output @ mgoin @ russellb
4243/tests /weight_loading @ mgoin @ youkaichao
44+ /tests /lora @ jeejeelee
Original file line number Diff line number Diff line change 3737 ("yec019/fbopt-350m-8bit" , "read pre-quantized 8-bit opt model" ),
3838]
3939
40- models_pre_quant_8bit_to_test = [
41- ('meta-llama/Llama-Guard-3-8B-INT8' ,
42- 'read pre-quantized llama 8-bit model' ),
43- ("yec019/fbopt-350m-8bit" , "read pre-quantized 8-bit opt model" ),
44- ]
45-
4640
4741@pytest .mark .skipif (not is_quant_method_supported ("bitsandbytes" ),
4842 reason = 'bitsandbytes is not supported on this GPU type.' )
You can’t perform that action at this time.
0 commit comments