The model to consider.
Hi, may I ask has this model (HuggingFaceM4/siglip-so400m-14-980-flash-attn2-navit) beed supported in vllm? The most related implemented model seems to be Idefics2VisionTransformer, but I have some trouble in loading the weights.
The closest model vllm already supports.
Idefics2VisionTransformer
What's your difficulty of supporting the model you want?
No response
Before submitting a new issue...