How to quantize the mmproj part of a multimodal model? #14056

haozixu · 2025-06-07T05:27:40Z

haozixu
Jun 7, 2025

Hi, I'm trying to run Qwen2.5-VL-3B on my smartphone. I want to quantize the weights of the mmproj to 4 bits (like Q4_0, IQ4_NL) but only Q8_0 and F16 versions are provided on huggingface. Currently the python script convert_hf_to_gguf,py cannot apply 4bit quantizations, and llama-quantize cannot be directly used as well since it doesn't recognize the separate clip architecture.

I'm wondering what's the proper way to implement this? Any roadmap of quantization support for ViT encoders?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to quantize the mmproj part of a multimodal model? #14056

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

How to quantize the mmproj part of a multimodal model? #14056

Uh oh!

haozixu Jun 7, 2025

Replies: 0 comments

haozixu
Jun 7, 2025