RunpodR
Runpod8mo ago
Bj9000

Selecting a hf quant

Hi, using vllm serveless. Is there a way to specify a specific quant to use for a hf gguf model directory url?
Was this page helpful?