RunpodR
Runpod5mo ago
swousy

How do I set quantization to fp8 in the serverless settings?

If I select 'bitsandbytes' will that automatically change quant level to fp8?
Was this page helpful?