R
Runpod3mo ago
swousy

How do I set quantization to fp8 in the serverless settings?

If I select 'bitsandbytes' will that automatically change quant level to fp8?
3 Replies
Unknown User
Unknown User3mo ago
Message Not Public
Sign In & Join Server To View
swousy
swousyOP3mo ago
What do you mean? I want to force it to use fp4/8 etc
Unknown User
Unknown User3mo ago
Message Not Public
Sign In & Join Server To View

Did you find this page helpful?