TheBloke/goliath-120b-GPTQ with RunPod Kobold AI United

Hi! I got goliath-120b-GPTQ running with 3 A40. But the text generation speed is extremely slow. What is the best option for GPU config and settings to run this model? Thank you in advance!
6 Replies
🆁🅰🅻🅻🅴
It only uses one GPU?
Unknown User
Unknown User17mo ago
Message Not Public
Sign In & Join Server To View
🆁🅰🅻🅻🅴
3 x A40 Why does it only uses 1 gpu for me? Some setting i missed?
Unknown User
Unknown User17mo ago
Message Not Public
Sign In & Join Server To View

Did you find this page helpful?