RunPod•14mo ago

is there any method to deploy bert architecture models serverlessly?

Solution:

@Adam? https://www.runpod.io/console/explore then select this...

9 Replies

Jason•13mo ago

Hi There is using huggingface's transformers you can cache the model first in some path, then load it using relative / absolute path. put the model in the image / use a network volume

Adam?OP•13mo ago

Hi, thanks for help Is there any template for this? Or I should write the handler by myself?

Jason•13mo ago

yes there is use VLLM template

Solution

Jason•13mo ago

@Adam? https://www.runpod.io/console/explore then select this

Jason•13mo ago

If you need help with configuring that template theres a setup menu after you click on it

Adam?OP•13mo ago

Yeah I have try this

Jason•13mo ago

oh wait bert isnt compatible with that isnt it?

Adam?OP•13mo ago

But it doesn't supports bart models Exactly

Jason•13mo ago

Hmm then you'll have to write an custom handler first for beert using transformers pipeline works too

Gaming

Programming

is there any method to deploy bert architecture models serverlessly?

Did you find this page helpful?