Allow multiple cached models and API access

Hi, I love the cached models feature. Great job Runpod team.

I was wondering if you could perhaps add additional model support, for example if I wanted to also pre-load an image embedding model for multimodal embedding within the same vector space.

Additionally, it would be a great feature to have API access so that I can preload the models programmatically instead of having to use the UI.

Other than that, love the serverless product. Thanks!
Was this page helpful?