How to cache model download from HuggingFace - Tips?
Usin Serverless (48gb pro) w Flashboot. Want to optimize for fast cold start
is there a guide somewhere?
it does not seem to be caching the download - it's always re-downloading the model entirely (and slowly)
should i ssh into some persistent storage & download the model there? then reference that local path in the HF model load?
is there a guide somewhere?
it does not seem to be caching the download - it's always re-downloading the model entirely (and slowly)
should i ssh into some persistent storage & download the model there? then reference that local path in the HF model load?






