© 2026 Hedgehog Software, LLC

Twitter GitHub Discord

More

Communities Docs About Terms Privacy

How to cache model download from HuggingFace - Tips? - Runpod

Runpod•15mo ago•

15 replies

How to cache model download from HuggingFace - Tips?

Usin Serverless (48gb pro) w Flashboot. Want to optimize for fast cold start

is there a guide somewhere?

it does not seem to be caching the download - it's always re-downloading the model entirely (and slowly)

should i ssh into some persistent storage & download the model there? then reference that local path in the HF model load?

We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!

21,202Members

Resources

Recent Announcements

Similar Threads

Was this page helpful?

Similar Threads

How to download image from s3?

RRunpod / ⚡｜serverless

Download huggingface models that require hf-token during build time?

RRunpod / ⚡｜serverless

Download Hugging face Model failed

RRunpod / ⚡｜serverless

my serverless worker is downloading models to `/runpod-volume/.cache/huggingface` by itself

RRunpod / ⚡｜serverless