Need help putting 23 GB .pt file in serverless enviornment
I have a 23 GB .pt file containing tensors for 36 attention processors for each step, cannot reduce size I need to somehow put this one on the serverless environment to use in the inference, I am getting no space left on device error when bulding the docker image (I understand it should be small, but i don't have an option) Can someone please help
Continue the conversation
Join the Discord to ask follow-up questions and connect with the community
R
Runpod
We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!