Still waiting
I've opted to deploy my first serverless GPU, I opted for a RTX A5000, and 'bigcode/starcoder'. It's been almost 40 minutes. And I'm still waiting for models to complete. How long does this normal take for a 16B model? My person GPU (4070) was able to load it within seconds.?
