Need help with hosting a vton model on serverless

I'm trying to run the CatVTON model (https://github.com/Zheng-Chong/CatVTON, https://huggingface.co/spaces/zhengchong/CatVTON) using RunPod serverless. (Any vton model is fine as long as I can get the output)

My Goal:

A Flutter app sends an image to a Flask backend.

Flask forwards the image to a RunPod serverless instance running the model.

The processed image is sent back to Flask, then returned to Flutter.


I need a beginner-friendly guide to set this up. It's just for demo purposes, so I prefer serverless to save costs. I previously tried Google Colab Pro but ran into too many errors when trying out AI models.

I’m a bit scared about increasing costs while doing and testing this myself, and I worry I might end up paying a large amount.

Would appreciate insights on Hugging Face, RunPod serverless, and setting up this pipeline properly!
Was this page helpful?