What worked in the end went like this:
-Make new pod
-Choose "vllm:latest" template
-Go to container start command
-Replace existing model url with the one you want to use (from huggingface)
-On the same window scroll down to environment variables
-There will be one called "HF_TOKEN", in the field to the right of this, put your hugginface access key
-Start the pod
-Click on "connect" button and open the https port thing to get your api url
-Ask some AI to give you a test command for command-line or elsewhere so you can test if it works (include also the api key that was specified in the container start command)