Serve & deploy vLLM-accelerated Phi-4-mini-reasoning
vessl service create -f quickstart.yaml
) may temporarily result in unexpected errors. If this occurs, please use VESSL CLI with Python 3.12 for the time being. We are working on it./docs
to your endpoint URL:
api-test.py
script. Replace YOUR-SERVICE-ENDPOINT
with your actual endpoint and execute the command below:
vessl whoami
command to confirm if the default organization matches the one where Service exists.vessl configure --reset
command to change the default organization.ceil[current replicas * ( current CPU metric value / desired CPU metric value )]
[min, max]
range.