Skip to content

Get Deployment Response

AI deployment

Properties

PropertyTypeRequiredDescription
created-atstringyesCreation time

ISO 8601 date-time.
deployment-urlstringyesDeployment inference endpoint URL
gpu-countintegeryesNumber of GPUs

Min: 1.
gpu-typestringyesGPU type family

Min length: 1.
idstringyesDeployment ID

Must be a valid UUID.
inference-engine-parametersarray[string]yesOptional extra inference engine server CLI args
inference-engine-versionstringyesAllowed values: 0.12.0, 0.15.1, 0.16.0, 0.17.0, 0.18.0, 0.18.1, 0.19.0, 0.19.1, 0.20.0, 0.20.1, 0.20.2, 0.21.0, 0.22.0, 0.22.1.

Default: 0.22.1.
modelModel reference. Provide either id or name.yes
namestringyesDeployment name

Min length: 1.
replicasintegeryesNumber of replicas (>=0)

Min: 0.
service-levelstringyesService level

Min length: 1.
statestringyesDeployment state

Allowed values: ready, creating, preparing, error, deploying, scaling, updating.
state-detailsstringyesDeployment state details
updated-atstringyesUpdate time

ISO 8601 date-time.
Last updated on