Skip to content

Get Deployment Response

AI deployment

Properties

PropertyTypeRequiredDescription
created-atstringyesCreation time
deployment-urlstringyesDeployment inference endpoint URL
gpu-countintegeryesNumber of GPUs
gpu-typestringyesGPU type family
idstringyesDeployment ID
inference-engine-parametersarray[string]yesOptional extra inference engine server CLI args
inference-engine-versionstringyesAllowed values: 0.12.0, 0.15.1, 0.16.0, 0.17.0, 0.18.0, 0.18.1, 0.19.0, 0.19.1, 0.20.0, 0.20.1.
modelModel referenceyes
namestringyesDeployment name
replicasintegeryesNumber of replicas (>=0)
service-levelstringyesService level
statestringyesDeployment state

Allowed values: ready, creating, preparing, error, deploying.
state-detailsstringyesDeployment state details
updated-atstringyesUpdate time
Last updated on