Get Deployment Response
AI deployment
Properties
| Property | Type | Required | Description |
|---|---|---|---|
created-at | string | yes | Creation time |
deployment-url | string | yes | Deployment inference endpoint URL |
gpu-count | integer | yes | Number of GPUs |
gpu-type | string | yes | GPU type family |
id | string | yes | Deployment ID |
inference-engine-parameters | array[string] | yes | Optional extra inference engine server CLI args |
inference-engine-version | string | yes | Allowed values: 0.12.0, 0.15.1, 0.16.0, 0.17.0, 0.18.0, 0.18.1, 0.19.0, 0.19.1, 0.20.0, 0.20.1. |
model | Model reference | yes | |
name | string | yes | Deployment name |
replicas | integer | yes | Number of replicas (>=0) |
service-level | string | yes | Service level |
state | string | yes | Deployment state Allowed values: ready, creating, preparing, error, deploying. |
state-details | string | yes | Deployment state details |
updated-at | string | yes | Update time |
Last updated on