Create Deployment Request

Create Deployment Request

Deployment an AI model onto a set of GPUs

Properties

PropertyTypeRequiredDescription
gpu-countintegeryesNumber of GPUs (1-8)
gpu-typestringyesGPU type family (e.g., gpua5000, gpu3080ti)
replicasintegeryesNumber of replicas (>=1)
inference-engine-parametersarray[string]noOptional extra inference engine server CLI args
modelModel Refno
namestringnoDeployment name
Last updated on