exo dedicated-inference deployment update
Description
This command updates an AI deployment.
exo dedicated-inference deployment update ID or NAME [flags]Options
| Option | Description |
|---|---|
--help, -h | help for update |
--inference-engine-parameter-help | Show inference engine parameters help |
--inference-engine-params | Space-separated inference engine server CLI arguments (e.g., "–gpu-memory-usage=0.8 –max-tokens=4096") |
--inference-engine-version | Inference engine version |
--name | New deployment name |
--zone, -z | zone |
Options inherited from parent commands
| Option | Description |
|---|---|
--config, -C | Specify an alternate config file [env EXOSCALE_CONFIG] |
--output-format, -O | Output format (table|json|text), see "exo output –help" for more information |
--output-template | Template to use if output format is "text" |
--quiet, -Q | Quiet mode (disable non-essential command output) |
--use-account, -A | Account to use in config file [env EXOSCALE_ACCOUNT] |
Related Commands
- deployment - Manage AI deployments
Last updated on