ReferenceExoscale APIAIDedicated InferenceDedicated InferenceDedicated Inference lets you run Large Language Models (LLMs) on Exoscale GPU infrastructure.Read moreAI ModelLifecycle of AI Models.DeploymentDeployments are loaded model instances ready for inference.Last updated on January 28, 2026