Skip to content

Exoscale Documentation Platform Product Reference Contact ↗

CTRL K

CTRL K

Platform
Product
Reference
Contact ↗

Exoscale API
Exoscale Partner API
CLI
- Command Reference
IAM Resources
- compute
- sos
- dbaas
- iam
- dns
Terraform Provider
- Exoscale Provider
Exoscale Libraries

Dedicated Inference

Dedicated Inference

Dedicated Inference lets you run Large Language Models (LLMs) on Exoscale GPU infrastructure.

Lifecycle of AI Models.

Deployments are loaded model instances ready for inference.

Last updated on January 28, 2026

© 2025 Exoscale is a registered trademark of Akenes SA - Reg/VAT ID CHE-423.524.322 // Privacy // Terms & Conditions